VISUAL ANALYTICS ATIF FARID MOHAMMAD CS/SIS/DSBA 2015 INTRODUCTION WHO YOU ARE? WHO I AM? INTRODUCTION TO VISUAL ANALYTICS BOOKS: INFORMATION VISUALIZATION ROBERT SPENCE . MODERN ANALYTICS METHODOLOGIES CHAMBERS & DINSMORE . STATISTICS (FREE BOOK FOR REFERENCE ONLY) . HTTP://ONLINESTATBOOK.COM/ONLINE_STATISTICS_EDUCATION.PDF R FOR EVERYONE (REFERENCE ONLY) LANDER HOWEVER, THIS IS NOT STATISTICS COURSE… SINCE, THERE IS NO VISUAL ANALYTICS COURSE BOOK AVAILABLE… WE WILL WORK TOGETHER WITH AVAILABLE MATERIAL DO I KNOW ALL ? NOT NECESSARY I CAN CERTAINLY ASSIST TO FIND ANSWERS COLLECT & CONNECT THE DOTS Work Work Learning Social Life Work Learning Social Life Shopping Work Learning YOUR INTEREST IS IMPORTANT COMPUTER PROGRAMMING – R/SPSS GETTING DATA STATISTICS CRITICAL THINKING COMMUNICATING YOUR STORIES UNDERGRAD – PROJECT OR RESEARCH PAPER GRAD – RESEARCH PAPER FINAL PROJECT/RESEARCH PAPER SHOULD HAVE SIX (6) MAJOR ASPECTS AND THESE ARE… 1. YOUR INTEREST IS IMPORTANT 2. COMPUTER PROGRAMMING – R/SPSS 3. GETTING DATA 4. STATISTICS 5. CRITICAL THINKING 6. COMMUNICATING YOUR STORIES FINAL PROJECT CAN HAVE ANY OF THE GIVEN: ASTRONOMY FINANCIAL FITBIT – YOUR CHOICE DATA GENOMICS MOVIE RECOMMENDATIONS REAL ESTATE SOCIAL MEDIA ANALYTICS ETC… FINAL PROJECT/RESEARCH PAPER TOPIC IS YOUR CHOICE… ALL OF YOUR EFFORT WILL BE IN TERMS OF RESEARCH PAPER YOU CAN SEND FOR PUBLICATION IT IS YOUR CHOICE… HIGHLY RECOMMENDED HOMEWORK – 30% QUIZZES – 15% ATTENDANCE – 5% MID TERM – 25% FINAL PROJECT – 25% OR RESEARCH PAPER 6 PAGES – 25% MAIN IDEA --- IS --- PERCEPTION AMPLIFICATION OF PERCEPTION WHAT IS ALREADY KNOWN ? WHAT IS NOT KNOWN OR UNKNOWN ? WHAT IS META DATA AND DATA ? ------------ VISUAL PERCEPTION ------------ • HIGH BANDWIDTH • FAST SCREENING OF A LOT OF DATA • PATTERN RECOGNITION • HIGHER-LEVEL COGNITION INTERACTION • DIRECT MANIPULATION • TWO-WAY COMMUNICATION SPATIAL PRE-ATTENTIVE PROCESSING VISION IS A MASSIVELY PARALLEL PROCESSOR DEDICATED TO • DETECT • ANALYZE • RECOGNIZE • REASON WITH VISUAL INPUT WHAT IS DIFFERENT? BLUE DOT IS THE DIFFERENCE LET US LOOK AT LIVE HEAT MAP OF CYBER ATTACKS HTTP://MAP.IPVIKING.COM/ TECHNIQUES AND TECHNOLOGIES • A wide variety of techniques and technologies has been developed and adapted for • Data aggregation • Data manipulation • Data analysis • Data visualization • These techniques and technologies draw from several fields including • Statistics • Computer science • Applied mathematics • Economics. TECHNOLOGIES • Database and Data warehouse • Google File System and MapReduce: Big Table • Hadoop: HBase and MapReduce, open source Apache project, Cassandra: An open source (free) NoSQL • Data warehouse: ETL (extract, transform, and load) tools and business intelligence tools. • Business intelligence (BI): Data Warehouse, Reporting, Real-Time Management Dashboards • Cloud computing: Services, SOA, etc. • Metadata: XML, JSON, BSON • Stream processing • R, SAS and SPSS • Visualization: Tag cloud, History Flow, Tree Map, Heat Map & UnKNOWN ORIGIN OF INFORMATION VISUALIZATION 01/14/2015 SCATTERPLOT AND SCATTERPLOT MATRIX TREE VISUALIZATION(1) Node-Link Diagrams Sunburst Dendrogram TREE VISUALIZATION(2) Circle-packing layouts Treemap NETWORK VISUALIZATION Force-Directed Layout Matrix Views Arc Diagrams PARALLEL COORDINATES STACKED GRAPHS SMART MONEY MAP BIG DATA AND VISUAL ANALYTICS Big Data Environment – Cloudera/Hortonworks/MapR/EMC/Amazon EMR… HBase HBase Database Admin Domain HBase Database Admin Domain Mid-Level Development Domain Impala Java Mid-Level Development Python Hive Scala Pig Latin C Sharp Visual Analytics Mid-Level Development Impala Java Hive Python Pig Latin Scala R/SPSS HBase Visual Analytics Many more QUESTIONS ???