Pacific Visualization Conference

advertisement
Challenges for
Information Visualization Research:
Visual Quality and Data Quantity
Ben Shneiderman
ben@cs.umd.edu
@benbendc
Founding Director (1983-2000), Human-Computer Interaction Lab
Professor, Department of Computer Science
Member, Institute for Advanced Computer Studies
University of Maryland
College Park, MD 20742
Interdisciplinary research community
- Computer Science & Info Studies
- Psych, Socio, Poli Sci & MITH
(www.cs.umd.edu/hcil)
Design Issues
•
•
•
•
•
Input devices & strategies
• Keyboards, pointing devices, voice
• Direct manipulation
• Menus, forms, commands
Output devices & formats
• Screens, windows, color, sound
• Text, tables, graphics
• Instructions, messages, help
Collaboration & Social Media
Help, tutorials, training
• Visualization
Search
www.awl.com/DTUI
Fifth Edition: 2010
Information Visualization
•
Visual bandwidth is enormous
• Human perceptual skills are remarkable
• Trend, cluster, gap, outlier...
• Color, size, shape, proximity...
•
Two Challenges
• Visual Quality
• Data Quantity
Business takes action
•
•
•
•
•
•
•
•
•
General Dynamics buys MayaViz
Agilent buys GeneSpring
Google buys Gapminder
Oracle buys Hyperion
Microsoft buys Proclarity
InfoBuilders buys Advizor Solutions
SAP buys (Business Objects buys
Xcelsius & Inxight & Crystal Reports )
IBM buys (Cognos buys Celequest) & ILOG
TIBCO buys Spotfire
Spotfire: Retinol’s role in embryos & vision
Spotfire: Sales data, filtered by purchases
Spotfire: DC natality data
http://registration.spotfire.com/eval/default_edu.asp
10M - 100M pixels
Large displays
for single or multiple users
100M-pixels & more
1M-pixels & less
Small mobile devices
Information Visualization: Mantra
•
•
•
•
•
•
•
•
•
•
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
Overview, zoom & filter, details-on-demand
SciViz .
•
•
•
1-D Linear
2-D Map
3-D World
Document Lens, SeeSoft, Info Mural
•
•
•
•
Multi-Var
Temporal
Tree
Network
Spotfire, Tableau, GGobi, TableLens, ParCoords,
InfoViz
Information Visualization: Data Types
GIS, ArcView, PageMaker, Medical imagery
CAD, Medical, Molecules, Architecture
LifeLines, TimeSearcher, Palantir, DataMontage
Cone/Cam/Hyperbolic, SpaceTree, Treemap
Pajek, JUNG, UCINet, SocialAction, NodeXL
infosthetics.com
flowingdata.com
infovis.org
www.infovis.net/index.php?lang=2
Anscombe’s Quartet
1
x
2
y
3
x
y
x
4
y
x
y
10.0
8.04
10.0
9.14
10.0
7.46
8.0
6.58
8.0
6.95
8.0
8.14
8.0
6.77
8.0
5.76
13.0
7.58
13.0
8.74
13.0
12.74
8.0
7.71
9.0
8.81
9.0
8.77
9.0
7.11
8.0
8.84
11.0
8.33
11.0
9.26
11.0
7.81
8.0
8.47
14.0
9.96
14.0
8.10
14.0
8.84
8.0
7.04
6.0
7.24
6.0
6.13
6.0
6.08
8.0
5.25
4.0
4.26
4.0
3.10
4.0
5.39
19.0
12.50
12.0
10.84
12.0
9.13
12.0
8.15
8.0
5.56
7.0
4.82
7.0
7.26
7.0
6.42
8.0
7.91
5.0
5.68
5.0
4.74
5.0
5.73
8.0
6.89
Anscombe’s Quartet
1
x
2
y
3
x
y
x
4
y
x
y
10.0
8.04
10.0
9.14
10.0
7.46
8.0
6.58
8.0
6.95
8.0
8.14
8.0
6.77
8.0
5.76
13.0
7.58
13.0
8.74
13.0
12.74
8.0
7.71
9.0
8.81
9.0
8.77
9.0
7.11
8.0
8.84
11.0
8.33
11.0
9.26
11.0
7.81
8.0
8.47
14.0
9.96
14.0
8.10
14.0
8.84
8.0
7.04
6.0
7.24
6.0
6.13
6.0
6.08
8.0
5.25
4.0
4.26
4.0
3.10
4.0
5.39
19.0
12.50
12.0
10.84
12.0
9.13
12.0
8.15
8.0
5.56
7.0
4.82
7.0
7.26
7.0
6.42
8.0
7.91
5.0
5.68
5.0
4.74
5.0
5.73
8.0
6.89
Property
Value
Mean of x
9.0
Variance of x
11.0
Mean of y
7.5
Variance of y
4.12
Correlation
0.816
Linear regression
y = 3 + 0.5x
Anscombe’s Quartet
Visual Quality & Data Quantity
??????
Visual Quality & Data Quantity
The goal of visualization is insight,
not pictures
Visual Quality & Data Quantity
The goal of visualization is insight,
not pictures
 User-controlled Filtering
 Meaningful Aggregation
Temporal Data: TimeSearcher 1.3
•
•
•
Time series
• Stocks
• Weather
• Genes
User-specified
patterns
Rapid search
Temporal Data: TimeSearcher 2.0
•
•
•
Long Time series (>10,000 time points)
Multiple variables
Controlled precision in match
(Linear, offset, noise, amplitude)
LifeLines: Patient Histories
www.cs.umd.edu/hcil/lifelines
LifeLines2: Align-Rank-Filter & Summarize
Aggregation: SpaceTree
www.cs.umd.edu/hcil/spacetree
Treemap: Gene Ontology
+ Space filling
+ Space limited
+ Color coding
+ Size coding
- Requires learning
(Shneiderman, ACM Trans. on Graphics, 1992 & 2003)
www.cs.umd.edu/hcil/treemap/
Treemap: Smartmoney MarketMap
www.smartmoney.com/marketmap
Market falls steeply Feb 27, 2007, with one exception
Market mixed, February 8, 2008
Energy & Technology up, Financial & Health Care down
Market rises, September 1, 2010, Gold contrarians
Market rises, March 21, 2011, Sprint declines
Treemap: Newsmap (Marcos Weskamp)
newsmap.jp
Treemap: Nutritional Analysis
www.hivegroup.com
Treemap: Spotfire Bond Portfolio Analysis
www.spotfire.com
Treemap: NY Times – Car&Truck Sales
www.cs.umd.edu/hcil/treemap/
Treemap (Voronoi): NY Times - Inflation
www.nytimes.com/interactive/2008/05/03/business/20080403_SPENDING_GRAPHIC.html
State-of-the-art network visualization
Network from Database Tables
www.centrifugesystems.com
NodeXL:
Network Overview for Discovery & Exploration in Excel
www.codeplex.com/nodexl
NodeXL:
Network Overview for Discovery & Exploration in Excel
www.codeplex.com/nodexl
NodeXL: Import Dialogs
www.codeplex.com/nodexl
Tweets at #WIN09 Conference: 2 groups
‘GOP’ tweets, clustered (red-Republicans)
WWW2010 Twitter Community
WWW2011 Twitter Community: Grouped
No Location
Philadelphia
Innovation Clusters: People, Locations, Companies
11,000 nodes
26,000 links
Pharmaceutical/Medical
Pittsburgh Metro
Westinghouse Electric
No Location
Philadelphia
Innovation Clusters: People, Locations, Companies
Pharmaceutical/Medical
Pittsburgh Metro
Westinghouse Electric
No Location
Philadelphia
Innovation Clusters: People, Locations, Companies
Patent
Tech
Navy
SBIR (federal)
PA DCED (state)
Related patent
2: Federal agency
Pharmaceutical/Medical
Pittsburgh Metro
3: Enterprise
5: Inventors
9: Universities
10: PA DCED
11/12: Phil/Pitt metro cnty
13-15: Semi-rural/rural cnty
17: Foreign countries
Westinghouse Electric
19: Other states
CHI2010 Twitter Community
www.codeplex.com/nodexl/
Flickr clusters for “mouse”
Computer
Mickey
Animal
Flickr networks
Analyzing Social Media Networks with NodeXL
I. Getting Started with Analyzing Social Media Networks
1. Introduction to Social Media and Social Networks
2. Social media: New Technologies of Collaboration
3. Social Network Analysis
II. NodeXL Tutorial: Learning by Doing
4. Layout, Visual Design & Labeling
5. Calculating & Visualizing Network Metrics
6. Preparing Data & Filtering
7. Clustering &Grouping
III Social Media Network Analysis Case Studies
8. Email
9. Threaded Networks
10. Twitter
11. Facebook
12. WWW
13. Flickr
14. YouTube
15. Wiki Networks
www.elsevier.com/wps/find/bookdescription.cws_home/723354/description
Social Media Research Foundation
Researchers who want to
- create open tools
- generate & host open data
- support open scholarship
Map, measure & understand
social media
Support tool projects to
collection, analyze & visualize
social media data.
smrfoundation.org
Social Media Research Foundation
Researchers who want to
- create open tools
- generate & host open data
- support open scholarship
Map, measure & understand
social media
Support tool projects to
collection, analyze & visualize
social media data.
smrfoundation.org
Visual Quality & Data Quantity
The goal of visualization is insight,
not pictures
 User-controlled Filtering
 Meaningful Aggregation
UN Millennium Development Goals
To be achieved by 2015
• Eradicate extreme poverty and hunger
• Achieve universal primary education
• Promote gender equality and empower women
• Reduce child mortality
• Improve maternal health
• Combat HIV/AIDS, malaria and other diseases
• Ensure environmental sustainability
• Develop a global partnership for development
29th Annual Symposium
May 22-23, 2012
www.cs.umd.edu/hcil
For More Information
•
Visit the HCIL website for 400 papers & info on videos
www.cs.umd.edu/hcil
•
•
•
Conferences & resources: www.infovis.org
See Chapter 14 on Info Visualization
Shneiderman, B. and Plaisant, C., Designing the User Interface:
Strategies for Effective Human-Computer Interaction:
Fifth Edition (2010) www.awl.com/DTUI
Edited Collections:
Card, S., Mackinlay, J., and Shneiderman, B. (1999)
Readings in Information Visualization: Using Vision to Think
Bederson, B. and Shneiderman, B. (2003)
The Craft of Information Visualization: Readings and Reflections
For More Information
•
•
•
•
•
Treemaps
• HiveGroup: www.hivegroup.com
• Smartmoney: www.smartmoney.com/marketmap
• HCIL Treemap 4.0: www.cs.umd.edu/hcil/treemap
Spotfire: www.spotfire.com
TimeSearcher: www.cs.umd.edu/hcil/timesearcher
NodeXL: nodexl.codeplex.com
Hierarchical Clustering Explorer:
www.cs.umd.edu/hcil/hce
•
•
LifeLines2:
Similan:
www.cs.umd.edu/hcil/lifelines2
www.cs.umd.edu/hcil/similan
Download