Improving the Readability of Network Visualizations Cody Dunne

advertisement
Improving the Readability of
Network Visualizations
Cody Dunne
cdunne@cs.umd.edu
FDA OBI – April 8, 2013
Why Visualization?
Anscombe’s Quartet
I
x
II
y
x
III
y
x
IV
y
x
y
10.00
8.04
10.00
9.14
10.00
7.46
8.00
6.58
8.00
6.95
8.00
8.14
8.00
6.77
8.00
5.76
13.00
7.58
13.00
8.74
13.00
12.74
8.00
7.71
9.00
8.81
9.00
8.77
9.00
7.11
8.00
8.84
11.00
8.33
11.00
9.26
11.00
7.81
8.00
8.47
14.00
9.96
14.00
8.10
14.00
8.84
8.00
7.04
6.00
7.24
6.00
6.13
6.00
6.08
8.00
5.25
4.00
4.26
4.00
3.10
4.00
5.39
19.00
12.50
12.00
10.84
12.00
9.13
12.00
8.15
8.00
5.56
7.00
4.82
7.00
7.26
7.00
6.42
8.00
7.91
5.00
5.68
5.00
4.74
5.00
5.73
8.00
6.89
Anscombe’s Quartet - Statistics
Property
Value
Equality
Mean of x in each case
9
Exact
Variance of x in each case
11
Exact
Mean of y in each case
7.50
To 2 decimal places
Variance of y in each case
4.122 or 4.127
To 3 decimal places
Correlation between x and
0.816
y in each case
Linear regression line in
each case
To 3 decimal places
To 2 and 3 decimal
y = 3.00 + 0.500x
places, respectively
Anscombe’s Quartet - Scatterplots
Networks!
Tweets of the #Win09 Workshop
Who Uses Network Analysis
Sociology
Scientometrics
Biology
Urban
Planning
Politics
Archaeology
WWW
NodeXL
Collect data, Excel analysis, statistics, visualization, layout
algorithms, filtering, clustering, attribute mapping…
NodeXL Graph Gallery
smrfoundation.org
NodeXL as a Teaching Tool
I. Getting Started with Analyzing Social Media Networks
1. Introduction to Social Media and Social Networks
2. Social media: New Technologies of Collaboration
3. Social Network Analysis
II. NodeXL Tutorial: Learning by Doing
4. Layout, Visual Design & Labeling
5. Calculating & Visualizing Network Metrics
6. Preparing Data & Filtering
7. Clustering &Grouping
III Social Media Network Analysis Case Studies
8. Email
9. Threaded Networks
10. Twitter
11. Facebook
12. WWW
13. Flickr
14. YouTube
15. Wiki Networks
http://www.elsevier.com/wps/find/bookdescription.cws_home/723354/description
NodeXL as a Research Tool
Bonsignore EM, Dunne C, Rotman D, Smith M, Capone T, Hansen DL and Shneiderman B (2009), "First
steps to NetViz Nirvana: Evaluating social network analysis with NodeXL", In CSE '09. pp. 332-339.
DOI:10.1109/CSE.2009.120
Mohammad S, Dunne C and Dorr B (2009), "Generating high-coverage semantic orientation lexicons from
overtly marked words and a thesaurus", In EMNLP '09. pp. 599-608.
Smith M, Shneiderman B, Milic-Frayling N, Rodrigues EM, Barash V, Dunne C, Capone T, Perer A and Gleave
E (2009), "Analyzing (social media) networks with NodeXL", In C&T '09. pp. 255-264.
DOI:0.1145/1556460.1556497
Outline
Motif
Simplification
Group-in-a-Box
Layouts
Sample of Other
Projects
Lostpedia articles
Observations
1: There are repeating patterns in
networks (motifs)
2: Motifs often dominate the
visualization
3: Motifs members can be
functionally equivalent
Motif Simplification
Fan Motif
2-Connector Motif
Lostpedia articles
Lostpedia articles
Glyph Design: Fan
Glyph Design: Connector
Cliques too!
Interactivity
Fan motif: 133 leaf vertices
with head vertex “Theory”
Senate Co-Voting: 65%
Agreement
Senate Co-Voting: 70%
Agreement
Senate Co-Voting: 80%
Agreement
Discussion
• Motif simplification effective for
• Reducing complexity
• Understanding larger or hidden relationships
• Qualitative and task-based user studies
• Available now in NodeXL: nodexl.codeplex.com
Dunne C and Shneiderman B (2013), "Motif simplification: improving network visualization readability with
fan, connector, and clique glyphs", In CHI '13.
Shneiderman B and Dunne C (2012), "Interactive network exploration to derive insights: Filtering, clustering,
grouping, and simplification", In Graph Drawing ‘12. pp. 2-18. DOI:10.1007/978-3-642-36763-2_2
Outline
Motif
Simplification
Group-in-a-Box
Layouts
Sample of Other
Projects
Group-in-a-Box Meta-Layouts
• Squarified Treemap
• Croissant-Donut
• Force-Directed
Risk Movements
Plain Layout
with Clusters
Risk Movements
GIB Treemap
Risk Movements
GIB Croissant
Risk Movements
GIB Force-Directed
Pennsylvania
Innovation
Pennsylvania
Innovation
GIB Treemap
Discussion
• Three Group-in-a-Box layouts for dissecting networks
• Improved group and overview visualization
• Tradeoffs: Filling space vs. showing relationships
• Empirical Twitter evaluation
• Available now/soon in NodeXL: nodexl.codeplex.com
Shneiderman B and Dunne C (2012), "Interactive network exploration to derive insights: Filtering, clustering,
grouping, and simplification", In Graph Drawing ‘12. pp. 2-18. DOI:10.1007/978-3-642-36763-2_2
Chaturvedi S, Ashktorab Z, Dunne C, Zacharia R, and Shneiderman B (2013), "Group-in-a-Box layouts for
visualizing network communities and their ties", Under submission.
Rodrigues EM, Milic-Frayling N, Smith M, Shneiderman B, and Hansen (2011), “Group-in-a-Box layout for
multi-faceted analysis of communities”, In SocialCom ’11. pp. 354-361.
DOI:10.1109/PASSAT/SocialCom.2011.139
Outline
Motif
Simplification
Group-in-a-Box
Layouts
Sample of Other
Projects
Performance Improvement
• Nvidia CUDA 240-core
• Fruchterman-Reingold
• Up to 802X speedup
• Eigenvector Centrality
• Up to 17,972X speedup
Sharma P, Khurana U, Shneiderman B, Scharrenbroich M, and Locke J (2011), Speeding up network layout
and centrality measures for social computing goals. In SBP '11. pp. 244-251. DOI:10.1007/978-3-642-196560_35
Evolving Twitter Topics
Malik S, Smith A, Papadatos P, Li J, Dunne C, and Shneiderman B. “TopicFlow: Visualizing topic alignment of
Twitter data over time”. Under submission.
Evolving Network Attributes
Gove R, Gramsky N, Kirby R, Sefer E, Sopan A, Dunne C, Shneiderman B and Taieb-Maimon M (2011),
"NetVisia: Heat map & matrix visualization of dynamic social network statistics & content", In SocialCom '11.
pp. 19-26. DOI:10.1109/PASSAT/SocialCom.2011.216
Document Collections
Dunne C, Shneiderman B, Gove R, Klavans J and Dorr B (2012), "Rapid understanding of scientific paper
collections: Integrating statistics, text analytics, and visualization", JASIST. Vol. 63(12). pp. 2351-2369.
Gove R, Dunne C, Shneiderman B, Klavans J and Dorr B (2011), "Evaluating visual and statistical exploration
of scientific literature networks", In VL/HCC '11. pp. 217-224.
Exposing Exploration History
Dunne C, Riche NH, Lee B, Metoyer RA and Robertson GG (2012), "GraphTrail: Analyzing large multivariate,
heterogeneous networks while supporting exploration history", In CHI '12. pp. 1663-1672.
Riche N, Lee B and Dunne C (2011), "Interactive visualization for exploring multi-modal, multi-relational, and
multivariate graph data". U.S. Patent Application No. (13/041474).
Funders & Collaborators
Funding
• NSF grants SBE 0915645, IIS 0705832, IIS 0968521
• HHS SHARP grant 10510592
• Social Media Research Foundation, Connected Action Consulting Group,
Microsoft External Research, Microsoft Research, National Cancer
Institute
Co-Authors
• Ben Shneiderman, Marc Smith, Snigdha Chaturvedi, Zahra Ashktorab,
Rajan Zacharia, Tony Capone, Eduarda Mendes Rodrigues, Natasa MilicFrayling, Nathalie Riche, Bongshin Lee, Ron Metoyer, George Robertson,
Robert Gove, Bonnie Dorr, Judith Klavans, Saif Mohammad, Puneet
Sharma, Ping Wang, Awalin Sopan, Nick Gramsky, Rose Kirby, Emre Sefer,
Meirav Taieb-Maimon, Vladimir Barash, Adam Perer, Eric Gleave, Derek
Hansen, Elizabeth Bonsignore, Dana Rotman, Ryan Blue, Adam Fuchs, Kyle
King, and Aaron Schulman
Collaborators
• Catherine Plaisant, Jon Froehlich, Leah Findlater, Yiyan Liu
Discussion
Effective node-link visualizations in NodeXL:
• Motif simplification to reduce complexity
• Group-in-a-Box Layouts to show groups and ties
Additional projects targeted at specific use cases
• Twitter, evolving networks, document collections,
shared explorations
Cody Dunne
cdunne@cs.umd.edu
www.cs.umd.edu/~cdunne/
Download