Visualizing Change over Time: the Pipes Metaphor
Jim Blythe
USC Information Sciences Institute
1
Talk outline
Why pipes??
Visualizing, interacting with and animating flow.
Scaling up: flow in citation and hats data with 1m nodes+links
The emerging semantics/provenance of a visualization
Wrap-up
2
Motion-based visualization for relational spatio-temporal (RST) data
Motion is a natural tool for visualizing RST data
Draws human attention to change in complex data
Can reflect temporal aspects directly
Natural way to provide feedback on analyst ‘what-if’ tests
Interaction essential to avoid false impressions due to dimension reduction.
But we don’t yet have a theory or conventions for using motion in network visualizations
Users interpret visual cues in rigid ways, both from cognitive aspects and via a rich set of pre-established conventions
Fewer experimental results for motion (but see work by
Ware and others)
3
Our solution: visualizations based on physical analogies
By appealing to well-known physical analogies, provide interpretable images and ways to interact with the data.
Users will bring expectations about how to interpret observed motion and change
Examples studied: fluid flow through elastic pipes and wave motion through a graph
4
Why pipes??
Fluid flow in pipes is a simple and common model , not fully exploited for interacting with networks.
Everyone is familiar with it: intuitive and powerful
Provides natural interactions with a network
Puts focus on links, rather than well-studied nodes
A general method – for oil/gas, money, information or ideas, actual or max flow, ..
5
Impact for analysts
Facilitate what-if testing of actions on networks
Bring computed network properties to life through interaction with fast feedback
Facilitate understanding of observed and possible future dynamics of underlying system
Draw attention to widely distributed effects of action as network changes
Enable rapid identification of relevant data, regardless of how they are distributed relationally, spatially, or temporally
General techniques that can be applied to, e.g., disease outbreaks, terrorist networks, flow of money
6
Talk outline
Why pipes??
Visualizing, interacting with and animating flow.
Scaling up: flow in citation and hats data with 1m nodes+links
The emerging semantics/provenance of a visualization
Wrap-up
7
Example: trade in widgets
Links show capacities.
E.g. “What is the greatest number of widgets F can receive from A and B?”
Standard max flow problems are easy to solve, but are NP-hard if e.g.
probabilities are involved.
3
9
3
3
2
3
8
Simple use of pipes
2d ‘pipe’ shows capacity and flow.
Red shows flow.
Black shows unused capacity.
If I must remove one link, which has the least effect? The greatest?
If I can increase one link in capacity, which should I choose?
3
9
2
5
Min cut
3
3
2
3
9
Scaling up: satellite data on oil and gas pipelines
~3000 nodes, 3000 links, each with up to 80 line segments
Flow updated ~50 times per second (on 1.6 GHz centrino, 512M RAM)
Helps users pinpoint areas most vulnerable to attack, or high-payoff areas to improve capacity in real time.
See our demo.
10
11
12
13
14
15
16
Generic action API for 3 rd -party code
Users can
Add and modify nodes, links and capacities
Set node properties, e.g. source, sink, …
Temporarily block nodes and links
3 rd -party systems can
Declare node or link properties of interest
Register for user modifications
Provide updated data on links and nodes
17
Talk outline
Why pipes??
Visualizing, interacting with and animating flow.
Scaling up: flow in citation and hats data with 1m nodes+links
The emerging semantics/provenance of a visualization
Wrap-up
18
Scaling up: Cora publication dataset
Dataset of citations with 220k nodes and 700k links
32k papers have year, title, journal/conference
Links are ‘person a is an author of paper b’, or ‘paper a cites paper b’
Complete graph loads in seconds in KrackPlot.
Define filters and graph manipulations for visualization on the fly.
19
Timeline view of single author
20
Timeline view of single author
1992: Author moves from CMU to ISI
21
Flow applications in publication data
Identify keywords or key phrases in part of the text
Have access to title in CORA
Visualize flow from paper to citing paper when rare phrases are repeated
Uncover influence between individuals and communities, or track the emergence of a sub-field
22
Provenance and semantics of a visualization
A number of manipulations must be choreographed to create the most useful visualizations
Some are at data/filtering level – what entities are shown as nodes and as links
Some are at the visual level – assigning visual cues to properties of the data
Together they form the provenance, or semantics, of the visualization
When another viewer sees the visualization, they will need to know the semantics to understand it
23
Example from timeline display
Goal:
Show co-authors and cocitations on a timeline for “name”
Actions required to create timeline display:
Search $name -> $node display only current selection show neighborhood size 2 around $node unselect $node add child links define nbrhd "focal + children + parents|entity=person“ around $node map year to time in anneal layout lay out by annealing show classification as color and shape
24
Maintaining provenance to create shareable visualizations – and reusing them
The visual system captures the user’s actions and generalizes them by analyzing dependencies
Allows replay on other points of interest
- e.g. a new reusable process to visualize author timelines
Can be encapsulated with the visualization to provide provenance
- another viewer can inspect the filtering and visual cues used
25
Hats output datasets
Tested loading of output with 20k hats and ~ 1.3million links
Interesting sequences of meetings can be viewed as a flow of capabilities
26
Impact for analysts
Facilitate what-if testing of actions on networks
Bring computed network properties to life through interaction with fast feedback
Facilitate understanding of observed and possible future dynamics of underlying system
Draw attention to widely distributed effects of action as network changes
Enable rapid identification of relevant data, regardless of how they are distributed relationally, spatially, or temporally
General techniques that can be applied to, e.g., disease outbreaks, terrorist networks, flow of money
27
Highlighting change
Motion:
Node/link motion to reflect structural change
Highlighting links: moving bulges, links that fade, radiate, wiggle or flash
Static:
Show the difference with overlays using color, background shading, …
Build on our work on motion in social network viz
[McGrath & Blythe 04] and work in general viz
[Bartram et al 01].
28
Status and future work
Completed tasks on initial animation and interaction
Defining user tasks to test utility – uncover vulnerabilities or leverage points in flow
Or discover influential actors in publication space
Next tasks: use animations to show consequences of interaction or change in structure over time.
Push on interesting ideas on emerging semantics of visualizations in collaborative environments.
29
Back-up slides
30