How Human Brain Understand Visualisation and Graph Visualisation Evaluation Kai Xu National ICT Australia Sydney, Australia Part I: How Human Braid Understand Visualisation Acknowledgement • Most of the content can be found in the book “Information Visualization – Perception for Design”. • It’s author Colin Ware at University of New Hampshire is a close friend of the information visualisation research group at School of IT. Table of Contents • The human visual perception system. • Visual Attention and pre-attentive patterns. Visual Perception System Information flow Visualisation Eyes Brain A B C D Parallel processing of orientation, texture, Object Identification, color and motion Working Memory features Detection of 2D patterns, contours and regions Visual Perception System A B C D Parallel processing of orientation, texture, Object Identification, color and motion Working Memory features Detection of 2D patterns, contours and regions Brain processing speed Task complexity Visual Perception System Stage 1 A B C D Parallel processing of orientation, texture, Object Identification, color and motion Working Memory features Detection of 2D patterns, contours and regions • Rapid parallel processing: billions of neurons; • Extraction of orientation, texture, color, and motion features. Visual Perception System A B C D Parallel processing of orientation, texture, Object Identification, color and motion Working Memory features Detection of 2D patterns, contours and regions Stage 2 • Slower processing than stage 1; • Detection of 2D patterns, contours and regions. Visual Perception System Stage 3 A B C D Parallel processing of orientation, texture, Object Identification, color and motion Working Memory features Detection of 2D patterns, contours and regions • Slow serial processing; • Involve both working and long-term memory; • Object identification and eye-hand coordination. Visual Angle • Visual angle: the angle subtended by an object at the eye of an observer. – A thumbnail held at arm length subtends about 1 degree of visual angle. Human Visual Field 100 • Visual field of view when gazing straight ahead. 80 60 – Can see slightly more than 180 degrees horizontally; – Much less angle vertically. 40 20 LEFT RIGHT • The irregular boundary of left and right fields are caused by facial features such as nose. • The dark grey area is the region of binocular overlap. Visual Acuity • If focus on the central point, every character is about equally distinct. • This is because the visual acuity decreases quickly with the distance from fovea. Acuity test Line test Points test Acuity distribution Pixels and Brain Pixels Brain pixel screen pixel 1 bp Small Screen Big Screen Ultimate Display High resolution binocular insert (25x19) Left eye background (82.5x66.7) Right eye background (82.5x66.7) Binocular overlap Visual Attention and Pre-Attentive Patterns • In the stage 1 of perception system, the whole visual field is processed in parallel and very fast; • The information that can be captured in this stage are easily distinguished. • Pre-attentive patterns (pop-out effects). • Should be considered when designing visualisation. • Some examples. Color Orientation Motion Size Simple shading Conjunction (does not pop out) Compound features (do not pop out) Surrounded colors do not pop out Laws of pre attentive display • Must stand out on some simple dimension – – – – color, simple shape = orientation, size motion, depth • Conjunctions of pre-attentive dimensions do not always work. Lessons for Information Visualisation • Can be used for individual symbols or areas; • Avoid possible negative effect: – Do not use large areas of strong color. • Orthogonality: use a different channel for a different type of information. • Example: Mapping high dimensional data to display variables. – – – – – – – Position (2) Orientation (1) Size (1) Motion (2)++ Blinking (1) Color (3) … Part II: Graph Visualisation Evaluation Table of Contents • • • • • • Quick review Introduction Evaluation of graph drawing aesthetics Evaluation of graph layout methods Evaluation of large graph visualization Conclusions Review - Drawing conventions Review - Graph Layout • Important part of graph visualization • Finds “good” positions for nodes and edges – To improve “graph readability”, i.e., facilitate people's understanding of the graph structure • Review: Layout algorithm covered in the previous lectures: – Tree layout – Layered layout (Sugiyama method) – Force-directed layout (spring algorithm) Tree • Layered drawing – nodes are placed on horizontal layers • Radial drawing – the layers are mapped to concentric circles • HV drawing – places the edges horizontally or vertically • Space-filling methods (Treemap) – Inclusion indicates parentchild relationship; – Improved space efficiency Directed Graph Layered drawing (framework) 1. Cycle removal: if there is directed cycles, temporarily reverses the direction of some to make the graph acyclic; 2. Layer assignment: nodes are assigned to horizontal layers, and thus determines their ycoordinate; 3. Crossing reduction: within each layer the nodes are ordered to reduce the number of crossings; 4. Horizontal coordinate assignment: the x-coordinates of each vertex is determined. Undirected Graph • Force-directed methods – A graph is treated as a system of entities with force acting between them. – The algorithm seeks a configuration with locally minimal energy, i.e. , a position for every entity such that the sum of the forces on each entity is zero. – Common example • Spring embedder Introduction • What’s covered so far: – Various Graph layout algorithms • This lecture: how these affect people's understanding of the graph. – Are they effective at all? – Which one is relatively more effective? • Also: visualization of large graphs – Where the traditional aesthetics and layout algorithms do not really work Graph Drawing Aesthetics Aesthetics are the graphic properties layout algorithm try to optimise. • Crossings: – Minimization of the total crossing number • Area – Minimization of drawing area – Only meaningful to some layout. Example, grid drawing with integer coordinates • Aspect ratio – The ratio of the long and short edge length of its covering rectangle – Ideal case is to obtain any aspect ratio in a given range (so the drawing can fit into differently shaped screen space) Graph Drawing Aesthetics • Edge length (several variations): – – – – minimization of the sum of the edge length; minimization of the maximum edge length; minimization of the variance of the edge length; only meaningful to some layout algorithm. • Bends (several variations): – – – – minimization of the total number of bends; minimization of maximum number of bends on an edge; minimization of the variance of the number of bends on the edge; trivially satisfied by straight-line drawing. Graph Drawing Aesthetics • Angular resolution: – maximization of the smallest angle; – especially relevant for straight-line drawing. • Symmetry: – display the symmetries of the graph in drawing – reflective and rotational symmetry • Orthogonality: – how well the edges are parallel to the axes, and how well the nodes match to a grid; • Upward flow: – for directed graph only, – how well edges are pointing to a specified direction (usually upward); Evaluation • Measuring the performance of subjects (users) completing certain task(s). • Graph-related tasks are used for graph visualization. – Example: find the shortest path between 2 nodes in a graph. • Performance is usually measured by – Accuracy – Completion time Graph-Related Task Performance There are many factors affect the performance • The difficulty level of the task – Simple: find all the neighbors of a node – Hard: find all the nodes have graph distance 2 to two given nodes. • Size of the graphs – Small – Large • Subjects background – Whether they are familiar with graph visualization or not – Whether they are familiar with certain application domain (for domainspecific tasks). • And many more … • Should consider/control as many as possible when doing a test. Table of Contents • • • • • Introduction Evaluation of graph drawing aesthetics Evaluation of graph layout methods Evaluation of large graph visualization Conclusions Do Aesthetics Affect Graph Readability ? • Problem: Most aesthetics are proposed based on experience – Later becomes something the research community agree on • Study: readability of abstract graph – Tasks are not domain specific Purchase, H.C. et al. (1996) Dataset • Three aesthetics are tested: 1. Minimizing edge crossings, 2. Minimizing bends, and 3. Showing symmetry. • Two planar graphs are used – one with 16 nodes and 18 edges – the other with 16 nodes and 28 edges • Nine drawings are produced for each graph, – with three levels (few, some and many) of bends, crossings, and symmetry respectively. • To isolate the effect of each aesthetic, the drawings with different bend levels shows no crossings or symmetry. – the same for the two other aesthetics (manual layout). • Tasks: 1. Shortest path: the length of the shortest path between two nodes; 2. Connections between nodes: minimum number of nodes needs to be removed to disconnect two nodes; 3. Connections between subgraphs: minimum number of edges needs to be removed to disconnect two subgraphs. • Results – Effective: increasing bends or crossings decreases readability; – Not clear: symmetry. Caveats • Dataset is fairly simple – Small planar graph • The selected tasks are similar – All focus on path between nodes or subgraphs, – this hardly cover all the information a graph structure can possibly convey. • It is possible that change in the dataset and/or tasks can alter the results Which Aesthetic is the most important? • • The relative importance among aesthetics Including 5 aesthetics: 1. 2. 3. 4. 5. minimizing edge crossings, minimizing bends, symmetry. minimum angle orthogonality Purchase, H.C (1997) Dataset (Similar to last work) • Planar graph with 16 nodes and 28 edges is used • 5 aesthetics and 10 drawings – 2 for each aesthetics: representing a strong or weak presence. • b: bends, c: crossings, m: minimal angle, o: Orthogonality, s: symmetry • Tasks (the same as the last work) 1. Shortest path: between two nodes; 2. Connections between nodes: number of nodes to disconnect two nodes; 3. Connections between subgraphs: number of nodes to disconnect two subgraphs. • Results – Most important: reducing the number of crossing; – Less effective: minimizing the number of bends and maximizing symmetry; – Not obvious: maximizing the minimum angle and orthogonality. Does Aesthetics Affect Cognitive Load? • From a cognitive psychology angle • Testing aesthetics that affect shortest path task performance: – Continuity (path bendiness): the angular deviation from a straight line. – Number of crossings and average crossing angles: the crossings on the shortest path, and the angle of crossing. – Number of branches: the number of edges connect to the nodes on the shortest path but not part of the path. – Shortest path length and total edge length Ware, C. et al. (2002) • Task – find the shortest path between 2 given nodes • Dataset – 180 drawings with per-defined parameters. – 42 nodes in each graph, – 2 examples Results: • Important: path continuity. • Edge crossings – Neutral: the total number of edge crossings in the graph. – Important: those cross the shortest path. • Important: the number of branches emanating from nodes on the path. Table of Contents • • • • • Introduction Evaluation of graph drawing aesthetics Evaluation of graph layout methods Evaluation of large graph visualization Conclusions How People Read Graph? • Eye movement: when people reading graph . – using eye-tracking device. • Dataset: 3 small social networks. • Drawings: – Four drawings each graph ; – two circular layout and two radial layout; – The pair of same layout: one has more crossings than the other. • Task: shortest path between nodes • Results: – Graph layout can affect: • slow down • trigger extra eye movements. – Caused by edges: • incident to nodes concerned, • going toward to the target node • alongside the paths. Huang, W., Eades, P. (2005) Layout vs. Graph Readability • Comparing different layout methods. • A planar graph of 17 nodes and 29 edges . • maximum node degree is 4: – So it is applicable to orthogonal drawing ; – A quite strong constraint. • 3 layout algorithms: – Force-directed: 3 variations; – Planar orthogonal grid drawing: 2 variation; – Planar grid drawing: 3 variations. Tasks: 1. Shortest path between two nodes; 2. Disconnect two nodes; 3. Disconnect two subgraphs. Results: • One planar grid drawing method (SEIS) produced significantly more errors than the rest; • For the rest, the average response times were not significantly different. – So there is not much difference between layout algorithms! Purchase, H.C. (1998) Are the Drawings Actually Different? • For one drawing method, there are usually many variations – How different are the drawings? • But first, what are the measurements? • A study to find good measurements for orthogonal drawing. • What’s interesting: – The authors tried to find out the measurements through evaluation, rather than just pick some. Bridgeman, S.S. and Tamassia, R. (2001) What difference needs to be measured? • Rotation: the minimum angle needs to be rotated to make one drawing the same as the other; • Ordering: which drawing is more similar to the original; • Magnitude: one drawing is k times more similar to the original than the other drawing. The tests: • Rotation: – Left: original drawing; right: same drawing with different orientations. – The user chooses the one that looks most similar. • Ordering: – Left: original drawing; the other two: modified drawings; – The user chooses the more similar one. Magnitude • Measures response times: – Assumption: user will complete the task quicker if the drawings are similar. • Left: original drawing; Right: modified drawing; • Identify missing vertex in the right drawing: – Vertices have labels, otherwise the task is too difficult. Evaluated Measures (a lot) • • • • • • • • • • • Corresponding Objects. Point Set Selection. Drawing Alignment. Suitability for Ordering vs. Rotation and Ordering. Notation. Undirected Hausdorff Distance. Paired Hausdorff Distance. Average Distance. Nearest Neighbor between. Orthogonal Ordering. Ranking. • • • • • • … Average Relative Distance λ-Matrix Nearest Neighbor Within ε-Clustering Separation-Based Clustering Shape. Results • Rotation: absolute and relative point positions are important. • Ordering: point positions are less significant. • Magnitude: no clear measurement is found. • To recognize the graph as similar, the most important are : – the perimeter of the drawing; – The position and shape of few key features. • To find a specific change (magnitude) – The drawings need to look very much alike, or – Some other cues (color change, more distinctive vertex names, etc.) are needed to highlight the change. Table of Contents • • • • • Introduction Evaluation of graph drawing aesthetics Evaluation of graph layout methods Evaluation of large graph visualization Conclusions Is Large Graph Readable? • A graph with 3200 nodes. • Laid out using force-directed method. • The “readability” will be more or less the same with other layout methods. • Many real-world networks are much larger. • What can we do? Clustered Graph • Clustering can show the structure of a large graph. – The nodes are partitioned into clusters; – The connections between clusters reveal the underlying graph structure. • Example: a multi-level 2.5D drawing for hierarchically-clustered graphs. • Difficult to evaluate: – Tightly coupled with interaction. Extending Layout to 3D Many 2D layout methods can be extended to 3D: • Natural extension: – Force-directed method; – Orthogonal drawing. • Layered tree drawing – cone tree • Poly-plane drawing – Placing subtree on poly-planes in 3D • Layered drawing of directed graphs – Placing nodes on the parallel circles on the surface of a cone or cylinder – Placing nodes on parallel layers in vertical planes (“walls”) Does 3D Really Improve Readability? Comparing various approaches to visualize a graph in 2D and 3D • 2D: orthographic (parallel) projection • Static Perspective: perspective projection • Stereo: shutter glasses • Passive rotation: automatic • Hand coupled rotation: mouse-controlled rotation; • Head coupled perspective: head-controlled perspective; • Combination of previous approach: stereo, head coupled perspective. “Stereo, head coupled perspective” setup • Task: whether two nodes are connected by a path of length 2. • Dataset: randomly laid out graph. Ware, C., Franck, G. (1996) Results: • A static perspective is only slightly better than a 2D diagram; • 3D motion and stereo viewing both help but not particularly important; – Both are more significant than stereo cues. • Stereo viewing alone increases the understandable graph size by a factor of 1.6; • Head coupling alone increases by a factor of 2.2; • Combine the two (head-coupled stereo viewing) increases by a factor of 3; What Else Can We Do With 3D? • Improve graph readability with “geon diagram” • Geon: simple 3D shape with color and texture. • Geon diagram: – Nodes: geons; – Edges: connections between geons. – Attributes of nodes and edges: geon color and texture. • Experiment 1 – UML diagrams: geon vs. 2D; – The subjects were first shown a structure in either geon or UML form, and later asked to identify it in a series of diagrams. • Results: – Geon diagrams have half the errors and significantly faster. – Geon diagrams can be recalled much more reliably. • Experiment 2: – Geon diagram vs. 2D geon • Dataset and tasks are the same. • Results: – Geon is much more accurate and reliable compare to 2D geon. Motion • Using the forth dimension — time — to improve graph readability • Different from dynamic or time-series graph visualization – Graph is not changed – Improving rather than reducing readability: it is usually more difficult to understand the graph when it changes. C. Ware and R. Bobrow. 2004. Motion highlighting • Circular: – A circular motion around the center position • Jolt: – Moves in pulses; – Similar to an object oscillating from being struck briefly. • Crawl: – Animated sawtooth patterns radiating out from the selected node • Expanding nodes: – Grow larger and smaller periodically Comparing Different Motion Highlighting • Experiment 1 – Task: whether there are two red nodes within two links from a circled node. – Also included “no highlighting” and “static highlighting”, i.e., the selected subgraph is marked with different color. • Results: – Motion highlighting requires half times and more accurate comparing to “no highlighting”; – Static highlighting is the most accurate, but slower than motion highlighting. Experiment 2: Node and Edge Motion Highlighting • Tasks: – Are there at least two red nodes within two links of the specified subgraph? – Are there at least two red links within two links of the specified subgraph? • Results: – Motion highlighting both nodes and links is as good as or better than separate highlighting of nodes and links; – The static highlighting: • As good as the motion techniques for revealing links • Not as good as circular motion for revealing nodes. Experiment 3: Complex Pattern • Task: – Identify a chain of three red nodes connected by blue edges. • Results: – Motion highlighting had similar performance as static highlighting. – The reason may be the pattern was relatively easy to identify, because it always started at the ringed node. Motion in Large Graph • Comparing 4 highlighting methods: 1. 2. 3. 4. • Static highlighting: – – • No highlighting; Static highlighting; Motion highlighting; Static and motion highlighting. increase node size; Change edge color. Combined highlighting – – Combine static highlighting with pulse highlighting; The larger nodes and links are in pulse highlighting mode. • Dataset: – Graph of five sizes are used: 32, 100, 320, 1000, and 3200. • Task 1: – If there was a red node within 2 links of a specified node. • Results 1: – Motion highlighting is accurate even for the largest diagrams. – Without highlighting, error rates were high even for the smallest network. – Motion highlighting and static highlighting were equally effective. • Task 2: if 2 subgraphs had nodes in common. • Highlighting methods: – One with static highlighting and one with motion highlighting, or – Two with different motion highlighting. • Results 2: – Best performer: one with motion highlighting and the other with static highlighting. Ware, C., Bobrow, R. (2005) Beyond Node-Link Diagram • Matrix Representation • A graph can be represented by a connectivity matrix. • Advantage: – no edge crossing • Disadvantage: – large empty space for spare graph A comparison between node-link diagram and matrix representation • Tasks: – estimating the number of nodes; – estimating the number of links; – finding the most connected node; – finding a node with a given label; – finding a link between two specified nodes; – finding a common neighbor between two specified nodes; – finding a path between two nodes. (more variety of tasks) • Dataset – Random graph of size: 20, 50, and 100 nodes; – For each size, different link density: 0.2, 0.4 and 0.6. Ghoniem, M. et al. (2004) Results: • When graphs are bigger than 20 vertices, matrix outperforms nodelink diagrams on most tasks. – Only path finding is consistently in favor of node-link diagrams. • For small graphs: – Node-link diagrams are always more readable than matrices; • For larger graphs: – Matrices are 30% more accurate; – Matrices have comparable or better answer time. • For more complex tasks such as “path finding”, interaction is needed: – For example, displaying all the possible paths after selecting two nodes; – For matrix, path can be displayed by connecting cells using curves (mix matrix with node-link diagram). Summary • Effectiveness of aesthetics: – Most effective: edge crossings; – Less effective: bends and symmetry; – Not obvious: minimum angle and orthogonality. • Effectiveness of layout: – No significant difference between force-directed, orthogonal and planar layout methods. • Large graphs: – 3D is only better when coupled with interactions (and stereo vision if possible). – The 3D Geon diagram works better than 2D version; – Motion highlighting is better than static highlighting in some tasks. – Matrix representation is better than node-link diagram except path finding. Caveat • These conclusion are only valid in their test setup. • For instance, “different layout methods have little impact on graph readability” when – Graph is small and simple – For shortest path task only • It is possible that change in any test conditions can alter the results.