Evaluating Modern Graphics – New Standards or Old? Hilary Green Macquarie University, Sydney Summary This paper investigates whether there is a need for new criteria for the evaluation of new data display methods. Two modern graphical display methods are examined using well documented guidelines from the literature of statistical graphics. It appears that established criteria, while still valid, do not extend to important features of some modern visual displays. Keywords: dynamic graphs, information visualisation, graphical perception. Introduction Due to rapid advances in computer technology and to the increasing ability to obtain and display some datasets on demand, development of new graphics is inevitable. In particular, the use of colour and movement are fascinating features of modern graphs. Their dynamic features and ability to offer instantaneous feedback make them invaluable as a tool for data mining or prospecting. Typically, the engaged user needs to focus on the task of discovery without the distraction of superfluous detail. On the other hand, the user will often wish to capture pertinent findings for reexamination at some later stage. And still later, the user may wish to communicate discoveries to an audience, thus requiring a static graphical display. Can we evaluate modern graphics using historical criteria - or is it necessary to develop new criteria? To address this issue, two examples of modern graphics, a mosaic plot and a Grapharti are presented and examined in the light of historic criteria for good graphics. I conclude that the historic criteria remain valid; however, modern graphics require additional standards for the evaluation of features that were not formerly present. 968 Two Examples of New Graphics An example of a recent mosaic plot, originally developed by Hartigan & Kleiner in 1981, is shown in Figure 1. Mosaic displays show the frequencies in n-way contingency tables by nested rectangular regions whose areas are proportional to cell frequencies. Interactive mosaic plots permit the addition and deletion of variables as well as the ordering of the variables and choice of colours. Figure 1 shows there is an association between sex, hair and eye colour: females with brown hair are most likely to have brown eyes, while males are unlikely to have both brown hair and brown eyes. Brown Standardized residuals: Hazel Green Blue <4 4:2 2:0 0:2 2:4 >4 Model (Hair) (Eye) (Sex) Male Female Black Brown Red Blond Fig. 1: Mosaic display for hair color, eye color, and sex. (from http://www.math.yorku.ca/SCS/mosaics.html). Grapharti, is designed to display data from surveys, opinion polls, financial data - wherever there is a long list of items each associated with at least two quantitative variables. The method is demonstrated using data from daily opinion polls carried out on the Australian ninemsn website (ninemsn.com.au). The dates, questions and the numbers of yes and no responses are stored in a database and retrieved from a web page in a format as shown in the Grapharti of Figure 2. A rectangle represents each question; the width is determined by the number of responses and the colour by the proportion of yes responses. These images are tiled alongside each other. By rolling the mouse over the image the user extracts further infor- 969 mation. The user has the opportunity to explore the data by variable selection, grouping, ordering and filtration. “The spatial representation may be visually browsed and analyzed in ways that avoid language processing and thus reduce the analyst’s mental workload”. In the example in Figure 2, the mouse was over the tile representing the outcome of the poll of the 16th October, 2002, showing a large and positive response. Both these graphs summarize proportions. The proportions of each of gender, eye colour and hair colour are simultaneously depicted on the mosaic plot in Figure 1. The mosaic plot reveals information about a single sample. In Grapharti each tile reveals information from different sources. Both these displays are fairly recent innovations; mosaic plots in 1981, Grapharti in 2005. However, the mosaic plot considered in Figure 1 includes two enhancements designed to facilitate the perception of relations among the variables: the shading of each tile and the re-ordering of the categories [1]. Fig. 2: Screen shot of Grapharti from ninemsn daily news polls. Background The search for the understanding of the processes that produce a successful graphic has been taken up by many individuals from a variety of fields, including statisticians, psychologists and computer scientists. In the visualization of information, there must be two component processes. The first is that the brain translates visual representations to arrive at cognition, and the second, that visual representations assist in the amplification of cognition. The first of these processes is sometimes is called perceptual organization – “the issue of how the brain really makes us see, in the sense of 970 how does it produce conscious experience, remains a mystery”. The approach to the second process, where visual representations assist in the amplification of cognition, relies on the fact that visual representations are tools. These tools are most useful because they “facilitate the comprehension of huge amounts of data, allow the perception of emergent properties that were not anticipated, enable problems with the data to become apparent, facilitate understanding of both large scale and small scale features and patterns in the data and facilitate hypothesis formation”. There has been much research into discovering how the tools work and most importantly, which tools are effective: some of this research has been carried out by statisticians. The works of Jaques Bertin, John Tukey, Edward Tufte and William Cleveland are seminal and together they have compiled a set of guidelines by which we can judge the efficacy of statistical graphics. In the following section, mosaic plots and Grapharti are investigated within the framework of these guidelines. Evaluation/Assessing New Graphics William Playfair (1759–1823) is often regarded as the progenitor of modern statistical graphics, The French king, Louis VI said of Playfair’s graphs that “they spoke all languages and were very clear and easily understood”. His appraisal of Playfair’s graphs initiated a recurring theme; graphs must speak a universal language, they “must have immediate and strong impact and show phenomena not numbers”. Applying these notions to our two examples, it seems relatively easy to discover the method of reading mosaic plots and Grapharti. However, deciphering the meaning of the graphs may take a little more concentration. The use of colour in the mosaic graph refers to scaled residuals, a concept that needs to be learned, and the rectangles in the mosaic plots vary in both width and length, making comparison of areas difficult. With a Grapharti, we need to explore the detail before the essence can be perceived. Graphs are encoded with objects that are perceived at the preattentional level such as points, lines and shapes, so it is not surprising that they can be deciphered across all cultures and all languages. Cleveland and McGill have determined, experimentally, a ranking of elements that assist with processing elementary tasks and thus facilitate accurate decoding of graphs. These are, from best to worst; position on a common scale, position on identical non-aligned scales, length, direction, angle, area, volume, curvature, shading, and colour saturation. These rankings, though not rigid, provide a basis on which to judge graphs. The mosaic plot employs area 971 and colour saturation, while Grapharti makes use of length and colour saturation. Both the displays feature lower level pre-attentive elements. A graphic should “serve a reasonably clear purpose; description, exploration, tabulation or decoration”. The main purpose of the mosaic plot is description, whereas that of a Grapharti is to support exploration. Both graphs use rectangles to depict quantitative information: the size, shape, colour and position of each rectangle show variation in data. They are examples of Tufte’s small multiples: “Information slices positioned within the eyespan, … Constancy of design puts the emphasis on changes in data”. Small multiples are effective because they invite comparison and can show shifts in relationships. The mosaic in Figure 1 shows, for example, that most people had brown hair (width of the horizontal brown category). By using small multiples, the mosaic plot is able to achieve its purpose - to describe the data and facilitate comparisons. The main purpose of Grapharti is to encourage exploration of the data, or data mining. By lining up tiles in date or colour order the “multiples create visual lists … helping viewers to analyze, compare, differentiate, decide”, and thus the user is encouraged to discover what sort of questions provoke a large number of responses, and how strongly respondents feel about certain issues. Tukey also maintained that we should be clear about two distinct functions of visual displays – ‘prospecting’ and ‘transference’, and they must be used quite differently in each situation. Prospecting, or searching the data for phenomena, would typically require many displays. The transference of “a recognition of the appearances that indicate the phenomena that deserve report”, would require a clearly labeled static graph. Thus the function of the mosaic plot is transference, while that of Grapharti is prospecting. It follows that dynamic graphs are more useful for prospecting and static graphs for transference. Other salient features of statistical graphs are the use of gridlines, tick marks, legends and labels. Whilst neither of the examples uses gridlines or tick marks, each employs separators; white spaces in the mosaic plots separate groups within categories and fine muted borders separate tiles in Grapharti. These are necessary to distinguish different events by enhancing boundary detection, and so do not constitute “chart junk” nor unnecessary “data ink”. Comparisons of scale are facilitated by the dimensions of the rectangles. The legend in the mosaic plot clearly provides an ordinal value to the colour scheme, and so enables the reader to interpret this variable in the context of the data. In the same way, a simple legend for Grapharti should be provided. In each graph, the necessity of constantly referring to a legend to decode the colours is eliminated by the use 972 of sensible colour coding. The categories in the mosaic plot have been clearly labeled. In Figure 2, labeling is not required as the tiles are ordered horizontally and wrapped within the data frame; the title provides adequate description of what is shown. However, Grapharti can be organized to display more complex results, thus necessitating the use of axes and labels. Tufte suggests that the display should “reflect a balance, a proportion, a sense of relevant scale”. Balance has been achieved in the mosaic plot in Figure 1, by the ordering of the hair/eye categories, so that the most prominent effects are displayed in diagonally opposite extremes of the display. This has the effect of highlighting one of the main conclusions of the investigation– that the three variables are not independent. From an aesthetic point of view as well, there is a sense of balance in the variety and intensity of colours, and the sizes of the rectangles. In the same way, reordering the tiles in Grapharti, Figure 2, has produced a sense of balance suggesting that the questions have evoked a variable proportion of yes voters. Data density is defined as the number of data entries per square centimeter. Grapharti is clearly more “data dense” than the mosaic plot (19.31 to 1.33) but far less meaningful in its static form. Further guidelines needed The mosaic plot appears to satisfy most criteria for an effective graphic while a static Grapharti reveals little meaningful information. However, as the purpose of Grapharti is to facilitate data exploration, the use of dynamic features is an essential component of this graphic. A list of guidelines for dynamic graphs would be useful. Statisticians have contributed to this burgeoning area, and, again, they propose that common sense should prevail. “For a dynamic method to be useful the visual and manual tasks must be easy to perform”. For dynamic graphs, “The basic ideas – impact, easy comparison, balanced emphasis, etc, can and still should be the same”. “Dynamic graphics can be viewed as methods that allow us to move rapidly through a long sequence of static views until the desired or static view is found, having found it we can present it to others”. To discover some guidelines for the display of dynamic graphs, we can investigate the research of Ben Shneiderman relating to human-computer interaction and user interface. In 1997, he proposed “well designed, effective computer systems generate positive feelings of success, competence, mastery, and clarity” - “users have to be seen as heroes”. A generally accepted Visual Information Seeking strategy is: “Overview first, zoom and filter, then details-on-demand”. A particularly relevant goal for designers of in- 973 terfaces that enable prospecting or visual information seeking is to “offer usability plus reliability. Guidelines to achieve this goal are, briefly; to strive for consistency, cater to universal usability, offer informative feedback, permit easy reversal of actions, and reduce short-term memory load. I investigate Grapharti in the light of these guidelines. Consistency is achieved by employing the same design features such as colour scheme for each display of a Grapharti image. On the page introducing Grapharti, the user is able to select some or all of the data to view in table format or as Grapharti, and sort in various ways. The use of familiar features such as buttons ensures the user is not distracted by the interface. Such features have become widely known and intuitive for users and possibly speak a universal language alluded to previously. Grapharti provides informative feedback by the information yielded on rolling the cursor over the tiles. This feature is further enhanced by being able to click on tiles to retrieve further information. However, in the format shown in this paper, Grapharti goes no further in assisting with the task of prospecting. To successfully use a dynamic visualization for prospecting it is essential that a user have the ability save useful instances that may provide insight. “Users should be able to bundle search sessions as well as save individual portions of a given search session, and flexibly access and modify each”. Incorporating such features into Grapharti would further reduce short term memory load. In particular, a search option and a shopping basket facility would help generate positive feelings of success and clarity in users. Conclusion For graphs that are generated dynamically, then, it is apparent that some further guidelines need to be substantiated in the literature. Some graphs, like the mosaic plot shown in Figure 1, survive extremely well in their static form. This is largely because of adherence to details such as careful labelling and appropriate legends that are shown at each rendition. The historic criteria referred to previously seem to suffice for evaluating such graphs. These graphs are extremely useful for transference of information. Dynamic graphs are far less informative in their static form but function well as the tools for data visualization. For these graphs, well researched and functional guidelines can be gathered from others, particularly from computer scientists. We argue that dynamic graphs need not adhere to the traditional guidelines applicable to static graphics, particularly when the manipulation of the interactive features on the graph is intuitive. However, the user of dynamic graphs needs the ability to reorganise, retrieve and 974 store potentially useful discoveries. It is reasonable to conclude that static graphics may need one set of criteria; while dynamic graphics may need another (closely related but distinct) set! Reference List Cleveland, W. S. and M. E. McGill (1988). Dynamic Graphics for Statistics. FL, USA, CRC Press, Inc. Cleveland, W. S. and R. McGill (1985). "Graphical Perception and Graphical Methods for Analyzing Scientific Data." Science, New Series 229(4716): 828-833. Friendly, M. (1994). "Mosaic displays for n-way contingency tables."Journal of the American Statistical Association 89: 190200. Green, H. (2005). Grapharti. Statistics Education and the Communication of Statistics, Sydney, Australia. Hearst, M. (1999). User Interfaces and Visualization. Modern Information Retrieval. R. Baeza-Yates and B. Ribeiro-Neto. New York, Addison Wesley Longman Publishing Co. Inc. McLaughlin, J. (2005). Looking for the Mind's Eye. 2005. Shneiderman, B. (1997). "Designing Information-Abundant Websites: Issues and Recommendations." International Journal of Human-Computer Studies (1997) 47(1): 5-29. Tufte, E. R. (1983). The visual display of quantitative information. Cheshire,Conn., Graphics Press. Tufte, E. R. (1997). Visual explanations : images and quantities, evidence and narrative. Cheshire, Conn., Graphics Press. Tukey, J. (1990). "Data-Based Graphics: Visual Display in the Decades to Come." Statistical Science 5(3): 327-339. Wainer, H. (2004). Graphic Discovery: A Trout in the Milk and Other Visual Adventures. Princeton, New Jersey, Princeton University Press. Ware, C. (2004). Information visualization : perception for design. San Francisco, Morgan Kaufman. Wise, J. A., Thomas, J.J., Pennock, K., Lantrip, D., Pottier, M., Schur, A., & Crow, V. (1995). Visualizing the Non-Visual: Spatial analysis and interaction with information from text documents. Readings in Information Visualization: using vision to think. S. K. M. S.K. Card, and B. Shneiderman. San Francisco, Calif., Morgan Kaufmann Publishers.