Scientific Figure Design v2.0 Simon Andrews, Anne Segonds-Pichon, Boo Virk simon.andrews@babraham.ac.uk anne.segonds-pichon@babraham.ac.uk bhupinder.virk@babraham.ac.uk What this course covers… • Theory of data visualisation – Why do some figures work better than others? • • • • • Ethics of data representation Elements of graphic design Editing bitmap images in GIMP Vector editing and compositing in Inkscape Journal submissions What this course doesn’t cover… • How to draw graphs in specific programs – R Introduction – Statistics with R – Statistics with GraphPad • Network representations • Spatial data representations Timetable • Morning – Introduction – Data Visualisation Theory • Coffee – Data Representation Practical – Ethics talk • Afternoon – – – – Design theory talk Ethics practical GIMP Tutorial GIMP Practical • Coffee – Inkscape Tutorial – Inkscape Practical – Final practical Data Visualisation Process Collect Raw Data Process and Filter Data Clean Dataset Exploratory Analysis Generate Conclusion Types of figure • Exploration – Understanding your data • Reference – No specific point to make, a resource • Illustration – A way to present the data to support a specific conclusion Exploratory visualisation • • • • • Understand your data Multiple ways to present and summarise Crude representations Interactive Not intended for final publication – Can be adapted for publication T r e a tm e n t 2 T r e a tm e n t 3 T r e a tm e n t 2 T r e a tm e n t 3 1500 1500 Frequency 0 5 10 15 10 15 0 1500 Histogram of log2(full.counts[[x]]) Frequency Histogram of log2(full.counts[[x]]) 20 0 5 10 15 Histogram of log2(full.counts[[x]]) Histogram of log2(full.counts[[x]]) 5 10 0 15 0 5 10 15 log2(full.counts[[x]]) log2(full.counts[[x]]) Histogram of log2(full.counts[[x]]) Histogram of log2(full.counts[[x]]) 5 10 15 20 20 1500 log2(full.counts[[x]]) Frequency log2(full.counts[[x]]) log2(full.counts[[x]]) T r e a tm e n t 1 0 Frequency 1500 1500 0 C o n tr o l 0 1500 0 15 log2(full.counts[[x]]) 5 20 0 0 10 log2(full.counts[[x]]) Frequency 20 T r e a tm e n t 1 5 1500 20 15 Histogram of log2(full.counts[[x]]) 0 40 10 Histogram of log2(full.counts[[x]]) 1500 60 5 log2(full.counts[[x]]) 80 40 C o n tr o l 0 Frequency 60 0 1500 100 80 15 0 100 Frequency 120 V a lu e V a lu e 0 120 10 log2(full.counts[[x]]) 0 Frequency 0 140 5 0 Frequency 0 140 Histogram of log2(full.counts[[x]]) 0 Frequency Histogram of log2(full.counts[[x]]) 0 5 10 log2(full.counts[[x]]) 15 20 Reference visualisation • • • • Using your data as a resource Allows users to look up data of interest Tabular / Configurable Interactive Illustrative visualisation • • • • Intended to convey a specific point Carefully chosen subset of data Optimised presentation Good design • Used for figures in papers What makes a good figure? • Has a clear message – Helps to tell a story – Adds to the text, and links to it • Is focussed – Don’t confuse one message with another • Is easy to interpret correctly – Good data visualisation – Good design • Is an honest and true reflection of the data