The theory of data visualisation

Slides:



Advertisements
Similar presentations
1 Human-Computer Interaction Screen Layout and Colour.
Advertisements

Color Physics –Light is E-M radiation of different frequencies. –Superposition principle Perception –3 cones -> 3D color space. (Metamers). –Convex subset.
The theory of data visualisation v2.0 Simon Andrews, Phil Ewels
Kira Jones Oral Communication Instructor
Geographic Information Systems GIS Output. 1. Color Theory Additive primaries blue, green, and red Subtractive primaries yellow, cyan, and magenta.
What is color for?.
Images and colour Colour - colours - colour spaces - colour models Raster data - image representations - single and multi-band (multi-channel) images -
Elements of Design. What are they? Line Colour Attributes Shape Categories Space Form.
The Principles of Design
Charts and Graphs V
Digital Images The digital representation of visual information.
Presentation Design: Starting Out. Color Basics There are two types of color models -- Reflective 1. CMYK Model Used in printing Stands for Cyan, Magenta,
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
In this lesson you will learn about the Elements of Art
Scientific Figure Design v2.0 Simon Andrews, Anne Segonds-Pichon, Boo Virk
Chapter 3 Space. Three Kinds of Space Space as format: size, scale, and presentation. Space as the relationships among objects and the areas surrounding.
Human Computer Interaction Design and graphics design in computer human interaction by Sylvia Ward.
By Emilio Dixon. Line  Definition: A line is a mark made by a moving point and having psychological impact according to its direction, weight and the.
Perception, Cognition and the Visual Seeing, thinking, knowing (link to optical video) (link to optical video) (link to optical video)
©2007 by the McGraw-Hill Companies, Inc. All rights reserved. 2/e PPTPPT.
StatisticsStatistics Graphic distributions. What is Statistics? Statistics is a collection of methods for planning experiments, obtaining data, and then.
Geographic Information Systems GIS Output. 1. Color Theory ► Additive primaries blue, green, and red ► Subtractive primaries yellow, cyan, and magenta.
Design Elements of Graphical Representation, (Factors supporting appearance and functionality of solutions). P0CCUAA.
(c) John Dempsey, University of Central Lancashire Using colours.
How to create a Scientific poster for the Group 4 presentation.
By: Ashley. Spot Color Spot color refers to the process of selecting text or a graphic object such as a circle and then adding a spot of color to it.
Introduction to Computer Graphics
CONFIDENTIAL Data Visualization Katelina Boykova 15 October 2015.
Elements of Design 1.02 Investigate Design Principles and Elements.
Intro to Color Theory. Objectives Identify and discuss various color models including RGB, CMYK, Black/white and spot color. Investigate color mixing.
Excel Part 4 Working with Charts and Graphics. XP Objectives Create an embedded chart Work with chart titles and legends Create and format a pie chart.
MIS 420: Data Visualization, Representation, and Presentation Content adapted from Chapter 2 and 3 of
DATA VISUALIZATION BOB MARSHALL, MD MPH MISM FAAFP FACULTY, DOD CLINICAL INFORMATICS FELLOWSHIP.
Integrating Graphics, Illustrations, Figures, Charts.
1 of 32 Computer Graphics Color. 2 of 32 Basics Of Color elements of color:
Graphics and Desktop Publishing Objective 1.02: Investigate Design Principles and Elements.
Plot type specific considerations
Scientific Figure Design
Color.
Chapter 12 Visual Representation of Data
Infographics Template types.
Tennessee Adult Education 2011 Curriculum Math Level 3
(c) University of Wisconsin, CS559 Spring 2002
Color Theory.
Yonglei Tao School of Computing & Info Systems GVSU
Excel Part 4 Working with Charts and Graphics
Graphics – Day 1.
Excel Part 4 Working with Charts and Graphics
Pengkayaan Materi: Guidelines, Principles, and Theories -> COLOR
Proposal: Preliminary Results and Discussion
graphical representation of data
Color, Depth, and Space.
Module 6: Presenting Data: Graphs and Charts
CSc4730/6730 Scientific Visualization
Scientific Figure Design
CSc4730/6730 Scientific Visualization
Presenting Categorical & Discrete Data
2.01 Understand Digital Raster Graphics
Colour theory.
Pitfalls and misuses of statistics and graphs
Art and Design – Formal Elements Miss Brompton
Communication of Color
Visual & Media Literacy
Color Model By : Mustafa Salam.
Primary and secondary There are two theories about how we can organize the different colours. Physicists explain color as a function of light. This is.
Color Models l Ultraviolet Infrared 10 Microwave 10
Charts A chart is a graphic or visual representation of data
COMPARING TWO SIDES OF THE SAME COIN
Excel Part 4 Working with Charts and Graphics
Colour Theory.
Presentation transcript:

The theory of data visualisation Simon Andrews, Phil Ewels simon.andrews@babraham.ac.uk phil.ewels@scilifelab.se

Data Visualisation A scientific discipline involving the creation and study of the visual representation of data whose goal is to communicate information clearly and efficiently to users. Data Visualisation is both an art and a science.

ISBN-10: 1466508914 http://www.cs.ubc.ca/~tmm/talks.html

Data Viz Process Collect Raw Data Process and Filter Data Clean Dataset Exploratory Analysis Generate Visualisation Generate Conclusion

A data visualisation should… Show the data Not distort the data Summarise to make things clearer Serve a clear purpose Link to the accompanying text and statistics

Things you can illustrate

Graphical Representations Basic questions How are you going to turn the data into a graphical form (weight becomes length etc.) How are you going to arrange things in space How are you going to use colours, shapes etc. to clarify the point you want to make

Marks and Channels Marks Channels Geometric primitives Lines Points Areas Used to represent data sets Channels Graphical appearance of a mark Colour Length Position Angle Used to encode data

Figures are a combination of marks and channels 1 Mark = Rectangle 1 Channel = Length of longest side 1 Mark = Circle segment 1 Channel = Angle 1 Mark = Diamond shape 2 Channels = X position, Y position 1 Mark = Circle 4 Channels: X position Y position Area Colour

Golden Rules Effectiveness Expressiveness Encode the most important information with the most effective channel Expressiveness Match the properties of the data and channel

Types of channel Quantitative Qualitative Position on scale Length Angle Area Colour (saturation) Colour (lightness) Qualitative Spatial Grouping Colour (hue) Shape

Colour Technical representations of colour Red + Green + Blue (RGB) Cyan + Magenta + Yellow + Black (CMYK) Perceptual representation of colour Hue + Saturation + Lightness (HSL)

HSL Representation Hue = Shade of colour = Qualitative Saturation = Amount of colour = Quantitative Lightness = Amount of white = Quantitative Humans have no innate quantitative perception of hue but we have learned some (cold – hot, rainbow etc.) Our perception of hue is not linear

Types of channel Quantitative Qualitative Position on scale Length Angle Area Colour (saturation) Colour (lightness) Qualitative Spatial Grouping Colour (hue) Shape

Data Types Quantitative Ordered Categorical Height, Length, Weight, Expression etc. Ordered Small, Medium, Large January, February, March Categorical WT, Mutant1, Mutant2 GeneA, GeneB, GeneC

Golden Rules Effectiveness Expressiveness Encode the most important information with the most effective channel Expressiveness Match the properties of the data and channel

Golden Rules Effectiveness Expressiveness Encode the most important information with the most effective channel Expressiveness Match the properties of the data and channel

Effectiveness of quantitation 2X 7X 4.5X 1.8X 16X 3.4X

Quantitation Perception

Golden Rules Effectiveness Expressiveness Encode the most important information with the most effective channel Expressiveness Match the properties of the data and channel

Most Quantitative Representations Good quantitation Bar chart Stacked bar chart with common start Stacked bar chart with different starts Pie charts Bubble plots (circular area) Rectangular area Colour (luminance) Colour (saturation) Poor quantitation

Discriminability If you encode categorical data are the differences between categories easy for the user to perceive correctly?

Qualitative Discrimination How many colours can you discriminate?

Qualitative Discrimination How many (fillable) shapes can you discriminate? Can combine with colour, but need to maintain similar fillable areas

Separability The effectiveness of a channel does not always survive being combined with a second channel. There are large variations in how much two different channels interfere with each other Trying to put too much information on a figure can erode the impact of the main point you’re trying to make

Separability There is no confusion between the two channels Larger points are easier to discriminate than smaller ones We tend to focus on the area of the shape rather than the height/width separately Humans are very bad at separating combined colours

Popout A distinct item immediately stands out from the others Triggered by our low level visual system You don’t need to actively look at every point (slow!) to see it

Popout (find the red circle)

Popout Speed of identification is independent of the number of distracting points

Popout (Find the circle)

Popout Colour pops out more than shape

Popout Mixing channels removes the effect (Find the red circle)

Use of space Where you want a viewer to focus on specific subsets of data you can help their perception by using the layout or highlighting of data to draw their attention to the point you’re making

Grouping

Grouping Exon CGI Intron Repeat

Containment

Containment

Containment Wild Type Mutant

Linking 1 1 2 2 3 3

Linking

Ordering Is a monkey heavier than a dog?

Validation Always try to validate plots you create You have seen your data too often to get an unbiased view Show the plot to someone not familiar with the data What does this plot tell you? Is this the message you wanted to convey? If they pick multiple points, do they choose the most important one first?

General Rules No unnecessary figures One point per figure Does a graphical representation make things clearer? Would a table be better? One point per figure Design each figure to illustrate a single point Adding complexity compromises the effectiveness of the main point No absolute reliance on colour Figures should ideally still work in black and white Colour should help perception No 3D 3D is hardly ever justified and makes things less clear Figures should be self-contained Must be understandable without additional information