Data Encoding Fundamentals. Visual Attributes Important things to consider before making design decisions –Who is your audience? –What is the purpose.

Slides:



Advertisements
Similar presentations
Info 2040 Foundation of Quantitative Analysis
Advertisements

Statistics for the Social Sciences Psychology 340 Fall 2006 Distributions.
Unit 8: Presenting Data in Charts, Graphs and Tables
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 2-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Module 4: Data presentation & interpretation. Module 4: Learning objectives  Understand different ways to best summarize data  Choose the right table/graph.
Jamie Starke.  Sizing the Horizon: The Effects of Chart Size and Layering on the Graphical Perception of Time Series Visualizations ◦ J. Heer, N. Kong,
DESCRIPTIVE STATISTICS: GRAPHICAL AND NUMERICAL SUMMARIES
Module 3: Data presentation & interpretation. Module 3: Learning Objectives  Understand different ways to best summarize data  Choose the right table/graph.
CHAPTER 2 Basic Descriptive Statistics: Percentages, Ratios and rates, Tables, Charts and Graphs.
Quantitative Data Analysis Definitions Examples of a data set Creating a data set Displaying and presenting data – frequency distributions Grouping and.
Chapter 7 Correlational Research Gay, Mills, and Airasian
Presentation of Data.
Designing Great Visualizations Jock D. Mackinlay Director, Visual Analysis, Tableau Software.
1. Statistics 2. Frequency Table 3. Graphical Representations  Bar Chart, Pie Chart, and Histogram 4. Median and Quartiles 5. Box Plots 6. Interquartile.
Statistics - Descriptive statistics 2013/09/23. Data and statistics Statistics is the art of collecting, analyzing, presenting, and interpreting data.
Chapter 3: Graphic Presentation
Higher Biology Unit 1: Cell biology Unit 2: Genetics & Adaptations
Southampton Education School Southampton Education School Dissertation Studies Quantitative Data Analysis.
Jargon & Basic Concepts Howell Statistical Methods for Psychology.
Graphic Presentation The Pie Chart The Bar Graph The Statistical Map
Copyright © 2012 by Nelson Education Limited.2-1 Chapter 2 Basic Descriptive Statistics: Percentages, Ratios and Rates, Tables, Charts, and Graphs.
Biostatistics ZMP 602 E_Mail:
Census A survey to collect data on the entire population.   Data The facts and figures collected, analyzed, and summarized for presentation and.
Graphical Analysis. Why Graph Data? Graphical methods Require very little training Easy to use Massive amounts of data can be presented more readily Can.
Fundamentals of Measurement by Michael Everton (mxe06u)
Statistics, Data, and Statistical Thinking
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 2 Section 1 – Slide 1 of 27 Chapter 2 Section 1 Organizing Qualitative Data.
UNDERSTANDING RESEARCH RESULTS: DESCRIPTION AND CORRELATION © 2012 The McGraw-Hill Companies, Inc.
Application of Number INSET June Application of Number What is Application of Number?
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
Handout week 1 course Renske Doorenspleet 1 Chapter 1 -A. The role of statistics in the research process -B. Statistical applications -C. Types of variables.
Experimental Research Methods in Language Learning Chapter 9 Descriptive Statistics.
Qualitative Data: consists of attributes, labels or non-numerical entries Examples: Quantitative Data: consists of numerical measurements or counts Examples:
Chapter 2: Behavioral Variability and Research Variability and Research 1. Behavioral science involves the study of variability in behavior how and why.
Psy 230 Jeopardy Measurement Research Strategies Frequency Distributions Descriptive Stats Grab Bag $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500.
Bell Ringer Using female = 0 and male = 1, calculate the average maleness in this classroom.
GRAPHICS GUIDELINES MUSE/CE 11B Anagnos/Williamson From Pfeiffer, W.S Technical Writing: A Practical Approach. 5th Edition. Prentice Hall. New Jersey.
1 Statistics Statistics can be found in all aspects of life:
CONFIDENTIAL Data Visualization Katelina Boykova 15 October 2015.
STATISTICS AND OPTIMIZATION Dr. Asawer A. Alwasiti.
Logistic Regression. Linear regression – numerical response Logistic regression – binary categorical response eg. has the disease, or unaffected by the.
Presenting Data in Charts, Graphs and Tables #1-8-1.
Surveillance and Population-based Prevention Department for Prevention of Noncommunicable Diseases Displaying data and interpreting results.
Cartography: Communicating Spatial Information Scott Bell GIS Institute.
Measurement Experiment - effect of IV on DV. Independent Variable (2 or more levels) MANIPULATED a) situational - features in the environment b) task.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
Visualizing Collections Data Why Pie Charts Aren't Always the Answer.
Basic Statistics for Testing. Why we need statistics Types of scales Frequency distributions Percentile ranks.
Data Coaching Services Chart Interpretation 1. o Bar o Stacked Bar o Pie o Line o Scatter plot 2.
Basic terms.
INTRODUCTION AND DEFINITIONS
Graphs Graph Interpretation.
Reliability & Validity
Module 6: Presenting Data: Graphs and Charts
CSc4730/6730 Scientific Visualization
Stats Club Marnie Brennan
Introduction to Statistics
Visual Variables for Information Visualization
CSc4730/6730 Scientific Visualization
Biostatistics College of Medicine University of Malawi 2011.
Course code:- PGPPA2F007T STATISTICAL METHODS AND COMPUTER APPLICATIONS.
Module 3: Data presentation & interpretation
Statistical Reasoning
Purpose of Displaying Data
Understanding Statistical Inferences
Chapter 3: Graphic Presentation
Descriptive Statistics
Biostatistics Lecture (2).
Basic Anthropometric data quality checks
Presentation transcript:

Data Encoding Fundamentals

Visual Attributes Important things to consider before making design decisions –Who is your audience? –What is the purpose of your visualization? –How many variables do you have? –Have you looked at the data? Is it clean? Are there potentially erroneous data? –What types of data are your variables? Quantitative –Fixed zero (ratio)? (e.g., mass, $) –Zero arbitrary (interval)? (e.g., date, long/lat) Ordinal (e.g., grades) Nominal (apples & oranges)

Visual Attributes Previous work has developed suggested “best practices” for encoding data Based on –Theory (psychology, human perception) –Research –Experience Playfair, Tukey, Bertin, Cleveland & McGill, Few, Schniederman, Mackinlay, Card, Heer & Bostock, etc.

Bertin Bertin, Jacques. Semiology of Graphics: Diagrams, Networks, Maps. Madison, WI: University of Wisconsin Press, Based on theory & experience

WS Cleveland, R McGill. Graphical perception: Theory, experimentation, and application to the development of graphical methods. Journal of the American statistical association. 1984;79(387): More accurate Less accurate Cleveland & McGill Based on research & theory

Mackinlay & Winslow. Designing Great Visualizations More accurate Less accurate Mackinlay. Automating the design of graphical presentations of relational information. Acm Transactions On Graphics. 1986;5(2): Mackinlay Based on research & theory

There are 5 quantitative values labeled A-E Is A bigger than C? WS Cleveland, R McGill. Graphical perception: Theory, experimentation, and application to the development of graphical methods. Journal of the American statistical association. 1984;79(387): Mackinlay & Winslow Designing Great Visualizations ng-great-visualizations.pdf

The Science Behind Optimal Encoding Cleveland & McGill WS Cleveland, R McGill. Graphical perception: Theory, experimentation, and application to the development of graphical methods. Journal of the American statistical association. 1984;79(387): Most accurate Least accurate

Methods n=55 Participants “fell into 2 categories” –Females, “mostly housewives, without substantial training & working in technical jobs” –Males & females “with substantial training & working in technical jobs” Assessed position-angle then position-length graphs Cleveland & McGill

Participants judged Which dotted bar was smaller what percent the smaller dotted bar is of the larger dotted bar Common scalePosition-length

Cleveland & McGill Participants judged Which slice/bar segment was largest The percentages of the smaller slices/bar segments AngleLength

Cleveland & McGill Analyses Accuracy Large absolute errors Mean of errors You can check the paper to see the equations used to calculate these

Findings No differences in accuracy by technical experience Position judgments – x more accurate than length judgments –~2x more accurate than angle judgments Cleveland & McGill

Recent build onto Cleveland & McGill’s work Used mTurk to assess proportional judgment tasks from Cleveland & McGill + circular area Also tested –Rectangular area judgment –Other things I won’t cover here (e.g., gridline spacing) Heer CSE) & Bostock

Methods Proportional judgment experiment n=50 ID’d the smaller of two marked values & estimated what % the smaller was of the larger Analyses same equations as Cleveland & McGill Heer & Bostock

Methods Rectangular judgment n=24 ID’d the smaller of two marked rectangles & estimated what % the smaller was of the larger Analyses same equations as Cleveland & McGill Heer & Bostock

Findings Bar > stacked bar > pie > bubble/treemap Heer & Bostock

The Science Behind Optimal Encoding More research is needed to validate previous research –Are findings universal? –Generalizability? –Are there particulars in biomedical & health informatics where these findings may not apply? Stakes higher, users vary (clinicians, researchers, patients), purposes vary (e.g., point-of care instant decision-making aids, developing prediction models, daily health management)

Summary Important things to consider before making design decisions There is guidance on “best practices” for how to encode data More research is needed to validate “best practices” especially in biomedical & health informatics