Introduction to Data Analytics

Slides:



Advertisements
Similar presentations
Lecture 06: Design II February 5, 2013 COMP Visualization.
Advertisements

Theory of Data Graphics Part 1 Most of a graphic’s ink should vary in response to data variation (see chapters 4-6)
Lecture 1: Beautiful graphics in R
Making effective plots: 1.Don’t use default Excel plots! 2.Figure should highlight the key relationships in the data. 3.Should be clear - no extraneous.
Statistics for the Behavioral Sciences Second Edition Chapter 3: Visual Displays of Data iClicker Questions Copyright © 2012 by Worth Publishers Susan.
Section 2-4 Statistical Graphics.
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
Data Presentation A guide to good graphics Bureau of Justice Statistics Marianne W. Zawitz.
Source: Tufte E. (2001) The Visual Display of Quantitiative Information. 2 nd Ed. Cheshire: Graphics Press Originally published in American Education,
Scientific Communication and Technological Failure presentation for ILTM, July 9, 1998 Dan Little.
Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 3.1 Chapter Three Art and Science of Graphical Presentations.
2007 會計資訊系統計學 ( 一 ) 上課投影片 3.1 Chapter Three Art and Science of Graphical Presentations.
Data Visualization.
1 Visualization Solutions for Effective Communication Warren C. Weber California State Polytechnic University, Pomona.
Analyzing and Visualizing Data Dr. Lam TECM 4180.
3D vs. 2D Graphs in Representing Lower Dimensional Data Do Irrelevant depth cues affect the comprehension of bar graphs? -Martin M. Fisher(2000) The use.
Using Publicly Available Data to Engage IV-E Students in Research and Statistics: Instructional Modules MODULE 4 SLIDE DECK: PRESENTING DATA GRAPHICALLY.
Tufte’s Design Principles
Jeffrey Nichols Displaying Quantitative Information May 2, 2003 Slide 0 Displaying Quantitative Information An exploration of Edward R. Tufte’s The Visual.
Graphics and visual information English 314 Technical communication Note: To hide or reveal these lecture notes, go to VIEW and click COMMENTS. This lecture.
Making Graphs. The Basics … Graphical Displays Should: induce the viewer to think about the substance rather than about the methodology, graphic design,
Mark P. Baldwin Northwest Research Associates, USA Cargese UTLS Summer School, 6 Oct Data Graphics AndTypography.
The Center for IDEA Early Childhood Data Systems April 25, 2014 Data Visualization: A Picture’s Worth a Thousand Numbers Nick Ortiz, Alice Ridgway and.
Mark P. Baldwin Northwest Research Associates, USA Cargese UTLS Summer School, 6 Oct Data Graphics AndTypography.
CMPT 880/890 Writing labs. Outline Presenting quantitative data in visual form Tables, charts, maps, graphs, and diagrams Information visualization.
Choose between Access and Excel Right questions, right program If you’re having trouble choosing between Access and Excel, take a moment to answer an important.
CS 235: User Interface Design December 3 Class Meeting Department of Computer Science San Jose State University Fall 2014 Instructor: Ron Mak
Graphical Display and Presentation of Quantitative Information 13 February 2006.
1 Bacon – T. A. Webinar – 7 March 2012 Transforming Assessment with Adaptive Questions Dick Bacon Department of Physics University of Surrey
POLITICAL CARTOONS What they are, what they mean and how we can use them.
Gary Klass Department of Politics and Government Illinois State University.
November 6-9, Seattle, WA Bad Reports: Fixing Their Mistakes Roger Noble Consultant LobsterPot Solutions.
1 Eric Rasmusen, March 10, 2014 Graphs and Tables.
Graphics for Macroeconomics. Principles Graphing is done best when it clearly communicates ideas about data Focus on the main point while preventing distractions.
Visualizing Data in Excel Geof Hileman, FSA Kennell & Associates, Inc June 4, 2012.
How to read a scientific paper
Department of Politics and Government Illinois State University
MIS2502: Data Analytics Principles of Data Visualization David Schuff
Copyright 2010, The World Bank Group. All Rights Reserved. Analysis, Presentation, and Uses of Data, Part II.
COMMUNICATING DATA USING GRAPHICS MIS2502 Data Analytics.
14-1 © 2014 by McGraw-Hill Education. This is proprietary material solely for authorized instructor use. Not authorized for sale or distribution in any.
ANALYTIC HIERARCHY PROCESS MATRIX ABOUT AHP. The AHP Matrix.
1 CSE 2337 Chapter 3 Data Visualization With Excel.
Information Design Trends Unit Three: Information Visualization Lecture 1: Escaping Flatland.
Proposal: Preliminary Results and Discussion. Dos and Don’ts DoDon’t Include initial results if you have them You can also conduct and report on informal.
Data Presentation Adapted by Joanna Wolfe from Marianne W. Zawitz, Bureau of Justice Statistics, October 11, 2000 Presenting effective Tables and Figures.
MIS2502: Data Analytics Principles of Data Visualization.
Recap Iterative and Combination of Data Visualization Unique Requirements of Project Avoid to take much Data Audience of Problem.
Andrew Barnes February Why use charts and graphics? It gives a visual representation to numbers and statistics. It is simple to use and easy to.
Online Intelligence Solutions Data Visualization and Dashboard Design LIVE WEBINAR 04/04/2013.
Testing Tufte Applying Visual Design Principles to Student Test Results Dan Gilbert Mike Griffin
Assignment 7: Thinking about graphical excellence By: Sarah K. Brooks.
MIS5101: What is Analytics? Principles of Data Visualization.
MIS 420: Data Visualization, Representation, and Presentation Content adapted from Chapter 2 and 3 of
Copyright © 2013, 2010 and 2007 Pearson Education, Inc. Chapter Organizing and Summarizing Data 2.
Event: Perspective User Conference 2016 Topic:Business Intelligence Speaker: Jonathon Carrell LinkedIn: Blog:jonathoncarrell.net.
Presented to: By: Date: Federal Aviation Administration Effective Data Presentation Fernwood Avenue Middle School December 21, 2011 Ferne Friedman-Berg,
Data Visualization vs. Infographics
Graphics in Expository Writing: A Guide
Display of Quantitative Information
MIS2502: Data Analytics Principles of Data Visualization
Data Visualization Data visualization principles. Tell a story
Proposal: Preliminary Results and Discussion
Visualization Week 8.
MIS2502: Data Analytics Principles of Data Visualization
MIS2502: Data Analytics Principles of Data Visualization
What’s the problem? Goodson
Keller: Stats for Mgmt & Econ, 7th Ed
Garr Reynolds wrote a book in 2008, and he has a follow up book published this year (2010) called Presentation Zen Design – the library has both books.
Keller: Stats for Mgmt & Econ, 7th Ed
Presentation transcript:

Introduction to Data Analytics Part Four: Principles of Data Visualization David Schuff

What makes a good chart? Minard’s map of Napoleon’s campaign into Russia, 1869 Reprinted in Tufte (2009), p. 41

What makes a good chart? http://www.popvssoda.com/countystats/total-county.html

What makes a good chart? This is from an academic conference paper. What are the problems with this chart? The legend is for how often they find relevant information (given their “following” behavior”) Zhang et al. (2010), “A case study of micro-blogging in the enterprise: use, value, and related issues,” Proceedings of the 28th International Conference on Human Factors in Computing Systems.

Some basic principles (adapted from Tufte 2009) The chart should tell a story 1 The chart should have graphical integrity 2 The chart should minimize graphical complexity 3 Tufte’s fundamental principle: Above all else show the data

Principle 1: The chart should tell a story Graphics should be clear on their own The depictions should enable meaningful comparison The chart should yield insight beyond the text “If the statistics are boring, then you’ve got the wrong numbers.” (Tufte 2009)

Examples? http://www.evl.uic.edu/aej/491/week03.html http://flowingdata.com/2009/11/26/fox-news-makes-the-best-pie-chart-ever/

Telling a Story http://flowingdata.com/2011/01/19/states-with-the-most-and-least-firearms-murders/ http://economix.blogs.nytimes.com/2009/05/05/obesity-and-the-fastness-of-food/

Principle 2: The chart should have graphical integrity Basically, it shouldn’t “lie” (mislead the reader) Tufte’s “Lie Factor”: 𝐿𝑖𝑒 𝐹𝑎𝑐𝑡𝑜𝑟= 𝑠𝑖𝑧𝑒 𝑜𝑓 𝑒𝑓𝑓𝑒𝑐𝑡 𝑠ℎ𝑜𝑤𝑛 𝑖𝑛 𝑔𝑟𝑎𝑝ℎ𝑖𝑐 𝑠𝑖𝑧𝑒 𝑜𝑓 𝑒𝑓𝑓𝑒𝑐𝑡 𝑖𝑛 𝑑𝑎𝑡𝑎 Should be ~ 1 < 1 = understated effect > 1 = exaggerated effect

Examples of the “lie factor” 𝐿𝐹= 5.3/0.6 27.5/18 = 8.83 1.53 =5.77 𝐿𝐹= 4280% (𝑐ℎ𝑎𝑛𝑔𝑒 𝑖𝑛 𝑣𝑜𝑙𝑢𝑚𝑒) 454% (𝑐ℎ𝑎𝑛𝑔𝑒 𝑖𝑛 𝑝𝑟𝑖𝑐𝑒) =9.4 Reprinted from Tufte (2009), p. 57 & p. 62

A more recent, basic example The original graphic from Real Clear Politics, 2008. (Look at the y-axis) The adjusted graphic. http://20bits.com/articles/politics-and-tuftes-lie-factor/

Other tips to avoid “lying” Adjust for inflation Make sure the context is presented vs.

Principle 3: The chart should minimize graphical complexity Generally, the simpler the better… Key concepts Sometimes a table is better Data-ink Chartjunk

When a table is better than a chart For a few data points, a table can do just as well… Salesperson Total Sales Peacock $225,763.68 Leverling $201,196.27 Davolio $182,500.09 Fuller $162,503.78 Callahan $123,032.67 King $116,962.99 Dodsworth $75,048.04 Suyama $72,527.63 Buchanan $68,792.25 The table carries more information in less space and is more precise.

The Ultimate Table: The Box Score Large amount of information in a very small space So why does this work? Depends on the reader’s knowledge of the data

Sales Performance – March 2011 The Business Box Score? Sales Performance – March 2011 Salesperson TS WD BD NC DOR Peacock 225 3 40 20 28 Leverling 201 2 45 18 27 Davolio 182 5 38 22 Fuller 162 16 Callahan 123 1 15 14 King 116 0.5 12 Dodsworth 75 0.3 10 Suyama 72 8 Buchanan 68 Applying the same concept to our salesforce example. How does this help? How could it hurt? Key: TS – total sales WD – worst day BD – best day NC – number of customers DOR – days on the road

Data Ink Should be ~ 1 The amount of “ink” devoted to data in a chart Tufte’s Data-Ink ratio: 𝐷𝑎𝑡𝑎−𝑖𝑛𝑘 𝑟𝑎𝑡𝑖𝑜= 𝑑𝑎𝑡𝑎−𝑖𝑛𝑘 𝑡𝑜𝑡𝑎𝑙 𝑖𝑛𝑘 𝑢𝑠𝑒𝑑 𝑖𝑛 𝑔𝑟𝑎𝑝ℎ𝑖𝑐 Should be ~ 1 < 1 = more non-data related ink in graphic = 1 implies all ink devoted to data Tufte’s principle: Erase ink whenever possible

Being conscious of data ink Lower data-ink ratio (worse) Higher data-ink ratio (better)

What makes a good chart? Sometimes it’s really a matter of preference. These both minimize data ink. Why isn’t a table better here?

3-D Charts Evaluate this from a data-ink perspective. How does it affect the clarity of the chart?

Chartjunk: Data Ink “gone wild” Unnecessary visual clutter that doesn’t provide additional insight Distraction from the story the chart is supposed to convey When the data-ink ratio is low, chartjunk is likely to be high

Example: Moiré effects (Tufte 2009) Creates illusion of movement Stands out, in a bad way

Example: The Grid Why are these examples of chartjunk? What could you do to remedy it?

Data Ink Working Against Us Evaluate this chart in terms of Data Ink. Are there better visualizations?

Data Ink Working For Us Evaluate this chart in terms of Data Ink. Imagine this as a bar chart. As a table!!

Stacked Bar Charts are Often Trouble Original chart from the BBC website Why is this so difficult to read? What would be a better way to visualize it? http://j-walkblog.com/index.php?/weblog/posts/bad_charts/

Key Questions: Can you answer… What are three aspects of a good graphic? How can a chart “lie”? What is the Data Ink ratio and how does it relate to Chartjunk? When is a table better than a chart?