Visualizing Data and Communicating Information

Slides:



Advertisements
Similar presentations
Sociology 549, Lecture 3 Graphs by Paul von Hippel.
Advertisements

1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Reading Graphs and Charts are more attractive and easy to understand than tables enable the reader to ‘see’ patterns in the data are easy to use for comparisons.
Types of Data Displays Based on the 2008 AZ State Mathematics Standard.
Chapter 2 Graphs, Charts, and Tables – Describing Your Data
Quantitative Data Analysis: Univariate (cont’d) & Bivariate Statistics
Frequency Distributions Chapter 3 Homework: 1, 2, 3, 12.
Guilford County SciVis V105.01
CS1100: Computer Science and Its Applications Creating Graphs and Charts in Excel.
Info Vis: Multi-Dimensional Data Chris North cs3724: HCI.
Charts and Graphs V
©The McGraw-Hill Companies, Inc. 2008McGraw-Hill/Irwin Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation Chapter 2.
Quantitative Skills 1: Graphing
Examples of different formulas and their uses....
Chapter 03: Lecture Notes (CSIT 104) 11 Chapter 3 Charts: Delivering a Message Exploring Microsoft Office Excel 2007.
The Scientific Method Honors Biology Laboratory Skills.
 Frequency Distribution is a statistical technique to explore the underlying patterns of raw data.  Preparing frequency distribution tables, we can.
A Picture Is Worth A Thousand Words. DAY 7: EXCEL CHAPTER 4 Tazin Afrin September 10,
Graphing Data: Introduction to Basic Graphs Grade 8 M.Cacciotti.
Visual Display of Data Brad McMillen Evaluation and Research Department.
Applied Quantitative Analysis and Practices
Using Google Sheets To help with data. Sheets is a spreadsheet program that can interface with Docs, or Slides A spreadsheet program has cells (little.
© 2004 by the McGraw-Hill Companies, Inc. All rights reserved. Lecture 29 Enhancing Presentations with Graphics (2)
CS 235: User Interface Design November 19 Class Meeting Department of Computer Science San Jose State University Fall 2014 Instructor: Ron Mak
CS 235: User Interface Design April 30 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
Chapter 3: Organizing Data. Raw data is useless to us unless we can meaningfully organize and summarize it (descriptive statistics). Organization techniques.
Effective Visuals Tables Graphs Charts Illustrations.
Statistical Fundamentals: Using Microsoft Excel for Univariate and Bivariate Analysis Alfred P. Rovai Charts Overview PowerPoint Prepared by Alfred P.
Chapter 5: Organizing and Displaying Data. Learning Objectives Demonstrate techniques for showing data in graphical presentation formats Choose the best.
Copyright © 2009 Pearson Education, Inc. 3.2 Picturing Distributions of Data LEARNING GOAL Be able to create and interpret basic bar graphs, dotplots,
CS 235: User Interface Design May 12 Class Meeting Department of Computer Science San Jose State University Spring 2015 Instructor: Ron Mak
© 2010 Pearson Education, Inc. All rights reserved Data Analysis/Statistics: An Introduction Chapter 10.
DATA VISUALIZATION BOB MARSHALL, MD MPH MISM FAAFP FACULTY, DOD CLINICAL INFORMATICS FELLOWSHIP.
Integrating Graphics, Illustrations, Figures, Charts.
CHART SELECTION TOOLKIT How To Choose The Right Chart For Your Audience.
Data organization and Presentation. Data Organization Making it easy for comparison and analysis of data Arranging data in an orderly sequence or into.
Organizing Qualitative Data
Visualizing your data effectively
Chapter 12 Visual Representation of Data
QM222 Nov. 9 Section D1 Visualizing Using Graphs More on your project Test returned QM222 Fall 2016 Section D1.
Charts & Graphs CTEC V
Exploring Microsoft Office Excel 2007
Chapter 15 : Communicating Evidence Visually
10 Chapter Data Analysis/Statistics: An Introduction
Guilford County SciVis V105.01
3.2 Picturing Distributions of Data
Unit 4 Statistical Analysis Data Representations
Tutorial 4: Enhancing a Workbook with Charts and Graphs
Charts and Graphs V
QM222 A1 Visualizing data using Excel graphs
How could data be used in an EPQ?
Ms jorgensen Unit 1: Statistics and Graphical Representations
Applied Statistical Analysis
Module 6: Presenting Data: Graphs and Charts
MASTERING CHART SELECTION
CSc4730/6730 Scientific Visualization
Graphical Techniques.
Presenting Data.
Lecture 3 part-2: Organization and Summarization of Data
CSc4730/6730 Scientific Visualization
Graphical Techniques.
Interpreting Data: Graphs & Charts (1)
Topic 7: Visualization Lesson 1 – Creating Charts in Excel
Organizing Qualitative Data
Statistical Reasoning
Purpose of Displaying Data
Essentials of Statistics for Business and Economics (8e)
Charts A chart is a graphic or visual representation of data
Fu Jen Catholic University
Presentation transcript:

Visualizing Data and Communicating Information Larry Jerome: Data Architect KIPP DC

Introductions Name Organization/Role Most commonly used platform (e.g. excel, R, Tableau, etc. ) What are you currently working on or what’s currently challenging you? What are you currently working on? Use this for activity 1

What is data visualization? Add picture

What is data visualization? A primary goal of data visualization is to communicate information clearly and efficiently to users graphics, plots, information graphics, tables, and charts Source: Jen Underwood Business Intelligence and Predictive Analytics at Impact Analytix, April 10, 2013 Source: Wikipedia; Jen Underwood: Business Intelligence and Predictive Analytics, April 10, 2013

What is data visualization? Two basic types of visualization Exploratory Explanatory Source: Jen Underwood Business Intelligence and Predictive Analytics at Impact Analytix, April 10, 2013 Exploratory for live reporting-attendance, also grades Explanatory for period reporting-End of window test scores, grades Source: Wikipedia; Jen Underwood: Business Intelligence and Predictive Analytics, April 10, 2013

Why We Visualize What insights can be easily drawn from this table?

Why We Visualize What insights can be easily drawn from this visualization?

Process Of Developing Visualizations Identify your purpose Quantify it Visualize it

Identifying Your Purpose What questions do you need to answer? Who is your audience? What questions do they have? What action will be taken? Audience: Role: what decisions do they make? What questions do they need answered? work flow: in what context will it be used? What information is needed regularly? How much time do they have? Comfort and skills: how sophisticated are they with using the data? Do they enjoy digging into the numbers? Business and data expertise: How familiar are they with the metrics? Do they understand where the data comes from? Are they familiar with the organization and terminology? The special ed team wants to close the achievement gap between special ed and gen ed students. How can you help inform their decision making and planning? How can we use blended learning platforms effectively? Source: Rebecca Vichniac: KIPP Data Summit, June 2014

Identifying Your Purpose: Who is your audience Role: what decisions do they make? What questions do they need answered? Work flow: in what context will it be used? What information is needed regularly? How much time do they have? Comfort and skills: how sophisticated are they with using the data? Do they enjoy digging into the numbers? Business and data expertise: How familiar are they with the metrics? Do they understand where the data comes from? Are they familiar with the organization and terminology? Audience: Role: what decisions do they make? What questions do they need answered? work flow: in what context will it be used? What information is needed regularly? How much time do they have? Comfort and skills: how sophisticated are they with using the data? Do they enjoy digging into the numbers? Business and data expertise: How familiar are they with the metrics? Do they understand where the data comes from? Are they familiar with the organization and terminology? The special ed team wants to close the achievement gap between special ed and gen ed students. How can you help inform their decision making and planning? How can we use blended learning platforms effectively? Source: Rebecca Vichniac: KIPP Data Summit, June 2014

Activity 1: Identifying Your Purpose

Choosing The Right Metrics What are some examples of metrics you use? Why do you use them?

Choosing The Right Metrics: What Is A Metric? What are some examples of metrics you use? Why do you use them?

Choosing The Right Metrics: What Is A Metric? What are some examples of metrics you use? Why do you use them?

Choosing the right metric The Perfect Metric Actionable Common Interpretation Transparent & Simple Calculation Accessible & Accurate Data Actionable Align to organizational mission or strategy Identify specific, quantifiable outputs/outcomes Establish targets against which results can be evaluated 3 bullet points focus on actionability

Choosing The Right Metric: Evaluating Metrics What are the limitations of this metric? Is there any analysis to show that these are the right metrics? Do these metrics change depending on the audience? Source: Rebecca Vichniac: KIPP Data Summit, June 2014

Activity 2: Quantify It

Choosing The Right Chart Type Comparison Distribution Composition Relationship

Choosing The Right Chart Type

Choosing The Right Chart Type: Comparison Among Items Among items: Bars and columns are the best: encodes quantitative values as length on the same baseline Two variables per item: use color gradient instead of width when possible-width can distort height comparison and people are good at distinguishing colors Bar charts, when ranking should be in order Do no overlap bars Over time Time should generally be on the X axis especially if there are many periods Cyclical data can sometimes be more effectively shown on a line chart Use line chart to show and compare trends of categories within a dimension Use a bar chart to compare in time periods http://www.tableausoftware.com/sites/default/files/media/whitepaper_visual-analysis-guidebook.pdf Dimensions= attributes of the data (term, grade, quartiles) Over Time

Choosing The Right Chart Type: Distribution Single Variable Show the distribution of values per interval Can also use a box plot to show the changes in distribution over time interval or dimension categories -highlights values not apparent on histograms Counts of the values per interval from lowest to highest Show the distribution of a single measure Box plot (box and whisker plot) Useful for showing multiple distributions Easily identify important data points Show how a distribution has changed over time Histogram Useful for showing data with unusual distributions Show distributions at a higher level of detail Show the distribution of multiple measures Scatterplot Similar to correlation chart Two (or more) Variables

Choosing The Right Chart Type: Composition Changing over time Values represent parts (proportions) of a whole. Many period=continuous data Few periods=discrete data Stacked bar charts highlight the pattern over time intervals Components of components: stacked bar male vs female sped and not sped Vertical bars only! Static

Choosing the right chart type: Relationship Comparison of two or more sets of values Running a simple correlation analysis can help identify relationships between measures, but remember that correlation doesn’t guarantee a relationship. Scatterplots Typically used to show the relationship between 2 variables Using color and size to show 3 and 4 dimensions Parallel line chart Compare the changes in slope and value of related measures Bar charts May be used when the audience is not familiar with scatterplots

Enhancing Visualizations Color Hue Color Saturation Size Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Use analogous colors for ordinal or interval values (continuous) Use complimentary colors for nominal values, and comparisons (discrete) Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Use Complimentary Colors for Comparisons Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks Identify Groups of Attributes Use fewer than 6 colors

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks Highlight Metric Patterns Less saturation: small values More saturation: larger values

Enhancing Visualizations: Color Saturation Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks Add Size to Emphasize Metric Trends

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Enhancing Visualizations Emphasize the most important idea put the most important data on the X- or Y-axis and less important data on the color, size or shape. Physical position in the easiest to perceive and the most powerful visual property (start at the top left) Orient your views for legibility If you have a view that only has long labels that only fit vertically, try rotating the view. Make it easy to compare values, especially when differences in values are small -Encode quantitative data accurately Choose the proper axis keep them consistent across views axis of different lengths can exaggerate slope choose the right context (length of time) keep intervals consistent Using consistent color gradients Avoid overloading your views Too much information can obscure the point Instead of stacking many measures and dimensions into one condensed view, break them down into small multiples Use text sparingly Do not use 3-D views Avoid noisy fill patterns, line styles, or saturated/bright colors, and borders (except for highlighting values) Limit the number tick marks

Activity 3: Visualize It

Best in Data Visualization https://public.tableau.com/s/gallery Filter to education Explore