Graphical Display 1 Pictures of Data. Edward Tufte Visual Display of Quantitative Information Using graphics for –Data –Concepts –Instruction –Finding.

Slides:



Advertisements
Similar presentations
Histograms Bins are the bars Counts are the heights Relative Frequency Histograms have percents on vertical axis.
Advertisements

Displaying Data Objectives: Students should know the typical graphical displays for the different types of variables. Students should understand how frequency.
So What Do We Know? Variables can be classified as qualitative/categorical or quantitative. The context of the data we work with is very important. Always.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
The visual display of quantitative data Joyce Chapman, Consultant for Communications & Data Analysis State Library of North Carolina,
Statistics 100 Lecture Set 6. Re-cap Last day, looked at a variety of plots For categorical variables, most useful plots were bar charts and pie charts.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Section 2-4 Statistical Graphics.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Lecture Slides Elementary Statistics Eleventh Edition and the Triola.
Descriptive Statistics Summarizing data using graphs.
Beginning the Visualization of Data
1.1 Displaying and Describing Categorical & Quantitative Data.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Lecture Slides Elementary Statistics Tenth Edition and the.
Graphic representations in statistics (part II). Statistics graph Data recorded in surveys are displayed by a statistical graph. There are some specific.
Chapter 1: Looking at Data - Distributions:
Statistics: Unlocking the Power of Data Lock 5 STAT 101 Dr. Kari Lock Morgan 9/6/12 Describing Data: One Variable SECTIONS 2.1, 2.2, 2.3, 2.4 One categorical.
Types of Data Displays Based on the 2008 AZ State Mathematics Standard.
1 Determining Effective Data Display with Charts.
Graphical Displays of Data Section 2.2. Objectives Create and interpret the basic types of graphs used to display data.
Frequency Distributions and Graphs
Graphing Examples Categorical Variables
Examining Univariate Distributions Chapter 2 SHARON LAWNER WEINBERG SARAH KNAPP ABRAMOWITZ StatisticsSPSS An Integrative Approach SECOND EDITION Using.
Jeffrey Nichols Displaying Quantitative Information May 2, 2003 Slide 0 Displaying Quantitative Information An exploration of Edward R. Tufte’s The Visual.
Charts and Graphs V
© 2005 The McGraw-Hill Companies, Inc., All Rights Reserved. Chapter 12 Describing Data.
Programming in R Describing Univariate and Multivariate data.
Advantages and Disadvantages
Exploratory Data Analysis. Computing Science, University of Aberdeen2 Introduction Applying data mining (InfoVis as well) techniques requires gaining.
Let’s Review for… AP Statistics!!! Chapter 1 Review Frank Cerros Xinlei Du Claire Dubois Ryan Hoshi.
Univariate EDA (Exploratory Data Analysis). EDA John Tukey (1970s) data –two components: smooth + rough patterned behaviour + random variation resistant.
Chapter 2 Summarizing and Graphing Data
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Edited by.
CHAPTER 2 Graphical Descriptions of Data. SECTION 2.1 Frequency Distributions.
ACOT Intro/Copyright Succeeding in Business with Microsoft Excel 2010: Chapter1.
ITEC6310 Research Methods in Information Technology Instructor: Prof. Z. Yang Course Website: c6310.htm Office:
Quantitative Skills 1: Graphing
The introduction to SPSS Ⅱ.Tables and Graphs for one variable ---Descriptive Statistics & Graphs.
Business Statistics: Communicating with Numbers By Sanjiv Jaggia and Alison Kelly McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc.
Univariate Data Chapters 1-6. UNIVARIATE DATA Categorical Data Percentages Frequency Distribution, Contingency Table, Relative Frequency Bar Charts (Always.
1 Laugh, and the world laughs with you. Weep and you weep alone.~Shakespeare~
StatisticsStatistics Graphic distributions. What is Statistics? Statistics is a collection of methods for planning experiments, obtaining data, and then.
Section 2.2: More Graphs and Displays. Objective: To be able to create and analyze a variety of graphical displays. 5. Stem and leaf plot: a way of viewing.
Tables and Graphing. Displaying Data Sometimes it is easier to read data in a visual format. This can come in the form of tables, graphs, charts, etc.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Section 2-2 Frequency Distributions.
Bellwork 1. If a distribution is skewed to the right, which of the following is true? a) the mean must be less than the.
Unit 4 Statistical Analysis Data Representations.
GrowingKnowing.com © Frequency distribution Given a 1000 rows of data, most people cannot see any useful information, just rows and rows of data.
Statistics Chapter 1: Exploring Data. 1.1 Displaying Distributions with Graphs Individuals Objects that are described by a set of data Variables Any characteristic.
Descriptive statistics Petter Mostad Goal: Reduce data amount, keep ”information” Two uses: Data exploration: What you do for yourself when.
1 CSE 2337 Chapter 3 Data Visualization With Excel.
Section 1-1 Day One Types of Data Bar Graphs, Pie Charts Dots Plots, Stem and leaf plots, Histograms.
Highline Class, BI 348 Basic Business Analytics using Excel Chapter 03: Data Visualization: Tables & Charts 1.
Copyright © Cengage Learning. All rights reserved. 2 Descriptive Analysis and Presentation of Single-Variable Data.
Slide Slide 1 Section 2-4 Statistical Graphics. Slide Slide 2 Key Concept This section presents other graphs beyond histograms commonly used in statistical.
MATH 2311 Section 1.5. Graphs and Describing Distributions Lets start with an example: Height measurements for a group of people were taken. The results.
CCGPS Advanced Algebra Day 1 UNIT QUESTION: How do we use data to draw conclusions about populations? Standard: MCC9-12.S.ID.1-3, 5-9, SP.5 Today’s Question:
Graphs with SPSS Aravinda Guntupalli. Bar charts  Bar Charts are used for graphical representation of Nominal and Ordinal data  Height of the bar is.
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Descriptive Statistics
Exploring, Displaying, and Examining Data
Unit 4 Statistical Analysis Data Representations
Elementary Applied Statistics
Unit 6 Day 2 Vocabulary and Graphs Review
MATH 2311 Section 1.5.
Statistical Reasoning
Laugh, and the world laughs with you. Weep and you weep alone
Frequency Distributions
Simple plots using R Instructor: Li, Han
Biostatistics Lecture (2).
Dot Plot A _________ is a type of graphic display used to compare frequency counts within categories or groups using dots. __________ help you see how.
Presentation transcript:

Graphical Display 1 Pictures of Data

Edward Tufte Visual Display of Quantitative Information Using graphics for –Data –Concepts –Instruction –Finding patterns and answering questions

Simple Plots One variable or one variable plus a grouping variable Reveals shape of the distribution Distribution of cases over a categorical variable Evidence that metric variable follows a normal distribution

Tufte’s Rules Above all else show data Maximize the data/ink ratio Erase non-data-ink Erase redundant data-ink Revise and edit

Pie Charts One variable, nominal scale, percentages A few categories (3-6) Exploded for emphasis Best in groups (2-4) Best replaced by other graphic displays

Rcmdr pie diagram of language family in the CA Indians data set

3D Pies Strongly discouraged plotrix has them: –Install.packages(“plotrix”) –library(plotrix) –pie3D(table(CAIndians$Language), radius=.85, labels = levels(CAIndians$Language), explode=.1)

Bar Charts One variable, nominal scale, counts or percentages More categories than pie charts(4- 15) Color or shaded Can be stacked, set side-by-side or rotated Can compare 2-3 groups

barplot Rcmdr uses this command: –barplot(table(CAIndians$Language), xlab="Language", ylab="Frequency") Just insert the following before the left parenthesis to get it in color: –, col=rainbow_hcl(6)

Barplots with 2 groups Side by side or stacked bar plots make it possible to compare two categories at a time Load MissFauna and extract the Lilbourn samples: –Lilbourn <- as.matrix(MissFauna[5:7,]) –LilbournPct <- prop.table(Lilbourn, 1)*100

Lilbourn Barplots par(mfrow=c(2,2)) barplot(LilbournPct) barplot(t(LilbournPct)) barplot(LilbournPct, beside=TRUE, legend=TRUE) barplot(t(LilbournPct), beside=TRUE, legend=TRUE)

Dot Chart Dot charts plot the amounts in each group along a common scale so they are more easily comparable: –dotchart(as.vector(table(CAIndians$La nguage)), pch=16, labels = levels(CAIndians$Language), xlab = "Frequency")

Stem and Leaf Plot Tukey proposed as a way of looking at the distribution of a numeric variable With a small sample, can preserve the original data while showing its shape

> stem.leaf(DartPoints$Length, unit=1, na.rm=TRUE) 1 | 2: represents 12 leaf unit: 1 n: * | | * | (11) 4. | * | | * | | * | | 8 1 8* | 4

Histograms Like a bar plot but the x-axis is a continuous measurement. Shape of data distribution is shown, but number of bars can change the shape.

Boxplot Also called box and whiskers plots show the quartiles and outlier points Multiple boxplots let you compare groups

Stripchart A stripchart plots the actual values along the y-axis It hides less information, but is not as familiar to many people

Kernel Density Plot Use the data to approximate a smooth distribution – varies according to the bandwidth A normal (or other distribution) is placed on each point and then the distributions are summed plot(density(DartPoints$Length))

Example x <- c(53, 59, 62, 63, 65, 67, 69, 71, 72, 77) hist(x, col="blue", las=1, cex.axis=1.5, cex.lab=1.5)

plot(density(x), main="Kernel Density Plot") rug(x) xi <- seq(45, 90,.5) for (i in 1:10) lines(xi, dnorm(xi, mean=x[i], sd=3)/10) d <- dnorm(x[1], mean=x[1], sd=3)/10 matlines(rbind(x, x), rbind(rep(0,10), rep(d, 10)), lty=3, lwd=2, col="dark gray")

# Density plots # Load DartPoints.RData par(mfrow=c(2, 2)) # Vary bandwidth plot(density(DartPoints$Length), cex.main=.8) plot(density(DartPoints$Length, bw=2), cex.main=.8) plot(density(DartPoints$Length, bw=6), cex.main=.8) plot(density(DartPoints$Length, bw=8), cex.main=.8) # To compare two distributions par(mfrow=c(1,1)) a <- density(DartPoints$Length[DartPoints$Name=="Darl"]) b <- density(DartPoints$Length[DartPoints$Name=="Pedernales"]) plot(a, main="Dart Points", xlab="Length", ylab="Density", xlim=c(min(a$x, b$x), max(a$x, b$x)), ylim=c(0, max(a$y, b$y)), col="red") lines(b, col="blue") legend("topright", levels(DartPoints$Name), lty=1, col=c("red", "blue"))

Violin Plot Combine box plot and kernel density plot (package vioplot) with(DartPoints, vioplot(Length[Name=="Darl"], Length[Name=="Pedernales"], names=levels(Name)))

Beanplot Add strip plot to violin plot (package beanplot) with(DartPoints, beanplot(Length[Name=="Darl"], Length[Name=="Pedernales"], names=levels(Name)))