Elementary Statistics: Looking at the Big Picture

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Understanding and Comparing Distributions 30 min.
CHAPTER 4 Displaying and Summarizing Quantitative Data Slice up the entire span of values in piles called bins (or classes) Then count the number of values.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 3 Introduction – Slide 1 of 3 Topic 16 Numerically Summarizing Data- Averages.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Descriptive Statistics  Summarizing, Simplifying  Useful for comprehending data, and thus making meaningful interpretations, particularly in medium to.
© 2008 Brooks/Cole, a division of Thomson Learning, Inc. 1 Chapter 4 Numerical Methods for Describing Data.
Have out your calculator and your notes! The four C’s: Clear, Concise, Complete, Context.
Categorical vs. Quantitative…
1 Chapter 4 Numerical Methods for Describing Data.
Chapter 5: Boxplots  Objective: To find the five-number summaries of data and create and analyze boxplots CHS Statistics.
Organizing Data AP Stats Chapter 1. Organizing Data Categorical Categorical Dotplot (also used for quantitative) Dotplot (also used for quantitative)
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture 1 Lecture 7: Chapter 4, Section 3 Quantitative Variables (Summaries,
Describing Data Week 1 The W’s (Where do the Numbers come from?) Who: Who was measured? By Whom: Who did the measuring What: What was measured? Where:
Chapter 4 Histograms Stem-and-Leaf Dot Plots Measures of Central Tendency Measures of Variation Measures of Position.
Descriptive Statistics ( )
UNIT ONE REVIEW Exploring Data.
Exploratory Data Analysis
Describing Distributions Numerically
Chapter 1: Exploring Data
ISE 261 PROBABILISTIC SYSTEMS
Chapter 3 Describing Data Using Numerical Measures
Chapter 5 : Describing Distributions Numerically I
Numerical Descriptive Measures
Unit 4 Statistical Analysis Data Representations
4. Interpreting sets of data
Averages and Variation
NUMERICAL DESCRIPTIVE MEASURES
Laugh, and the world laughs with you. Weep and you weep alone
Numerical Descriptive Measures
Box and Whisker Plots Algebra 2.
2.6: Boxplots CHS Statistics
DAY 3 Sections 1.2 and 1.3.
Topic 5: Exploring Quantitative data
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
STA 291 Spring 2008 Lecture 5 Dustin Lueker.
Quartile Measures DCOVA
Warmup Draw a stemplot Describe the distribution (SOCS)
Displaying Distributions with Graphs
Displaying and Summarizing Quantitative Data
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
Organizing Data AP Stats Chapter 1.
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Exploratory Data Analysis
Chapter 1: Exploring Data
Measures of Position Section 3.3.
Describing Distributions Numerically
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal)
Chapter 1: Exploring Data
Advanced Algebra Unit 1 Vocabulary
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Types of variables. Types of variables Categorical variables or qualitative identifies basic differentiating characteristics of the population.
Lesson Plan Day 1 Lesson Plan Day 2 Lesson Plan Day 3
Analyze Data: IQR and Outliers
Module 8: Two-Way Frequency Tables
Business and Economics 7th Edition
Chapter 1: Exploring Data
Presentation transcript:

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries) Summarize with Shape, Center, Spread Displays: Stemplots, Histograms Five Number Summary, Outliers, Boxplots ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Looking Back: Review 4 Stages of Statistics Data Production (discussed in Lectures 1-4) Displaying and Summarizing Single variables: 1 cat. (Lecture 5), 1 quantitative Relationships between 2 variables Probability Statistical Inference ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Issues to Consider (C) 2007 Nancy Pfenning Example: Issues to Consider Background: Intro stat students’ earnings (in $1000s) previous year: 12, 3, 7, 1, … [survey was anonymous]. Questions: What population do the data represent? Were responses unbiased? Responses: All students at that university, if sample was representative in terms of gender, age, and major. Probably unbiased because it was anonymous. Looking Back: These are data production issues. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.17 p.92 Elementary Statistics: Looking at the Big Picture

Example: More Issues to Consider (C) 2007 Nancy Pfenning Example: More Issues to Consider Background: Intro stat students’ earnings (in $1000s) previous year: 12, 3, 7, 1, … [survey was anonymous]. Questions: How do we summarize the data? Sample average was $3776. Can we conclude population average was less than $5000? Responses: Mean and other summaries are the focus of this part. We’re not yet ready to draw broader conclusions. Looking Ahead: This is an inference question, to be addressed in Part Four. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.29c p.104 Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definitions Distribution: tells all possible values of a variable and how frequently they occur Summarize distribution of a quantitative variable by telling shape, center, spread. Shape: tells which values tend to be more or less common Center: measure of what is typical in the distribution of a quantitative variable Spread: measure of how much the distribution’s values vary ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definitions Symmetric distribution: balanced on either side of center Skewed distribution: unbalanced (lopsided) Skewed left: has a few relatively low values Skewed right: has a few relatively high values Outliers: values noticeably far from the rest Unimodal: single-peaked Bimodal: two-peaked Uniform: all values equally common (flat shape) Normal: a particular symmetric bell-shape ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Displays of a Quantitative Variable (C) 2007 Nancy Pfenning Displays of a Quantitative Variable Displays help see the shape of the distribution. Stemplot Advantage: most detail Disadvantage: impractical for large data sets Histogram Advantage: works well for any size data set Disadvantage: some detail lost Boxplot Advantage: shows outliers, makes comparisons CQ Disadvantage: much detail lost ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definition Stemplot: vertical list of stems, each followed by horizontal list of one-digit leaves stems 1-digit leaves . . . ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Constructing a Stemplot (C) 2007 Nancy Pfenning Example: Constructing a Stemplot Background: Masses (in 1000 kg) of 20 dinosaurs: 0.0 0.0 0.1 0.2 0.4 0.6 0.7 0.7 1.0 1.1 1.1 1.2 1.5 1.7 1.7 1.8 2.9 3.2 5.0 5.6 Question: Display with stemplot; what does it tell us about the shape? ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.38a p.107 Elementary Statistics: Looking at the Big Picture

Example: Constructing a Stemplot (C) 2007 Nancy Pfenning Example: Constructing a Stemplot Background: Masses (in 1000 kg) of 20 dinosaurs: 0.0 0.0 0.1 0.2 0.4 0.6 0.7 0.7 1.0 1.1 1.1 1.2 1.5 1.7 1.7 1.8 2.9 3.2 5.0 5.6 Response: Start with 0 to 5 (ones digits) as stems. 0 | 1 | 2 | 3 | 4 | 5 | ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.38a p.107 Elementary Statistics: Looking at the Big Picture

Example: Constructing a Stemplot (C) 2007 Nancy Pfenning Example: Constructing a Stemplot Background: Masses (in 1000 kg) of 20 dinosaurs: 0.0 0.0 0.1 0.2 0.4 0.6 0.7 0.7 1.0 1.1 1.1 1.2 1.5 1.7 1.7 1.8 2.9 3.2 5.0 5.6 Response: Add leaves (tenths) for each stem. 0 | 1 | 2 | 3 | 4 | 5 | 0 0 1 2 4 6 7 7 0 1 1 2 5 7 7 8 9 2 Do not skip the 4 stem: why? 0 6 ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.38a p.107 Elementary Statistics: Looking at the Big Picture

Example: Constructing a Stemplot (C) 2007 Nancy Pfenning Example: Constructing a Stemplot Background: Masses (in 1000 kg) of 20 dinosaurs: 0.0 0.0 0.1 0.2 0.4 0.6 0.7 0.7 1.0 1.1 1.1 1.2 1.5 1.7 1.7 1.8 2.9 3.2 5.0 5.6 Response: 0 | 0 0 1 2 4 6 7 7 1 | 0 1 1 2 5 7 7 8 2 | 9 3 | 2 4 | 5 | 0 6 Skewness? Rotate to horizontal, lowest numbers to left. Long right tailright-skewed. 5 | 0 6 0 | 0 0 1 2 4 6 7 7 4 | 1 | 0 1 1 2 5 7 7 8 2 | 9 3 | 2 ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.38a p.107 Elementary Statistics: Looking at the Big Picture

Example: Constructing a Stemplot (C) 2007 Nancy Pfenning Example: Constructing a Stemplot Background: Masses (in 1000 kg) of 20 dinosaurs: 0.0 0.0 0.1 0.2 0.4 0.6 0.7 0.7 1.0 1.1 1.1 1.2 1.5 1.7 1.7 1.8 2.9 3.2 5.0 5.6 Response: 0 | 0 0 1 2 4 6 7 7 1 | 0 1 1 2 5 7 7 8 2 | 9 3 | 2 4 | 5 | 0 6 Rotate to horizontal, lowest numbers to left. Long right tailright-skewed. 1 peakunimodal Most below 2000 kg, a few unusually heavy. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.38a p.107 Elementary Statistics: Looking at the Big Picture

Modifications to Stemplots (C) 2007 Nancy Pfenning Modifications to Stemplots Too few stems? Split… Split in 2: 1st stem gets leaves 0-4, 2nd gets 5-9 Split in 5: 1st stem gets leaves 0-1, 2nd gets 2-3, etc. Split in 10: 1st gets 0, …, 10th gets 9. Too many stems? Truncate last digit(s). ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Splitting Stems (C) 2007 Nancy Pfenning Example: Splitting Stems Background: Credits taken by 14 “other” students: 4 7 11 11 11 13 13 14 14 15 17 17 17 18 Questions: What shape do we guess for non-traditional (other) students? How to construct stemplot to make shape clear? Responses: Expect shape left-skewed due to part-timers Stemplot: 1st attempt has too few stems 0 | 4 7 1 | 1 1 1 3 3 4 4 5 7 7 7 8 0 | 4 0 | 7 1 | 1 1 1 3 3 4 4 1 | 5 7 7 7 8 so split 2 ways: Plot shows expected left skewness. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Truncating Digits (C) 2007 Nancy Pfenning Example: Truncating Digits Background: Minutes spent on computer day before 0 10 20 30 30 30 30 45 45 60 60 60 67 90 100 120 200 240 300 420 Question: How to construct stemplot to make shape clear? Response: Stems 0 to 42 too many: truncate last digit, work with 100’s (stems) and 10’s (leaves): 0 | 0 1 2 3 3 3 3 4 4 6 6 6 6 9 1 | 0 2 2 | 0 4 3 | 0 4 | 2 Skewed right: most times less than 100 minutes, but a few had unusually long times. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.25b p.92 Elementary Statistics: Looking at the Big Picture

Displays of a Quantitative Variable (C) 2007 Nancy Pfenning Displays of a Quantitative Variable Stemplot Histogram Boxplot ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definition Histogram: to display quantitative values… Divide range of data into intervals of equal width. Find count or percent or proportion in each. Use horizontal axis for range of data values, vertical axis for count/percent/proportion in each. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Constructing a Histogram (C) 2007 Nancy Pfenning Example: Constructing a Histogram Background: Prices of 12 used upright pianos: 100 450 500 650 695 1100 1200 1200 1600 2100 2200 2300 Question: Construct a histogram for the data; what does it tell us about the shape? Response: Divide into intervals of width 500, find count in each interval. Shape is fairly flat (uniform). We opted to put 500 as left endpoint of 2nd interval; be consistent (a price of 1000 would go in 3rd interval, not 2nd). ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.27a p.103 Elementary Statistics: Looking at the Big Picture

Example: Constructing a Histogram (C) 2007 Nancy Pfenning Example: Constructing a Histogram Background: Prices of 12 used upright pianos: 100 450 500 650 695 1100 1200 1200 1600 2100 2200 2300 Question: Construct a histogram for the data; what does it tell us about the shape? Response: Divide into intervals of width 500, find count in each interval. Shape is fairly flat (uniform). Note: It is also possible to identify midpoints instead of cutpoints. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.27a p.103 Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definitions (Review) Shape: tells which values tend to be more or less common Center: measure of what is typical in the distribution of a quantitative variable Spread: measure of how much the distribution’s values vary ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definitions Median: a measure of center: the middle for odd number of values average of middle two for even number of values Quartiles: measures of spread: 1st Quartile (Q1) has one-fourth of data values at or below it (middle of smaller half) 3rd Quartile (Q3) has three-fourths of data values at or below it (middle of larger half) (By hand, for odd number of values, omit median to find quartiles.) ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definitions Percentile: value at or below which a given percentage of a distribution’s values fall A Closer Look: Q1 is 25th percentile, Q3 is 75th percentile. Range: difference between maximum and minimum values Interquartile range: tells spread of middle half of data values, written IQR=Q3-Q1 ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Ways to Measure Center and Spread (C) 2007 Nancy Pfenning Ways to Measure Center and Spread Five Number Summary: Minimum Q1 Median Q3 Maximum Mean and Standard Deviation (more useful but less straightforward to find) ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Finding 5 Number Summary and IQR (C) 2007 Nancy Pfenning Example: Finding 5 Number Summary and IQR Background: Credits taken by 14 non-traditional students: 4 7 11 11 11 13 13 14 14 15 17 17 17 18 Question: What are Five Number Summary, range, and IQR? Response: Minimum: 4 Q1: 11 Median: 13.5 Q3: 17 Maximum: 18 Range: 18-4=14 IQR: 17-11=6 ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.28b p.103 Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definition The 1.5-Times-IQR Rule identifies outliers: below Q1-1.5(IQR) considered low outlier above Q3+1.5(IQR) considered high outlier ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Displays of a Quantitative Variable (C) 2007 Nancy Pfenning Displays of a Quantitative Variable Stemplot Histogram Boxplot ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Elementary Statistics: Looking at the Big Picture (C) 2007 Nancy Pfenning Definition A boxplot displays median, quartiles, and extreme values, with special treatment for outliers: Bottom whisker to minimum non-outlier Bottom of box at Q1 Line through box at median Top of box at Q3 Top whisker to maximum non-outlier Outliers denoted “*”. ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture

Example: Identifying Outliers (C) 2007 Nancy Pfenning Example: Identifying Outliers Background: Credits taken by 14 non-traditional students had 5 No. Summary: 4, 11, 13.5, 17, 18 Question: Are there outliers? Response: Q1=11, Q3=17 IQR=17-11=6 1.5IQR=9 Q1-1.5(IQR)=11-1.5(6)=2: Low outliers? No. Q3+1.5(IQR)=17+1.5(6)=26: High outliers? No. 2 26 9 9 6 11 17 ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.28c p.103 Elementary Statistics: Looking at the Big Picture

Example: Constructing Boxplot (C) 2007 Nancy Pfenning Example: Constructing Boxplot Background: Credits taken by 14 non-traditional students had 5 No. Summary: 4, 11, 13.5, 17, 18 Question: How is the boxplot constructed? Response: Typical credits about 13.5, middle half between 11 and 17, shape is left-skewed Maximum=18 Q3=17 Median=13.5 Q1=11 Minimum 4 ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Practice: 4.28c p.103 Elementary Statistics: Looking at the Big Picture

Lecture Summary (Quantitative Displays, Begin Summaries) (C) 2007 Nancy Pfenning Lecture Summary (Quantitative Displays, Begin Summaries) Display: stemplot, histogram Shape: Symmetric or skewed? Unimodal? Normal? Center and Spread median and range, IQR identify outliers display with boxplot ©2011 Brooks/Cole, Cengage Learning Elementary Statistics: Looking at the Big Picture Elementary Statistics: Looking at the Big Picture