BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.

Slides:



Advertisements
Similar presentations
Analyzing and Interpreting Data To understand a set of data, you need to organize and summarize the values. A measure of central tendency is used to.
Advertisements

AP Stat Day Days until AP Exam
Copyright © 2014 by McGraw-Hill Higher Education. All rights reserved.
Copyright © 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 5- 1.
Describing Data: Measures of Dispersion
Outliers. Do Now Bill Gates makes $100 thousand a year. He s in a room with 9 teachers, 4 of whom make $40k, 3 make $45k, and 2 make $55k a year. What.
CS1512 Foundations of Computing Science 2 Lecture 20 Probability and statistics (2) © J R W Hunter,
1 1 Slide © 2003 South-Western/Thomson Learning TM Slides Prepared by JOHN S. LOUCKS St. Edwards University.
Data Distributions Warm Up Lesson Presentation Lesson Quiz
Introduction Our daily lives often involve a great deal of data, or numbers in context. It is important to understand how data is found, what it means,
Chapter 2 Tutorial 2nd & 3rd LAB.
Describing Quantitative Variables
DESCRIBING DISTRIBUTION NUMERICALLY
Lecture 4 Chapter 2. Numerical descriptors
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Basic Practice of Statistics - 3rd Edition
Chapter 2 Describing distributions with numbers. Chapter Outline 1. Measuring center: the mean 2. Measuring center: the median 3. Comparing the mean and.
1 Stat 1510 Statistical Thinking & Concepts Describing Distributions with Numbers.
1.3: Describing Quantitative Data with Numbers
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter 3 Looking at Data: Distributions Chapter Three
Essential Statistics Chapter 21 Describing Distributions with Numbers.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
Chapter 5 Describing Distributions Numerically.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
1 Never let time idle away aimlessly.. 2 Chapters 1, 2: Turning Data into Information Types of data Displaying distributions Describing distributions.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 2: Describing Distributions with Numbers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
1.3 Describing Quantitative Data with Numbers
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Essential Statistics Describing Distributions with Numbers
Basic Practice of Statistics - 3rd Edition
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
The Five-Number Summary
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Describing Distributions with Numbers
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers

BPS - 5th Ed. Chapter 22 Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range) –variance –standard deviation

BPS - 5th Ed. Chapter 23 Mean or Average u Traditional measure of center u Sum the values and divide by the number of values

BPS - 5th Ed. Chapter 24 Median (M) u A resistant measure of the data’s center u At least half of the ordered values are less than or equal to the median value u At least half of the ordered values are greater than or equal to the median value u If n is odd, the median is the middle ordered value u If n is even, the median is the average of the two middle ordered values

BPS - 5th Ed. Chapter 25 Median (M) Location of the median: L(M) = (n+1)/2, where n = sample size. Example: If 25 data values are recorded, the Median would be the (25+1)/2 = 13 th ordered value.

BPS - 5th Ed. Chapter 26 Median u Example 1 data: Median (M) = 4 u Example 2 data: Median = 5 (ave. of 4 and 6) u Example 3 data: Median  2 (order the values: 2 4 6, so Median = 4)

BPS - 5th Ed. Chapter 27 Comparing the Mean & Median u The mean and median of data from a symmetric distribution should be close together. The actual (true) mean and median of a symmetric distribution are exactly the same. u In a skewed distribution, the mean is farther out in the long tail than is the median [the mean is ‘pulled’ in the direction of the possible outlier(s)].

BPS - 5th Ed. Chapter 28 Question A recent newspaper article in California said that the median price of single-family homes sold in the past year in the local area was $136,000 and the mean price was $149,160. Which do you think is more useful to someone considering the purchase of a home, the median or the mean?

BPS - 5th Ed. Chapter 29 Case Study Airline fares appeared in the New York Times on November 5, 1995 “...about 60% of airline passengers ‘pay less than the average fare’ for their specific flight.” u How can this be? 13% of passengers pay more than 1.5 times the average fare for their flight

BPS - 5th Ed. Chapter 210 Spread, or Variability u If all values are the same, then they all equal the mean. There is no variability. u Variability exists when some values are different from (above or below) the mean. u We will discuss the following measures of spread: range, quartiles, variance, and standard deviation

BPS - 5th Ed. Chapter 211 Range u One way to measure spread is to give the smallest (minimum) and largest (maximum) values in the data set; Range = max  min u The range is strongly affected by outliers

BPS - 5th Ed. Chapter 212 Quartiles u Three numbers which divide the ordered data into four equal sized groups. u Q 1 has 25% of the data below it. u Q 2 has 50% of the data below it. (Median) u Q 3 has 75% of the data below it.

BPS - 5th Ed. Chapter 213 Quartiles Uniform Distribution Q1Q1 Q2Q2 Q3Q3

BPS - 5th Ed. Chapter 214 Obtaining the Quartiles u Order the data. u For Q 2, just find the median. u For Q 1, look at the lower half of the data values, those to the left of the median location; find the median of this lower half. u For Q 3, look at the upper half of the data values, those to the right of the median location; find the median of this upper half.

BPS - 5th Ed. Chapter 215 Weight Data: Sorted L(M)=(53+1)/2=27 L(Q 1 )=(26+1)/2=13.5

BPS - 5th Ed. Chapter 216 Weight Data: Quartiles u Q 1 = u Q 2 = 165 (Median) u Q 3 = 185

BPS - 5th Ed. Chapter third quartile median or second quartile first quartile Weight Data: Quartiles

BPS - 5th Ed. Chapter 218 Five-Number Summary u minimum = 100 u Q 1 = u M = 165 u Q 3 = 185 u maximum = 260 Interquartile Range (IQR) = Q 3  Q 1 = 57.5 IQR gives spread of middle 50% of the data

BPS - 5th Ed. Chapter 219 Boxplot u Central box spans Q 1 and Q 3. u A line in the box marks the median M. u Lines extend from the box out to the minimum and maximum.

BPS - 5th Ed. Chapter 220 M Weight Data: Boxplot Q1Q1 Q3Q3 minmax Weight

BPS - 5th Ed. Chapter 221 Example from Text: Boxplots

BPS - 5th Ed. Chapter 222 Identifying Outliers u The central box of a boxplot spans Q 1 and Q 3 ; recall that this distance is the Interquartile Range (IQR). u We call an observation a suspected outlier if it falls more than 1.5  IQR above the third quartile or below the first quartile.

BPS - 5th Ed. Chapter 223 Variance and Standard Deviation u Recall that variability exists when some values are different from (above or below) the mean. u Each data value has an associated deviation from the mean:

BPS - 5th Ed. Chapter 224 Deviations u what is a typical deviation from the mean? (standard deviation) u small values of this typical deviation indicate small variability in the data u large values of this typical deviation indicate large variability in the data

BPS - 5th Ed. Chapter 225 Variance u Find the mean u Find the deviation of each value from the mean u Square the deviations u Sum the squared deviations u Divide the sum by n-1 (gives typical squared deviation from mean)

BPS - 5th Ed. Chapter 226 Variance Formula

BPS - 5th Ed. Chapter 227 Standard Deviation Formula typical deviation from the mean [ standard deviation = square root of the variance ]

BPS - 5th Ed. Chapter 228 Variance and Standard Deviation Example from Text Metabolic rates of 7 men (cal./24hr.) :

BPS - 5th Ed. Chapter 229 Variance and Standard Deviation Example from Text ObservationsDeviationsSquared deviations  1600 = 192 (192) 2 = 36,  1600 = 66 (66) 2 = 4,  1600 = -238 (-238) 2 = 56,  1600 = 14 (14) 2 =  1600 = -140 (-140) 2 = 19,  1600 = 267 (267) 2 = 71,  1600 = -161 (-161) 2 = 25,921 sum = 0sum = 214,870

BPS - 5th Ed. Chapter 230 Variance and Standard Deviation Example from Text

BPS - 5th Ed. Chapter 231 Choosing a Summary u Outliers affect the values of the mean and standard deviation. u The five-number summary should be used to describe center and spread for skewed distributions, or when outliers are present. u Use the mean and standard deviation for reasonably symmetric distributions that are free of outliers.

BPS - 5th Ed. Chapter 232 Number of Books Read for Pleasure: Sorted M L(M)=(52+1)/2=26.5

BPS - 5th Ed. Chapter 233 Five-Number Summary: Boxplot Median = 3 interquartile range (iqr) = = 4.5 range = 99-0 = 99 Mean = 7.06 s.d. = Number of books