Describing Distributions With Numbers Section 1.3 (mean, median, range, quartiles, IQR) Target Goal: I can analyze data using shape, center and spread.

Slides:



Advertisements
Similar presentations
Describing Quantitative Variables
Advertisements

Lesson Describing Distributions with Numbers parts from Mr. Molesky’s Statmonkey website.
Describing Distributions With Numbers
Descriptive Measures MARE 250 Dr. Jason Turner.
Describing Distributions with Numbers
Lecture 4 Chapter 2. Numerical descriptors
Looking at data: distributions - Describing distributions with numbers IPS chapter 1.2 © 2006 W.H. Freeman and Company.
Looking at data: distributions - Describing distributions with numbers
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
CHAPTER 2: Describing Distributions with Numbers
AP Statistics Chapters 0 & 1 Review. Variables fall into two main categories: A categorical, or qualitative, variable places an individual into one of.
Describing distributions with numbers
Objectives 1.2 Describing distributions with numbers
INTRODUCTORY STATISTICS Chapter 2 DESCRIPTIVE STATISTICS PowerPoint Image Slideshow.
1.3: Describing Quantitative Data with Numbers
Have out your calculator and your notes! The four C’s: Clear, Concise, Complete, Context.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
The Practice of Statistics Third Edition Chapter 1: Exploring Data 1.2 Describing Distributions with Numbers Copyright © 2008 by W. H. Freeman & Company.
Warm-up The number of deaths among persons aged 15 to 24 years in the United States in 1997 due to the seven leading causes of death for this age group.
INVESTIGATION 1.
Chapter 3 Looking at Data: Distributions Chapter Three
Describing Quantitative Data with Numbers Section 1.3.
Review BPS chapter 1 Picturing Distributions with Graphs What is Statistics ? Individuals and variables Two types of data: categorical and quantitative.
Chapter 2 Describing Distributions with Numbers. Numerical Summaries u Center of the data –mean –median u Variation –range –quartiles (interquartile range)
Chapter 5 Describing Distributions Numerically.
BPS - 5th Ed. Chapter 21 Describing Distributions with Numbers.
Notes Unit 1 Chapters 2-5 Univariate Data. Statistics is the science of data. A set of data includes information about individuals. This information is.
1.3 Describing Quantitative Data with Numbers Pages Objectives SWBAT: 1)Calculate measures of center (mean, median). 2)Calculate and interpret measures.
Numerical descriptions of distributions
+ Chapter 1: Exploring Data Section 1.3 Describing Quantitative Data with Numbers The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
More Univariate Data Quantitative Graphs & Describing Distributions with Numbers.
Chapter 1: Exploring Data, cont. 1.2 Describing Distributions with Numbers Measuring Center: The Mean Most common measure of center Arithmetic average,
BPS - 5th Ed.Chapter 21 Describing Distributions with Numbers.
IPS Chapter 1 © 2012 W.H. Freeman and Company  1.1: Displaying distributions with graphs  1.2: Describing distributions with numbers  1.3: Density Curves.
Chapter 5 Describing Distributions Numerically Describing a Quantitative Variable using Percentiles Percentile –A given percent of the observations are.
Describe Quantitative Data with Numbers. Mean The most common measure of center is the ordinary arithmetic average, or mean.
AP Statistics 5 Number Summary and Boxplots. Measures of Center and Distributions For a symmetrical distribution, the mean, median and the mode are the.
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
CHAPTER 2: Describing Distributions with Numbers
Measuring Variability
Analyzing One-Variable Data
CHAPTER 1 Exploring Data
CHAPTER 1 Exploring Data
Please take out Sec HW It is worth 20 points (2 pts
Describing Distributions with Numbers
Drill {A, B, B, C, C, E, C, C, C, B, A, A, E, E, D, D, A, B, B, C}
POPULATION VS. SAMPLE Population: a collection of ALL outcomes, responses, measurements or counts that are of interest. Sample: a subset of a population.
1.3 Describing Quantitative Data with Numbers
Advanced Placement Statistics Ch 1.2: Describing Distributions
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Exploratory Data Analysis
Chapter 1: Exploring Data
Measures of Center.
Chapter 1: Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Chapter 1: Exploring Data
CHAPTER 1 Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Basic Practice of Statistics - 3rd Edition
Chapter 1: Exploring Data
Describing Distributions with Numbers
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Chapter 1: Exploring Data
Presentation transcript:

Describing Distributions With Numbers Section 1.3 (mean, median, range, quartiles, IQR) Target Goal: I can analyze data using shape, center and spread. Hw: pg 70: 79, 80, 81, 84, 87, 89

Measuring Center Mean ( : x bar): The most common measure of center.

Median (M) The midpoint of a distribution, the # such that half the observations are smaller and the other half are larger.

Resistant Measure (of center) Resists influence of extreme observations. Mean (average) is not resistant Mean (average) is not resistant Median (midpoint) is resistant Median (midpoint) is resistant The mean and median would be exactly the same if the distribution is exactly symmetric. The mean and median would be exactly the same if the distribution is exactly symmetric. In a skewed distribution, the mean is farther out in the long tail then the median. In a skewed distribution, the mean is farther out in the long tail then the median.

Ex: Barry Bonds Find the median of home runs hit in first 16 seasons. n= n= M= M= 16 34

How do outliers affect the median? Are there any outliers? If so remove and find M. n= 15 n= 15 M new = 34 vs M old = 34 M new = 34 vs M old = 34 Median is resistant

How do outliers affect the mean? 16, 19, 24, 25, 25, 33, 33, 34, 34, 37, 37, 40, 42, 46, 49, 73 Find the mean of the original data. Enter data into L1 Enter data into L1 STAT:CALC:1 –Var Stats: L1 STAT:CALC:1 –Var Stats: L1, Mean = 35.44, Mean = Find the mean without outlier. Remove 73 from L1 Remove 73 from L1 STAT:CALC:1 –Var Stats: L1 STAT:CALC:1 –Var Stats: L1 Mean = Mean = We can find median also with 1-Var Stats. Scroll down the list.

Ex: SSHA Scores The Survey of Study Habits and Attitudes (SSHA) is a psychological test that evaluates college students’ motivation, study habits, and attitudes toward school. A private college gives the SSHA to a sample of 18 of its incoming first-year women students. Their scores are:

Make a stemplot of these data. Enter into L2 and sort, then use: Stems from Stems from Leaves: ones; so 154 looks like 15/4 Leaves: ones; so 154 looks like 15/4

Are there any potential outliers? About where is the center of the distribution ? Potential outliers: Center: 11 5 Median: mean of the 9 th and 10 th observ about n = 18

Shape Describe shape: The overall shape of the distribution is irregular, as often happens when only a few observations are available.

Spread What is the spread of the scores (ignoring any outliers)? 178 – 101 = 77 or from 101 to 178

Center b. : Find the mean score from the formula for the mean. By hand: Sum of the 18 observations/18 = 2539/18 = = 2539/18 = Calculator keystrokes : 2nd STAT(list):MATH:MEAN(L2) or STAT:CALC:1-Var Stat (L2) =

c. Find the median of these scores. Which is larger: the median or the mean? Median = average of the 9 th and 10 th scores = = vs. mean = Explain why? The mean is larger than the median because of the outlier at 200 which pulls the mean toward the long right tail of the distribution.

Describe the following distributions: Both distributions are roughly symmetric. The center for both distributions is 90 goals. Both distributions have different amounts of VARIABILITY.

Measuring Spread/Variability Range: The difference between the largest and smallest observations. Range: The difference between the largest and smallest observations. Quartiles: (uses median) The quartiles mark out the middle half and improve our description of spread. Quartiles: (uses median) The quartiles mark out the middle half and improve our description of spread.

Quartiles 1. Arrange observations in increasing order and locate M. 2. The first quartile (Q1) lies one-quarter of the way up list of ordered observations M of lower half larger than 25% of ordered observations.

3. The third quartile (Q3) lies three-quarters of the way up list of ordered observations larger than 75% of ordered observations. larger than 75% of ordered observations. M of upper half M of upper half 4. The “second quartile” median (M) median (M) Note: is not larger than 50% of ordered observations Note: is not larger than 50% of ordered observations is at 50% mark is at 50% mark

Barry Bonds cont. Locate Q1 and Q3: Q1 = 25 Q3 = 41

Interquartile Range (IQR) : The distance between the first and third quartiles. IQR = Q3 – Q1 IQR = 41 – 25 = 16

Note: If an observation falls in the IQR it is not unusually high or low. We use IQR to identify suspect outliers.

Outliers Unusually high or unusually low Unusually high or unusually low Basic “rule of thumb” for identifying is if the observation falls more than 1.5 x IQR above the third quartile or below the first quartile. Basic “rule of thumb” for identifying is if the observation falls more than 1.5 x IQR above the third quartile or below the first quartile.

Outliers 1. Find IQR 2. Q x IQR (upper cutoff) 3. Q1 – 1.5 x IQR (lower cutoff)

Is Barry bonds 73 an outlier? IQR = IQR = 41 – 25 = x IQR = 1.5 x IQR = = 24 = 24 Q = Q = = 65 (upper cutoff) = 65 (upper cutoff) Yes or no? Yes or no? Yes Bonds record setting year of 73 is an outlier.

Ex: McDonald’s Chicken Sandwiches Problem: Determine whether the Premium Crispy Chicken Club Sandwich with 28 grams of fat is an outlier. (2 min) Solution: Here are the 14 amounts of fat in order:

Which is resistant range or IQR? Range: Range: IQR: IQR: