Download presentation
Presentation is loading. Please wait.
1
Describing Location in a Distribution
Chapter 2: Modeling Distributions of Data Section 2.1 Describing Location in a Distribution
2
Here are scores of 25 students on their first AP statistics test:
Make a dotplot: Fayrene scored an 86 on the test . How did she perform on this test RELATIVE to her classmates? We can describe Fayrene’s performance on her first statistics test using percentiles. Looking at the distribution, we can see that Fayrene’s 86 is the 22nd highest score on this test, counting up from the lowest score. Since 21 of the 25 observations (84%) are at or below her score, Fayrene scored at the 84th percentile.
3
“Percent correct” does not measure the same thing as a “Percentile”
Definition: The pth percentile of a distribution is the value with p percent of the observations less than it. Joe Fred Billy Bob got a 72 on the test. At what percentile does his test fall? “Percent correct” does not measure the same thing as a “Percentile” Percentiles are always whole numbers – round to nearest interger Gloria Jean made the 93 on the test. At what percentile does her test fall?
4
Cumulative relative frequency
Make a frequency distribution table for the data: Grades on AP Stats Test Grade Frequency Relative frequency Cumulative frequency Cumulative relative frequency 65-69 70-74 75-79 80-84 85-89 90-94 95-99
5
Cumulative relative frequency
Grades on AP Stats Test Grade Frequency Relative frequency Cumulative frequency Cumulative relative frequency 65-69 1 4% 70-74 4 16% 5 20% 75-79 7 28% 12 48% 80-84 8 32% 20 80% 85-89 3 12% 23 92% 90-94 2 8% 25 100% 95-99 0% Cumulative Relative Frequency Graphs A cumulative relative frequency graph (or ogive) displays the cumulative relative frequency of each class of a frequency distribution.
6
Describing Location in a Distribution
Cumulative Relative Frequency Graphs A cumulative relative frequency graph (or ogive) displays the cumulative relative frequency of each class of a frequency distribution. Describing Location in a Distribution Age of First 44 Presidents When They Were Inaugurated Age Frequency Relative frequency Cumulative frequency Cumulative relative frequency 40-44 2 2/44 = 4.5% 2/44 = 4.5% 45-49 7 7/44 = 15.9% 9 9/44 = 20.5% 50-54 13 13/44 = 29.5% 22 22/44 = 50.0% 55-59 12 12/44 = 34% 34 34/44 = 77.3% 60-64 41 41/44 = 93.2% 65-69 3 3/44 = 6.8% 44 44/44 = 100%
7
Describing Location in a Distribution
Interpreting Cumulative Relative Frequency Graphs Use the graph from page 88 to answer the following questions. Was Barack Obama, who was inaugurated at age 47, unusually young? Estimate and interpret the 65th percentile of the distribution Describing Location in a Distribution 65 11 58 47
8
Here is a table showing the distribution of median household incomes for the 50 states and the District of Columbia. Make an ogive. Median Income ($1000s) Frequency Relative Cumulative 35 to < 40 1 40 to < 45 10 45 to < 50 14 50 to < 55 12 55 to < 60 5 60 to < 65 6 65 to < 70 3
9
Here is a table showing the distribution of median household incomes for the 50 states and the District of Columbia. Make an ogive. Median Income ($1000s) Frequency Relative Cumulative 35 to < 40 1 1/51 = 0.020 40 to < 45 10 10/51 = 0.196 11 11/51 = 0.216 45 to < 50 14 14/51 = 0.275 25 25/51 = 0.490 50 to < 55 12 12/51 = 0.236 37 37/51 = 0.725 55 to < 60 5 5/51 = 0.098 42 42/51 = 0.824 60 to < 65 6 6/51 = 0.118 48 48/51 = 0.941 65 to < 70 3 3/51 = 0.059 51 51/51 = 1.000
10
The point at (50,0.49) means 49% of the states had median household incomes less than $50,000. The point at (55, 0.725) means that 72.5% of the states had median household incomes less than $55,000. Thus, 72.5% - 49% = 23.5% of the states had median household incomes between $50,000 and $55, Due to rounding error, this value is slightly different than the relative frequency for the 50 to <55 category.
11
(a) At what percentile is California, with a median income of $57,445?
(b) Estimate and interpret the first quartile of this solution.
12
Back to the scores of 25 students on their first AP statistics test:
Another way to describe Fayrene’s position within the distribution of test scores is to tell how many standard deviations above or below the mean her score is. Converting scores from original values to standard deviation units is known as standardizing.
13
Here are scores of 25 students on their first AP statistics test:
Definition: If x is an observation from a distribution that has known mean and standard deviation, the standardized value of x is: A standardized value is often called a z-score. A z-score tells us how many standard deviations away from the mean the original observation falls and in which direction. (+ is above; - is below)
14
So what does the z-score tell you?
Let’s explore . . . So what does the z-score tell you? Suppose the mean and standard deviation of a distribution are m = 50 & s = 5. If the x-value is 55, what is the z-score? If the x-value is 45, what is the z-score? If the x-value is 60, what is the z-score? 1 -1 2
15
Find the mean and standard deviation of the data.
Fayrene’s score on the test was 86. What is her standardized test score? What does that mean? Gloria Jean earned the highest score in the class, 93. What is her corresponding z-score? Joe Fred Billy Bob got a 72 on the test. What is his z-score? What does that mean?
16
The day after receiving her statistics test result of 86, Fayrene earned an 82 on her chemistry test. The mean of the test scores was 76 with a standard deviation of 4. What was Fayrene’s z-score for this test? Relative to her class, which test did Fayrene do better on?
17
Jonathan wants to work at Utopia Landfill
Jonathan wants to work at Utopia Landfill. He must take a test to see if he is qualified for the job. The test has a normal distribution with µ = 45 and σ = 3.6. In order to qualify for the job, a person can not score lower than 2.5 standard deviations below the mean. Jonathan scores 35 on this test. Does he get the job? No, he scored 2.78 SD below the mean
18
Sally is taking two different math achievement tests with different means and standard deviations. The mean score on test A was 56 with a standard deviation of 3.5, while the mean score on test B was 65 with a standard deviation of Sally scored a 62 on test A and a 69 on test B. On which test did Sally score the best? She did better on test A.
19
Your stats teacher told you that your last test scores were normally distributed with a mean of 85 and a standard deviation of 5. She gives your paper back with a grade of z-score = 2. What was your percentage? Your friend made a z-score of What was his percentage? You made a 95% on the test and your friend made a 77.5%.
20
In 2009, the mean number of wins in MLB was 81 with a standard deviation of 11.4 wins.
Problem: Find and interpret the z-scores for the following teams. (a) The New York Yankees, with 103 wins. (b) The New York Mets, with 70 wins Yankees: z = 1.93 Mets: z = -0.96
21
Why do we use µ and σ for the mean and standard deviation of salaries?
The REBEL STAT Company We have a company with 14 employees that earn the following monthly salaries: m = _______________ s = _______________ Why do we use µ and σ for the mean and standard deviation of salaries? $ $634.39
22
Business has been good, so every employee receives a raise of $500.
What will happen to the mean and standard deviation? = _______________ $ $634.39
23
What will happen to the mean and standard deviation?
Business has NOT been good. Suppose that we have to cut everyone’s pay by $500. What will happen to the mean and standard deviation? = _______________ $ $634.39
24
Mean increases that value Standard deviation stays the same
What happens to the mean and standard deviation if a number is added to each data value?
25
Business has been good. Suppose that we give everyone a 30% raise.
What will happen to the mean and standard deviation? = _______________ $ $824.71
26
Both measures increase
What happens to the mean and standard deviation if a number is multiplied to each data value?
28
Transforming Data Transforming converts the original observations from the original units of measurements to another scale. Transformations can affect the shape, center, and spread of a distribution. Effect of Adding (or Subtracting) a Constant Adding the same number a (either positive, zero, or negative) to each observation: adds a to measures of center and location (mean, median, quartiles, percentiles), but Does not change the shape of the distribution or measures of spread (range, IQR, standard deviation). n Mean sx Min Q1 M Q3 Max IQR Range Guess(m) 44 16.02 7.14 8 11 15 17 40 6 32 Error (m) 3.02 -5 -2 2 4 27
29
Transforming Data Effect of Multiplying (or Dividing) by a Constant
Multiplying (or dividing) each observation by the same number b (positive, negative, or zero): multiplies (divides) measures of center and location by b multiplies (divides) measures of spread by |b|, but does not change the shape of the distribution n Mean sx Min Q1 M Q3 Max IQR Range Error(ft) 44 9.91 23.43 -16.4 -6.56 6.56 13.12 88.56 19.68 104.96 Error (m) 3.02 7.14 -5 -2 2 4 27 6 32
30
Linear transformation rule
When adding a constant to a random variable, the mean changes but not the standard deviation. When multiplying a constant to a random variable, the mean and the standard deviation changes.
31
Here are a graph and table of summary statistics for a sample of 30 test scores. The maximum possible score on the test was 50 points. n Min Q1 M Q3 Max IQR Range Score 30 35.8 8.17 12 32 37 41 48 9 36 Suppose that the teacher was nice and added 5 points to each test score. How would this change the shape, center, and spread of the distribution? Here are graphs and summary statistics for the original scores and the +5 scores: n Min Q1 M Q3 Max IQR Range Score 30 35.8 8.17 12 32 37 41 48 9 36 Score + 5 40.8 17 42 46 53 From both the graph and summary statistics, we can see that the measures of center and measures of position all increased by 5. However the shape of the distribution did not change nor did the spread of the distribution.
32
Suppose that the teacher in the previous alternate example wanted to convert the original test scores to percents. Since the test was out of 50 points, he should multiply each score by 2 to make them out of Here are graphs and summary statistics for the original scores and the doubled scores. n Min Q1 M Q3 Max IQR Range Score 30 35.8 8.17 12 32 37 41 48 9 36 Score x 2 60 71.6 16.34 24 64 74 82 96 18 72 From the graphs and summary statistics we can see that the measures of center, location, and spread all have doubled, just like the individual observations. But even though the distribution is more spread out, the shape hasn’t changed. It is still skewed to the left with the same clusters
33
Rules:
34
An appliance repair shop charges a $30 service call to go to a home for a repair. It also charges $25 per hour for labor. From past history, the average length of repairs is 1 hour 15 minutes (1.25 hours) with standard deviation of 20 minutes (1/3 hour). Including the charge for the service call, what is the mean and standard deviation for the charges for labor?
35
A student was performing an experiment that compared a new protein food to the old food for goldfish. He found the mean weight gain for the new food to be 12.8 grams with a standard deviation of 3.5 grams. Later he realized that the scale was out of calibration by 1.5 grams (meaning that the scale weighed items 1.5 grams too much). What should the mean and standard deviation be for the new food? µ = 12.8 – 1.5 = 11.3 grams σ = 3.5 (not effected by + or -)
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.