2.4 - 2.5. The procedure for finding the variance and standard deviation for grouped data is similar to that for finding the mean for grouped data, and.

Slides:



Advertisements
Similar presentations
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Advertisements

Statistics 1: Introduction to Probability and Statistics Section 3-3.
Descriptive Statistics
Section 2.4 Measures of Variation Larson/Farber 4th ed.
Measures of Variation Section 2.4 Statistics Mrs. Spitz Fall 2008.
Slide Slide 1 Copyright © 2007 Pearson Education, Inc Publishing as Pearson Addison-Wesley. Created by Tom Wegleitner, Centreville, Virginia Section 3-1.
Learning Objectives for Section 11.3 Measures of Dispersion
Measures of Dispersion
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
3-3 Measures of Variation. Definition The range of a set of data values is the difference between the maximum data value and the minimum data value. Range.
12.3 – Measures of Dispersion
Describing Data: Numerical Measures
A Look at Means, Variances, Standard Deviations, and z-Scores
Descriptive Statistics
CHAPTER 3 : DESCRIPTIVE STATISTIC : NUMERICAL MEASURES (STATISTICS)
SECTION 3.2 MEASURES OF SPREAD Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Statistics Workshop Tutorial 3
Review – Using Standard Deviation Here are eight test scores from a previous Stats 201 class: 35, 59, 70, 73, 75, 81, 84, 86. The mean and standard deviation.
Unit 3 Section 3-3 – Day : Measures of Variation  Range – the highest value minus the lowest value.  The symbol R is used for range.  Variance.
Probabilistic and Statistical Techniques
JDS Special Program: Pre-training1 Basic Statistics 01 Describing Data.

Warm-Up If the variance of a set of data is 12.4, what is the standard deviation? If the standard deviation of a set of data is 5.7, what is the variance?
Descriptive Statistics Measures of Variation. Essentials: Measures of Variation (Variation – a must for statistical analysis.) Know the types of measures.
 The range of a data set is the difference between the maximum and minimum data entries in the set. The find the range, the data must be quantitative.
Section 2.4 Measures of Variation Larson/Farber 4th ed. 1.
Chapter 3.2 Measures of Variance.
AP Stats BW 9/19 Below is a list of gas mileage ratings for selected passenger cars in miles per gallon. Choose the correct histogram of the data. 53,
Statistics Numerical Representation of Data Part 2 – Measure of Variation.
Chapter 3 Numerically Summarizing Data 3.2 Measures of Dispersion.
Descriptive Statistics
Section 3-3 Measures of Variation. WAITING TIMES AT DIFFERENT BANKS Jefferson Valley Bank (single waiting line) Bank of Providence.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
1 Elementary Statistics Larson Farber Descriptive Statistics Chapter 2.
Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Measures of Variance Section 2-5 M A R I O F. T R I O L A Copyright © 1998, Triola,
1 Descriptive Statistics 2-1 Overview 2-2 Summarizing Data with Frequency Tables 2-3 Pictures of Data 2-4 Measures of Center 2-5 Measures of Variation.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Descriptive Statistics 2.
Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
Data Set: Apartment Rents (in ascending order)
Unit 2 Section 2.4 – Day 2.
Measures of Variation 1 Section 2.4. Section 2.4 Objectives 2 Determine the range of a data set Determine the variance and standard deviation of a population.
Section 3-2 Measures of Variation.
Chapter 2 Descriptive Statistics 1 Larson/Farber 4th ed.
Section 2.4 Measures of Variation. Section 2.4 Objectives Determine the range of a data set Determine the variance and standard deviation of a population.
Chapter 11 Data Descriptions and Probability Distributions Section 3 Measures of Dispersion.
Chapter Three McGraw-Hill/Irwin © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
Chapter 3: Section 2 Measures of Variance. Paint Comparison: How many months will they last??? Brand ABrand B Average for Brand.
Seventy efficiency apartments were randomly Seventy efficiency apartments were randomly sampled in a college town. The monthly rent prices for the apartments.
Closing prices for two stocks were recorded on ten successive Fridays. Mean = 61.5 Median = 62 Mode = 67 Mean = 61.5 Median = 62 Mode =
Sect.2.4 Measures of variation Objective: SWBAT find the range of a data set Find the variance and standard deviation of a population and of a sample How.
Measures of Variation. Range, Variance, & Standard Deviation.
Section 2.4 Measures of Variation 1 of 149 © 2012 Pearson Education, Inc. All rights reserved.
Do Now Find the mean and standard deviation for the data set (assume sample data):
Section 2.4 Measures of Variation 1 of 149 © 2012 Pearson Education, Inc. All rights reserved.
Describing Data: Summary Measures. Identifying the Scale of Measurement Before you analyze the data, identify the measurement scale for each variable.
Chapter 3 Section 3 Measures of variation. Measures of Variation Example 3 – 18 Suppose we wish to test two experimental brands of outdoor paint to see.
Descriptive Statistics Measures of Variation
Measures of Dispersion
Section 3.3 Measures of Variation.
Chapter 2 Descriptive Statistics.
Section 3.2 Measures of Spread.
ANATOMY OF THE EMPIRICAL RULE
2.4 Measures of Variation.
10-5 The normal distribution
Distribution Shape: Skewness
Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
Two Data Sets Stock A Stock B
Section 2.5 notes continued
Section 2.4 Measures of Variation Larson/Farber 4th ed.
Presentation transcript:

The procedure for finding the variance and standard deviation for grouped data is similar to that for finding the mean for grouped data, and it uses the midpoints of each class.

Make a table as shown A Class B Frequency C Midpoint d f*x m e f*x m 2 Multiply the frequency by the midpoint for each class, and place the products in Column D. Multiply the frequency by the square of the midpoint, and place the products in column E. Find the sums of columns B, D, and E, (The sums of column B is n. The sum of column D is Ʃ f*x m. The sum of column E is Ʃ f*x m 2 ) Substitute in the formula and solve to get the variance. Take the square root to get the standard deviation

Find the variance and the standard deviation for the frequency distribution of the data. The data represent the number of miles that 20 runners ran during one week. ClassFrequencyMidpoint

ClassFrequencyMidpointf-x m f-x m , , n=20 Ʃ f-x m = 490is Ʃ f-x m 2 = 45,002 Multiply the frequency by the midpoint for each class, and place the products in the 4 th column Multiply the frequency by the square of the midpoint, he products and place the 5 th column. Find the sums of the 2 nd, 4 th and 5 th column.

=20(45,002) /20(20-1) =900, ,100/20(19) =659,940/380 = Take the square root to get the standard deviation S= √ = Be sure to use the number found in the sum of the 2 nd column for n. Do not use the number of classes.

The range can be used to approximate the standard deviation. The approximation is called the range rule of thumb. S ≈ range/4 Example: The data set 5, 8, 8, 9, 10, 12, and 13, has a standard deviation o f 2.7 and the range is 13-5= 8 The range rule of thumb is s ≈ 2. In this example the range rule of thumb underestimates the standard deviation but it is in the ballpark.

The range rule of thumb can be used to estimate the largest and smallest data values of a data set. The smallest value will be approximately 2 standard deviations below the mean, and the largest data value will be approximately 2 standard deviations above the mean of the data set. Example the mean from the data set 5, 8, 8, 9, 10, 12, and 13, is 9.3 hence, Smallest data value = X - 2s = (2.8) = 3.7 Largest data value = X + 2s = (2.8) = 14.9 Now look back at the original data set. The Smallest was 5 and the largest was 13. Again these are considered rough estimates. Better approximations can be obtained by using Chebyshev’s theorem and the empirical rule.

The portion of values from any data set lying within z standard deviations (z>1) of the mean is at least 1 – 1/z 2. Z = 2: In any data set, at least 1 – 1/2 2 = ¾, or 75%, of the data lie within 2 standard deviations of the mean. Z=3: In any data set, at least 1 – 1/3 2 = 8/9, or 88.9%, of the data lie within 3 standard deviations of the mean. Applies to any distribution regardless of it’s shape.

The age distributions for Alaska and Florida are shown in the histograms. Decide which is which. Apply Chebychev’s Theorem to the data for Florida.

The mean price of houses in a certain neighborhood is $50,000, and the standard deviation is $10,000. Find the price range for which at least 75% of the houses will sell.

Chebyshev’s theorem can be used to find the minimum percentage of data values that will fall between any two given values. Example: A survey of local companies found that the mean amount of travel allowances for executives was $0.25 per mile. The standard deviation was $0.02. Using Chebychev’s theorem, find the minimum percentage of the data values that will fall between $0.20 and $0.30.

Data values that lie more than 2 standard deviations from the mean are considered unusual. Data values that lie more than three standard deviations from the mean are very unusual. Applies only to bell shaped (NORMAL) distributions Approximately 68% of the data values will fall within 1 standard deviation of the mean. Approximately 95% of the data values will fall within 2 standard deviation of the mean. Approximately 99.7% of the data values will fall within 3 standard deviation of the mean.

Many real-life data sets have distributions that are approximately symmetric and bell shaped. 68% of the data lie within 1 standard deviation 95% of the data lie within 2 standard deviations 99.7% of the data lie within 3 standard deviations

In a survey conducted by the National Center for Health Statistics, the sample mean height of women in the U.S. (ages 20-29) was 64 inches with a sample standard deviation of 2.75 inches. Estimate the percent of women whose heights are between 64 inches and 69.5 inches. We know 64 is the mean to calculate how much 2 standard deviations from the mean is we take the MEAN + 2(STANDARD DEVIATIONS)= or 64+2(2.75)=69.5

Because the distribution is bell shaped, you can use the Empirical Rule. Because the 69.5 is 2 standard deviations above the mean height, the percent of the heights between 64 inches and 69.5 inches is 34% % or 47.6% So 47.6% of women are between 64 inches and 69.5 inches.