Section 3.2 Measures of Variation Range Standard Deviation Variance.

Slides:



Advertisements
Similar presentations
Statistical Techniques I EXST7005 Start here Measures of Dispersion.
Advertisements

Chapter Three McGraw-Hill/Irwin © 2005 The McGraw-Hill Companies, Inc., All Rights Reserved
Lecture 2 Describing Data II ©. Summarizing and Describing Data Frequency distribution and the shape of the distribution Frequency distribution and the.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
B a c kn e x t h o m e Parameters and Statistics statistic A statistic is a descriptive measure computed from a sample of data. parameter A parameter is.
Business Statistics: A Decision-Making Approach, 7e © 2008 Prentice-Hall, Inc. Chap 3-1 Business Statistics: A Decision-Making Approach 7 th Edition Chapter.
Basic Business Statistics 10th Edition
2.3. Measures of Dispersion (Variation):
Chap 3-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 3 Describing Data: Numerical Statistics for Business and Economics.
Learning Objectives In this chapter you will learn about the importance of variation how to measure variation range variance standard deviation.
12.3 – Measures of Dispersion
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Describing Data: Numerical
Measures of Central Tendency: Mode, Median, and Mean
Describing Data from One Variable
Frequency Distributions, Histograms, and Related Topics.
Chapter 3 – Descriptive Statistics
Unit 3 Section 3-3 – Day : Measures of Variation  Range – the highest value minus the lowest value.  The symbol R is used for range.  Variance.
Chapter 3 Averages and Variations
Measures of Central Tendency Section 2.3. Central Values There are 4 values that are considered measures of the center. 1.Mean 2.Median 3.Mode 4.Midrange.
© Copyright McGraw-Hill CHAPTER 3 Data Description.
Chapter 3 Descriptive Measures
Copyright (C) 2002 Houghton Mifflin Company. All rights reserved. 1 Understandable Statistics Seventh Edition By Brase and Brase Prepared by: Lynn Smith.
1.3: Describing Quantitative Data with Numbers
Descriptive Statistics: Numerical Methods
STAT 280: Elementary Applied Statistics Describing Data Using Numerical Measures.
Review of Measures of Central Tendency, Dispersion & Association
Statistics Numerical Representation of Data Part 2 – Measure of Variation.
Chapter 3 Numerically Summarizing Data 3.2 Measures of Dispersion.
Central Tendency and Variability Chapter 4. Variability In reality – all of statistics can be summed into one statement: – Variability matters. – (and.
Objectives The student will be able to: find the variance of a data set. find the standard deviation of a data set.
Section 3.2 Measures of Dispersion. Range = the difference between the largest value and smallest value range = (largest value) – (smallest value)
Section 3-3 Measures of Variation. WAITING TIMES AT DIFFERENT BANKS Jefferson Valley Bank (single waiting line) Bank of Providence.
SECTION 12-3 Measures of Dispersion Slide
1 CHAPTER 3 NUMERICAL DESCRIPTIVE MEASURES. 2 MEASURES OF CENTRAL TENDENCY FOR UNGROUPED DATA  In Chapter 2, we used tables and graphs to summarize a.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 3 Section 2 – Slide 1 of 27 Chapter 3 Section 2 Measures of Dispersion.
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Three Averages and Variation.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 3-1 Chapter 3 Numerical Descriptive Measures Business Statistics, A First Course.
3 Averages and Variation
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
1 Descriptive Statistics Descriptive Statistics Ernesto Diaz Faculty – Mathematics Redwood High School.
Chapter 5: Measures of Dispersion. Dispersion or variation in statistics is the degree to which the responses or values obtained from the respondents.
Measures of Variation Range Standard Deviation Variance.
Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall2(2)-1 Chapter 2: Displaying and Summarizing Data Part 2: Descriptive Statistics.
Lesson 25 Finding measures of central tendency and dispersion.
CHAPTER 2: Basic Summary Statistics
A way to organize data so that it has meaning!.  Descriptive - Allow us to make observations about the sample. Cannot make conclusions.  Inferential.
Copyright © 2016 Brooks/Cole Cengage Learning Intro to Statistics Part II Descriptive Statistics Intro to Statistics Part II Descriptive Statistics Ernesto.
CHAPTER 3 – Numerical Techniques for Describing Data 3.1 Measures of Central Tendency 3.2 Measures of Variability.
2.4 Measures of Variation The Range of a data set is simply: Range = (Max. entry) – (Min. entry)
One-Variable Statistics. Descriptive statistics that analyze one characteristic of one sample  Where’s the middle?  How spread out is it?  How do different.
Descriptive Statistics Ernesto Diaz Faculty – Mathematics
Measures of Central Tendency
Business and Economics 6th Edition
One-Variable Statistics
PROBABILITY AND STATISTICS
Intro to Statistics Part II Descriptive Statistics
Intro to Statistics Part II Descriptive Statistics
Chapter 3 Describing Data Using Numerical Measures
Section 3.2 Measures of Spread.
Measures of central tendency
BUS7010 Quant Prep Statistics in Business and Economics
Numerical Descriptive Measures
Descriptive Statistics Healey Chapters 3 and 4 (1e) or Ch. 3 (2/3e)
Measures of Center.
Refer to Ex 3-18 on page Record the info for Brand A in a column. Allow 3 adjacent other columns to be added. Do the same for Brand B.
CHAPTER 2: Basic Summary Statistics
Business and Economics 7th Edition
Presentation transcript:

Section 3.2 Measures of Variation Range Standard Deviation Variance

The range = largest minus smallest The range is the difference between the largest and smallest values of a distribution. Example: Find the range: 10, 13, 17, 17, 18 The range = largest minus smallest = 18 -10 = 8

The Standard Deviation The standard variation is a measure of the average variation of the data entries from the mean. Standard deviation of a sample mean of the sample n = sample size

To calculate standard deviation of a sample Calculate the mean of the sample. Find the difference between each entry (x) and the mean. These differences will add up to zero. Square the deviations from the mean. Sum the squares of the deviations from the mean. Divide the sum by (n - 1) to get the variance. Take the square root of the variance to get the standard deviation.

The Variance The variance is the square of the standard deviation Variance of a Sample

Example Find the standard deviation and variance 30 26 22 4 -4 16 ___ Sum = 0 32 78 Mean = 26 The variance The standard deviation s = = 32 ¸ 2 =16

Example Find the mean, the standard deviation and variance 4 5 7 -1 2 1 4 mean = 5 Σx =25

Example cont. Mean = 5

Computation Formulas for Sample Variance and Standard Deviation: To find Σx2 Square the x values, then add. To find ( Σx ) 2 Sum the x values, then square.

Use the computing formulas to find s and s2 x 4 5 7 x2 16 25 49 25 131

Population Mean Population Standard Deviation

Coefficient Of Variation The disadvantage of the standard deviation as a comparative measure of variation is that it depends on the units of measurement. This means that it is difficult to use the standard deviation to compare measurements from different populations. For this reason, statisticians have defined the coefficient of variation, which expresses the standard deviation as a percentage of the sample or population mean.

Coefficient Of Variation: The coefficient of variation is a measurement of the relative variability (or consistency) of data. Notice that the numerator and denominator in the definition of CV have the same units, so CV itself has no units of measurement. This give us the advantage of being able to directly compare the variability of two different populations using the coefficient of variation.

CV is used to compare variability or consistency A sample of newborn infants had a mean weight of 6.2 pounds with a standard deviation of 1 pound. A sample of three-month-old children had a mean weight of 10.5 pounds with a standard deviation of 1.5 pound. Which (newborns or 3-month-olds) are more variable in weight?

To compare variability, compare Coefficient of Variation For newborns: For 3-month-olds: Higher CV: more variable CV = 16% CV = 14% Lower CV: more consistent Use Coefficient of Variation You may wish to compare two groups of data, to answer: Which is more consistent? Which is more variable?

Example A local fishing store sells spinners (a type of fishing lure). The store has only 8 different types of spinners for sale. The prices (in dollars) are 2.10 1.95 2.60 2.00 1.85 2.25 2.15 2.25 Find the coefficient of variation Solution Compute the mean and standard deviation of the population μ = $2.14 and σ = $0.22

Example cont. Compare the CV of prices and comment on the meaning of the results. The CV can be though of as a measure of the spread of the data relative to the average of the data. Since the fishing store is very small, it carries a small selection of spinners that are all priced similarly. The CV tells us that the standard deviation of the spinner prices is only 10.28% from the mean.

Example A large fishing store in Nebraska has a broad selection of spinners. The prices of a random sample of 10 spinners are 1.69 1.49 3.09 1.79 1.39 2.89 1.49 1.39 1.49 1.99 Use the calculator to compute and s = $0.62 Compute the CV for the spinner prices

Example cont. Compare the mean, standard deviation, and CV for the spinner prices at the two fishing stores. Comment on the differences. The CV for Nebraska store is three times more than the CV from the previous example. First, because the fishing store in the previous example is small, and tends to have higher prices (larger μ). Second, it has limited selection of spinners with a smaller variation of price.

Shebyshev’s Theorem The spread of dispersion of a set of data about the mean will be small if the standard deviation is small, and it will be large if the standard deviation is large. If we are dealing with a symmetrical bell-shaped distribution, then we can make very definite statements about the proportion of the data that must lie within a certain number of standard deviations on either side of the mean. However, the concept of data spread about the mean can be expressed quite generally for all data distributions (skewed, symmetric, or other shape) by using the remarkable theorem of Chebyshev.

CHEBYSHEV'S THEOREM For any set of data and for any number k, greater than one, the proportion of the data that lies within k standard deviations of the mean is at least:

Results of Chebyshev’s theorem For k = 2: or at least 75% of the data fall in the interval from from to (between 2 St Deviations) For K = 3 at least 88.9% (between 3 St Deviations) For K = 4 at least 93.8% (between 4 St Deviations)

Using Chebyshev’s Theorem A mathematics class completes an examination and it is found that the class mean is 77 and the standard deviation is 6. According to Chebyshev's Theorem, between what two values would at least 75% of the grades be?

Mean = 77 Standard deviation = 6 At least 75% of the grades would be in the interval: 77 – 2(6) to 77 + 2(6) 77 – 12 to 77 + 12 65 to 89 Assignment 5

Entering Data (Calc.) Data is stored in Lists on the calculator.  Locate and press the STAT button on the calculator.  Choose EDIT.  The calculator will display the first three of six lists (columns) for entering data.  Simply type your data and press ENTER. Use your arrow keys to move between lists. Data can also be entered from the home screen using set notation -- {15, 22, 32, 31, 52, 41, 11} → L1 (where → is the STO key) Data can be entered in a second list based upon the information in a previous list.  In the example below, we will double all of our data values in L1 and store them in L2.  If you arrow up ONTO L2, you can enter a formula for generating L2.  The formula will appear at the bottom of the screen.  Press ENTER and the new list is created.

Clearing Data (Calc.) To clear all data from a list:  Press STAT.  From the EDIT menu, move the cursor up ONTO the name of the list (L1).  Press CLEAR.  Move the cursor down.  NOTE:  The list entries will not disappear until the cursor is moved down.  (Avoid pressing DEL as it will delete the entire column.  If this happens, you can reinstate the column by pressing STAT #5 SetUpEditor.) You may also clear a list by choosing option #4 under the EDIT menu, ClrList.   ClrList will appear on the home screen waiting for you to enter which list to clear.  Enter the name of a list by pressing the 2nd button and the yellow L1 (above the 1). To clear an individual entry:  Select the value and press DEL.

Sorting Data (Calc.) Sorting Data: (helpful when finding the mode) Locate and press the STAT button.  Choose option #2, SortA(. Specify the list you wish to sort by pressing the 2nd button and the yellow L1 list name.  Press ENTER and the list will be put in ascending order (lowest to highest).  SortD will put the list in descending order. One Variable Statistical Calculations: Press the STAT button. Choose CALC at the top.  Select 1-Var Stats.  Notice that you are now on the home screen.  Specify the list you wish to use by choosing the 2nd button and the list name:            Press ENTER and view the calculations.  Use the down arrow to view all of the information.

One Variable Statistical Calculations (Calc.) = mean = the sum of the data = the sum of the squares of the data = the sample standard deviation = the population standard deviation = the sample size (# of pieces of data) = the smallest data entry = data at the first quartile = data at the median (second quartile) = data at the third quartile = the largest data entry

Measures of Dispersion (Calc) Range, Standard Deviation, Variance, Mean Absolute Deviation   Problem:  For the data set {10, 12, 40, 35, 14, 24, 13, 21, 42, 30}, find the range, the standard deviation, the variance, and the mean absolute deviation to the nearest hundredth.   A quick reminder before we begin the solution: In statistics, the population form is used when the data being analyzed includes the entire set of possible data.  The sample form is used when the data is a random sample taken from the entire set of data.  You should use population form unless you know that you are working with a random sample of the data.

Measures of Dispersion cont. (Calc) To find the range: To find the range: Enter the data, as is, into L1.  You can enter the list on the home screen and "store" to L1, or you can go directly to L1 (2nd STAT, #1 Edit).   Sort the list to quickly retrieve the highest and lowest values for the range. (2nd STAT, #2 SortA).  You can choose ascending or descending. Read the high and low values from L1 for computing the range. Range = 42 - 10 = 32. OR:  To find the range:  Do not sort.  Simply type on the home screen using the min and max functions found under MATH → NUM #6 min and #7 max.                Range = 32

Measures of Dispersion cont. (Calc) To find standard deviation: To find standard deviation:  Since this question deals with the complete set, we will be using "population" form, not sample form. Go to one-variable stats for "population" standard deviation.    STAT → CALC  #1 1-Var Stats   NOTE! The standard deviations found in the CATALOG, stdDev, and also found by 2nd LIST → MATH #7 stdDev are both Sample standard deviations. Population Standard Deviation = 11.43

Measures of Dispersion cont. (Calc) To find variance: To find variance: The "population" variance is the square of the population standard deviation. The symbol is under VARS - #5 Statistics NOTE! The variance found in the CATALOG and also found by 2nd List → MATH #8 variance are both Sample variances. To find mean absolute deviation: To find mean absolute deviation: To calculate the mean absolute deviation you will have to enter the formula. Mean Absolute Deviation = 10.12

Measures of Dispersion cont. (Calc) NOTE! Be sure that you have run 1-Var Stats (under STAT - CALC #1) first, so that the calculator will have computed . Otherwise, you will get an error from this formula. and n are found under VARS #5 Statistics. Sum and abs are quickly found in CATALOG. Sum is also under 2nd LIST - MATH #5 sum. abs is also under MATH - NUM #1abs. OR: To find mean absolute deviation: A longer, but workable, solution can also be accomplished using the lists. As stated above, run 1-Var Stats so the calculator will compute . Now, go to L2 (STAT #1 EDIT) and move UP onto L2. Type, at the bottom of the window, the portion of the formula that finds the difference between each data entry and the mean, using absolute value to make these distances positive. Now, find the mean, , of L2 by using 1-Var Stats on L2, and read the answer of 10.12.

Measures of Dispersion on Grouped Data Problem: Data Entry Frequency 100 8 150 15 200 21 250 14 300 5 For the data set shown in this table, find the range, the standard deviation, and the variance to the nearest hundredth. Since this question deals with the complete set, we will be using "population" form, not sample form. For central tendency on grouped data, see Mean, Mode, Median with Grouped Data.

Measures of Dispersion on Grouped Data Solution: To find the range:  No need for calculator work for the range.  It is easily observed from the table. Range = 300 - 100 = 200. To find standard deviation:  Remember, we are looking for "population" form which will be found using 1-Var Stats. Enter the "Data Entry" into L1 and the "Frequency" into L2.  Go to one-variable stats to find "population" standard deviation.    STAT → CALC  #1 1-Var Stats Be sure to use parameters L1, L2 to indicate both the values AND their frequencies. NOTE! The standard deviation found in the CATALOG, stdDev, and also found by 2nd LIST → MATH #7 stdDev are both Sample standard deviations.   Population Standard Deviation = 56.42   Population Standard Deviation = 56.42

Measures of Dispersion on Grouped Data To find variance:   The "population" variance is the square of the population standard deviation. The symbol is under VARS - #5 Statistics     NOTE!  The variance found in the CATALOG and also found by 2nd List → MATH #8 variance are both Sample variances. Population Variance = 3183.42