Download presentation
Presentation is loading. Please wait.
1
Variance Variance: Standard deviation:
The average squared distance of observations from the mean. Its corresponding measure of central tendency is the mean. Standard deviation: The square root of the variance. Variance is in squared units, so can be hard to interpret. Standard deviation is in the original units of the measurements. These measure how much the data deviate from the mean. They use all the data in their calculations. Draw distributions with large and small variances on the board.
2
Variance Two versions depending on the context of data
Population: all subjects of interest Every single person in the United States Entire UCM student body Calculation: π 2 = π=1 π π₯ π βπ 2 π = ππ π Population standard deviation: π= π 2 Sample: subset of population 100 random people in the US 100 random students from UCM π 2 = π=1 π π₯ π β π₯ πβ1 = ππ πβ1 Sample standard deviation: π = π 2 Point out what a deviation is. Make note that SS stands for sum of squares. Make a slight allusion to why s^2 is calculated with n-1 instead of n.
3
Population Standard Deviation (Ο)
Measure of how spread out the values are in a data set consisting of the entire population of interest (i.e., how much do the population values deviate from the mean?) Step 1: Calculate the deviation (distance) of each value (π π ) from the population mean (Β΅) Β΅ = 3 Step 2: Square each deviation, then add them up (β2) 2 + (β1) 2 + (0) 2 +(1) 2 +(2) 2 =10 Step 3: Divide the sum of squared deviations by the number of observations (N) 10/5=2 (population variance = Ο 2 ) Step 4: Take the square root of the βaverage squared deviationβ (population variance) 2 =1.41=π (3-3) = 0 (2-3) = -1 (4-3) = 1 (1-3) = -2 (5-3) = 2 You square these values because otherwise they will add up to 0 BY DEFINITION. Show why that is.
4
π= (1β3) 2 + (2β3) 2 + (3β3) 2 + (4β3) 2 + (5β3) 2 5 =1.41
Formulas Population Standard Deviation π= π=1 π ( π₯ π βπ) 2 π π= (1β3) 2 + (2β3) 2 + (3β3) 2 + (4β3) 2 + (5β3) =1.41
5
Sample Standard Deviation (s)
Measure of how spread out the values are in a data set consisting of a sample of the population of interest (i.e., how much do the sample values deviate from the mean?) Step 1: Calculate the deviation (distance) of each value ( π π ) from the sample mean ( π ) π₯ =3 Step 2: Square each deviation, then add them up (β2) 2 + (β1) 2 + (0) 2 +(1) 2 +(2) 2 =10 Step 3: Divide the sum of squared deviations by number of observations minus 1 (n-1) 10/(5β1)=10/4=2.5 (sample variance = π 2 ) Step 4: Take the square root of the βcorrected average squared deviationβ (sample variance) 2.5 =1.58=π (3-3) = 0 (2-3) = -1 (4-3) = 1 (1-3) = -2 (5-3) = 2
6
π = (1β3) 2 + (2β3) 2 + (3β3) 2 + (4β3) 2 + (5β3) 2 5β1 =1.58
Formulas Sample Standard Deviation π = π=1 π ( π₯ π β π₯ ) 2 πβ1 π = (1β3) 2 + (2β3) 2 + (3β3) 2 + (4β3) 2 + (5β3) 2 5β1 =1.58 Write the computational formula/proof on the board.
7
Simplified Formula Using the rules for summations, a simpler formula for the standard deviation can be found. π = π=1 π ( π₯ π β π₯ ) 2 πβ1 = π=1 π ( π₯ π 2 β2 π₯ π π₯ + π₯ 2 ) πβ1 = π=1 π π₯ π 2 β π=1 π 2 π₯ π₯ π + π=1 π π₯ 2 πβ1 = π=1 π π₯ π 2 β2 π₯ π=1 π π₯ π +π π₯ πβ1 = π=1 π π₯ π 2 β2π π₯ 2 +π π₯ 2 πβ1 = π=1 π π₯ π 2 βπ π₯ 2 πβ1 π = ( )β5Γ β1 =1.58
8
Another Example Suppose we have a sample of 6 scores on a quiz:
5, 10, 3, 7, 2, 3 What is the mode for these data? 3 What is the median for these data? 4 What is the mean for these data? 5 What is the sample variance for these data? 46 5 =9.2 What is the sample standard deviation for these data? 9.2 β3.033
9
Population versus Sample SD
The sample SD seeks to estimate the population SD, in the same way that the sample mean seeks to estimate the population mean The distribution of the sample mean is such that it may incorrectly estimate the population mean, but itβs no more likely to overestimate than it is to underestimate It is unbiased The sample SD, on the other hand, will consistently underestimate the population SD, so we divide it by a smaller number (n-1 instead of n) to correct for this underestimation error. Without correction, the sample SD and sample variance would be biased estimators
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.