Presentation is loading. Please wait.

Presentation is loading. Please wait.

Estimating the Standard Deviation © Christine Crisp “Teach A Level Maths” Statistics 2.

Similar presentations


Presentation on theme: "Estimating the Standard Deviation © Christine Crisp “Teach A Level Maths” Statistics 2."— Presentation transcript:

1 Estimating the Standard Deviation © Christine Crisp “Teach A Level Maths” Statistics 2

2 The formula for the standard error ( the standard deviation of the sample means ) is where  is the population standard deviation and n is the sample size. standard error (s.e. ) = However, we may not know the population standard deviation so we must estimate this from our sample. The obvious quantity to use is the sample standard deviation but it can be shown that this is too small so we need to make an adjustment.

3 When we are estimating in Statistics, we talk about biased and unbiased estimators. An unbiased estimator is one that on average gives the value we are estimating. So, for example, if the value we wanted to estimate was equal to 2, and all possible samples gave us these values: 1.8, 1.9, 2, 2.1, 2.2 the statistic giving the values would be unbiased: it’s mean is 2. ( We mustn’t worry that 4 of the 5 values are wrong. That isn’t the point. We must be right on average. ) We want unbiased estimators. If we had 1.8, 2, 2, 2, 2.3 our estimator is biased since the average is not correct, even though more individual values are correct.

4 Population and 1000 sample means Population, 1 st sample and mean of 1 st sample Let’s look at the hens eggs again. We’ve met 3 different standard deviations (s.ds.) so we need to be clear which s.d. we are talking about. The 1 st s.d. is the Population standard deviation ( the one we want to estimate ) The 2 nd s.d. is the standard error or standard deviation of all sample means. It is also unknown as it depends on the unknown .

5 Population and 1000 sample means Population, 1 st sample and mean of 1 st sample Let’s look at the hens eggs again. We’ve met 3 different standard deviations (s.ds.) so we need to be clear which s.d. we are talking about. We are left with the 3 rd s.d., the standard deviation, s, of our one sample. It can shown ( although we don’t need to do it ) that this is a biased estimator. However, we can tweak it to change it into an unbiased estimator. s could be the standard deviation of this sample

6 Population and 1000 sample means Population, 1 st sample and mean of 1 st sample Let’s look at the hens eggs again. We’ve met 3 different standard deviations (s.ds.) so we need to be clear which s.d. we are talking about. The unbiased estimator of  is where s is the standard deviation of a sample.

7 The unbiased estimator of  2 In your formula book you will find the unbiased estimator of  2, the population variance, written as To use this, replace the capital X i by x i ( the sample data ) and by ( the sample mean ). However, you’ll probably be using calculator functions not a formula and your calculator gives the unbiased estimator of the population standard deviation as well as the sample standard deviation. Try the following: It gives the same result as ( For standard deviation, just square root. )

8 Enter the following data in your calculator: 1, 3, 5 Select the list of statistics and you should find the values The 1 st ( larger ) of these is the unbiased estimator of . The other is the standard deviation of the sample. If you are not sure which to use, think about whether you are making estimates from a sample. If so, use the 1 st ( larger ) value. The biased and unbiased estimators are nearly the same. One further point: if n is large, is very close to 1. Ignore the symbols used by the calculator. They are not the ones we use.

9 SUMMARY  To estimate the variance of a population we use the unbiased estimator, S 2, where and s 2 is the variance of a sample of size n.  Calculators give the values of both s, the sample standard deviation and S the unbiased estimator of population standard deviation but we must ignore the calculator notation.  An unbiased estimator is one where the average of all possible values equals the quantity being estimated.  S 2 can also be found from where x represents each data item and is the sample mean.

10 The calculator gives the sample s.d. as 10·0457... so we need to square to find the variance. I’ve written down 3 s.f. but will use the more exact calculator value. This is the unbiased estimate of the population mean,  e.g. 1. Six people in a factory were selected and asked how long they took to get to work. The results, in minutes, were as follows: 7, 12, 13, 20, 30, 35 Calculate the mean and variance of the times in the sample and hence find unbiased estimates of the mean and variance of the times for all the workers. The unbiased estimate of the variance,  2, is Sample mean, Sample variance, Solution: Although I’m showing the formulae in the solution, I’m using the calculator functions to find each answer.

11 e.g. 2. The following sets of data are from samples, each from a different Normal population. Find unbiased estimates of the mean, , and standard deviation, , of each of the populations. (a) 17, 24, 25, 31, 42 (b) (c) (a)Sample: Unbiased estimates of population parameters are: Solutions:

12 (b) Sample mean,  Unbiased estimate of mean,  is Unbiased estimate of population standard deviation,  is Sample variance, Estimate of population variance:

13 Sample mean,  Unbiased estimate of mean,  is Unbiased estimate of  is S where (c)

14 Exercise The following sets of data are from samples, each from a different Normal population. Find unbiased estimates of the mean, , and standard deviation, , of each of the populations. (a) 5·2, 7·9, 8·1, 9·3 (b) (c) (a)Sample: Unbiased population estimates: Solutions:

15 Sample mean,  Unbiased estimate of mean,  is Unbiased estimate of  is Sample variance, (b)

16 Sample mean,  Unbiased estimate of mean,  is Unbiased estimate of  is S where (c)

17 Estimating the Standard Deviation

18 The following slides contain repeats of information on earlier slides, shown without colour, so that they can be printed and photocopied. For most purposes the slides can be printed as “Handouts” with up to 6 slides per sheet.

19 Estimating the Standard Deviation The unbiased estimator of  2 The unbiased estimator of  2 ( the population variance ) is given by In your formula book you will find this written as You can use either, replacing the capital X i by x i and by ( data and mean ) from your sample. However, you’ll probably be using calculator functions not a formula and your calculator the unbiased estimator of population standard deviation as well as the sample standard deviation.

20 Estimating the Standard Deviation Enter the following data in your calculator: 1, 3, 5 Select the list of statistics and you should find the values The 1 st ( larger ) of these is the unbiased estimator of . The other is the standard deviation of the sample. If you aren’t sure which to use, think about whether you are making estimates from a sample. If so, use the 1 st ( larger ) value. The biased and unbiased estimators are nearly the same. One further point: if n is large, is very close to 1.

21 Estimating the Standard Deviation SUMMARY  To estimate the variance of a population we use the unbiased estimator, S 2, where and s 2 is the variance of a sample of size n.  An unbiased estimator is one where the mean of all possible values equals the quantity being estimated.  S 2 can also be found from where x represents each data item and is the sample mean.  Calculators give the values of both s, the sample standard deviation and S the unbiased estimator of population standard deviation but we must ignore the calculator notation.

22 Estimating the Standard Deviation e.g. The following sets of data are from samples, each from a different Normal population. Find unbiased estimates of the mean, , and standard deviation, , of each of the populations. (a) 17, 24, 25, 31, 42 (b) (c) (a)Using calculator functions, Solutions: Unbiased estimates of population parameters are: For the sample,

23 Estimating the Standard Deviation (b) Sample mean,  Unbiased estimate of mean,  is Unbiased estimate of population standard deviation,  is Sample variance, Estimate of population variance:

24 Estimating the Standard Deviation Sample mean,  Unbiased estimate of mean,  is Unbiased estimate of population standard deviation,  is S where (c)


Download ppt "Estimating the Standard Deviation © Christine Crisp “Teach A Level Maths” Statistics 2."

Similar presentations


Ads by Google