Download presentation
Presentation is loading. Please wait.
Published byCynthia Stevenson Modified over 9 years ago
1
The Practice of Statistics Chapter 9: 9.1 Sampling Distributions Copyright © 2008 by W. H. Freeman & Company Daniel S. Yates
2
Essential Questions for 9.1 What is a parameter? What is a statistics? What is sampling variability? What is a sampling distribution? How do you describe a sampling distribution? What is an unbiased statistic and an unbiased estimator?
3
Definitions parameter: –a number that describes the population –a parameter is a fixed number –in practice, we do not know its value because we cannot examine the entire population
4
Definitions statistic: –a number that describes a sample –the value of a statistic is known when we have taken a sample, but it can change from sample to sample –we often use a statistic to estimate an unknown parameter
6
Compare parameter –mean: μ –standard deviation: σ –proportion: p Sometimes we call the parameters “true”; true mean, true proportion, etc. statistic –mean: x-bar –standard deviation: s –proportion: (p-hat) Sometimes we call the statistics “sample”; sample mean, sample proportion, etc.
7
Sampling Variability Question: What would happen if we took many sample? To answer this question, we would do the following: –Take a large number of samples from the same population. –Calculate the sample means or the sample proportion for each sample. –Make a histogram. –Examine the distribution for shape, center, spread and outliers. In Practice it is too expensive to take many samples from a population. Simulation may be used instead of many samples to approximate the sampling distribution. Probability may be used to obtain an exact sampling distribution without simulation.
8
Sampling Distributions The sampling distribution of a statistic is the distribution of values taken by the statistic in all possible samples of the same size from the same population.
9
http://www.ruf.rice.edu/~lane/stat_sim/sam pling_dist/
11
The population used to construct the random number table (Table B) can be described by the probability function below. Example – Using an Exact Sampling Distribution Consider taking an SRS of size 2 from this population and computing the means for the sample.
12
Instead of doing a simulation we can construct the actual sampling distribution displayed below. Using this table we can construct a sampling distribution.
13
The sampling distribution of the means for sample size n = 2. μ = ?. Does this agree with E(x) of the distribution?
14
Example –Survivor Fan? Television executives and companies who advertise on TV are interested in how many viewers watch particular television shows. According to 2001 Nielsen ratings, Survivor II was one of the most watched television shows in the US during every week that is aired. Suppose that true proportion of US adults who watched Survivor II is p=.37. Suppose we did a survey with n=100. Suppose we did this survey 1000 times.
15
Sampling Distribution for SRSs of size n = 100
16
Sampling distribution for SRSs of size n = 1000.
17
Sampling distribution for SRSs of size n = 1000 with scale change.
18
Simulations Means (n=120) 4 3 2 Means (n=60) 432 Means (n=30) 432 To illustrate the general behavior of samples of fixed size n, 10000 samples each of size 30, 60 and 120 were generated from this uniform distribution and the means calculated. Probability histograms were created for each of these (simulated) sampling distributions. Notice all three of these look to be essentially normally distributed. Further, note that the variability decreases as the sample size increases.
19
Simulations Skewed distribution To further illustrate the general behavior of samples of fixed size n, 10000 samples each of size 4, 16 and 32 were generated from the positively skewed distribution pictured below. Notice that these sampling distributions all all skewed, but as n increased the sampling distributions became more symmetric and eventually appeared to be almost normally distributed.
20
Back to Survivor Fan Problem Notice both distributions are centered at p =.37
21
Survivor Fan Because the sampling distribution is centered at the true value, there is no systematic tendency to overestimate or underestimate the paramater p.
24
Examples of errors in bias and variability
28
Large bias and large variability. Label according to bias and variability.
29
Small bias and small variability.
30
Small bias and large variability.
31
Large bias and small variability.
32
Essential Questions for 9.1 What is a parameter? What is a statistics? What is sampling variability? What is a sampling distribution? How do you describe a sampling distribution? What is an unbiased statistic and an unbiased estimator?
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.