Chapter 7 Sample Variability. Those who jump off a bridge in Paris are in Seine. A backward poet writes inverse. A man's home is his castle, in a manor.

Slides:



Advertisements
Similar presentations
Chapter 18 Sampling distribution models
Advertisements

 These 100 seniors make up one possible sample. All seniors in Howard County make up the population.  The sample mean ( ) is and the sample standard.
Sampling: Final and Initial Sample Size Determination
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
What Is a Sampling Distribution?
Chapter 8 Estimating Single Population Parameters
Chapter 7 Sampling and Sampling Distributions
Sampling Distributions
Chapter 9 Hypothesis Testing.
Chapter 7 ~ Sample Variability
Sampling Theory Determining the distribution of Sample statistics.
Chapter 7 Sampling Distributions
Chapter 7: Sampling Distributions
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution?
Sampling Theory Determining the distribution of Sample statistics.
Sociology 5811: Lecture 7: Samples, Populations, The Sampling Distribution Copyright © 2005 by Evan Schofer Do not copy or distribute without permission.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
Section 8.1 Estimating  When  is Known In this section, we develop techniques for estimating the population mean μ using sample data. We assume that.
Chapter 7: Sample Variability Empirical Distribution of Sample Means.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
Population and Sample The entire group of individuals that we want information about is called population. A sample is a part of the population that we.
8 Sampling Distribution of the Mean Chapter8 p Sampling Distributions Population mean and standard deviation,  and   unknown Maximal Likelihood.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Stat 1510: Sampling Distributions
Stats Lunch: Day 3 The Basis of Hypothesis Testing w/ Parametric Statistics.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 7 Sampling Distributions 7.1 What Is A Sampling.
1 ES Chapter 11: Goals Investigate the variability in sample statistics from sample to sample Find measures of central tendency for sample statistics Find.
Random Variables Numerical Quantities whose values are determine by the outcome of a random experiment.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 7: Sampling Distributions Section 7.1 What is a Sampling Distribution?
Unit 7: Sampling Distributions
Chapter 18 Sampling distribution models math2200.
Chapter 13 Sampling distributions
Introduction to Inference Sampling Distributions.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
SAMPLING DISTRIBUTION OF MEANS & PROPORTIONS. SAMPLING AND SAMPLING VARIATION Sample Knowledge of students No. of red blood cells in a person Length of.
Sampling Distributions Chapter 18. Sampling Distributions If we could take every possible sample of the same size (n) from a population, we would create.
Chapter 7: The Distribution of Sample Means
Sampling Theory Determining the distribution of Sample statistics.
This represents the most probable value of the measured variable. The more readings you take, the more accurate result you will get.
Sampling Distributions
Chapter 7: Sampling Distributions
Combining Random Variables
Determining the distribution of Sample statistics
Chapter 9: Sampling Distributions
AP Statistics: Chapter 18
Sampling Distributions
What Is a Sampling Distribution?
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Test Drop Rules: If not:
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 9: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Sampling Distributions (§ )
Chapter 7: Sampling Distributions
Essential Statistics Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
The Practice of Statistics – For AP* STARNES, YATES, MOORE
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Presentation transcript:

Chapter 7 Sample Variability

Those who jump off a bridge in Paris are in Seine. A backward poet writes inverse. A man's home is his castle, in a manor of speaking.

Sampling  The Need Get information about a population without checking the entire population Get information about a population without checking the entire population  Advantages Cost Cost Time Time Accuracy (can be achieved with low cost) Accuracy (can be achieved with low cost) Destruction is sometimes involved; checking all is not possible. Destruction is sometimes involved; checking all is not possible.  [Insert Excel Simulation here]

Distribution of Means

Visual Mean of Means

Distribution of Sample Means  Many different sample means are possible  The sample means cluster closer to the population mean than the population values do.  The larger the sample, the closer they cluster around the population mean  Therefore the likelihood of a single sample mean being close to the true mean is high

Distribution of Sample Means  When trying to use a sample to estimate a population mean, we know we won’t get the exact value  We want some way of managing the error so as to be as close as we need to be  We can decide on a margin of error that we are willing to accept (polls typically 2% - 4%).  We cannot eliminate the possibility of getting a value outside that range, but we can keep it small by adjusting the sample size.

How Close Can We Get?  The variance of the sample mean is the population variance divided by n (sample size)  Thus larger n’s bring smaller variances  Let’s look at an example. In order to understand the process, we will assume we actually know the true mean and variance. Each of the following graphs is from a computer simulation of taking 100 samples from a normal population with μ=15 and σ=3, but with different sample sizes.

μ= 15, σ=3, Sample Size 1 Number observed in [14,16]: 30

μ = 15, σ=3, Sample Size 4 Number observed in [14,16]: 52

μ = 15, σ=3, Sample Size 9 Number observed in [14,16]: 74

μ = 15, σ=3, Sample Size 16 Number observed in [14,16]: 81

μ = 15, σ=3, Sample Size 25 Number observed in [14,16]: 90

μ = 15, σ=3, Sample Size 36 Number observed in [14,16]: 97

Number in [14,16] vs Sample Size

So What?  In Real Life, we don’t know the true mean and variance. We want to estimate them.  Furthermore, we will only take one sample, which represents just one data point from the distributions we have illustrated.  We will probably NEVER know where in the distribution that data point is coming from.  Under these conditions, how can we provide an estimate that is trustworthy?  Clearly, the sample size directly affects the likelihood that the sample mean will be close to the true mean.

Which one would you like to pick from?  The situation: You have 100 balls in an urn (left). Each has an odd number on it, which may be from 7-25, but you don’t know how many of each there are. You will draw one ball and record its number. If this number matches the mean of the distribution, your company will make lots of money and you will get a promotion. However, you have the opportunity, for a sizable fee, to trade in the urn for the one on the right. If you do so, and are wrong, you will be fired because of the excessive expense you incurred.

Does the name Pavlov ring a bell? Reading while sunbathing makes you well red. When two egotists meet, it's an I for an I.

Notes: 1. : the sample mean. 2. : the standard deviation of the sample means. 3.The theory involved with sampling distributions described in the remainder of this chapter requires random sampling. Random Sample: A sample obtained in such a way that each possible sample of a fixed size n has an equal probability of being selected. (Example: Every possible handful of size n has the same probability of being selected.)

The Central Limit Theorem  The most important idea in all of statistics.  Describes the sampling distribution of the sample mean.  Examples suggest: the sample mean (and sample total) tend to be normally distributed.

Distribution of Sample Means If all possible random samples of a particular size n are taken from any population with a mean  and a standard deviation , the distribution of sample means will: 1.have a mean equal to . 2.have a standard deviation equal to Further, if the sampled population has a normal distribution, then the sampling distribution of will also be normal for samples of all sizes. Central Limit Theorem The distribution of sample means will come closer to normal as the sample size increases.

Graphical Illustration of the Central Limit Theorem: Original Population Distribution of : n = 2 Distribution of : n = 10 Distribution of : n = 30

Example: Consider a normal population with  = 50 and  = 15. Suppose a sample of size 9 is selected at random. Find: 1.2. Solution: Since the original population is normal, the distribution of the sample mean is also (exactly) normal.

Example: A recent report stated that the day-care cost per week in Boston is $109. Suppose this figure is taken as the mean cost per week and that the standard deviation is known to be $20. 1.Find the probability that a sample of 50 day-care centers would show a mean cost of $105 or less per week. 2.Suppose the actual sample mean cost for the sample of 50 day-care centers is $120. Is there any evidence to refute the claim of $109 presented in the report? Solution: The shape of the original distribution is unknown, but the sample size, n, is large. The CLT applies. The distribution of is approximately normal.

 To investigate the claim, we need to examine how likely an observation is the sample mean of $120.  Consider how far out in the tail of the distribution of the sample mean is $120.  Compute the tail probability.  Since the tail probability is so small, this suggests the observation of $120 is very rare (if the mean cost is really $109).  There is evidence to suggest the claim of  = $109 is wrong.

In democracy your vote counts. In feudalism your count votes. She was engaged to a boyfriend with a wooden leg but broke it off. A chicken crossing the road is poultry in motion.