Topic 8: Sampling Distributions

Slides:



Advertisements
Similar presentations
Chapter 18 Sampling distribution models
Advertisements

AP Statistics Section 9.2 Sample Proportions
Sampling distributions. Example Take random sample of students. Ask “how many courses did you study for this past weekend?” Calculate a statistic, say,
Sampling distributions. Example Take random sample of 1 hour periods in an ER. Ask “how many patients arrived in that one hour period ?” Calculate statistic,
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Sampling Variability & Sampling Distributions.
Today Today: Chapter 8, start Chapter 9 Assignment: Recommended Questions: 9.1, 9.8, 9.20, 9.23, 9.25.
Sample Distribution Models for Means and Proportions
Chapter 5 Sampling Distributions
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
AP Statistics 9.3 Sample Means.
Chapter 18: Sampling Distribution Models
Chapter 10 – Sampling Distributions Math 22 Introductory Statistics.
Population and Sample The entire group of individuals that we want information about is called population. A sample is a part of the population that we.
Chapter 8 Sampling Variability and Sampling Distributions.
A.P. STATISTICS LESSON SAMPLE PROPORTIONS. ESSENTIAL QUESTION: What are the tests used in order to use normal calculations for a sample? Objectives:
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 19 Confidence intervals for proportions
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a measure of the population. This value is typically unknown. (µ, σ, and now.
Sampling Theory and Some Important Sampling Distributions.
Chapter 18 Sampling Distribution Models *For Means.
Sampling Distributions: Suppose I randomly select 100 seniors in Anne Arundel County and record each one’s GPA
Sampling Distributions Chapter 18. Sampling Distributions If we could take every possible sample of the same size (n) from a population, we would create.
MATH Section 4.4.
Sampling Theory Determining the distribution of Sample statistics.
Sampling Distributions Chapter 18. Sampling Distributions A parameter is a number that describes the population. In statistical practice, the value of.
Sampling Distributions
Sampling Distributions Chapter 18
Chapter 6: Sampling Distributions
Ch5.4 Central Limit Theorem
Chapter 7 Review.
Sampling Distribution Estimation Hypothesis Testing
Chapter 7: Sampling Distributions
Sampling Variability & Sampling Distributions
Section 9.2 – Sample Proportions
Chapter 6: Sampling Distributions
Sampling Distributions for a Proportion
Chapter 5 Sampling Distributions
Chapter 18: Sampling Distribution Models
Week 10 Chapter 16. Confidence Intervals for Proportions
Chapter 5 Sampling Distributions
Basic Practice of Statistics - 3rd Edition Sampling Distributions
Basic Practice of Statistics - 3rd Edition Sampling Distributions
Sampling Distribution Models
Sampling Distribution of the Sample Mean
MATH 2311 Section 4.4.
Sampling Distribution
Sampling Distribution
AP Statistics: Chapter 18
Chapter 7: Sampling Distributions
Chapter 10: Estimating with Confidence
Sampling Distribution Models
Sampling Distribution of a Sample Proportion
Sampling Distributions
CHAPTER 15 SUMMARY Chapter Specifics
Sampling Distribution of the Mean
CHAPTER 12 More About Regression
Chapter 7: Sampling Distributions
From Samples to Populations
Sampling Distribution Models
Sampling Distribution of a Sample Proportion
Sampling Distributions (§ )
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Chapter 7: Sampling Distributions
Quantitative Methods Varsha Varde.
Central Limit Theorem cHapter 18 part 2.
Chapter 7: Sampling Distributions
Cherish our mother earth; be kind to other beings
How Confident Are You?.
MATH 2311 Section 4.4.
Presentation transcript:

Topic 8: Sampling Distributions How sample proportions vary around the population proportion How sample means vary around the population mean

Populations and samples We are often interested in population parameters. Since complete populations are difficult (or impossible) to collect data on, we use sample statistics as point estimates for the unknown population parameters of interest. Sample statistics vary from sample to sample. Quantifying how sample statistics vary provides a way to estimate the margin of error associated with our points estimate. For this topic, let’s try to understand how point estimates vary from sample to sample. Suppose we randomly sample 1,000 adults from each state in the US. Would you expect the sample means of their heights to be the same, somewhat different, or very different?

Populations and samples We are often interested in population parameters. Since complete populations are difficult (or impossible) to collect data on, we use sample statistics as point estimates for the unknown population parameters of interest. Sample statistics vary from sample to sample. Quantifying how sample statistics vary provides a way to estimate the margin of error associated with our points estimate. For this topic, let’s try to understand how point estimates vary from sample to sample. Suppose we randomly sample 1,000 adults from each state in the US. Would you expect the sample means of their heights to be the same, somewhat different, or very different? Not the same, but only somewhat different.

How sample proportions vary around the population proportion

Example: do you smoke? 30,000 college students were asked if they smoke. Their answers are summarized below. 25.87% of the college students reported that they smoke.

Example: do you smoke? 30,000 college students were asked if they smoke. Their answers are summarized below. 25.87% of the college students reported that they smoke. Suppose this is our population, and we want to estimate this percentage by choosing a sample.

A random sample of 1000 students Our statistic: 27.60% (proportion of 1000 randomly selected students that smoke) Parameter: 25.87% (proportion of all 30,000 students that smoke).

A random sample of 1000 students Our statistic: 27.60% (proportion of 1000 randomly selected students that smoke) Parameter: 25.87% (proportion of all 30,000 students that smoke). Our statistic is quite close. Were we lucky, or is there a reason?

Sampling Distribution We repeatedly take a random sample of 1000 students and compute the statistic (proportion that smoke). Below is a histogram displaying the statistics computed for many samples.

Sampling Distribution We repeatedly take a random sample of 1000 students and compute the statistic (proportion that smoke). Below is a histogram displaying the statistics computed for many samples. Mean = .2583 Standard Deviation = 0.0140

The central limit theorem (proportions) The distribution of the sample proportion is well approximated by the normal model: Where p is the population proportion and n in the sample size. SE is the standard error, which is the standard deviation of the sampling distribution. It wasn’t coincidence that our sample proportion was “close” to the population proportion. It wasn’t coincidence that the sampling distribution was centered at the population proportion and approximately normal. Note: if the sample size increases, then the standard error decreases.

Central limit theorem (CLT): conditions for proportions Certain conditions must be met for the CLT to apply: We have a random sample from the population The sample is large enough so that we see at least 5 observations of both possible outcomes

How sample means vary around the population mean

Example: what do MLB players make? The salaries of 16,383 Major League Baseball players are displayed in the histogram below. Mean salary for these players is $1,265,466 Suppose this is our population, and we want to estimate this mean by choosing a small sample.

A random sample of 1000 players The salaries of 1,000 Major League Baseball players are displayed in the histogram below. Our statistic: $1,261,780.70 (mean salary of randomly selected 1,000 players) Parameter: $1,265,466 (mean salary for all 16,383 players). Our statistic is quite close. Again, were we lucky or is there a reason?

Sampling Distribution We repeatedly take a random sample of 1000 players and compute the statistic (mean salary). Below is a histogram displaying the statistics computed for many samples.

Sampling Distribution We repeatedly take a random sample of 1000 players and compute the statistic (mean salary). Below is a histogram displaying the statistics computed for many samples. Mean = $1,266,084.20 Standard deviation = $66,220.87

The central limit theorem (means) The distribution of the sample mean is well approximated by the normal model: Where μ is the population mean and n in the sample size. SE is the standard error, which is the standard deviation of the sampling distribution. It wasn’t coincidence that our sample mean was “close” to the population mean. It wasn’t coincidence that the sampling distribution was centered at the population mean and approximately normal. Note: if the sample size increases, then the standard error decreases.

Central limit theorem (CLT): conditions for means Certain conditions must be met for the CLT to apply: The more skewed the population distribution, the larger sample size we need for CLT to apply. For moderately skewed distributions n > 30 is a widely used rule of thumb.