Chapter 7: Sampling Distributions

Slides:



Advertisements
Similar presentations
Chapter 6 – Normal Probability Distributions
Advertisements

Sampling Distributions and Sample Proportions
Sampling Distributions
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
1 Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Sampling Variability & Sampling Distributions.
Chapter 7 Introduction to Sampling Distributions
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Introduction to Statistics Chapter 7 Sampling Distributions.
Sampling Distributions
Part III: Inference Topic 6 Sampling and Sampling Distributions
Sampling Distributions
CHAPTER 3: The Normal Distributions Lecture PowerPoint Slides The Basic Practice of Statistics 6 th Edition Moore / Notz / Fligner.
BCOR 1020 Business Statistics
UNIT FOUR/CHAPTER NINE “SAMPLING DISTRIBUTIONS”. (1) “Sampling Distribution of Sample Means” > When we take repeated samples and calculate from each one,
Chapter 5 Sampling Distributions
Overview 7.2 Central Limit Theorem for Means Objectives: By the end of this section, I will be able to… 1) Describe the sampling distribution of x for.
Chapter 6: Probability Distributions
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
STA Lecture 161 STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately)
Dan Piett STAT West Virginia University
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 6 Sampling Distributions.
AP Statistics Chapter 9 Notes.
Chapter 8: Confidence Intervals
16-1 Copyright  2010 McGraw-Hill Australia Pty Ltd PowerPoint slides to accompany Croucher, Introductory Mathematics and Statistics, 5e Chapter 16 The.
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
Chap 6-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 6 Introduction to Sampling.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 6-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
+ Chapter 2: Modeling Distributions of Data Section 2.2 Normal Distributions The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
CHAPTER 3: The Normal Distributions ESSENTIAL STATISTICS Second Edition David S. Moore, William I. Notz, and Michael A. Fligner Lecture Presentation.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
Measures of central tendency are statistics that express the most typical or average scores in a distribution These measures are: The Mode The Median.
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
Chapter 7 Probability and Samples: The Distribution of Sample Means
Distributions of the Sample Mean
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
1 Chapter 8 Sampling Distributions of a Sample Mean Section 2.
Chapter 8 Sampling Variability and Sampling Distributions.
The Normal Distribution
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Chapter 7: Sampling Distributions Section 7.1 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
Lecture 5 Introduction to Sampling Distributions.
+ Chapter 2: Modeling Distributions of Data Section 2.2 Normal Distributions The Practice of Statistics, 4 th edition - For AP* STARNES, YATES, MOORE.
Chapter 7 Introduction to Sampling Distributions Business Statistics: QMIS 220, by Dr. M. Zainal.
MATH Section 4.4.
Sampling and Sampling Distributions Chapter 8.4 – 8.7.
THE NORMAL DISTRIBUTION
Sampling Distributions
Sampling Distributions
Chapter 2: Modeling Distributions of Data
Sampling Variability & Sampling Distributions
Chapter 7 Sampling Distributions.
Week 10 Chapter 16. Confidence Intervals for Proportions
Chapter 2: Modeling Distributions of Data
Chapter 7 Sampling Distributions.
MATH 2311 Section 4.4.
Chapter 2: Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 7 Sampling Distributions.
Chapter 2: Modeling Distributions of Data
The Practice of Statistics
Chapter 7 Sampling Distributions.
Chapter 2: Modeling Distributions of Data
Chapter 3 Modeling Distributions of Data
Chapter 2: Modeling Distributions of Data
Chapter 7 Sampling Distributions.
Mean and Median.
STA 291 Summer 2008 Lecture 12 Dustin Lueker.
Chapter 5: Sampling Distributions
MATH 2311 Section 4.4.
Presentation transcript:

Chapter 7: Sampling Distributions Basic Practice of Statistics - 3rd Edition Discovering Statistics 2nd Edition Daniel T. Larose Chapter 7: Sampling Distributions Lecture PowerPoint Slides Chapter 5

Chapter 7 Overview 7.1 Introduction to Sampling Distributions 7.2 Central Limit Theorem for Means 7.3 Central Limit Theorem for Proportions

The Big Picture Where we are coming from and where we are headed… In Chapters 1–4, we learned ways to describe data sets using numbers, tables, and graphs. In Chapters 5–6 we learned the tools of probability and probability distributions that allow us to quantify uncertainty. In Chapter 7, we will discover that seemingly random statistics have predictable behaviors. The special type of distribution we use to describe these behaviors is called the sampling distribution. We will also learn about the most important result in statistical inference, the Central Limit Theorem. The sampling distributions we learn in this chapter form the basis for the statistical inference we will perform in the rest of the book.

7.1: Introduction to Sampling Distributions Objectives: Explain the sampling distribution of the sample mean. Describe the sampling distribution of the sample mean when the population is normal. Find probabilities and percentiles for the sample mean when the population is normal.

Sampling Distribution of the Sample Mean In this chapter, we will develop methods that will allow us to quantify the behavior of statistics like the sample mean. The sampling distribution of the sample mean for a given sample size n consists of the collection of the means of all possible samples of size n from the population. Example 7.1 If we calculate the mean time for every possible sample of three individuals, we get the sampling distribution below.

Sampling Distribution of the Sample Mean When working with sampling distributions, it is important to know the mean and standard deviation. The mean of the sampling distribution of the sample mean is the value of the population mean µ. That is, . The standard deviation of the sampling distribution of the sample mean is called the standard error of the mean. It is equal to , where σ is the population standard deviation. Note, because the denominator of the standard error formula is √n, the larger the sample size, the tighter the resulting sampling distribution. Larger sample sizes lead to smaller variability, which results in more precise estimation.

Example According to CanEquity Mortgage company, the mean age of mortgage applicants in the City of Toronto is 37 years old. Assume that the standard deviation is 6 years. Find the mean and standard deviation for the sampling distribution of the sample mean for the following sample sizes: (a) 4, (b) 100, (c) 225 (a) (b) (c)

Sampling Distribution of the Sample Mean for a Normal Population Two important facts should be noted about sample means that are collected from a normal population. For a normal population, the sampling distribution of the sample mean is distributed as normal (µ, σ/√n), where µ is the population mean and σ is the population standard deviation. When the sampling distribution of the sample mean is normal, we may standardize to produce the standard normal random variable:

Probabilities and Percentiles Using a Sampling Distribution Since we know the sampling distribution of the sample mean is normal when the population is normally distributed, we can use the techniques of Section 6.5 to answer questions about the means of samples taken from normal populations. Example Suppose the quiz scores for a certain instructor are normal (70, 10). Find the probability that a randomly chosen student’s score will be above 80. Find the probability that a sample of 25 quiz scores will have a mean score greater than 80.

Probabilities and Percentiles Using a Sampling Distribution Example Suppose the quiz scores for a certain instructor are normal (70, 10). What two symmetric values contain the middle 90% of all sample means between them? Assume a class size of 25. The middle 90% will fall between the 5th percentile and the 95th percentile. These percentiles correspond to Z = –1.645 and Z = 1.645. 70 – 1.645(2) = 66.71 70 + 1.645(2) = 73.29

7.2: Central Limit Theorem for Means Objectives: Use normal probability plots to assess normality. Describe the sampling distribution of sample means for skewed and symmetric populations as the sample size increases. Apply the Central Limit Theorem for Means to solve probability questions about the sample mean.

Normal Probability Plots Much of our analysis requires that the sample data come from a population that is normally distributed. We can use histograms, dotplots, and stem-and-leaf displays to assess normality. But a more precise tool is the normal probability plot of the estimated cumulative normal probabilities against the corresponding data values. If the points in the normal probability plot either cluster around a straight line or nearly all fall within the curved bounds, then it is likely that the data set is normal. Systematic deviations off the straight line are evidence against the claim that the data set is normal.

Sampling Distribution of x-bar for Skewed Populations The sampling distribution of sample means for a normal population is also normal. What if the population is not normal?

Central Limit Theorem for Means Regardless of the population, the sampling distribution of the sample mean becomes approximately normal as the sample size gets larger. Central Limit Theorem for Means Given a population with mean µ and standard deviation σ, the sampling distribution of the sample mean becomes approximately normal (µ, σ/√n) as the sample size gets larger, regardless of the shape of the population. Rule of Thumb: We consider n ≥ 30 as large enough to apply the Central Limit Theorem for Means for any population.

Central Limit Theorem for Means If the Population is Normal The sampling distribution of sample means is normal. If the Population is Non-Normal or Unknown and the Sample Size is At Least 30 The sampling distribution of the sample mean is approximately normal. If the Population is Non-Normal or Unknown and the Sample Size is Less Than 30 We have insufficient information to conclude that the sampling distribution of the sample mean is either normal or approximately normal.

7.3: Central Limit Theorem for Proportions Objectives: Explain the sampling distribution of the sample proportion. Apply the Central Limit Theorem for Proportions to solve probability questions about the sample proportion.

Sampling Distribution of the Sample Proportion The sample mean is not the only statistic that can have a sampling distribution. Every statistic has a sampling distribution. One of the most important is the sampling distribution of the sample proportion. Suppose each individual in a population either has or does not have a particular characteristic. If we take a sample of size n from the population, the sample proportion (read “p-hat) is: where X represents the number of individuals in the sample that have the particular characteristic. The sampling distribution of the sample proportion for a given sample size n consists of the collection of the sample proportions of all possible samples of size n from the population.

Sampling Distribution of the Sample Proportion The mean of the sampling distribution of the sample proportion is the value of the population proportion p. This may be denoted as The standard deviation of the sampling distribution of the sample proportion is called the standard error of the proportion and is found by where p is the population proportion and n is the sample size. The sampling distribution of the sample proportion may be considered approximately normal only if both np ≥ 5 and n(1 – p) ≥ 5. The minimum sample size required to produce approximate normality is the larger of either n1 = 5/p or n2 = 5/(1 – p).

Sampling Distribution of the Sample Proportion The National Institutes of Health reported that color blindness linked to the X chromosome afflicts 8% of men. Suppose we take a random sample of 100 men and let p denote the proportion of men in the population who have color blindness linked to the X chromosome.

Applying the Central Limit Theorem for Proportions The sampling distribution of the sample proportion follows an approximately normal distribution with mean p and standard deviation when both np ≥ 5 and n(1 – p) ≥ 5. When the sampling distribution of the sample proportion is approximately normal, we can standardize to produce the standard normal Z:

Example The Texas Workforce Commission reported that the state unemployment rate in March 2007 was 4.3%. Let p = 0.043 represent the population proportion of unemployed workers in Texas. Find the probability that a sample of 117 Texas workers will have a proportion unemployed greater than 9%. Since 117(0.043) > 5 and 117(0.957) > 5, we can apply the Central Limit Theorem for Proportions. P(Z > 2.51) = 1 – 0.9940 = 0.0060

Example The Texas Workforce Commission reported that the state unemployment rate in March 2007 was 4.3%. Let p = 0.043 represent the population proportion of unemployed workers in Texas. Find the 99th percentile of sample proportions for n = 117. The Z-value associated with 0.9901 is 2.33.

Chapter 7 Overview 7.1 Introduction to Sampling Distributions 7.2 Central Limit Theorem for Means 7.3 Central Limit Theorem for Proportions