Chapter 9 Sampling Distributions Sir Naseer Shahzada.

Slides:



Advertisements
Similar presentations
Sampling Distributions
Advertisements

Copyright © 2005 Brooks/Cole, a division of Thomson Learning, Inc. 9.1 Chapter 9 Sampling Distributions.
Sampling Distributions Martina Litschmannová K210.
Chapter 8: Binomial and Geometric Distributions
CHAPTER 13: Binomial Distributions
THE CENTRAL LIMIT THEOREM The “World is Normal” Theorem.
Terminology A statistic is a number calculated from a sample of data. For each different sample, the value of the statistic is a uniquely determined number.
Ch 9 實習.
Copyright © 2009 Cengage Learning 9.1 Chapter 9 Sampling Distributions.
McGraw-Hill Ryerson Copyright © 2011 McGraw-Hill Ryerson Limited. Adapted by Peter Au, George Brown College.
Sampling Distributions
Sampling Distributions
2007 會計資訊系統計學 ( 一 ) 上課投影片 9-1 Sampling Distributions Chapter 9.
1 Sampling Distributions Chapter Introduction  In real life calculating parameters of populations is prohibitive because populations are very.
Chapter 7 Sampling and Sampling Distributions
QBM117 Business Statistics Statistical Inference Sampling Distribution of the Sample Mean 1.
Sampling Distributions
Part III: Inference Topic 6 Sampling and Sampling Distributions
Introduction to Probability and Statistics Chapter 7 Sampling Distributions.
AP Statistics Section 10.2 A CI for Population Mean When is Unknown.
Normal and Sampling Distributions A normal distribution is uniquely determined by its mean, , and variance,  2 The random variable Z = (X-  /  is.
Copyright © 2012 Pearson Education. All rights reserved Copyright © 2012 Pearson Education. All rights reserved. Chapter 10 Sampling Distributions.
SAMPLING DISTRIBUTION
Chapter 5 Sampling Distributions
McGraw-Hill/IrwinCopyright © 2009 by The McGraw-Hill Companies, Inc. All Rights Reserved. Chapter 7 Sampling Distributions.
AP Statistics Chapter 9 Notes.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Review and Preview This chapter combines the methods of descriptive statistics presented in.
Copyright © 2009 Cengage Learning Chapter 10 Introduction to Estimation ( 추 정 )
Copyright ©2011 Nelson Education Limited The Normal Probability Distribution CHAPTER 6.
Estimating Means and Proportions Using Sample Means and Proportions To Make Inferences About Population Parameters.
Chapter 7 Sampling Distributions
Chapter 10 Introduction to Estimation Sir Naseer Shahzada.
Copyright © 2010 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin Chapter 7 Sampling Distributions.
Sampling Distribution and the Central Limit Theorem.
Biostatistics Unit 5 – Samples. Sampling distributions Sampling distributions are important in the understanding of statistical inference. Probability.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
Sampling Error SAMPLING ERROR-SINGLE MEAN The difference between a value (a statistic) computed from a sample and the corresponding value (a parameter)
Chapter 18: Sampling Distribution Models
Lecture 8: More on the Binomial Distribution and Sampling Distributions June 1, 2004 STAT 111 Introductory Statistics.
Chapter 7: Sampling Distributions Section 7.1 How Likely Are the Possible Values of a Statistic? The Sampling Distribution.
Sampling Distribution WELCOME to INFERENTIAL STATISTICS.
Copyright © 2009 Cengage Learning 9.1 Chapter 9 Sampling Distributions ( 표본분포 )‏
Inference: Probabilities and Distributions Feb , 2012.
SESSION 39 & 40 Last Update 11 th May 2011 Continuous Probability Distributions.
1 Sampling Distribution of the Mean. 2 The Sampling Distribution of the Sample Mean Let’s start at a familiar point: the sample mean, which we know as.
Sampling Distributions Sampling Distributions. Sampling Distribution Introduction In real life calculating parameters of populations is prohibitive because.
SAMPLING DISTRIBUTION. 2 Introduction In real life calculating parameters of populations is usually impossible because populations are very large. Rather.
A statistic from a random sample or randomized experiment is a random variable. The probability distribution of this random variable is called its sampling.
Sampling Distribution Models and the Central Limit Theorem Transition from Data Analysis and Probability to Statistics.
Sampling Distributions
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
And distribution of sample means
Sampling Distributions
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
Ch 9 實習.
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
Virtual University of Pakistan
CHAPTER 6 Random Variables
Introduction to Sampling Distributions
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
Chapter 7 ENGR 201: Statistics for Engineers
MATH 2311 Section 4.4.
Keller: Stats for Mgmt & Econ, 7th Ed Sampling Distributions
Introduction to Estimation
Continuous Probability Distributions
SAMPLING DISTRIBUTION
12/12/ A Binomial Random Variables.
2007 會計資訊系統計學 ( 一 ) 上課投影片 9-1 Sampling Distributions Chapter 9.
Ch 9 實習.
MATH 2311 Section 4.4.
Presentation transcript:

Chapter 9 Sampling Distributions Sir Naseer Shahzada

Sampling Distributions… A sampling distribution is created by, as the name suggests, sampling. The method we will employ on the rules of probability and the laws of expected value and variance to derive the sampling distribution. For example, consider the roll of one and two dice…

x P(x)1/6 Sampling Distribution of the Mean… A fair die is thrown infinitely many times, with the random variable X = # of spots on any throw. The probability distribution of X is: …and the mean and variance are calculated as well:

Sampling Distribution of Two Dice A sampling distribution is created by looking at all samples of size n=2 (i.e. two dice) and their means… While there are 36 possible samples of size 2, there are only 11 values for, and some (e.g. =3.5) occur more frequently than others (e.g. =1).

Sampling Distribution of Two Dice… The sampling distribution of is shown below: P( ) 6/36 5/36 4/36 3/36 2/36 1/36 P( )

Compare… Compare the distribution of X… …with the sampling distribution of. As well, note that:

Generalize… We can generalize the mean and variance of the sampling of two dice: …to n-dice: The standard deviation of the sampling distribution is called the standard error:

Central Limit Theorem… The sampling distribution of the mean of a random sample drawn from any population is approximately normal for a sufficiently large sample size. The larger the sample size, the more closely the sampling distribution of X will resemble a normal distribution.

Central Limit Theorem… If the population is normal, then X is normally distributed for all values of n. If the population is non-normal, then X is approximately normal only for larger values of n. In many practical situations, a sample size of 30 may be sufficiently large to allow us to use the normal distribution as an approximation for the sampling distribution of X.

Sampling Distribution of the Sample Mean If X is normal, X is normal. If X is nonnormal, X is approximately normal for sufficiently large sample sizes. Note: the definition of “sufficiently large” depends on the extent of nonnormality of x (e.g. heavily skewed; multimodal)

Finite Populations… Statisticians have shown that the mean of the sampling distribution is always equal to the mean of the population and that the standard error is equal to for infinitely large populations. However, if the population is finite the standard error is where N is the population size and is called the finite population correction factor. If the population size is large relative to the sample size the finite population correction factor is close to 1 and can be ignored.

Finite Populations… As a rule of thumb we will treat any population that is at least 20 times larger than the sample size as large. In practice most applications involve populations that qualify as large. As a consequence the finite population correction factor is usually omitted. There are several applications that deal with small populations. Section 12.5 introduces one of these applications.

Example 9.1(a)… The foreman of a bottling plant has observed that the amount of soda in each “32-ounce” bottle is actually a normally distributed random variable, with a mean of 32.2 ounces and a standard deviation of.3 ounce. If a customer buys one bottle, what is the probability that the bottle will contain more than 32 ounces?

Example 9.1(a)… We want to find P(X > 32), where X is normally distributed and =32.2 and =.3 “there is about a 75% chance that a single bottle of soda contains more than 32oz.”

Example 9.1(b)… The foreman of a bottling plant has observed that the amount of soda in each “32-ounce” bottle is actually a normally distributed random variable, with a mean of 32.2 ounces and a standard deviation of.3 ounce. If a customer buys a carton of four bottles, what is the probability that the mean amount of the four bottles will be greater than 32 ounces?

Example 9.1(b)… We want to find P(X > 32), where X is normally distributed with =32.2 and =.3 Things we know: 1)X is normally distributed, therefore so will X. 2) = 32.2 oz. 3)

Example 9.1(b)… If a customer buys a carton of four bottles, what is the probability that the mean amount of the four bottles will be greater than 32 ounces? “There is about a 91% chance the mean of the four bottles will exceed 32oz.”

Graphically Speaking… what is the probability that one bottle will contain more than 32 ounces? what is the probability that the mean of four bottles will exceed 32 oz? mean=32.2

Chapter-Opening Example… The dean of the School of Business claims that the average salary of the school’s graduates one year after graduation is $800 per week with a standard deviation of $100. A second-year student would like to check whether the claim about the mean is correct. He does a survey of 25 people who graduated one year ago and determines their weekly salary. He discovers the sample mean to be $750. To interpret his finding he needs to calculate the probability that a sample of 25 graduates would have a mean of $750 or less when the population mean is $800 and the standard deviation is $100.

Chapter-Opening Example… We want to compute Although X is likely skewed it is likely that is normally distributed. The mean of is The standard deviation is

Chapter-Opening Example… The probability of observing a sample mean as low as $750 when the population mean is $800 is extremely small. Because the event is quite unlikely, we would conclude that the dean’s claim is not justified.

Standardizing the Sample Mean… The sampling distribution can be used to make inferences about population parameters. In order to do so, the sample mean can be standardized to the standard normal distribution using the following formulation:

Another Way to State the Probability… In Chapter 8 we saw that P(-1.96 < Z < 1.96) =.95 From the sampling distribution of the mean we have Substituting this definition of Z in the probability statement we produce

Another Way to State the Probability… With a little algebra we rewrite the probability statement as Similarly In general All are probability statements about, which we’ll use in statistical inference

Return to the Chapter-Opening Example… Substituting = 800, = 100, n = 25, and =.05, we get This is another way of checking the dean’s claim. The probability that falls between and is 95%. It is unlikely that we would observe a sample mean as low as $750 when the population mean is $800.

Sampling Distribution of a Proportion… The estimator of a population proportion of successes is the sample proportion. That is, we count the number of successes in a sample and compute: (read this as “p-hat”). X is the number of successes, n is the sample size.

Normal Approximation to Binomial… Binomial distribution with n=20 and p=.5 with a normal approximation superimposed ( =10 and =2.24)

Normal Approximation to Binomial… Binomial distribution with n=20 and p=.5 with a normal approximation superimposed ( =10 and =2.24) where did these values come from?! From §7.6 we saw that: Hence: and

Normal Approximation to Binomial… Normal approximation to the binomial works best when the number of experiments, n, (sample size) is large, and the probability of success, p, is close to 0.5 For the approximation to provide good results two conditions should be met: 1) np ≥ 5 2) n(1–p) ≥ 5

Normal Approximation to Binomial… To calculate P(X=10) using the normal distribution, we can find the area under the normal curve between 9.5 & 10.5 P(X = 10) ≈ P(9.5 < Y < 10.5) where Y is a normal random variable approximating the binomial random variable X

Normal Approximation to Binomial… P(X = 10) ≈ P(9.5 < Y < 10.5) where Y is a normal random variable approximating the binomial random variable X In fact: P(X = 10) =.176 while P(9.5 < Y < 10.5) =.1742 the approximation is quite good.

Sampling Distribution of a Sample Proportion… Using the laws of expected value and variance, we can determine the mean, variance, and standard deviation of. (The standard deviation of is called the standard error of the proportion.) Sample proportions can be standardized to a standard normal distribution using this formulation:

Sampling Distribution: Difference of two means The final sampling distribution introduced is that of the difference between two sample means. This requires:  independent random samples be drawn from each of two normal populations If this condition is met, then the sampling distribution of the difference between the two sample means, i.e. will be normally distributed. (note: if the two populations are not both normally distributed, but the sample sizes are “large” (>30), the distribution of is approximately normal)

Sampling Distribution: Difference of two means The expected value and variance of the sampling distribution of are given by: mean: standard deviation: (also called the standard error if the difference between two means)

Example 9.3… Since the distribution of is normal and has a mean of and a standard deviation of We can compute Z (standard normal random variable) in this way:

Example 9.3… Starting salaries for MBA grads at two universities are normally distributed with the following means and standard deviations. Samples from each school are taken… University 1University 2 Mean62,000 $/yr60,000 $/yr Std. Dev.14,500 $/yr18,300 $/yr sample size n 5060 What is the probability that the sample mean starting salary of University #1 graduates will exceed that of the #2 grads?

Example 9.3… “What is the probability that the sample mean starting salary of University #1 graduates will exceed that of the #2 grads?” We are interested in determining P(X 1 > X 2 ). Converting this to a difference of means, what is: P(X 1 – X 2 > 0) ? “there is about a 74% chance that the sample mean starting salary of U. #1 grads will exceed that of U. #2” Z