Sampling The sampling errors are: for sample mean

Slides:



Advertisements
Similar presentations
Chapter 6 Confidence Intervals.
Advertisements

University of Minnesota-Duluth, Econ-2030 (Dr. Tadesse) Inferential Statistics.
Sampling: Final and Initial Sample Size Determination
Sampling Distributions (§ )
Sampling and Sampling Distribution
1 1 Slide 2009 University of Minnesota-Duluth, Econ-2030(Dr. Tadesse) Chapter 7, Part B Sampling and Sampling Distributions Other Sampling Methods Other.
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Sampling and Sampling Distributions: Part 2 Sample size and the sampling distribution of Sampling distribution of Sampling methods.
Chapter 8 Estimation: Single Population
Sampling and Sampling Distributions Simple Random Sampling Point Estimation Sampling Distribution.
Chapter 7 Estimation: Single Population
1 1 Slide © 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
QMS 6351 Statistics and Research Methods Chapter 7 Sampling and Sampling Distributions Prof. Vera Adamchik.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
DATA Exploration: Statistics (One Variable) 1.Basic EXCELL/MATLAB functions for data exploration 2.Measures of central tendency, Distributions 1.Mean 2.Median.
Chapter 7 Sampling and Sampling Distributions n Simple Random Sampling n Point Estimation n Introduction to Sampling Distributions n Sampling Distribution.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Chapter 6 Confidence Intervals.
Chapter 6 Confidence Intervals 1 Larson/Farber 4th ed.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western/Thomson Learning.
1 1 Slide © 2003 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2004 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide © 2001 South-Western/Thomson Learning  Anderson  Sweeney  Williams Anderson  Sweeney  Williams  Slides Prepared by JOHN LOUCKS  CONTEMPORARYBUSINESSSTATISTICS.
Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2009 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2005 Thomson/South-Western Chapter 7, Part A Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide Chapter 7 (b) – Point Estimation and Sampling Distributions Point estimation is a form of statistical inference. Point estimation is a form of.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
6 Chapter Confidence Intervals © 2012 Pearson Education, Inc.
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Confidence Intervals 1 Chapter 6. Chapter Outline Confidence Intervals for the Mean (Large Samples) 6.2 Confidence Intervals for the Mean (Small.
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved OPIM 303-Lecture #5 Jose M. Cruz Assistant Professor.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 7 Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution.
1 1 Slide Sampling and Sampling Distributions Sampling Distribution of Sampling Distribution of Introduction to Sampling Distributions Introduction to.
1 Chapter 7 Sampling and Sampling Distributions Simple Random Sampling Point Estimation Introduction to Sampling Distributions Sampling Distribution of.
Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal
Normal Probability Distributions Chapter 5. § 5.1 Introduction to Normal Distributions and the Standard Distribution.
Chapter 7 Sampling and Sampling Distributions ©. Simple Random Sample simple random sample Suppose that we want to select a sample of n objects from a.
1 Chapter 7 Sampling Distributions. 2 Chapter Outline  Selecting A Sample  Point Estimation  Introduction to Sampling Distributions  Sampling Distribution.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Confidence Interval Estimation For statistical inference in decision making: Chapter 9.
1 Chapter 8 Interval Estimation. 2 Chapter Outline  Population Mean: Known  Population Mean: Unknown  Population Proportion.
Copyright © 2015, 2012, and 2009 Pearson Education, Inc. 1 Chapter Confidence Intervals 6.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Fundamentals of Business Statistics chapter7 Sampling and Sampling Distributions 上海金融学院统计系 Statistics Dept., Shanghai Finance University.
Sampling Distributions
Chapter 7 (b) – Point Estimation and Sampling Distributions
St. Edward’s University
Sampling and Sampling Distribution
Chapter 7 Sampling Distributions.
Chapter 5, Part A [SBE 7/e, Ch. 7] Sampling and Sampling Distributions
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 7 Sampling Distributions.
Econ 3790: Business and Economics Statistics
Chapter 6 Confidence Intervals.
Chapter 7 Sampling Distributions.
Chapter 7 Sampling Distributions.
Chapter 7 Sampling and Sampling Distributions
Chapter 6 Confidence Intervals.
Chapter 7 Sampling Distributions.
Presentation transcript:

Sampling The sampling errors are: for sample mean for sample standard deviation for sample proportion

Sampling Example: St. Andrew’s St. Andrew’s College receives 900 applications annually from prospective students. The application form contains a variety of information including the individual’s scholastic aptitude test (SAT) score and whether or not the individual desires on-campus housing. The director of admissions would like to know the following information: Applicants’ average SAT score over the past 10 years the proportion of applicants who live on campus.

Sampling Example: St. Andrew’s We will now look at two alternatives for obtaining the desired information. Conducting a census of all applicants over the last ten years (N = 9000) allows us to compute population parameters. Selecting a sample of 30 from the 9000 current applicants allows us to compute the sample statistics. If the relevant data for the entire 9000 applicants were in the college’s database, the population parameters of interest could be calculated using the formulas presented in Chapter 3.

Wants on-campus housing Conducting a Census Applicant Number SAT score Wants on-campus housing Sqrd. dev. from SAT mean 1 1004 Yes 112 2 942 2643 3 890 10694 4 1032 no 1489 5 857 18608 6 1015 466 7 1063 4843 8999 1090 9329 9000 1094 10118 Total 8,940,700 6,480 57,642,979

Conducting a Census Population Mean SAT Score Population Proportion Wanting On-Campus Housing Population Standard Deviation for SAT Score

Wants on-campus housing Conducting a Census m = 993 Applicant Number SAT score Wants on-campus housing Sqrd. dev. from SAT mean 1 1004 Yes 121 2 942 2601 3 890 10609 4 1032 no 1521 5 857 18496 6 1015 484 7 1063 4900 8999 1090 9409 9000 1094 10201 Total 8,940,700 6,480 57,642,979

Conducting a Census Population Mean SAT Score Population Proportion Wanting On-Campus Housing Population Standard Deviation for SAT Score data_sat_pop.xls

Simple Random Sampling Suppose the data is stored in boxes off campus. The Director of Admissions needs estimates of the population parameters for a meeting taking place in an hour. She decides a sample of 30 applicants will be used. The number of random samples (without replacement) of size 30 that can be drawn from a population of size 9000 is huge. For just this year, it is

Simple Random Sampling Taking a Sample of 30 Applicants Step 1: Assign a random number to each of the 9000 current applicants. Excel’s RAND function generates random numbers between 0 and 1 Step 2: Select the 30 applicants corresponding to the 30 smallest random numbers.

Sort rows by the random numbers Simple Random Sampling Sort rows by the random numbers Applicant Number random 1 .987 2 .567 3 .867 4 .124 5 .345 6 .103 7 .698 8999 .432 9000 .211

smallest random numbers. Wants on-campus housing Simple Random Sampling 30 applicant numbers with smallest random numbers. Applicant Number random SAT score Wants on-campus housing 675 .001 985 Yes 34 1002 768 .002 913 1823 .003 987 No 8897 .008 1123 7837 .009 989 231 912 701 .012 5065 .015 998 no Total 30,299 20

Simple Random Sampling Sample Mean SAT Score Sample Proportion Wanting On-Campus Housing Sample Standard Deviation for SAT Score

Wants on-campus housing Simple Random Sampling x = 1009.97 Applicant Number SAT score Wants on-campus housing Sqrd. dev. from SAT mean 675 985 Yes 623.5 34 1002 63.52 768 913 9403.18 1823 987 no 527.62 8897 1123 12,775.78 7837 989 439.74 231 912 9598.12 701 5065 998 143.28 Total 30,299 20 211,746.97

Simple Random Sampling Sample Mean SAT Score Sample Proportion Wanting On-Campus Housing Sample Standard Deviation for SAT Score Note: Different random numbers would have identified a different sample which would have resulted in different point estimates. data_sampling.xls

Sampling Distribution of The sampling distribution of is the probability distribution of all possible values of the sample mean. Expected Value of E( ) =  where  = the population mean Standard Deviation is also known as the standard error of the mean Standard Deviation of from an infinite population is

Sampling Distribution of Under repeated sampling using random samples of size n, the sample means are normally distributed with mean m and variance s 2/n when either The data is heavily skewed, n > 50, and s is known. OR The data is symmetric, n > 30, and s is known. OR The data is normally distributed and s is known.

Sampling Distribution of

Sampling Distribution of What is the probability that a simple random sample of 30 applicants will provide an estimate of the population mean SAT score that is within 10 points of the actual population mean  ? In other words, what is the probability that will be between 983 and 1003? Step 1: Calculate the z-value at the upper endpoint of the interval. z = (1003 - 993)/14.6 = .68

Sampling Distribution of Step 2: Find the area under the curve to the left of the upper endpoint. z = .6 8 P(z < .68) = .7517 P(x < 1003) = .7517

Sampling Distribution of Area = .2483 Area = .7517 993 1003

Sampling Distribution of Step 3: Calculate the z-value at the lower endpoint of the interval. z = (983 - 993)/14.6 = - .68 Step 4: Find the area under the curve to the left of the lower endpoint. P(z < -.68) = .2483 P(x < 983) = .2483

Sampling Distribution of Step 5: Calculate the area under the curve between the lower and upper endpoints of the interval. P(983 < < 1003) = .5034 The probability that the sample mean SAT score will be between 983 and 1003 is 0.5034 With n = 30, .5034 .2483 .2483 983 993 1003

Sampling Distribution of If the simple had included 100 applicants instead of 30, , but the standard error falls. E( ) remains equal to 993 The standard error was 14.6 when n = 30 The probability that the sample mean SAT score is between 983 and 1003 equals 0.5034 when n = 30, but equals 0.7888 with n = 100. This is because the sampling distribution with n = 100 has a smaller standard error So, the values of the sample mean have less variability and tend to hover closer to the population mean when n = 30. With n = 30, .5034 .2483 .2483 983 993 1003

Sampling Distribution of If the simple had included 100 applicants instead of 30, E( ) remains equal to 993 , but the standard error falls. The probability that the sample mean SAT score is between 983 and 1003 equals 0.5034 when n = 30, but equals 0.7888 with n = 100. This is because the sampling distribution with n = 100 has a smaller standard error So, the values of the sample mean have less variability and tend to hover closer to the population mean when n = 30. With n = 100, .7888 With n = 30, .5034 .2483 .2483 983 993 1003

Sampling Distribution of The Expected value of Standard deviation of from an infinite population is 𝜎 𝑝 = 𝜎 𝐷 𝑛 sD = standard deviation of D Even though the sample proportion wanting on-campus housing is 0.667, we expected it to be equal to the population proportion of p = .72 Most of the time, the sample proportion will be slightly above or slightly below the population proportion. Rarely will the sample proportion be way above or way below the population proportion. The sampling distribution of is approximately normal when np > 5 and n(1 – p) > 5

Sampling Distribution of The sample proportion can be computed in the same way as the sample mean when a dummy variable is coded from a nominal scaled binomial variable. Vote for Obama D Yes 1  No  Yes

We should have divided by n – 1 because the data came from a sample. Sampling Distribution of The sampling distribution of is the probability distribution of all possible values of the sample proportion. We should have divided by n – 1 because the data came from a sample. Since there are six 1s and four 0s In most cases involving sample proportions, n is very large. Hence, dividing by n or n – 1 yields roughly the same value 𝜎 𝑝 = 𝜎 𝐷 𝑛

Sampling Distribution of Example: St. Andrew’s College Recall that 72% of the prospective students applying to St. Andrew’s College desire on-campus housing. What is the probability that a simple random sample of 30 applicants will provide an estimate of the population proportion of applicants desiring on-campus housing that is within .05 of the actual population proportion? P(0.67 < < 0.77) = ? Step 1: Convert the upper endpoint of the interval to z. z1 = (.77 - .72)/.082 = .61

Sampling Distribution of For this example, with n = 30 and p = .72, the normal distribution is an acceptable approximation because: np = 30(.72) = 21.6 > 5 and n(1 - p) = 30(.28) = 8.4 > 5 ? .67 .72 .77

Sampling Distribution of Step 2: Find the area under the curve to the right of the upper endpoint. z1 = .6 1 P(z1 < .61) = .7291 P(p < .77) = .7291

Sampling Distribution of Area = .2709 Area = .7291 .72 .77

Sampling Distribution of Step 3: Calculate the z-value of the lower endpoint of the interval. z0 = (.67 - .72)/.082 = -.61 Step 4: Find the area under the curve to the left of the lower endpoint. P(z0 < -.61) = .2709 P(p < .67) = .2709

Sampling Distribution of Step 5: Calculate the area under the curve between the lower and upper endpoints of the interval. Area = .2709 Area = .2709 .4582 The probability that the sample proportion of applicants wanting on-campus housing within .05 of the actual population proportion is .4582 .67 .72 .77

Simple Random Sampling Population Parameter Parameter Value Point Estimator Point Estimate m = Population mean SAT score = Sample mean SAT score 993 1009.97 s = Population std. deviation for SAT score 80 s = Sample std. deviation for SAT score 85.45 The red values vary from sample to sample. Hence, the sampling distribution of the mean. p = Population pro- portion wanting campus housing .72 = Sample pro- portion wanting campus housing .667 data_sampling_dist.xls