Introduction to Probability and Statistics Thirteenth Edition Chapter 8 Large and Small-Sample Estimation.

Slides:



Advertisements
Similar presentations
Estimation of Means and Proportions
Advertisements

Chapter 6 Confidence Intervals.
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Statistics and Quantitative Analysis U4320
Note 10 of 5E Statistics with Economics and Business Applications Chapter 7 Estimation of Means and Proportions Large-Sample Estimation.
Copyright ©2006 Brooks/Cole A division of Thomson Learning, Inc. Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M.
Chapter 10 Statistical Inference About Means and Proportions With Two Populations Estimation of the Difference between the Means of Two Populations:
1 1 Slide © 2003 South-Western/Thomson Learning™ Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
1 Chapter 10 Comparisons Involving Means  1 =  2 ? ANOVA Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis.
1 1 Slide Slides Prepared by JOHN S. LOUCKS St. Edward’s University © 2002 South-Western /Thomson Learning.
Chapter 10 Comparisons Involving Means Part A Estimation of the Difference between the Means of Two Populations: Independent Samples Hypothesis Tests about.
Chapter 10 Comparisons Involving Means
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Chapter 8 Estimation: Single Population
OMS 201 Review. Range The range of a data set is the difference between the largest and smallest data values. It is the simplest measure of dispersion.
Chapter 7 Estimation: Single Population
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
BCOR 1020 Business Statistics
Note 9 of 5E Statistics with Economics and Business Applications Chapter 7 Estimation of Means and Proportions Point Estimation, Interval Estimation/Confidence.
© 2011 Pearson Education, Inc. Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses.
Chapter 8 Large-Sample Estimation
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Review of Basic Statistics. Definitions Population - The set of all items of interest in a statistical problem e.g. - Houses in Sacramento Parameter -
Copyright ©2011 Nelson Education Limited Large-Sample Estimation CHAPTER 8.
Introduction to Probability and Statistics Thirteenth Edition
Confidence Intervals Chapter 6. § 6.1 Confidence Intervals for the Mean (Large Samples)
Chapter 6 Confidence Intervals.
Section Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series.
Introduction to Probability and Statistics Chapter 8 Large-Sample Estimation.
1 Chapter 6. Section 6-1 and 6-2. Triola, Elementary Statistics, Eighth Edition. Copyright Addison Wesley Longman M ARIO F. T RIOLA E IGHTH E DITION.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
CS433 Modeling and Simulation Lecture 12 Output Analysis Large-Sample Estimation Theory Dr. Anis Koubâa 10 Jan 2009.
Estimates and Sample Sizes Lecture – 7.4
Chapter 7 Estimates and Sample Sizes
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
Copyright ©2011 Nelson Education Limited Large-Sample Tests of Hypotheses CHAPTER 9.
CS433 Modeling and Simulation Lecture 16 Output Analysis Large-Sample Estimation Theory Dr. Anis Koubâa 30 May 2009 Al-Imam Mohammad Ibn Saud University.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Introduction to Probability and Statistics Thirteenth Edition Chapter 9 Large-Sample Tests of Hypotheses.
1 Estimation From Sample Data Chapter 08. Chapter 8 - Learning Objectives Explain the difference between a point and an interval estimate. Construct and.
Copyright ©2011 Nelson Education Limited Large-Sample Estimation CHAPTER 8.
Introduction  Populations are described by their probability distributions and parameters. For quantitative populations, the location and shape are described.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 8 Hypothesis Testing.
Introduction to Statistical Inference Jianan Hui 10/22/2014.
Estimating a Population Mean:  Known
Introduction Suppose that a pharmaceutical company is concerned that the mean potency  of an antibiotic meet the minimum government potency standards.
MTH3003 PJJ SEM II 2014/2015 F2F II 12/4/2015.  ASSIGNMENT :25% Assignment 1 (10%) Assignment 2 (15%)  Mid exam :30% Part A (Objective) Part B (Subjective)
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Copyright ©2006 Brooks/Cole A division of Thomson Learning, Inc. Introduction to Probability and Statistics Twelfth Edition Robert J. Beaver Barbara M.
© Copyright McGraw-Hill 2004
Chapters 6 & 7 Overview Created by Erin Hodgess, Houston, Texas.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
6-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Copyright ©2003 Brooks/Cole A division of Thomson Learning, Inc. Sampling Distributions statistics Numerical descriptive measures calculated from the sample.
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Introduction to Probability and Statistics Twelfth Edition
Al-Imam Mohammad Ibn Saud University Large-Sample Estimation Theory
CONCEPTS OF ESTIMATION
Introduction to Probability and Statistics
Chapter 8 Confidence Intervals.
Estimates and Sample Sizes Lecture – 7.4
Presentation transcript:

Introduction to Probability and Statistics Thirteenth Edition Chapter 8 Large and Small-Sample Estimation

Large sample estimation

Population (Size of population = N) Sample number 1 Sample number 2 Sample number 3 Sample number N C n Each sample size = n

 Populations are described by their probability distributions and parameters.  For quantitative populations, the location and shape are described by  and .  For a binomial populations, the location and shape are determined by p.  If the values of parameters are unknown, we make inferences about them using sample information.

Types of Inference Estimation:Estimation: –Estimating or predicting the value of the parameter – – “What is (are) the most likely values of m or p?” Hypothesis Testing:Hypothesis Testing: –Deciding about the value of a parameter based on some preconceived idea. –“Did the sample come from a population with  = 5 or p = 0.2?”

Examples:Examples: –A consumer wants to estimate the average price of similar homes in her city before putting her home on the market. Estimation: Estimation: Estimate , the average home price. Hypothesis test Hypothesis test: Is the new average resistance,   equal to the old average resistance,    –A manufacturer wants to know if a new type of steel is more resistant to high temperatures than an old type was. Types of Inference

Whether you are estimating parameters or testing hypotheses, statistical methods are important because they provide: –Methods for making the inference –A numerical measure of the goodness or reliability of the inference Types of Inference

 An unknown population proportion p  An unknown population mean   ? p?

estimator  An estimator is a rule, usually a formula, that tells you how to calculate the estimate based on the sample.  Point estimation:  Point estimation: A single number is calculated to estimate the parameter.  Interval estimation:  Interval estimation: Two numbers are calculated to create an interval within which the parameter is expected to lie.

A. Point Estimators sampling distribution.  Since an estimator is calculated from sample values, it varies from sample to sample according to its sampling distribution. estimatorunbiased  An estimator is unbiased if the mean of its sampling distribution equals the parameter of interest.  It does not systematically overestimate or underestimate the target parameter. Properties

unbiased smallest spreadvariability  Of all the unbiased estimators, we prefer the estimator whose sampling distribution has the smallest spread or variability. Properties

Measuring the Goodness of an Estimator error of estimation. o The distance between an estimate and the true value of the parameter is the error of estimation. The distance between the bullet and the bull’s-eye. unbiased normal o When the sample sizes are large, our unbiased estimators will have normal distributions. Because of the Central Limit Theorem.

The Margin of Error  Margin of error:  Margin of error: The maximum error of estimation, is the maximum likely difference observed between sample mean x and true population mean µ, calculated as :

Margin of Error is the maximum likely difference observed between sample mean x and true population mean µ. denoted by E µ x + E x - E x -E < µ < x +E lower limit Definition upper limit

Definition Margin of Error µ x + E x - E also called the maximum error of the estimate E = z  /2  n

Estimating Means and Proportions For a quantitative population, For a binomial population, SE

Example A homeowner randomly samples 64 homes similar to her own and finds that the average selling price is $252,000 with a standard deviation of $15,000. Estimate the average selling price for all similar homes in the city.

A quality control technician wants to estimate the proportion of soda cans that are underfilled. He randomly samples 200 cans of soda and finds 10 underfilled cans. Example

A random sample of n = 500 observations from a binomial population produced x = 450 successes. Estimate the binomial proportion p and calculate the 90% margin of error Example

Create an interval (a, b) so that you are fairly sure that the parameter lies between these two values. confidence coefficient, 1- .“Fairly sure” is means “with high probability”, measured using the confidence coefficient, 1- . Usually, 1-  Suppose 1-  =.95 and that the estimator has a normal distribution. Parameter  1.96SE

Since we don’t know the value of the parameter, consider which has a variable center. Only if the estimator falls in the tail areas will the interval fail to enclose the parameter. This happens only 5% of the time. Estimator  1.96 SE Worked Failed

TO CHANGE THE CONFIDENCE LEVEL To change to a general confidence level, 1- , pick a value of z that puts area 1-  in the center of the z distribution. 100(1-  )% Confidence Interval: Estimator  z  SE Tail areaz  /

1. CONFIDENCE INTERVALS FOR MEANS AND PROPORTIONS For a quantitative population, For a binomial population, 1.96

A random sample of n = 50 males showed a mean average daily intake of dairy products equal to 756 grams with a standard deviation of 35 grams. Find a 95% confidence interval for the population average . 1.96

Find a 99% confidence interval for , the population average daily intake of dairy products for men. The interval must be wider to provide for the increased confidence that is does indeed enclose the true value of 

Of a random sample of n = 150 college students, 104 of the students said that they had played on a soccer team during their K-12 years. Estimate the proportion of college students who played soccer in their youth with a 98% confidence interval. 2.33

2. E STIMATING THE D IFFERENCE BETWEEN T WO M EANS  Sometimes we are interested in comparing the means of two populations. The average growth of plants fed using two different nutrients. The average scores for students taught with two different teaching methods.  To make this comparison,

We compare the two averages by making inferences about  1 -  2, the difference in the two population averages. If the two population averages are the same, then  1 -  2 = 0. The best estimate of  1 -  2 is the difference in the two sample means, E STIMATING THE D IFFERENCE BETWEEN T WO M EANS (C ONT ’ D )

T HE S AMPLING D ISTRIBUTION OF Properties of the Sampling Distribution of Expected Value Standard Deviation/Standard Error where:  1 = standard deviation of population 1  2 = standard deviation of population 2 n 1 = sample size from population 1 n 2 = sample size from population 2

I NTERVAL E STIMATE OF  1 -  2 : L ARGE -S AMPLE C ASE ( n 1 > 30 AND n 2 > 30)  Interval Estimate with  1 and  2 Known where: 1 -  is the confidence coefficient  Interval Estimate with  1 and  2 Unknown where: SE

E XAMPLE Compare the average daily intake of dairy products of men and women using a 95% confidence interval. Avg Daily IntakesMenWomen Sample size50 Sample mean Sample Std Dev

Could you conclude, based on this confidence interval, that there is a difference in the average daily intake of dairy products for men and women?  1 -  2 = 0.  1 =  2.The confidence interval contains the value  1 -  2 = 0. Therefore, it is possible that  1 =  2. You would not want to conclude that there is a difference in average daily intake of dairy products for men and women. E XAMPLE ( CONT ’ D )

3. Estimating the Difference between Two Proportions  Sometimes we are interested in comparing the proportion of “successes” in two binomial populations. The germination rates of untreated seeds and seeds treated with a fungicide. The proportion of male and female voters who favor a particular candidate for governor.  To make this comparison,

We compare the two proportions by making inferences about p 1 -p 2, the difference in the two population proportions. If the two population proportions are the same, then p 1 -p 2 = 0. The best estimate of p 1 -p 2 is the difference in the two sample proportions, Estimating the Difference between Two Proportions (cont’d)

The Sampling Distribution of Expected Value/mean Standard Deviation/Standard Error Distribution Form If the sample sizes are large (n 1 p 1, n 1 q 1, n 2 p 2, n 2 q 2 ) are all greater than to 5), the sampling distribution of can be approximated by a normal probability distribution.

Interval Estimate of p 1 - p 2 : Large-Sample Case

Example Compare the proportion of male and female college students who said that they had played on a soccer team during their K-12 years using a 99% confidence interval. Youth SoccerMaleFemale Sample size8070 Played soccer

Could you conclude, based on this confidence interval, that there is a difference in the proportion of male and female college students who said that they had played on a soccer team during their K-12 years? p 1 -p 2 = 0. p 1 = p 2.The confidence interval does not contains the value p 1 -p 2 = 0. Therefore, it is not likely that p 1 = p 2. You would conclude that there is a difference in the proportions for males and females. A higher proportion of males than females played soccer in their youth. Example (cont’d)

two- sided  Confidence intervals are by their nature two- sided since they produce upper and lower bounds for the parameter.  One-sided bounds  One-sided bounds can be constructed simply by using a value of z that puts a rather than  /2 in the tail of the z distribution.

 The total amount of relevant information in a sample is controlled by two factors: sampling planexperimental design  The sampling plan or experimental design: the procedure for collecting the information sample size n  The sample size n: the amount of information you collect. margin of error width of the confidence interval.  In a statistical estimation problem, the accuracy of the estimation is measured by the margin of error or the width of the confidence interval.

1. Determine the size of the margin of error, E, that you are willing to tolerate. 2. Choose the sample size by solving for n or n  n 1  n 2 in the inequality: 1.96 SE  E, where SE is a function of the sample size n. s   Range / For quantitative populations, estimate the population standard deviation using a previously calculated value of s or the range approximation   Range / 4. p .5 4. For binomial populations, use the conservative approach and approximate p using the value p .5.

A producer of PVC pipe wants to survey wholesalers who buy his product in order to estimate the proportion who plan to increase their purchases next year. What sample size is required if he wants his estimate to be within 0.04 of the actual proportion with probability equal to 0.95? He should survey at least 600 wholesalers.

4. Estimating the Variance The sample variance is defined by

Analysis of Sample Variance If s 2 is the variance of a random sample size n from a normal population, a 100(1-  )% confidence interval for  2 is Where and are values with (n-1) degrees of freedom.

Small sample estimation

 Take sample of 15 patrons from our library sample  Mean  Standard deviation  Number of cases 15  Find 95 percent confidence interval  t value, from table, for 14 degrees of freedom, 2.145

Values of t 

Interval Estimate of  1 -  2 : Small -Sample Case ( n 1 < 30 and/or n 2 < 30) Interval Estimate with  2 Known where:

Interval Estimate of  1 -  2 : Small -Sample Case ( n 1 < 30 and/or n 2 < 30)  Interval Estimate with  2 Unknown

Example: Specific Motors Specific Motors of Detroit has developed a new automobile known as the M car. 12 M cars and 8 J cars (from Japan) were road tested to compare miles-per- gallon (mpg) performance. It is assumed that both populations have equal variances. The sample statistics are: Sample #1 Sample #2 M Cars J Cars Sample Size n 1 = 12 cars n 2 = 8 cars Mean = 29.8 mpg = 27.3 mpg Standard Deviation s 1 = 2.56 mpg s 2 = 1.81 mpg

 Point Estimate of the Difference Between Two Population Means  1 = mean miles-per-gallon for the population of M cars  2 = mean miles-per-gallon for the population of J cars Point estimate of  1 -  2 = = = 2.5 mpg. Example: Specific Motors

 95% Confidence Interval Estimate of the Difference Between Two Population Means: Small-Sample Case We will make the following assumptions:  The miles per gallon rating must be normally distributed for both the M car and the J car.  The variance in the miles per gallon rating must be the same for both the M car and the J car.  Using the t distribution with n 1 + n = 18 degrees of freedom, the appropriate t value is t.025 =  We will use a weighted average of the two sample variances as the pooled estimator of  2. Example: Specific Motors

 95% Confidence Interval Estimate of the Difference Between Two Population Means: Small-Sample Case = or.3 to 4.7 miles per gallon. We are 95% confident that the difference between the mean mpg ratings of the two car types is from.3 to 4.7 mpg (with the M car having the higher mpg). Example: Specific Motors

Key Concepts I. Types of Estimators 1. Point estimator: a single number is calculated to estimate the population parameter. Interval estimator 2. Interval estimator: two numbers are calculated to form an interval that contains the parameter. II. Properties of Good Estimators 1. Unbiased: the average value of the estimator equals the parameter to be estimated. 2. Minimum variance: of all the unbiased estimators, the best estimator has a sampling distribution with the smallest standard error. 3. The margin of error measures the maximum distance between the estimator and the true value of the parameter.

Key Concepts III. Large-Sample Point Estimators To estimate one of four population parameters when the sample sizes are large, use the following point estimators with the appropriate margins of error.

Key Concepts IV. Large-Sample Interval Estimators To estimate one of four population parameters when the sample sizes are large, use the following interval estimators.

Key Concepts 1.All values in the interval are possible values for the unknown population parameter. 2.Any values outside the interval are unlikely to be the value of the unknown parameter. 3.To compare two population means or proportions, look for the value 0 in the confidence interval. If 0 is in the interval, it is possible that the two population means or proportions are equal, and you should not declare a difference. If 0 is not in the interval, it is unlikely that the two means or proportions are equal, and you can confidently declare a difference. V. One-Sided Confidence Bounds Use either the upper (  ) or lower (  ) two-sided bound, with the critical value of z changed from z  / 2 to z .