Determination of Sample Size

Slides:



Advertisements
Similar presentations
“Students” t-test.
Advertisements

CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
9.1 confidence interval for the population mean when the population standard deviation is known
Sampling: Final and Initial Sample Size Determination
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Estimation Procedures Point Estimation Confidence Interval Estimation.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
Chapter 8 Estimation: Single Population
Chapter 7 Estimation: Single Population
Let sample from N(μ, σ), μ unknown, σ known.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Statistical Intervals Based on a Single Sample.
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Confidence Intervals for the Mean (σ Unknown) (Small Samples)
© 2010 Pearson Prentice Hall. All rights reserved Chapter Estimating the Value of a Parameter Using Confidence Intervals 9.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Statistical Intervals for a Single Sample
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
PROBABILITY (6MTCOAE205) Chapter 6 Estimation. Confidence Intervals Contents of this chapter: Confidence Intervals for the Population Mean, μ when Population.
One Sample Inf-1 If sample came from a normal distribution, t has a t-distribution with n-1 degrees of freedom. 1)Symmetric about 0. 2)Looks like a standard.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Chap 8-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 8 Estimation: Single Population Statistics for Business and Economics.
Confidence Intervals for the Mean (Small Samples) 1 Larson/Farber 4th ed.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Copyright © 1998, Triola, Elementary Statistics Addison Wesley Longman 1 Estimates and Sample Sizes Chapter 6 M A R I O F. T R I O L A Copyright © 1998,
Statistical Inference Making decisions regarding the population base on a sample.
 A Characteristic is a measurable description of an individual such as height, weight or a count meeting a certain requirement.  A Parameter is a numerical.
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Example: In a recent poll, 70% of 1501 randomly selected adults said they believed.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Statistical Inference Making decisions regarding the population base on a sample.
Point Estimates point estimate A point estimate is a single number determined from a sample that is used to estimate the corresponding population parameter.
Chapter 9: One- and Two-Sample Estimation Problems: 9.1 Introduction: · Suppose we have a population with some unknown parameter(s). Example: Normal( ,
8.1 Estimating µ with large samples Large sample: n > 30 Error of estimate – the magnitude of the difference between the point estimate and the true parameter.
Section 6.2 Confidence Intervals for the Mean (Small Samples) Larson/Farber 4th ed.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Statistical Inference Making decisions regarding the population base on a sample.
Statistics for Business and Economics 8 th Edition Chapter 7 Estimation: Single Population Copyright © 2013 Pearson Education, Inc. Publishing as Prentice.
Section 6.2 Confidence Intervals for the Mean (Small Samples) © 2012 Pearson Education, Inc. All rights reserved. 1 of 83.
Probability & Statistics Review I 1. Normal Distribution 2. Sampling Distribution 3. Inference - Confidence Interval.
Chapter 14 Single-Population Estimation. Population Statistics Population Statistics:  , usually unknown Using Sample Statistics to estimate population.
Chapter 8 Confidence Interval Estimation Statistics For Managers 5 th Edition.
Estimation and Confidence Intervals. Point Estimate A single-valued estimate. A single element chosen from a sampling distribution. Conveys little information.
Chapter 6 Confidence Intervals.
Statistical Intervals for a Single Sample
Chapter 6 Confidence Intervals.
Estimating the Value of a Parameter
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
Chapter 6 Confidence Intervals.
CONCEPTS OF ESTIMATION
Chapter 7 Estimation: Single Population
Chapter 6 Confidence Intervals.
Confidence Interval Estimation
Chapter 6 Confidence Intervals.
Estimating the Value of a Parameter
Statistical Inference
Confidence Intervals.
Chapter 9: One- and Two-Sample Estimation Problems:
Chapter 8 Estimation: Single Population
Chapter 6 Confidence Intervals.
The z-test for the Mean of a Normal Population
Interval Estimation Download this presentation.
Chapter 7 Estimation: Single Population
STA 291 Summer 2008 Lecture 14 Dustin Lueker.
Presentation transcript:

Determination of Sample Size In almost all research situations the researcher is interested in the question: How large should the sample be?

Answer: Depends on: How accurate you want the answer. Accuracy is specified by: Specifying the magnitude of the error bound Level of confidence

Confidence Intervals for the mean of a Normal Population, m Let and Then t1 to t2 is a (1 – a)100% = P100% confidence interval for m = (1 – a)100% Error Bound for m The accuracy of a confidence interval is specified by setting: The magnitude of B (the Error bound), and The level of confidence (1 – a)100%

Error Bound: If we have specified the level of confidence then the value of za/2 will be known. If we have specified the magnitude of B, it will also be known Solving for n we get: s* is some estimate of s.

Summarizing: The sample size that will estimate m with an Error Bound B and level of confidence P = 1 – a is: where: B is the desired Error Bound za/2 is the a/2 critical value for the standard normal distribution s* is some preliminary estimate of s.

Notes: n increases as B, the desired Error Bound, decreases Larger sample size required for higher level of accuracy n increases as the level of confidence, (1 – a), increases za/2 increases as a/2 becomes closer to zero. Larger sample size required for higher level of confidence n increases as the standard deviation, s, of the population increases. If the population is more variable then a larger sample size required

Summary: The sample size n depends on: Desired level of accuracy Desired level of confidence Variability of the population

Example Suppose that one is interested in estimating the average number of grams of fat (m) in one kilogram of lean beef hamburger : This will be estimated by: randomly selecting one kilogram samples, then Measuring the fat content for each sample. Preliminary estimates of m and s indicate: that m and s are approximately 220 and 40 respectively. I want the study to estimate m with an error bound 5 and a level of confidence to be 95% (i.e. a = 0.05 and za/2 = z0.025 = 1.960)

Solution Hence n = 246 one kilogram samples are required to estimate m within B = 5 gms with a 95% level of confidence.

Confidence Intervals for the mean of a Bernoulli probability, p Let and Then t1 to t2 is a (1 – a)100% = P100% confidence interval for p = (1 – a)100% Error Bound for p The accuracy of a confidence interval is specified by setting: The magnitude of B (the Error bound), and The level of confidence (1 – a)100%

Error Bound: If we have specified the level of confidence then the value of za/2 will be known. If we have specified the magnitude of B, it will also be known Solving for n we get:

Summarizing: The sample size that will estimate p with an Error Bound B and level of confidence P = 1 – a is: where: B is the desired Error Bound za/2 is the a/2 critical value for the standard normal distribution p* is some preliminary estimate of p. If no estimate for p is available use p = 0.50. One can easily check that the maximum sample size required occurs when p = 0.50.

maximum sample size n occurs when p = 0.50.

Example Suppose that I want to conduct a survey and want to estimate p = proportion of voters who favour a downtown location for a casino: I know that the approximate value of p is p* = 0.50. This is also a good choice for p if one has no preliminary estimate of its value. I want the survey to estimate p with an error bound B = 0.01 (1 percentage point) I want the level of confidence to be 95% (i.e. a = 0.05 and za/2 = z0.025 = 1.960 Then

A general method for constructing confidence limits

Definition: A statistic t is called a pivotal statistic (for determining confidence limits for the parameter f) if: The distribution of t is completely known (not dependent on any unknown parameters.) The only unknown parameter that the statistic t depends on is the parameter f (the parameter being estimated.) The statistic t depends on the data x1, …, xn through the sufficient statistics, S1, …, Sq.

Examples of pivotal statistics Estimating m, the mean of a Normal population A pivotal statistic if s is known. Has a known distribution N(0,1). Only depends on the unknown parameter s. Depends on the data through the sufficient statistics

Estimating p, the bernoulli probability A pivotal statistic. Has a known distribution N(0,1). Only depends on the unknown parameter p. Depends on the data through the sufficient statistics

To construct confidence limits using a pivotal statistic Construct a probability statement regarding the pivotal statistic t. This is possible because the distribution of t is completely known. Translate this statement into a confidence statement about the parameter f (the parameter being estimated.)

Estimating m, the mean of a Normal population (s2 known) Pivotal Statistic Starting with after some manipulation we get

Estimating p, a Bernoulli probability Pivotal Statistic Starting with after some manipulation we get

Estimating m, the mean of a Normal population The t distribution Estimating m, the mean of a Normal population (s2 unknown) Let x1, … , xn denote a sample from the normal distribution with mean m and variance s2. Both m and s2 are unknown Recall Also

Recall also that if : then has a t-distribution with n degrees of freedom. Thus since then has a t-distribution with n – 1 degrees of freedom.

Thus we use as the pivotal statistic It satisfies the conditions of a pivotal statistic. has a known distribution, the t-distribution with n -1 df. only depends on the unknown parameter m. depends on the data through the sufficient statistics

Critical Values for the t–distribution with n df Definition The a-upper critical values for the t–distribution with n df is the quantity such that t–distribution with n df

Thus we use as the pivotal statistic to set up confidence limits for m. Starting with

Hence are (1 – a)100% confidence limits for m.

Example Let x1, x2, x3 , x4, x5, x6 denote weight loss from a new diet for n = 6 cases. The Data: The summary statistics:

95% Confidence Intervals (use a = 0.05) 95% Confidence Limits

Confidence Limits for s2 the variance of a Normal population Let x1, … , xn denote a sample from the normal distribution with mean m and variance s2. Both m and s2 are unknown. Recall The statistic U satisfies the conditions for a pivotal statistic for estimating s2.

U has a known distribution, the c2-distribution with n -1 df. only depends on the unknown parameter s2. depends on the data through the sufficient statistics

Critical Values for the c2–distribution with n df Definition The a-upper critical values for the c2–distribution with n df is the quantity such that c2–distribution with n df

Note: and

Confidence limits for s2 and s. thus

hence and

hence is a (1 – a)100 % confidence interval for s2. and is a (1 – a)100 % confidence interval for s.

Example Let x1, x2, x3 , x4, x5, x6 denote weight loss from a new diet for n = 6 cases. The Data: The summary statistics:

(1 – a)100 % confidence interval for s2. Using a = 0.05 Thus 95 % confidence interval for s2 are:

(1 – a)100 % confidence interval for s. Using a = 0.05 Thus 95 % confidence interval for s are: