Sample size computations Petter Mostad

Slides:



Advertisements
Similar presentations
Probability models- the Normal especially.
Advertisements

Tests of Hypotheses Based on a Single Sample
“Students” t-test.
Lecture (11,12) Parameter Estimation of PDF and Fitting a Distribution Function.
Sampling: Final and Initial Sample Size Determination
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
Chapter 9 Hypothesis Testing
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
REVIEW OF BASICS PART II Probability Distributions Confidence Intervals Statistical Significance.
Chapter Seventeen HYPOTHESIS TESTING
PSY 307 – Statistics for the Behavioral Sciences
Hypothesis Testing Steps of a Statistical Significance Test. 1. Assumptions Type of data, form of population, method of sampling, sample size.
DATA ANALYSIS I MKT525. Plan of analysis What decision must be made? What are research objectives? What do you have to know to reach those objectives?
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Lec 6, Ch.5, pp90-105: Statistics (Objectives) Understand basic principles of statistics through reading these pages, especially… Know well about the normal.
Hypothesis Testing for Population Means and Proportions
4-1 Statistical Inference The field of statistical inference consists of those methods used to make decisions or draw conclusions about a population.
Chapter 3 Hypothesis Testing. Curriculum Object Specified the problem based the form of hypothesis Student can arrange for hypothesis step Analyze a problem.
Sample Size Determination In the Context of Hypothesis Testing
IENG 486 Statistical Quality & Process Control
Inferences About Process Quality
BCOR 1020 Business Statistics Lecture 20 – April 3, 2008.
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
5-3 Inference on the Means of Two Populations, Variances Unknown
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Hypothesis Testing and T-Tests. Hypothesis Tests Related to Differences Copyright © 2009 Pearson Education, Inc. Chapter Tests of Differences One.
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
Statistical Inference for Two Samples
AM Recitation 2/10/11.
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 9.3 (323) A Test of the Mean of a Normal Distribution: Population Variance Unknown Given a random sample of n observations from a normal population.
More About Significance Tests
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
1 CSI5388: Functional Elements of Statistics for Machine Learning Part I.
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
Learning Objectives In this chapter you will learn about the t-test and its distribution t-test for related samples t-test for independent samples hypothesis.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 9 Inferences Based on Two Samples.
Confidence intervals and hypothesis testing Petter Mostad
4 Hypothesis & Testing. CHAPTER OUTLINE 4-1 STATISTICAL INFERENCE 4-2 POINT ESTIMATION 4-3 HYPOTHESIS TESTING Statistical Hypotheses Testing.
STA Lecture 251 STA 291 Lecture 25 Testing the hypothesis about Population Mean Inference about a Population Mean, or compare two population means.
Lecture 18 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
MeanVariance Sample Population Size n N IME 301. b = is a random value = is probability means For example: IME 301 Also: For example means Then from standard.
Ex St 801 Statistical Methods Inference about a Single Population Mean.
Chapter 9: Testing Hypotheses Overview Research and null hypotheses One and two-tailed tests Type I and II Errors Testing the difference between two means.
Descriptive Statistics Used to describe a data set –Mean, minimum, maximum Usually include information on data variability (error) –Standard deviation.
Various Topics of Interest to the Inquiring Orthopedist Richard Gerkin, MD, MS BGSMC GME Research.
Ch8.2 Ch8.2 Population Mean Test Case I: A Normal Population With Known Null hypothesis: Test statistic value: Alternative Hypothesis Rejection Region.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Sample Size Needed to Achieve High Confidence (Means)
Hypothesis Tests. An Hypothesis is a guess about a situation that can be tested, and the test outcome can be either true or false. –The Null Hypothesis.
Hypothesis Testing and Statistical Significance
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 7 Inferences Concerning Means.
Introduction For inference on the difference between the means of two populations, we need samples from both populations. The basic assumptions.
More on Inference.
Chapter 4. Inference about Process Quality
STAT 312 Chapter 7 - Statistical Intervals Based on a Single Sample
Hypothesis Tests: One Sample
Introduction to Inference
Chapter 9 Hypothesis Testing.
More on Inference.
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Introduction to Inference
STA 291 Spring 2008 Lecture 21 Dustin Lueker.
Presentation transcript:

Sample size computations Petter Mostad

Choice of sample size In practice, one of the most important decisions to make in an experiment. The goal may be to be reasonably sure that you will “detect an effect” if there is an effect. Usually, an initial guess of the size of the effect is need to estimate necessary sample size. Your goal may be to detect effects of “relevant size”. A related goal: To estimate sample size necessary for a sufficiently small confidence interval.

The power of a test The power of a test is the probability of rejecting the null hypothesis, when the null hypothesis is wrong. Often denoted 1-β H0 is not rejected H0 is rejected H0 is true H0 is not true Given that: Probability that: α (Type I error) β (Type II error)

The power depends on the alternative hypothesis The power can only be computed when the alternative hypothesis is formulated so that we can compute the distribution of the test statistic. Thus, the power can be a function for example of the size of the effect that we are trying to detect.

Example: Sample size to detect an effect on a continuous variable Assumptions: –We compare two samples of equal size –Each sample is from a normal distribution with variance σ 2 (common for both distributions) –The actual difference between the means is d –We use a two-sided t-test to detect this difference Then the sample size n needed for power p (probability p to reject the null hypothesis that there is no difference between the means of the groups) is approximately (for the value of k, see next overhead)

Table for k Power Significance level

Example You are comparing a new production process with an old, to find if the new has a better yield. The standard deviation of the yield for such processes is 5 (you know from other data). You want to detect the yield difference if it is at least 5. How many repetitions do you need?

Why is this so? The actual distribution of the difference in means is The actual distribution of the test statistic is, approximately In the t-test, this is compared to, approximately, N(0,1) k, for power 1-β and significance level α, is defined such that if x~N(k 1/2,1), then Our equation can then be derived from

Sample size to detect an effect on a proportion Assume we compare the frequencies P 1 and P 2 of “successes” in two groups, of size n 1 and n 2 If we want to test the hypothesis that the population frequencies p 1 and p 2 in the two groups are equal, we can use the test statistic where P 3 is the average frequency of the two groups, and compare it with a standard normal distribution. The sample size needed for power p when testing two groups of size n is approximately

Sample size to limit the confidence interval for an effect The confidence interval for an effect eff is approximately on the form Thus the length of the confidence interval is approximately Thus the n that will give a confidence interval of length d is:

Sample size to limit the confidence interval for a proportion For a proportion, the above gives the formula The factor p(1-p) is always smaller than 0.25! Thus, replacing it with 0.25 gives an upper limit for the necessary sample size.

Computations in more complex situations For tests similar to those above, we can derive similar formulas In general, if we specify –the experiment (including sample size) –the exact alternative hypothesis –the test procedure we can always estimate the power of the test. Then we can work backwards to get a sample size.