Sample Size Determination

Slides:



Advertisements
Similar presentations
Hypothesis Testing Goal: Make statement(s) regarding unknown population parameter values based on sample data Elements of a hypothesis test: Null hypothesis.
Advertisements

Sample size estimation
Statistics.  Statistically significant– When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha.
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Inferential Statistics
Inference Sampling distributions Hypothesis testing.
Confidence Intervals © Scott Evans, Ph.D..
EPIDEMIOLOGY AND BIOSTATISTICS DEPT Esimating Population Value with Hypothesis Testing.
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
BS704 Class 7 Hypothesis Testing Procedures
Inferences About Process Quality
Chapter 9 Hypothesis Testing.
PY 427 Statistics 1Fall 2006 Kin Ching Kong, Ph.D Lecture 6 Chicago School of Professional Psychology.
Review for Exam 2 Some important themes from Chapters 6-9 Chap. 6. Significance Tests Chap. 7: Comparing Two Groups Chap. 8: Contingency Tables (Categorical.
Sample Size and Statistical Power Epidemiology 655 Winter 1999 Jennifer Beebe.
Sample Size Determination Ziad Taib March 7, 2014.
Statistical Analysis. Purpose of Statistical Analysis Determines whether the results found in an experiment are meaningful. Answers the question: –Does.
Fall 2012Biostat 5110 (Biostatistics 511) Discussion Section Week 8 C. Jason Liang Medical Biometry I.
Sample size calculation
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Introduction to Biostatistics and Bioinformatics
Hypothesis Testing.
Dr Mohammad Hossein Fallahzade Determining the Size of a Sample In the name of God.
Statistical Analysis Statistical Analysis
Intervention Studies Principles of Epidemiology Lecture 10 Dona Schneider, PhD, MPH, FACE.
+ Chapter 9 Summary. + Section 9.1 Significance Tests: The Basics After this section, you should be able to… STATE correct hypotheses for a significance.
Inference for a Single Population Proportion (p).
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
1 Power and Sample Size in Testing One Mean. 2 Type I & Type II Error Type I Error: reject the null hypothesis when it is true. The probability of a Type.
Sample Size Determination Donna McClish. Issues in sample size determination Sample size formulas depend on –Study design –Outcome measure Dichotomous.
Statistical Power and Sample Size Calculations Drug Development Statistics & Data Management July 2014 Cathryn Lewis Professor of Genetic Epidemiology.
Power and Sample Size Determination Anwar Ahmad. Learning Objectives Provide examples demonstrating how the margin of error, effect size and variability.
Hypothesis Testing Quantitative Methods in HPELS 440:210.
Chapter 9 Power. Decisions A null hypothesis significance test tells us the probability of obtaining our results when the null hypothesis is true p(Results|H.
A Broad Overview of Key Statistical Concepts. An Overview of Our Review Populations and samples Parameters and statistics Confidence intervals Hypothesis.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Sample Size And Power Warren Browner and Stephen Hulley  The ingredients for sample size planning, and how to design them  An example, with strategies.
Biostatistics in Practice Peter D. Christenson Biostatistician LABioMed.org /Biostat Session 4: Study Size and Power.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size and Power.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Jeopardy Hypothesis Testing t-test Basics t for Indep. Samples Related Samples t— Didn’t cover— Skip for now Ancient History $100 $200$200 $300 $500 $400.
Chapter 20 Testing Hypothesis about proportions
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Fall 2002Biostat Statistical Inference - Confidence Intervals General (1 -  ) Confidence Intervals: a random interval that will include a fixed.
Biostatistics in Practice Peter D. Christenson Biostatistician Session 4: Study Size for Precision or Power.
© Copyright McGraw-Hill 2004
Copyright © 2013, 2009, and 2007, Pearson Education, Inc. Chapter 10 Comparing Two Groups Section 10.1 Categorical Response: Comparing Two Proportions.
Statistical Inference Drawing conclusions (“to infer”) about a population based upon data from a sample. Drawing conclusions (“to infer”) about a population.
Sample Size Determination
Compliance Original Study Design Randomised Surgical care Medical care.
Biostatistics Basics: Part I Leroy R. Thacker, PhD Associate Professor Schools of Nursing and Medicine.
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
Hypothesis Testing Steps for the Rejection Region Method State H 1 and State H 0 State the Test Statistic and its sampling distribution (normal or t) Determine.
1 Chapter 6 SAMPLE SIZE ISSUES Ref: Lachin, Controlled Clinical Trials 2:93-113, 1981.
Inferential Statistics Psych 231: Research Methods in Psychology.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Core Research Competencies:
Sample Size Determination
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Statistical Core Didactic
Sample Size Estimation
Comparing Populations
Statistical significance using p-value
Inferential statistics Study a sample Conclude about the population Two processes: Estimation (Point or Interval) Hypothesis testing.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Professor Dept. of Family & Community Medicine College.
Type I and Type II Errors
Statistical Power.
Presentation transcript:

Sample Size Determination Janice Weinberg, ScD Professor of Biostatistics Boston University School of Public Health

Outline Why does this matter? Scientific and ethical implications Statistical definitions and notation Questions that need to be answered prior to determining sample size Study design issues affecting sample size Some basic sample size formulas

Scientific And Ethical Implications From a scientific perspective: Can’t be sure we’ve made right decision regarding the effect of the intervention However, we want enough subjects enrolled to adequately address study question to feel comfortable that we’ve reached correct conclusion

From an ethical perspective: Too few subjects: Cannot adequately address study question. The time, discomfort and risk to subjects have served no purpose. May conclude no effect of an intervention that is beneficial. Current and future subjects may not benefit from new intervention based on current (inconclusive) study.

Too many subjects: Too many subjects unnecessarily exposed to risk. Should enroll only enough patients to answer study question, to minimize the discomfort and risk subjects may be exposed to.

Definitions and Notation Null hypothesis (H0): No difference between groups H0: p1 = p2 H0: 1 = 2 Alternative hypothesis (HA): There is a difference between groups HA: p1  p2 HA : 1  2 P-Value: Chance of obtaining observed result or one more extreme when groups are equal (under H0) Test of significance of H0 Based on distribution of a test statistic assuming H0 is true It is NOT the probability that H0 is true

Definitions and Notation : Measure of true population difference must be estimated. Difference of medical importance = |p1 - p2| = |1 - 2| n: Sample size per arm N: Total sample size (N=2n for 2 groups with equal allocation)

Type I error: Rejecting H0 when H0 is true : The type I error rate. Maximum p-value considered statistically significant Type II error: Failing to reject H0 when H0 is false : The type II error rate Power (1 - ): Probability of detecting group effect given the size of the effect () and the sample size of the trial (N)

Decision Based on the Data Truth Decision Based on the Data Treatments are equal (HO true) Treatments differ (HA true) Do Not Reject HO O.K. Type II error β Reject HO Type I error α

The quantities , ,  and N are all interrelated. Holding all other values constant, what happens to the power of the study if  increases? Power ↑  decreases? Power ↓ N increases? Power ↑ variability increases? Power ↓ Note: Typical error rates are  = .05 and  = .1 or .2 (80 or 90% power). Why is  often smaller than ?

SAMPLE SIZE: How many subjects are needed to assure a given probability of detecting a statistically significant effect of a given magnitude if one truly exists? POWER: If a limited pool of subjects is available, what is the likelihood of finding a statistically significant effect of a given magnitude if one truly exists?

Before We Can Determine Sample Size We Need To Answer The Following: 1. What is the main purpose of the study? 2. What is the primary outcome measure? Is it a continuous or dichotomous outcome? 3. How will the data be analyzed to detect a group difference? 4. How small a difference is clinically important to detect?

5. How much variability is in our population? 6. What is the desired  and ? 7. What is the sample size allocation ratio? 8. What is the anticipated drop out rate?

Example 1: Does the ingestion of large doses of vitamin A in tablet form prevent breast cancer? Suppose we know from Connecticut tumor-registry data that incidence rate of breast cancer over a 1-year period for women aged 45 – 49 is 150 cases per 100,000 Women randomized to Vitamin A vs. placebo

H0: p1 = p2 vs. HA: p1  p2 Example 1 continued Group 1: Control group given placebo pills by mail. Expected to have same disease rate as registry (150 cases per 100,000) Group 2: Intervention group given vitamin A tablets by mail. Expected to have 20% reduction in risk (120 cases per 100,000) Want to compare incidence of breast cancer over 1-year Planned statistical analysis: Chi-square test to compare two proportions from independent samples H0: p1 = p2 vs. HA: p1  p2

Example 2: Does a special diet help to reduce cholesterol levels? Suppose an investigator wishes to determine sample size to detect a 10 mg/dl difference in cholesterol level in a diet intervention group compared to a control (no diet) group Subjects with baseline total cholesterol of at least 300 mg/dl randomized

Example 2 continued Group 1: A six week diet intervention Group 2: No changes in diet Investigator wants to compare total cholesterol at the end of the six week study Planned statistical analysis: two sample t-test (for independent samples) H0: 1 = 2 vs. HA: 1  2

Some Basic Sample Size Formulas To Compare Two Proportions From Independent Samples: H0: p1=p2 1.  level 2.  level (1 – power) 3. Expected population proportions (p1, p2)

Some Basic Sample Size Formulas To Compare Two Means From Independent Samples: H0: 1 = 2 1.  level 2.  level (1 – power) 3. Expected population difference (= |1 - 2|) 4. Expected population standard deviation (1 , 2)

The Standard Normal Distribution N(0,1) refers to standard normal (mean 0 and variance 1)

Dichotomous Outcome (2 Independent Samples) Test H0: p1 = p2 vs. HA: p1  p2 Assuming two-sided alternative and equal allocation ***Always Round Up To Nearest Integer!

(2 Independent Samples) Dichotomous Outcome (2 Independent Samples) where  is the probability from a standard normal distribution

(2 Independent Samples) Continuous Outcome (2 Independent Samples) Test H0: 1 = 2 vs. HA: 1  2 Two-sided alternative and equal allocation Assume outcome normally distributed with:

(2 Independent Samples) Continuous Outcome (2 Independent Samples) where  is the probability from a standard normal distribution

Example 1: Does ingestion of large doses of vitamin A prevent breast cancer? Test H0: p1 = p2 vs. HA p1  p2 Assume 2-sided test with =0.05 and 80% power p1 = 150 per 100,000 = .0015 p2 = 120 per 100,000 = .0012 (20% rate reduction)  = p1 – p2 = .0003 z1-/2 = 1.96 z1- = .84 n per group = 234,882 Too many to recruit in one year!

Example 2: Does a special diet help to reduce cholesterol levels? Test H0: 1=2 vs. HA : 12 Assume 2-sided test with =0.05 and 90% power  = 1 - 2 = 10 mg/dl 1= 2 = (50 mg/dl) z1-/2 = 1.96 z1- = 1.28 n per group = 525 Suppose 10% loss to follow-up expected, adjust n = 525 / 0.9 = 584 per group

These two basic formulas address common settings but are often inappropriate Other types of outcomes/study designs require different approaches including: -Survival or time to event outcomes -Cross-over trials -Equivalency trials -Repeated measures designs -Clustered randomization

Sample Size Summary Sample size very sensitive to values of  Large N required for high power to detect small differences Consider current knowledge and feasibility Examine a range of values, i.e.: -for several , power find required sample size -for several n,  find power Often increase sample size to account for loss to follow-up Note: Only the basics of sample size are covered here. It’s always a good idea to consult a statistician