Sample Size Determination

Slides:



Advertisements
Similar presentations
Study Size Planning for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
Advertisements

Design of Experiments Lecture I
Sample size estimation
Sampling: Final and Initial Sample Size Determination
Unit 7 Section 6.1.
Estimation of Sample Size
Confidence Intervals © Scott Evans, Ph.D..
SAMPLE SIZE ESTIMATION
Research Curriculum Session III – Estimating Sample Size and Power Jim Quinn MD MS Research Director, Division of Emergency Medicine Stanford University.
Clustered or Multilevel Data
Sample size calculations
Sample Size Determination
Chapter 14 Inferential Data Analysis
Sample Size Determination Ziad Taib March 7, 2014.
Determining Sample Size
1 Bandit Thinkhamrop, PhD.(Statistics) Dept. of Biostatistics & Demography Faculty of Public Health Khon Kaen University Formulation of a research Using.
Power and Sample Size Determination Anwar Ahmad. Learning Objectives Provide examples demonstrating how the margin of error, effect size and variability.
Chap 20-1 Statistics for Business and Economics, 6e © 2007 Pearson Education, Inc. Chapter 20 Sampling: Additional Topics in Sampling Statistics for Business.
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
6.1 - One Sample One Sample  Mean μ, Variance σ 2, Proportion π Two Samples Two Samples  Means, Variances, Proportions μ 1 vs. μ 2.
Education Research 250:205 Writing Chapter 3. Objectives Subjects Instrumentation Procedures Experimental Design Statistical Analysis  Displaying data.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Sections 7-1 and 7-2 Review and Preview and Estimating a Population Proportion.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Statistical planning and Sample size determination.
Power & Sample Size Dr. Andrea Benedetti. Plan  Review of hypothesis testing  Power and sample size Basic concepts Formulae for common study designs.
Statistics 300: Elementary Statistics Sections 7-2, 7-3, 7-4, 7-5.
Overview and Common Pitfalls in Statistics and How to Avoid Them
Sample Size Determination
Various Topics of Interest to the Inquiring Orthopedist Richard Gerkin, MD, MS BGSMC GME Research.
Sampling and Statistical Analysis for Decision Making A. A. Elimam College of Business San Francisco State University.
Statistical inference Statistical inference Its application for health science research Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics.
Instrument design Essential concept behind the design Bandit Thinkhamrop, Ph.D.(Statistics) Department of Biostatistics and Demography Faculty of Public.
1 Bandit Thinkhamrop, PhD.(Statistics) Dept. of Biostatistics & Demography Faculty of Public Health Khon Kaen University Overview and Common Pitfalls in.
9-1 ESTIMATION Session Factors Affecting Confidence Interval Estimates The factors that determine the width of a confidence interval are: 1.The.
Chapter 8 Estimation ©. Estimator and Estimate estimator estimate An estimator of a population parameter is a random variable that depends on the sample.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Class Six Turn In: Chapter 15: 30, 32, 38, 44, 48, 50 Chapter 17: 28, 38, 44 For Class Seven: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 Read.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 15: Sample size and Power Marshall University Genomics.
CHAPTER 6: SAMPLING, SAMPLING DISTRIBUTIONS, AND ESTIMATION Leon-Guerrero and Frankfort-Nachmias, Essentials of Statistics for a Diverse Society.
Sample Size Considerations
Applied Biostatistics: Lecture 4
More on Inference.
Chapter 7 Confidence Interval Estimation
Confidence Intervals and Sample Size
What size of trial do I need?
Chapter 6 Inferences Based on a Single Sample: Estimation with Confidence Intervals Slides for Optional Sections Section 7.5 Finite Population Correction.
ESTIMATION.
Point and interval estimations of parameters of the normally up-diffused sign. Concept of statistical evaluation.
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
B&A ; and REGRESSION - ANCOVA B&A ; and
HLM with Educational Large-Scale Assessment Data: Restrictions on Inferences due to Limited Sample Sizes Sabine Meinck International Association.
Population, Samples, and Sampling Descriptions
Chapter 8: Inference for Proportions
Power, Sample Size, & Effect Size:
Calculating Sample Size: Cohen’s Tables and G. Power
Chapter 9 Hypothesis Testing.
More on Inference.
CONCEPTS OF ESTIMATION
Elementary Statistics
SAMPLE DESIGN: HOW MANY WILL BE IN THE SAMPLE—DESCRIPTIVE STUDIES ?
Estimating a Population Proportion
1. Homework #2 (not on posted slides) 2. Inferential Statistics 3
CHAPTER 16: Inference in Practice
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Professor Dept. of Family & Community Medicine College.
Effect of Sample size on Research Outcomes
Presentation transcript:

Sample Size Determination Bandit Thinkhamrop, PhD (Statistics) Dept. of Biostatistics & Demography Khon Kaen University

Essential of sample size calculation No one accept any “magic number” Too large vs Too small To justify with the sponsor and the Ethics Committee To ensure: adequate power to test a hypothesis desired precision to obtain an estimate

Two main approaches Hypothesis-based sample size calculation Involve “power” or beta error Ensure a significant finding but may not be conclusive clinically Easy and widely available Confidence interval methods of sample size calculation Involve precision of the estimation Ensure a conclusive finding clinically as this method is directly estimate the magnitude of effect Difficult and not widely available

Overall steps Identify the primary outcome Identify and review the magnitude of effect and its variability that will be used as the basis of the conclusion of the research. Identify what statistical method that will be used to obtain the main magnitude of effect. Calculate the sample size Describe how the sample size is calculated with sufficient details that allow explicability.

Steps in the calculation Base sample size calculation Design effect (for correlated outcome) Contingency (increase to account for non-responses or dropout) Rounding up to a nearest (and comfortable) number Evaluate if this sample size would provide a precise and conclusive answer to the research question by analyze the data as if it is as expected.

Suggested approaches For unknown parameters in the formula, try to find existing evidences or use your best “GUESTIMATE”, a.k.a. educated guest. Do not use only one scenario or based on only one reference for the calculation. It is highly recommended that all key parameters should be varied to see how they effect on the sample size. Always evaluate its sufficiency by estimate the main magnitude of effect and its 95% CI and see if it provide a conclusive finding. Consult with the statistician early

Common pitfalls Unjustified sample size by specifying a “magic” number Based on a simplify formula or a sample size table without understanding its limitations "A previous study in this area recruited 50 subjects and found highly significant results (p=0.001), and therefore a similar sample size should be sufficient." – never do it like this Inconsistent with the protocol Too much rely on the previous findings in sample size calculation

Examples of common calculations Mean – one group Mean – two independent groups Proportion – one group Proportion – two independent groups Get some idea from those Practice with your own research

Mean – one group :Formula Where: n = The sample size Z/2 = The standard normal coefficient, typically 1.96 for 95% CI s =The standard deviation. d = The desired precision level expressed as half of the maximum acceptable confidence interval width.

Mean – one group :Calculations (fix  = 0.05) Expected Standard deviation Precision (half width) n 25 5 99 30 141 10 27 38

Mean – one group :Descriptions A sample size of 38 would be able to estimate a mean with a precision of 10 assuming a standard deviation of 30 according to a study by <Reference>. That is, based on the expected mean of 55 <Reference>, the 95% confidence interval of the estimated mean would be between 45 and 65.

Mean – two independent group :Formula Represents the desired power (typically .84 for 80% power). Sample size in each group (assumes equal sized groups) A measure of variability (This is a variance or a square of the standard deviation) Represents the desired level of statistical significance (typically 1.96 for  = 0.05). Minimum meaningful difference or Effect Size

Minimum and meaningful difference Mean – two independent groups :Calculations (fix  = 0.05) H0: M1-M2=0. H1: M1-M2=D1<>0. Test Statistic: Z test with pooled variance (SD1 = 20; SD2 = 25) Power Mean in Control grp. Minimum and meaningful difference n1 n2 90% 30 10 109 80% 82 20 28 22 35 5 432 322 15 49 37

Mean – two independent groups :Descriptions A total sample size of 37 in group one and 37 in group two would have a power of 80% to detect a difference between group of 15 assuming a mean of 35 in control group with estimated group standard deviations of 20 and 25, respectively, according to a study by <Reference>. The test statistic used is the two-sided two sample t-test. The significance level of the test was targeted at 0.05.

Proportion – one group :Formula Where: n = The sample size Z/2 = The standard normal coefficient, , typically 1.96 for 95% CI p = The value of the proportion as a decimal percent (e.g., 0.45). d = The desired precision level expressed as half of the maximum acceptable confidence interval width.

Proportion – one group :Calculations (fix  = 0.05) Expected Prevalence Precision (half width) n 15% 2% 1,225 20% 1,537 4% 307 385

Proportion – one group :Descriptions A sample size of 400 would have a 95% confidence interval of 16% to 24% assuming a prevalence of 20% according to a study by <Reference>.

Proportion – two independent group :Formula Represents the desired power (typically .84 for 80% power). Sample size in each group (assumes equal sized groups) A measure of variability (similar to standard deviation) Represents the desired level of statistical significance (typically 1.96 for  = 0.05). Minimum meaningful difference or Effect Size

Proportion – two independent groups :Calculations (fix  = 0 Proportion – two independent groups :Calculations (fix  = 0.05) H0: P1-P2=0. H1: P1-P2=D1<>0. Test Statistic: Z test with pooled variance Power Proportion in Control grp. Minimum and meaningful difference n1 n2 90% 40% 5% 2,053 80% 1,534 50% 2,095 1,565 10% 519 388

Proportion – two independent groups :Descriptions A total sample size of 388 in group one and 388 in group two would have a power of 80% to detect a difference between group of 10% assuming a prevalence of 50% in control group according to a study by <Reference>. The test statistic used is the two-sided Z test. The significance level of the test was targeted at 0.0500.

Other considerations Sampling design affects the calculation of sample size Simple random sampling / assignment Stratified random sampling / assignment Clustered random sampling / assignment Complex study designs affects the calculation of sample size Matching Multiple stages of sampling Repeated measures Usually the sample size calculation is based on method of analysis Correlation, Agreement, Diagnostic performance Z-test Regression – multiple linear, logistic Multivariate analyses such as principle component or factor analysis Survival analyses Multilevel models

Other considerations Demonstrate superiority Sample size sufficient to detect difference between treatments Require to specify “minimum meaningful” difference Demonstrate non-inferiority or equally effective Sample size required to demonstrate equivalence larger than required to demonstrate superiority Require to specify “non-inferiority margin or equivalence range”

Precision or Power Estimation Equivalence to sample size calculation – do it in the planning phase of the study Do it when the number of available sample is known Wrong: “There are around 50 patients per year, of whom 10% may refuse to take part in the study. Therefore over the 2 years of the study, the sample size will be 90 patients. “ Correct: “It is estimated that there will be 90 patients in the clinic. This will give a precision of the prevalence estimation of  20% assuming a prevalence of 65%.”

Suggested learning resources WWW: Statistics Guide for Research Grant Applicants at St George’s University of London (maintained by Martin Bland): http://www-users.york.ac.uk/~mb55/guide/size.htm Software: PASS2008, nQuery, EpiTable, SeqTrial, PS, etc.

Q & A