November 10, 2010DSTS meeting, Copenhagen1 Power and sample size calculations Michael Væth, University of Aarhus Introductory remarks Two-sample problem.

Slides:



Advertisements
Similar presentations
Chapter 12: Inference for Proportions BY: Lindsey Van Cleave.
Advertisements

Chapter 4 Inference About Process Quality
Study Size Planning for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Sampling: Final and Initial Sample Size Determination
Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and Alternative Hypotheses Type I and Type II Errors Type I and Type II Errors.
1 1 Slide STATISTICS FOR BUSINESS AND ECONOMICS Seventh Edition AndersonSweeneyWilliams Slides Prepared by John Loucks © 1999 ITP/South-Western College.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Estimation of Sample Size
10-1 Introduction 10-2 Inference for a Difference in Means of Two Normal Distributions, Variances Known Figure 10-1 Two independent populations.
Sample size computations Petter Mostad
9-1 Hypothesis Testing Statistical Hypotheses Statistical hypothesis testing and confidence interval estimation of parameters are the fundamental.
Basic Elements of Testing Hypothesis Dr. M. H. Rahbar Professor of Biostatistics Department of Epidemiology Director, Data Coordinating Center College.
Intro to Statistics for the Behavioral Sciences PSYC 1900 Lecture 10: Hypothesis Tests for Two Means: Related & Independent Samples.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 8 Introduction to Hypothesis Testing.
EEM332 Design of Experiments En. Mohd Nazri Mahmud
Chapter 2 Simple Comparative Experiments
7-2 Estimating a Population Proportion
Inferences About Process Quality
Sample Size Determination Ziad Taib March 7, 2014.
Standard error of estimate & Confidence interval.
Fundamentals of Hypothesis Testing: One-Sample Tests
Chapter 9 Hypothesis Testing: Single Population
Inference for a Single Population Proportion (p).
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 9 Section 1 – Slide 1 of 39 Chapter 9 Section 1 The Logic in Constructing Confidence Intervals.
Today’s lesson Confidence intervals for the expected value of a random variable. Determining the sample size needed to have a specified probability of.
Comparing Two Proportions
Population All members of a set which have a given characteristic. Population Data Data associated with a certain population. Population Parameter A measure.
Statistical Review We will be working with two types of probability distributions: Discrete distributions –If the random variable of interest can take.
9-1 Hypothesis Testing Statistical Hypotheses Definition Statistical hypothesis testing and confidence interval estimation of parameters are.
1 1 Slide © 2007 Thomson South-Western. All Rights Reserved Chapter 9 Hypothesis Testing Developing Null and Alternative Hypotheses Developing Null and.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
Introduction to Inferece BPS chapter 14 © 2010 W.H. Freeman and Company.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
RDPStatistical Methods in Scientific Research - Lecture 41 Lecture 4 Sample size determination 4.1 Criteria for sample size determination 4.2 Finding the.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Copyright © 2010 Pearson Education, Inc. Chapter 22 Comparing Two Proportions.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
1 9 Tests of Hypotheses for a Single Sample. © John Wiley & Sons, Inc. Applied Statistics and Probability for Engineers, by Montgomery and Runger. 9-1.
Chap 8-1 Fundamentals of Hypothesis Testing: One-Sample Tests.
Fall 2002Biostat Statistical Inference - Proportions One sample Confidence intervals Hypothesis tests Two Sample Confidence intervals Hypothesis.
Statistics for Decision Making Basic Inference QM Fall 2003 Instructor: John Seydel, Ph.D.
Mystery 1Mystery 2Mystery 3.
What is a Test of Significance?. Statistical hypotheses – statements about population parameters Examples Mean weight of adult males is greater than 160.
© Copyright McGraw-Hill 2004
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 11 Section 3 – Slide 1 of 27 Chapter 11 Section 3 Inference about Two Population Proportions.
Learning Objectives After this section, you should be able to: The Practice of Statistics, 5 th Edition1 DESCRIBE the shape, center, and spread of the.
Sample Size Needed to Achieve High Confidence (Means)
Evaluating Hypotheses. Outline Empirically evaluating the accuracy of hypotheses is fundamental to machine learning – How well does this estimate its.
Review Statistical inference and test of significance.
Class Seven Turn In: Chapter 18: 32, 34, 36 Chapter 19: 26, 34, 44 Quiz 3 For Class Eight: Chapter 20: 18, 20, 24 Chapter 22: 34, 36 Read Chapters 23 &
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 10 Comparing Two Populations or Groups 10.1.
Hypothesis Testing Concepts of Hypothesis Testing Statistical hypotheses – statements about population parameters Examples mean weight of adult.
Inference for a Single Population Proportion (p)
ESTIMATION.
CHAPTER 10 Comparing Two Populations or Groups
Chapter 4. Inference about Process Quality
Inference: Conclusion with Confidence
Chapter 8: Inference for Proportions
Chapter 9 Hypothesis Testing.
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups
Presentation transcript:

November 10, 2010DSTS meeting, Copenhagen1 Power and sample size calculations Michael Væth, University of Aarhus Introductory remarks Two-sample problem with normal data Comparison of two proportions Sample size and power calculations based on Wald’s test Two-sample problem with censored survival data Non-inferiority trials and equivalence trials Sample size and confidence intervals

November 10, 2010DSTS meeting, Copenhagen2 Power and sample size calculations “Investigators often ask statisticians how many observations they should make (fortunately, usually before the study begins). To be answerable, this question needs fuller formulation. There is resemblance to the question, How much money should I take when I go on vacation? Fuller information is needed there too. How long a vacation? Where? With whom?” Moses(NEJM,1985)

November 10, 2010DSTS meeting, Copenhagen3 A study should:  Allow conclusive answers to the questions being addressed  Provide estimates of relevant quantities with sufficient precision Standard approach Identify a maximal risk of wrong conclusions or Quantify the size of a sufficient precision Determine the minimum sample size for which the study achieves the design goals Power and sample size calculations

November 10, 2010DSTS meeting, Copenhagen4 Implementation of standard approach Use commercial special-purpose software Simulations Analytic methods Power and sample size calculations

November 10, 2010DSTS meeting, Copenhagen5 RCT, equal allocation probabilities Outcome follow a normal distribution. Means Common standard deviation, assumed known: Estimated treatment difference: Expected treatment difference Minimal relevant difference Hypothesis: Test statistic: Two-sample problem with continuous outcome

November 10, 2010DSTS meeting, Copenhagen6 If the test statistic has a standard normal distribution. In general, the test statistic is normal: mean and standard deviation 1 Two-sample problem with continuous outcome (2)

November 10, 2010DSTS meeting, Copenhagen7 A: Distribution of the test statistic when B: Distribution of the test statistic for an alternative value of Level of significance Power Two-sample problem with continuous outcome (3)

November 10, 2010DSTS meeting, Copenhagen8 Assume so only the upper term matters Two-sample problem with continuous outcome (4) Only contribution from one term unless power close to level of significance Basic relation

November 10, 2010DSTS meeting, Copenhagen9 Two-sample problem with continuous outcome (5) Sample size for given power Power for given sample size

November 10, 2010DSTS meeting, Copenhagen10 Two-sample problem with continuous outcome (6) Depends on the error probabilities Depend on the problem Level ofStatistical power significance50%80%90%95% 5% % % % for selected values ofTable of

November 10, 2010DSTS meeting, Copenhagen11 Comparison of two proportions Score test: Basic relation becomes with

November 10, 2010DSTS meeting, Copenhagen12 Comparison of two proportions (2) Wald’s test: Basic relation becomes The simple structure N = (model term)(error term) is recovered

November 10, 2010DSTS meeting, Copenhagen13 Comparison of two proportions (3) Example 1 N(Score) = 2894 N(Wald) = 2888 N(Score) = 4610 N(Wald) = 4278 N(Score) = 4422 N(Wald) = 4749 Other sample fractions

November 10, 2010DSTS meeting, Copenhagen14 Sample size and power calculations based on Wald’s test Data and Statistical model Question: Hypothesis about 1-dim. parameter Wald’s test with Sample size for given power Power for given sample size

November 10, 2010DSTS meeting, Copenhagen15 Sample size and power calculations based on Wald’s test (2) Example 1 (ctd.) Same problem, but now use Wald’s test based on ln(odds) N = 2906 N = 4778 N = 4304 Score Wald

November 10, 2010DSTS meeting, Copenhagen16 Sample size and power calculations based on Wald’s test (3) Use of simulations Computer generates a large number of independent sample of size from a scenario representing a relevant difference Power estimated as proportion of samples for which Wald’s test is statistically significant at level Sample size for power level

November 10, 2010DSTS meeting, Copenhagen17 Sample size and power calculations based on Wald’s test (4) Use of simulations Sample size multiplier

November 10, 2010DSTS meeting, Copenhagen18 Two-sample problem with censored survival data Time-to-event data Two sample, proportional hazards model Hazard rates Parameter of interest Wald’s test with the number of events in group i

November 10, 2010DSTS meeting, Copenhagen19 Wald’s test is approximately normal with sd = 1 and mean average probability of an event in group i Sample size Two-samples with censored data (2) Sample size depends primarily on number of events

November 10, 2010DSTS meeting, Copenhagen20 Two-samples with censored data (3) Example 2 Design of a RCT with survival endpoint Comparison of new and standard treatment Endpoint: All-cause mortality Requirements: max. 6 years; power = 80% for HR = A T = A + F Study start Accrual ends Study ends Accrual period Follow-up period No additional follow-upIn general

November 10, 2010DSTS meeting, Copenhagen21 Two-samples with censored data (3) Example 2 (ctd.) KM-estimate: standard treatment1 - KM Std. Treatment: Average event probability = AUC/baseline Average event probability with new treatment

November 10, 2010DSTS meeting, Copenhagen22 Two-samples with censored data (4) Example 2 (ctd.) AccrualFollow-upTotalAverage mortality probabilityNumber ofPatients years StandardNewOverallpatientsper year designs with the the same expected number of events (635) 635 events are needed to meet the design requirements This can be achieved in different ways Competing risk: Replace 1-KM with Cumulative Incidence

November 10, 2010DSTS meeting, Copenhagen23 Non-inferiority & equivalence trials Null hypothesis Minimal relevant difference  Maximal irrelevant difference

November 10, 2010DSTS meeting, Copenhagen24 Non-inferiority & equivalence trials Non-inferiority: a one-sided hypothesis Two-sample problem with normal data (Wald’s test approach for 1-parameter problem) Basic relation Sample size Note: If the power is assessed at a zero difference, then the sample size needed to achieve this power will be underestimated if the effect of the new product is less than that of the active control

November 10, 2010DSTS meeting, Copenhagen25 Non-inferiority & equivalence trials Equivalence: union-intersection test Two one-sided tests Basic relations Sample size: is specified for Note: If the power is assessed at a zero difference, then the sample size needed to achieve this power is underestimated if the true difference is not zero.

November 10, 2010DSTS meeting, Copenhagen26 Sample size and confidence intervals Design phase: Sample size considerations are traditionally phrased in the terminology of hypothesis testing Formulas are derived by controlling error probabilities Reporting and interpreting results Focus on estimates and confidence intervals Hypothesis tests are downplayed Why not use the same approach on both occasions?

November 10, 2010DSTS meeting, Copenhagen27 Sample size and confidence intervals Power calculations when reporting the results? Probability statements should utilize the collected data and not be based on anticipated values of the parameters. Interpretation: Probability of replication Some statistical packages provide calculation of ”post-hoc power” or ”observed power”, i.e. Power computed at the estimated parameter value. This does not make sense. The power becomes a (known) function of the significance level.

November 10, 2010DSTS meeting, Copenhagen28 Sample size and confidence intervals Sample size calculations based on confidence intervals? Two-sample problem with normal data (Wald’s test approach for 1-parameter problem) 95% confidence interval Choose smallest N such that a confidence interval centered at excludes 0 Corresponds to power = 0.50

November 10, 2010DSTS meeting, Copenhagen29 Sample size and confidence intervals Use the fundamental relation between hypothesis test and confidence intervals to formulate the sample size requirements in confidence interval terminology Greenland(AJE,1988), Daly(BMJ,1991) To compute a sample size specify 1.The confidence level 2.The minimum size parameter-value that we wish to estimate unambigously, i.e. with a confidence interval that excluded the null value 3.The probability of achieving this if the true value is the this minimum value

November 10, 2010DSTS meeting, Copenhagen30 "How not to collaborate with a biostatistician”How not to collaborate with a biostatistician Power and sample size calculations A ”commentary” on the world-wide-web: