BIOST 513 Discussion Section - Week 10

Slides:



Advertisements
Similar presentations
Introduction to Hypothesis Testing
Advertisements

COMPUTER INTENSIVE AND RE-RANDOMIZATION TESTS IN CLINICAL TRIALS Thomas Hammerstrom, Ph.D. USFDA, Division of Biometrics The opinions expressed are those.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
CHAPTER 21 Inferential Statistical Analysis. Understanding probability The idea of probability is central to inferential statistics. It means the chance.
Inference Sampling distributions Hypothesis testing.
Our goal is to assess the evidence provided by the data in favor of some claim about the population. Section 6.2Tests of Significance.
Statistical Significance What is Statistical Significance? What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant?
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
Statistics II: An Overview of Statistics. Outline for Statistics II Lecture: SPSS Syntax – Some examples. Normal Distribution Curve. Sampling Distribution.
Fundamentals of Hypothesis Testing. Identify the Population Assume the population mean TV sets is 3. (Null Hypothesis) REJECT Compute the Sample Mean.
Statistical Significance What is Statistical Significance? How Do We Know Whether a Result is Statistically Significant? How Do We Know Whether a Result.
Statistical Methods in Computer Science Hypothesis Testing I: Treatment experiment designs Ido Dagan.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests Statistics.
Today Concepts underlying inferential statistics
© 1999 Prentice-Hall, Inc. Chap Chapter Topics Hypothesis Testing Methodology Z Test for the Mean (  Known) p-Value Approach to Hypothesis Testing.
Sample Size Determination
Sample Size Determination Ziad Taib March 7, 2014.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Chapter 12 Inferential Statistics Gay, Mills, and Airasian
Choosing Statistical Procedures
AM Recitation 2/10/11.
Confidence Intervals and Hypothesis Testing - II
Jeopardy Hypothesis Testing T-test Basics T for Indep. Samples Z-scores Probability $100 $200$200 $300 $500 $400 $300 $400 $300 $400 $500 $400.
Tests of significance & hypothesis testing Dr. Omar Al Jadaan Assistant Professor – Computer Science & Mathematics.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Week 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses.
Sample size determination Nick Barrowman, PhD Senior Statistician Clinical Research Unit, CHEO Research Institute March 29, 2010.
Hypothesis Testing: One Sample Cases. Outline: – The logic of hypothesis testing – The Five-Step Model – Hypothesis testing for single sample means (z.
1 Introduction to Hypothesis Testing. 2 What is a Hypothesis? A hypothesis is a claim A hypothesis is a claim (assumption) about a population parameter:
Introduction to Hypothesis Testing: One Population Value Chapter 8 Handout.
Assessing Survival: Cox Proportional Hazards Model
1 Lecture 19: Hypothesis Tests Devore, Ch Topics I.Statistical Hypotheses (pl!) –Null and Alternative Hypotheses –Testing statistics and rejection.
Lecture 20: Study Design and Sample Size Estimation in TTE Studies.
Lecture 16 Dustin Lueker.  Charlie claims that the average commute of his coworkers is 15 miles. Stu believes it is greater than that so he decides to.
Inference and Inferential Statistics Methods of Educational Research EDU 660.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
Statistical Inference Statistical Inference involves estimating a population parameter (mean) from a sample that is taken from the population. Inference.
Educational Research Chapter 13 Inferential Statistics Gay, Mills, and Airasian 10 th Edition.
Economics 173 Business Statistics Lecture 4 Fall, 2001 Professor J. Petry
Education 793 Class Notes Decisions, Error and Power Presentation 8.
Lecture 17 Dustin Lueker.  A way of statistically testing a hypothesis by comparing the data to values predicted by the hypothesis ◦ Data that fall far.
Hypothesis Testing. “Not Guilty” In criminal proceedings in U.S. courts the defendant is presumed innocent until proven guilty and the prosecutor must.
© Copyright McGraw-Hill 2004
1 Chapter 6 SAMPLE SIZE ISSUES Ref: Lachin, Controlled Clinical Trials 2:93-113, 1981.
Chapter 7: Hypothesis Testing. Learning Objectives Describe the process of hypothesis testing Correctly state hypotheses Distinguish between one-tailed.
Fundamentals of Data Analysis Lecture 4 Testing of statistical hypotheses pt.1.
Educational Research Inferential Statistics Chapter th Chapter 12- 8th Gay and Airasian.
Methods and Statistical analysis. A brief presentation. Markos Kashiouris, M.D.
Review of Central Tendency & Dispersion
Sample Size Considerations
Logic of Hypothesis Testing
Hypothesis Testing: One Sample Cases
Unit 3 Hypothesis.
Statistical Core Didactic
Hypothesis Testing.
Sample Size Estimation
Hypothesis Testing: Hypotheses
Statistics 103 Monday, July 10, 2017.
Statistical Tests - Power
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
F-tests Testing hypotheses.
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Statistics II: An Overview of Statistics
Power and Sample Size I HAVE THE POWER!!! Boulder 2006 Benjamin Neale.
Statistics for Business and Economics
Psych 231: Research Methods in Psychology
Sample Size and Power Part II
STA 291 Spring 2008 Lecture 17 Dustin Lueker.
Presentation transcript:

BIOST 513 Discussion Section - Week 10 Clara Domínguez-Islas

Outline Refresher for Power and Sample size Power and sample size calculations for time-to-event analyses Log-rank test Cox PH regression Homework 8 review

Power and Sample Size Considerations Statistical Power In scientific research, one of the roles of an investigator will likely be to formulate one or more null hypotheses, that is, “The hypothesis that the phenomenon to be demonstrated is in fact absent’’ [R.A. Fisher, 1949] Not the hypothesis she or he hopes to prove.

The Null Hypothesis, H0 The researcher's interest is aimed at disproving the null hypothesis in favor of the alternative hypothesis, Ha, E.g., In a criminal case, there is a trial by jury where H0: The accused is innocent (until proven guilty) Ha: The accused is guilty Rejecting (disproving) the null hypothesis in favor of the alternative hypothesis can be established. However, not rejecting the null hypothesis does not prove it.

The Null Hypothesis, H0 Problem Ideally, we would like to be able to make some statement as to the “proof” of the null hypothesis when it is not rejected. Solution We control for factors that reduce the chance of making a Type II Error (i.e., increase statistical power).

Definition Power (of -a- statistical test) The probability of correctly rejecting the null hypothesis (i.e., when it is false). For our jury example, it is the probability the jury gives a guilty verdict when the accused is, in fact, guilty. Power = 1 - Pr[Type II Error].

Power of a Statistical Test Statistical Power is conducted by the test If you have K primary aims for your investigation, there should be K power/sample size calculations Challenges arise when considering simultaneous sets of tests (K>1) We will focus on the case where we have K=1 test Statistical Power can often be shown to be a function of The probability of making a Type I Error (a) The information in the data (e.g., sample size (N), # events (L), etc.) Effect size (ES) – for the scientific summary you wish to investigate e.g., HR comparing two exposure groups for having some event e.g., OR comparing two exposure groups e.g., |m1-m0|/s standardized difference in means comparing two exposure groups

Power of -a- Statistical Test Armed with any three of the four factors that influence power, we can generally find the fourth. Specify: {a, Power, N}  to find ES Specify: {a, ES, N}  to find Power Specify: {a, Power, ES}  to find N a is usually easy (set to 0.01, 0.05). Power is usually easy, too (set to 0.80, 0.90), but sometimes we wish to know the Power for a given ES and N.

Main components of sample-size computation for Survival Analysis Method of analysis Log-rank test, Cox PH model, parametric test Probability of type I error (a) Usually 0.05 or 0.01 Probability of type II error (b) or Power (1-b) Usually 80% - 90% (or 20% - 10%) Effect size, expressed as The hazard ratio (D = h2(t)/h2 (t) ), or The log of the hazard ratio (log(D)) ,or The coefficient in a Cox regression model, b1 = ln { h( t, x1 +1 | x2 ,…, xp ) / h( t, x1 | x2 ,…, xp ) }

Main components of sample-size computation – stpower command Three subcommands stpower logrank stpower cox stpower exponential Main options alpha(), power(), beta(), n(), hratio(), onsided Syntax (examples) stpower logrank, hratio(0.5) power(0.8) stpower cox -1, sd(0.3)

Determination of number of events for the log-rank test - Freedman Objective: To obtain the number of events required to ensure pre-specified power 1-b for a two-sided log-rank test at level a to detect a hazard ratio of Da for the experimental group relative to the control group. Hypothesis: H0: S1(t) = S2(t) vs HA: S1(t) ≠ S1(t) Equivalent hypothesis: H0: D = 1 vs HA: D ≠ 1 Where: E = number of events required l = sample size ratio (N2/N1) z1-a/k, z1-b = percentiles of a standard normal distribution k = 1 (one-sided test) or 2 (two sided test) Assumptions: Proportional hazards; large-sample approximation to the test statistic, D

Determination of number of events for the log-rank test - Schoenfeld Objective: To obtain the number of events required to ensure pre-specified power 1-b for a two-sided log-rank test at level a to detect a hazard ratio of Da for the experimental group relative to the control group. Hypothesis: H0: S1(t) = S2(t) vs HA: S1(t) ≠ S1(t) Equivalent hypothesis: H0: ln(D) = 0 vs HA: ln(D) ≠ 0 Where: E = number of events required l = sample size ratio (N2/N1) p1 = proportion of subjects in group 1 (control) z1-a/k, z1-b = percentiles of a standard normal distribution k = 1 (one-sided test) or 2 (two sided test) Assumptions: Proportional hazards; large-sample approximation to the test statistic, ln(D)

Sample size determination for the log-rank test – Example 1 Estimate the required sample size to achieve 80% power to detect a 25% reduction in the hazard of the experimental group compared to control, by using a two-sided log-rank test with 5% significance level, in a study with balanced design (1:1 randomization) a = b = D = l =

Sample size determination for the log-rank test – Example 1 Estimate the required sample size to achieve 80% power to detect a 25% reduction in the hazard of the experimental group compared to control, by using a two-sided log-rank test with 5% significance level, in a study with balanced design (1:1 randomization) a = 0.05 b = 0.2 D = 0.75 l = 1 Assume: Complete follow-up (no censoring)

Sample size determination for the log-rank test – Example 1 . stpower logrank, alpha(0.05) beta(0.2) hratio(0.75) Estimated sample sizes for two-sample comparison of survivor functions Log-rank test, Freedman method Ho: S1(t) = S2(t) Input parameters: alpha = 0.0500 (two sided) hratio = 0.7500 power = 0.8000 p1 = 0.5000 Estimated number of events and sample sizes: E = 386 N = 386 N1 = 193 N2 = 193

Determination of sample size based on the number of events Under censoring, the required number of subjects for the study is given by Where: E = number of events required pE = probability of observing the event = 1 - survival probability at the end of the study If withdrawal (lost to follow-up) is anticipated, a conservative adjustment can be done:

Sample size determination for the log-rank test – Example 1 Estimate the required sample size to achieve 80% power to detect a 25% reduction in the hazard of the experimental group compared to control, by using a two-sided log-rank test with 5% significance level. a = 0.05 b = 0.2 D = 0.75 Now, consider the following: Unbalanced design, with experimental group twice as large as the control Administrative censoring (40% of subjects in control group survive by the end of the study) Lost to follow-up (10% withdrawal)

Determination of number of events for the Cox PH regression - Objective: To obtain the number of events required to ensure prespecified power 1-b for a two-sided Wald test to detect a change of b1A = ln(Da) in the log-hazard associated to one-unit change in a covariate of interest x1, adjusted for other factors x2,…, xp. Hypothesis: H0 : (b1 , b2 , ..., bp) = (0 , b2 ,..., bp) vs HA : (b1 , b2 , ..., bp) = (b1A , b2 ,..., bp) Where: s 2 = variance of the covariate of interest x1 R2 = proportion of the variability of x1 explained by other covariates Assumptions: Proportional hazards; based on large-sample approximation for the test statistic, b1

Sample size determination for the Cox PH regression – Example In a study with multiple-myeloma patients, the interest is to investigate the effect of the log blood urea nitrogen lBUN on patients’ survival. Estimate the required sample size to achieve 80% power to detect a change of 0.5 in the log-hazard associated to one-unit change in the lBUN using a one-sided Wald test with 5% significance level, after adjusting for other factors. From previous studies it has been estimated that the standard deviation of IBUN is 0.3126. Assume: lBUN independent of other covariates (randomized trial) lBUN correlated to other covariates (R2= 0.1837) Censoring (overall death rate = 0.8)