Statistical Issues in Interpreting Clinical Trials D. L. DeMets Journal of Internal Medicine 255: 529-537. 2004 “Lies, Damn Lies, and Clinical Statistics”

Slides:



Advertisements
Similar presentations
Appraisal of an RCT using a critical appraisal checklist
Advertisements

ISSUES THAT PLAGUE NON- INFERIORITY TRIALS PAST AND FUTURE RALPH B. DAGOSTINO, SR. BOSTON UNIVERSITY HARVARD CLINICAL RESEARCH INSTITUTE.
Phase II/III Design: Case Study
Study Size Planning for Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare Research and Quality (AHRQ)
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
1 Health and Disease in Populations 2002 Week 9 – 2/5/02 Randomised controlled trials 2 Dr Jenny Kurinczuk.
天 津 医 科 大 学天 津 医 科 大 学 Clinical trail. 天 津 医 科 大 学天 津 医 科 大 学 1.Historical Background 1537: Treatment of battle wounds: 1741: Treatment of Scurvy 1948:
Some comments on the 3 papers Robert T. O’Neill Ph.D.
Purpose To determine whether metoprolol controlled/extended release
ODAC May 3, Subgroup Analyses in Clinical Trials Stephen L George, PhD Department of Biostatistics and Bioinformatics Duke University Medical Center.
Statistical Issues in Contraceptive Trials
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Clinical Trials Hanyan Yang
By Dr. Ahmed Mostafa Assist. Prof. of anesthesia & I.C.U. Evidence-based medicine.
EVIDENCE BASED MEDICINE
RANDOMIZED CLINICAL TRIALS. What is a randomized clinical trial?  Scientific investigations: examine and evaluate the safety and efficacy of new drugs.
Sample Size Determination Ziad Taib March 7, 2014.
INCLUSION/EXCLUSION CRITERIA 1. The following are considerations when defining the cardiac arrest trial patient population (i.e., the inclusion/exclusion.
Copyright (c) 2004 Brooks/Cole, a division of Thomson Learning, Inc. Chapter 8 Tests of Hypotheses Based on a Single Sample.
Are the results valid? Was the validity of the included studies appraised?
1 Efficacy Results NDA (MTP-PE) Laura Lu Statistical Reviewer Office of Biostatistics FDA/CDER.
COURAGE: Clinical Outcomes Utilizing Revascularization and Aggressive Drug Evaluation Purpose To compare the efficacy of optimal medical therapy (OMT)
Multiple Choice Questions for discussion
Inference in practice BPS chapter 16 © 2006 W.H. Freeman and Company.
Intervention Studies Principles of Epidemiology Lecture 10 Dona Schneider, PhD, MPH, FACE.
Published in Circulation 2005 Percutaneous Coronary Intervention Versus Conservative Therapy in Nonacute Coronary Artery Disease: A Meta-Analysis Demosthenes.
BIOE 301 Lecture Seventeen. Guest Speaker Jay Brollier World Camp Malawi.
Research Skills Basic understanding of P values and Confidence limits CHE Level 5 March 2014 Sian Moss.
Study design P.Olliaro Nov04. Study designs: observational vs. experimental studies What happened?  Case-control study What’s happening?  Cross-sectional.
Clinical Trial Designs An Overview. Identify: condition(s) of interest, intended population, planned treatment protocols Recruitment of volunteers: volunteers.
Introduction to inference Use and abuse of tests; power and decision IPS chapters 6.3 and 6.4 © 2006 W.H. Freeman and Company.
Consumer behavior studies1 CONSUMER BEHAVIOR STUDIES STATISTICAL ISSUES Ralph B. D’Agostino, Sr. Boston University Harvard Clinical Research Institute.
1 Statistical Review Dr. Shan Sun-Mitchell. 2 ENT Primary endpoint: Time to treatment failure by day 50 Placebo BDP Patients randomized Number.
Dose Interruption/Reduction of Tyrosine Kinase Inhibitors in the First 3 Months of Treatment of CML Is Associated with Inferior Early Molecular Responses.
AIRE: Acute Infarction Ramipril Efficacy study Purpose To determine whether the ACE inhibitor ramipril reduces mortality in patients with evidence of heart.
1 OTC-TFM Monograph: Statistical Issues of Study Design and Analyses Thamban Valappil, Ph.D. Mathematical Statistician OPSS/OB/DBIII Nonprescription Drugs.
1 Statistics in Drug Development Mark Rothmann, Ph. D.* Division of Biometrics I Food and Drug Administration * The views expressed here are those of the.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
Evaluating Impacts of MSP Grants Hilary Rhodes, PhD Ellen Bobronnikov February 22, 2010 Common Issues and Recommendations.
Lecture 16 Section 8.1 Objectives: Testing Statistical Hypotheses − Stating hypotheses statements − Type I and II errors − Conducting a hypothesis test.
통계적 추론 (Statistical Inference) 삼성생명과학연구소 통계지원팀 김선우 1.
Introduction to sample size and power calculations Afshin Ostovar Bushehr University of Medical Sciences.
BIOE 301 Lecture Seventeen. Progression of Heart Disease High Blood Pressure High Cholesterol Levels Atherosclerosis Ischemia Heart Attack Heart Failure.
Federal Institute for Drugs and Medical Devices The BfArM is a Federal Institute within the portfolio of the Federal Ministry of Health (BMG) The use of.
Medical Statistics as a science
1 Updates on Regulatory Requirements for Missing Data Ferran Torres, MD, PhD Hospital Clinic Barcelona Universitat Autònoma de Barcelona.
August 20, 2003FDA Antiviral Drugs Advisory Committee Meeting 1 Statistical Considerations for Topical Microbicide Phase 2 and 3 Trial Designs: A Regulatory.
Evaluating Impacts of MSP Grants Ellen Bobronnikov Hilary Rhodes January 11, 2010 Common Issues and Recommendations.
1 Study Design Issues and Considerations in HUS Trials Yan Wang, Ph.D. Statistical Reviewer Division of Biometrics IV OB/OTS/CDER/FDA April 12, 2007.
EBM --- Journal Reading Presenter :呂宥達 Date : 2005/10/27.
© Copyright McGraw-Hill 2004
Compliance Original Study Design Randomised Surgical care Medical care.
1 BLA Sipuleucel-T (APC-8015) FDA Statistical Review and Findings Bo-Guang Zhen, PhD Statistical Reviewer, OBE, CBER March 29, 2007 Cellular, Tissue.
European Patients’ Academy on Therapeutic Innovation The Purpose and Fundamentals of Statistics in Clinical Trials.
1 Pulminiq™ Cyclosporine Inhalation Solution Pulmonary Drug Advisory Committee Meeting June 6, 2005 Statistical Evaluation Statistical Evaluation Jyoti.
Making Randomized Clinical Trials Seem Less Random Andrew P.J. Olson, MD Assistant Professor Departments of Medicine and Pediatrics University of Minnesota.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
1 Chapter 6 SAMPLE SIZE ISSUES Ref: Lachin, Controlled Clinical Trials 2:93-113, 1981.
Response, PFS or OS – what is the best endpoint in advanced colorectal cancer? Marc Buyse IDDI, Louvain-la-Neuve & Hasselt University
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 1: Demonstrating Equivalence of Active Treatments:
CCEB STATISTICAL CONSIDERATIONS IN CLINICAL TRIALS Susan S. Ellenberg, Ph.D. University of Pennsylvania School of Medicine ASENT Clinical Trials Course.
Angela Aziz Donnelly April 5, 2016
The Importance of Adequately Powered Studies
CLINICAL PROTOCOL DEVELOPMENT
Donald E. Cutlip, MD Beth Israel Deaconess Medical Center
Randomized Trials: A Brief Overview
Critical Reading of Clinical Study Results
Medical Statistics Exam Technique and Coaching, Part 2 Richard Kay Statistical Consultant RK Statistics Ltd 22/09/2019.
Presentation transcript:

Statistical Issues in Interpreting Clinical Trials D. L. DeMets Journal of Internal Medicine 255: “Lies, Damn Lies, and Clinical Statistics” Justin L. Grobe September 1, 2004

Drug Development Paradigm  Medicinal Chemistry Targeted development of new compounds  Animal Testing Test efficacy, potency, safety  Human Clinical Trials Multiple phases to test efficacy, potency, safety, and to compare new intervention to standard

Clinical Trials – Design Paradigm  Randomization Assignment to treatment group Order effects  Placebo “Control” (Ethical considerations)

Input = Return  “No clever analysis can rescue a flawed design or poorly conducted trial.”  Compliance issues

Five Major Statistical Issues 1. Intention-to-treat principle 2. Surrogate outcome measures 3. Subgroup analyses 4. Missing data 5. Noninferiority trials

Statistical Issue 1: Intention-to-treat principle  “… all patients are accounted for in the primary analysis, and primary events observed during the follow- up period are to be accounted for as well.”  Results can be biased if either of these aspects are not adhered to

Myths and examples  Myth: Large trials are free of these concerns Increased numbers of patients decreases variability of response variable, thereby making detection of differences easier; EXCEPT, this amplifies biases in the outcome measurement WHICH MAY cause detection of differences which do not actually exist

To include or not to include…  Two common reasons to drop patient data Post hoc ineligibility assessment Lack of patient compliance

TABLE 1: Post-hoc ineligibility assessment Anturane Reinfarction Trial  1629 patients who had survived a heart attack 813 patients received Anturane 816 patients received placebo 71 patients deemed “ineligible” for analysis by protocol Table Anturane mortality results Anturane (%) Placebo (%) P-value Randomized 74/813 (9.1) 89/816 (10.9) 0.20 ‘Eligible’ 64/775 (8.3) 85/783 (10.9) 0.07 ‘Ineligible’ 10/38 (26.3) 4/33 (12.1) 0.12 P-values for eligible versus ineligible Striking statistical comparisons are made by including/excluding patients in each group: thus the results are biased by post hoc exclusions

TABLE 2: Patient compliance Coronary Drug Project  3885 post-heart attack men were given clofibrate or placebo 708 clofibrate and 1813 placebo patients were at least 80% compliant Table 2 Coronary drug project 5-year mortality Clofibrate Placebo n % Deaths n % Deaths Total (as reported) By compliance <80% >80% Compliance itself is considered an outcome: thus to base the interpretation of the ‘drug outcome’ on the ‘compliance outcome’ is confounding

Dealing with noncompliance  Larger sample sizes are required to compensate for the dilution effect of noncompliance 10% noncompliance requires 23% increase in sample size 20% noncompliance requires a 56% increase in sample size

Statistical Issue 2: Surrogate outcome measures  Outcome measures of primary question must be: Clinically relevant Sensitive to intervention Ascertainable in all patients Resistant to bias  Result: Large, time-consuming, costly studies  Alternative approach: surrogate outcome measures

Surrogate outcome measure: Assumption  If the intervention will modify surrogate outcome, it will modify the primary clinical outcome

Surrogate outcome measure: Requirements 1. Surrogate outcome must be predictive of clinical outcome 2. Surrogate outcome must fully capture the total effect of the intervention on the clinical outcome “Necessary and sufficient”

Surrogate outcome measures: Difficult to obtain and validate  Intervention may modify the surrogate and have no or only partial effect on the clinical outcome  Intervention may modify the clinical outcome without affecting the surrogate (Note: NOT surprisingly, track record for use of surrogate outcome measures is very bad)

Surrogate outcome measures: Example: Cardiac Arrhythmia Suppresion Trial (CAST)  Three drugs tested for suppression of cardiac arrhythmias All three drugs had been shown to suppress premature cardiac ventricular contractions (surrogate)  Two drugs terminated early (10-15% into study) because both drugs dramatically increased cause- specific sudden death and total mortality Table 3 Cardiac Arrhythmia Suppression Trial Early termination in two drug arms Drugs Placebo Sudden death339 Total mortality Clearly the interventions (drugs) had differential effects on the surrogate measure (premature cardiac ventricular contractions) and the clinical outcome (mortality)

Statistical Issue 3: Subgroup analyses  Clinical trials usually try to include as many (diverse) patients as possible for multiple reasons: Large sample size Reasonable recruitment time Assess internal consistency of results  Seemingly logical use of the large data set is to do many post hoc analyses on subgroups

Subgroup analysis: Mathematical problems  Introduction of subgroups increases probability of false positives 5 subgroups yields greater than 20% chance of at least one (p=0.05) statistically significant difference BY CHANCE

Subgroup analysis: MERIT trial  Beta-blocker (metoprolol) treatment for patients with congestive heart failure Showed a 34% reduction in mortality overall

Subgroup analysis: MERIT trial Consistency of mortality results across lots of subgroups found with subgroup analysis:

Subgroup analysis: MERIT trial In the USA, total mortality is not reduced, yet total mortality plus any hospitalization is…?

Subgroup analysis: MERIT trial  Two other similar heart failure trials evaluating other beta-blockers showed no regional difference;  THUS, it is likely that the MERIT finding is due to chance alone.

Subgroup analysis: PRAISE-I and PRAISE-II trials  PRAISE-I performed to evaluate amlodipine for the treatment of congestive heart failure Subgroups:  Ischemia  Nonischemia  Analysis of subgroups separately showed a significant (p<0.001) effect of amlodipine on heart failure in nonischemic patients, but no effect on ischemic patients  Researchers decided to perform PRAISE- II trial on nonischemic patients only

Subgroup analysis: PRAISE-I and PRAISE-II trials  PRAISE-II showed remarkably similar mortality results in the drug and placebo groups  PRAISE-II directly opposed the exciting results of PRAISE-I’s subgroup analysis

Statistical Issue 4: Missing data  Missing data is often simply “dropped”  This violates two rules: 1. Intention-to-treat rule  all patients must be accounted for in primary outcome analysis 2. Common sense rule  if patient is too sick to complete trial, this may be informative!

Missing data  In “time to event” trials (like mortality), data can be missing because the study ends before the event happens Patients are then “censored” (dropped)  This can introduce serious mathematical bias  (Mortality studies in USA have no excuse  death indices allow follow-up without help from patient)

Statistical Issue 5: Noninferiority trials  “New intervention is not worse than the standard” New intervention may be:  Easier to administer  Better tolerated  Less toxic  Less expensive  Any given study may be a superiority and/or noninferiority trial, depending on results

Noninferiority trials

 Three challenges must be met: 1. Noninferiority trial must be of highest quality to detect clinically meaningful differences 2. Noninferiority trial must have a strong, effective control intervention (state-of-the-art care) 3. Margin of indifference is arbitrary, depending on medical importance of treatment and risk-to-benefit tradeoffs

Noninferiority trials: OPTIMAAL Trial  Losartan (angiotensin II receptor blocker) vs captopril (ACE inhibitor) in heart failure patient population Losartan has fewer (and less severe) side effects than captopril  OPTIMAAL Designed to detect 20% reduction in relative risk, with 95% power Margin of indifference set at 1.1  Thus 95% confidence interval needed to exclude risk of 1.1 to declare losartan “noninferior” to captopril

Noninferiority trials: OPTIMAAL Trial  Mortality results for OPTIMAAL Relative risk of with 95% confidence interval of 1.28  NEITHER superiority nor noninferiority were achieved Researchers computed that captopril had (historical data) a relative risk of vs. placebo, and thus calculated that losartan must therefore have a relative risk of vs. placebo…  The statistically appropriate conclusion at this point is: NO ACCEPTABLE CONCLUSIONS POSSIBLE FROM THIS DATA

CONCLUSIONS  Statistics can not make up for bad design  Statistics can not make up for poor execution of design  Statistics is very limited in being able to compensate for Ineligible patients being enrolled Noncompliance Unreliable outcome measures Missing data Underpowered trials