Use of Piecewise Weighted Log-Rank Test for Trials with Delayed Effect

Slides:



Advertisements
Similar presentations
Industry Issues: Dataset Preparation for Time to Event Analysis Davis Gates Schering Plough Research Institute.
Advertisements

Surviving Survival Analysis
Shibing Deng Pfizer, Inc. Efficient Outlier Identification in Lung Cancer Study.
LSU-HSC School of Public Health Biostatistics 1 Statistical Core Didactic Introduction to Biostatistics Donald E. Mercante, PhD.
Systematic Review of Literature Part XIX Analyzing and Presenting Results.
Departments of Medicine and Biostatistics
天 津 医 科 大 学天 津 医 科 大 学 Clinical trail. 天 津 医 科 大 学天 津 医 科 大 学 1.Historical Background 1537: Treatment of battle wounds: 1741: Treatment of Scurvy 1948:
Clinical Trial Design Considerations for Therapeutic Cancer Vaccines Richard Simon, D.Sc. Chief, Biometric Research Branch, NCI
Some comments on the 3 papers Robert T. O’Neill Ph.D.
ODAC May 3, Subgroup Analyses in Clinical Trials Stephen L George, PhD Department of Biostatistics and Bioinformatics Duke University Medical Center.
Common Problems in Writing Statistical Plan of Clinical Trial Protocol Liying XU CCTER CUHK.
Adaptive Designs for Clinical Trials
Sample Size Determination Ziad Taib March 7, 2014.
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
1 Efficacy Results NDA (MTP-PE) Laura Lu Statistical Reviewer Office of Biostatistics FDA/CDER.
CI - 1 Cure Rate Models and Adjuvant Trial Design for ECOG Melanoma Studies in the Past, Present, and Future Joseph Ibrahim, PhD Harvard School of Public.
Background to Adaptive Design Nigel Stallard Professor of Medical Statistics Director of Health Sciences Research Institute Warwick Medical School
Challenges of Non-Inferiority Trial Designs R. Sridhara, Ph.D.
Consumer behavior studies1 CONSUMER BEHAVIOR STUDIES STATISTICAL ISSUES Ralph B. D’Agostino, Sr. Boston University Harvard Clinical Research Institute.
Successful Concepts Study Rationale Literature Review Study Design Rationale for Intervention Eligibility Criteria Endpoint Measurement Tools.
1 An Interim Monitoring Approach for a Small Sample Size Incidence Density Problem By: Shane Rosanbalm Co-author: Dennis Wallace.
1 Statistics in Drug Development Mark Rothmann, Ph. D.* Division of Biometrics I Food and Drug Administration * The views expressed here are those of the.
1 THE ROLE OF COVARIATES IN CLINICAL TRIALS ANALYSES Ralph B. D’Agostino, Sr., PhD Boston University FDA ODAC March 13, 2006.
What is a non-inferiority trial, and what particular challenges do such trials present? Andrew Nunn MRC Clinical Trials Unit 20th February 2012.
1 Updates on Regulatory Requirements for Missing Data Ferran Torres, MD, PhD Hospital Clinic Barcelona Universitat Autònoma de Barcelona.
Biostatistics Case Studies 2006 Peter D. Christenson Biostatistician Session 4: An Alternative to Last-Observation-Carried-Forward:
Some Design Issues in Microbicide Trials August 20, 2003 Thomas R. Fleming, Ph.D. Professor and Chair of Biostatistics University of Washington FDA Antiviral.
1 Study Design Issues and Considerations in HUS Trials Yan Wang, Ph.D. Statistical Reviewer Division of Biometrics IV OB/OTS/CDER/FDA April 12, 2007.
Early Molecular and Cytogenic Response Is Predictive for Long-Term Progression-Free and Overall Survival in Chronic Myeloid Leukemia (CML) Hanfstein B.
1 BLA Sipuleucel-T (APC-8015) FDA Statistical Review and Findings Bo-Guang Zhen, PhD Statistical Reviewer, OBE, CBER March 29, 2007 Cellular, Tissue.
Effect of Rosiglitazone on the Risk of Myocardial Infarction And Death from Cardiovascular Causes Alternative Interpretations of the Evidence George A.
1 Pulminiq™ Cyclosporine Inhalation Solution Pulmonary Drug Advisory Committee Meeting June 6, 2005 Statistical Evaluation Statistical Evaluation Jyoti.
HERA TRIAL: 2 Years versus 1 Year of Trastuzumab After Adjuvant Chemotherapy in Women with HER2-Positive Early Breast Cancer at 8 Years of Median Follow-Up.
Introduction to Biostatistics, Harvard Extension School, Fall, 2005 © Scott Evans, Ph.D.1 Sample Size and Power Considerations.
1 Chapter 6 SAMPLE SIZE ISSUES Ref: Lachin, Controlled Clinical Trials 2:93-113, 1981.
Response, PFS or OS – what is the best endpoint in advanced colorectal cancer? Marc Buyse IDDI, Louvain-la-Neuve & Hasselt University
Single Season Study Design. 2 Points for consideration Don’t forget; why, what and how. A well designed study will:  highlight gaps in current knowledge.
Methods and Statistical analysis. A brief presentation. Markos Kashiouris, M.D.
 Adaptive Enrichment Designs for Confirmatory Clinical Trials Specifying the Intended Use Population and Estimating the Treatment Effect Richard Simon,
Biostatistics Case Studies 2007
April 18 Intro to survival analysis Le 11.1 – 11.2
The Importance of Adequately Powered Studies
How many study subjects are required ? (Estimation of Sample size) By Dr.Shaik Shaffi Ahamed Associate Professor Dept. of Family & Community Medicine.
Adaptive non-inferiority margins under observable non-constancy
Impact of censoring on the statistical methods for handling non-proportional hazards in Immuno-Oncology studies Yifan Huang, Luping Zhao, Jiabu Ye, and.
Statistical Core Didactic
Donald E. Cutlip, MD Beth Israel Deaconess Medical Center
Deputy Director, Division of Biostatistics No Conflict of Interest
Randomized Trials: A Brief Overview
Heterogeneity and sources of bias
Being an effective consumer of preclinical research
S1316 analysis details Garnet Anderson Katie Arnold
Regulatory Industry Statistics Workshop 2018
Aiying Chen, Scott Patterson, Fabrice Bailleux and Ehab Bassily
Common Problems in Writing Statistical Plan of Clinical Trial Protocol
Risk ratios 12/6/ : Risk Ratios 12/6/2018 Risk ratios StatPrimer.
Evaluation challenges - Reporting
Dr. Matthew Keough August 8th, 2018 Summer School
Chapter 12 Power Analysis.
Advanced Statistical Methods for Translational Research
Biostatistics Primer: What a Clinician Ought to Know: Hazard Ratios
Single Pivotal vs Two Replicated Studies. Zijiang Yang
Optimal Basket Designs for Efficacy Screening with Cherry-Picking
Biomarkers as Endpoints
Design and Analysis of Survival Trials with Treatment Crossover, delayed treatment effect and treatment dilution Presenter: Xiaodong Luo– R&D-SANOFI US.
Björn Bornkamp, Georgina Bermann
Gregory Levin, FDA/CDER/OTS/OB/DBIII
How Should We Select and Define Trial Estimands
Aparna Raychaudhuri, Ph. D
Presentation transcript:

Use of Piecewise Weighted Log-Rank Test for Trials with Delayed Effect Boguang Zhen, Ph.D. Therapeutics Evaluation Branch Division of Biostatistics CBER/FDA Disclaimer: My contributions are an informal communication and represent my own best judgment. These comments do not bind or obligate FDA

Outline Background Concerns Evaluations/Discussions Concluding Remark

Delayed Effect (DE) In chemotherapy, study drugs act directly on cancer In therapeutic cancer vaccine therapy considerable time might be needed for cancer vaccines to induce immunity similar to many other immunotherapies, The impact of the so delayed effect (or lag time) should be considered

Sipuleucel-T case: 6 months~ DE NEJM: 363 (5): 418, 2010

Issues on Delayed Effect For time-to-event endpoint Regular log-rank test commonly used Under non-proportional hazards setting Ignore it or increase sample size Lose power or increase cost/duration Piecewise weighted log-rank test (PWLRT) Allocate different weight to a subset of events Reduce sample size Increase power time-to-event endpoint – motivated by therapeutic vaccine cancer trials with OS endpoint, but the concern could be in any trial that deals with time-to-endpoint with delayed effect, , e.g. cardio-vascular trials. the regular log-rank test with equal weight across all time points has been commonly used to compare the difference of the endpoint between the experimental group and the control group

PWLRT not widely used Challenging and Opportunity! Why not free lunch? Lack of awareness Issues Allocate different weight to a subset of events Could specify zero weight for some events - implies that some subjects would not contribute ‘meaningful’ information to the test result May not be straightforward to interpret the analysis results based on the weighted methods … Challenging and Opportunity!

Concerns Type I probability Intention-to-treat principle Study results interpretation Changing point specification Crossover effect Others? -- evaluate and address the above 5 concerns

Evaluations/Discussions Use the PWLRT method in Xu’s paper (Stat in Med 2017; 36(4):592-605) for evaluation for simplicity t* denote the hazard ratio changing point For simplicity, I only use the PWLRT method as proposed by Xu et al. for the evaluation, but the results can also be generalized to other weighted methods that deal with delayed effect. Inject the product to tumor, melanoma, and break tumor cells so there is an effect. After breaking the tumor cells, they release a lot of antigens that will take some time to stimulate the immune system response, and then you will have a full effect. Extreme case that hazard ratio=1 or w1 = 0. IF we don’t have issue for w1=0, not effect at all before t*, it should not have the problem for the case that w1>0.

Type I probability Inflated? Evaluation Some subsets may behave differently from others By allocating zero weight for other subsets, it may inflate Type I probability Evaluation No DE under the null of no difference between the two groups – use of different subset should not inflate Type I probability Simulation

Simulation study shows no Type I probability inflation   Sample Size Regular log-rank PWLRT t*= 3 months t*= 6 months t*= 9 months 300 600 900 0.0502 0.0503 0.0514 0.0510 0.0503 0.0520 0.0513 0.0511 0.0495 0.0496 0.0501 -- 100,000 replications testing at the alpha level of 0.05

Intention-to-treat (ITT) principle Violated? less weight for some events could mean they contribute less information to the test result 0 weight for some events may imply excluding some subjects so violating ITT principle Evaluation What is behind the ITT principle? Any potential bias?

Behind ITT principle ITT principle? Behind ITT principle? Conduct analysis according to the assigned treatment instead of the actual treatment Include all randomized subjects Behind ITT principle? Preserve the initial randomization in order to prevent bias and provide a secure foundation for statistical tests Bias with PWLRT? Should conduct analysis according to the assigned treatment instead of the actual treatment – same as for other methods Include all randomized subjects with different weights It’s not difficult to show the initial randomization is preserved even if zero weight is used (in a manuscript to be submitted - not show here in the interest of time) we have showed that the proposed PWLRT can still preserve the initial randomization with w1=0 as long as subjects are analyzed according to the treatment assignments they received in a manuscript to be submitted.

Study results interpretation -- a hypothetical example Consider a new product: no survival benefit until 7 months after treatment Regular log-rank test vs. PWLRT? Regular log-rank test (p = 0.04) KM plot for median survival: 25-month (new), 20-month (control) Show 5-month survival benefit PWLRT (p = 0.004) w*1 =0 for subjects died before 7-month, w*2 =1 for others KM plot for survival - same as above KM plot for subset of subjects who live > 7-month Median survival: 35-month (new) vs. 27-month (control) Show 8-month survival benefit KM shows the observed median survival 25 vs 20 month….

K-M plot with all subjects   K-M plot with subjects who live after 7 months

Study results interpretation (2) -- a hypothetical example Both regular log-rank test and PWLRT can be used Regular log-rank test: less power and may fail to detect a statistically significant Interpretations with PWLRT: 5-month survival benefit unconditional on the time point when the new product starts to work – same as for regular log-rank test No survival benefit for the new product before 7-month 8-month survival benefit instead of 5 35-month median survival instead of 25 for the treated group -- if live to the time when the new product kicks in its effect Which one is more informative or we need both? Study results based on PWLRT should be interpretable.

Changing point specification Challenging in estimating the changing point (t*) in practice If miss the changing point Under powered study? Worse than using the conventional method?

Estimate the changing point in practice Obtain a close estimate for the changing point Mechanism of action for the product Animal/other pre-clinical study data Literature Early phase trial data -- Join adventure from all disciplines

Missing the changing point Xu et al.: Designing therapeutic cancer vaccine trials with delayed treatment effect (Stat in Med 2017; 36(4):592-605) Assuming fixed changing point Address the robustness to the mis-specification of t* under various scenarios (Evaluation #4) Still better than the regular log-rank

Xu et al.: Evaluation #4 Table 1. The impact of mis-specifying t* on power calculation -- assume t*=6-month with 80% power for sample size (see paper for details)

Missing the changing point (2) “Designing cancer immunotherapy trials with random treatment time-log effect” -- Xu et al. accepted by SIM Assuming a subject-specific lag duration: t* ~ Dist(T1, T2) Incorporating individual heterogeneity in lag time to response to treatment Further reducing risk of mis-specification

Crossover effect Survival curves for a hypothetical crossover effect trial

Crossover effect Problematic for using PWLRT in trials with crossover effect Same problem by using other conventional methods such as the regular log-rank test Conventionally, a clinical trial should be designed with the assumption that there will be no crossover effect Review issues and sensitivity analyses

Summary Can the concerns be addressed? Type I probability Intention-to-treat principle Study results interpretation Changing point specification Crossover effect

Concluding Remark With careful consideration of all kind of potential biases and appropriate planning, the results based on PWLRT should be valid and interpretable

Thank you! My email: boguang.zhen@fda.hhs.gov