Using Predictive Biomarkers in the Design of Adaptive Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.

Slides:

Advertisements

Similar presentations

Patient Selection Markers in Drug Development Programs

Advertisements

A New Paradigm for the Utilization of Genomic Classifiers for Patient Selection in the Critical Path of Medical Product Development Richard Simon, D.Sc.

New Paradigms for Clinical Drug Development in the Genomic Era Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

New Designs for Phase III Clinical Trials

It is difficult to have the right single completely defined predictive biomarker identified and analytically validated by the time the pivotal trial of.

Breakout Session 4: Personalized Medicine and Subgroup Selection Christopher Jennison, University of Bath Robert A. Beckman, Daiichi Sankyo Pharmaceutical.

Federal Institute for Drugs and Medical Devices | The Farm is a Federal Institute within the portfolio of the Federal Ministry of Health (Germany) How.

Transforming Correlative Science to Predictive Personalized Medicine Richard Simon, D.Sc. National Cancer Institute

Statistical Issues in Incorporating and Testing Biomarkers in Phase III Clinical Trials FDA/Industry Workshop; September 29, 2006 Daniel Sargent, PhD Sumithra.

Clinical Trial Designs for the Evaluation of Prognostic & Predictive Classifiers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.

Use of Archived Tissue in Evaluating the Medical Utility of Prognostic & Predictive Biomarkers Richard Simon, D.Sc. Chief, Biometric Research Branch National.

Targeted (Enrichment) Design. Prospective Co-Development of Drugs and Companion Diagnostics 1. Develop a completely specified genomic classifier of the.

Clinical Trial Design Considerations for Therapeutic Cancer Vaccines Richard Simon, D.Sc. Chief, Biometric Research Branch, NCI

ODAC May 3, Subgroup Analyses in Clinical Trials Stephen L George, PhD Department of Biostatistics and Bioinformatics Duke University Medical Center.

Statistical Issues in the Evaluation of Predictive Biomarkers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Moving from Correlative Science to Predictive Medicine Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

New designs and paradigms for science- based oncology clinical trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Use of Prognostic & Predictive Biomarkers in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Predictive Classifiers Based on High Dimensional Data Development & Use in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch.

Richard Simon, D.Sc. Chief, Biometric Research Branch

Moving from Correlative Studies to Predictive Medicine Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute brb.nci.nih.gov.

Statistical Challenges for Predictive Onclogy Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Predictive Analysis of Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Guidelines on Statistical Analysis and Reporting of DNA Microarray Studies of Clinical Outcome Richard Simon, D.Sc. Chief, Biometric Research Branch National.

Phase II Design Strategies Sally Hunsberger Ovarian Cancer Clinical Trials Planning Meeting May 29, 2009.

Re-Examination of the Design of Early Clinical Trials for Molecularly Targeted Drugs Richard Simon, D.Sc. National Cancer Institute linus.nci.nih.gov/brb.

Use of Genomics in Clinical Trial Design and How to Critically Evaluate Claims for Prognostic & Predictive Biomarkers Richard Simon, D.Sc. Chief, Biometric.

Thoughts on Biomarker Discovery and Validation Karla Ballman, Ph.D. Division of Biostatistics October 29, 2007.

Predictive Biomarkers and Their Use in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Novel Clinical Trial Designs for Oncology

Predictive Analysis of Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Phase II Trials in Oncology S. Gail Eckhardt, MD Lillian Siu, MD Brian I. Rini, M.D.

Prospective Subset Analysis in Therapeutic Vaccine Studies Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Personalized Predictive Medicine and Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Use of Prognostic & Predictive Biomarkers in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Some Statistical Aspects of Predictive Medicine

Cancer Clinical Trials in the Genomic Era Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Validation of Predictive Classifiers Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Development and Use of Predictive Biomarkers Dr. Richard Simon.

Use of Prognostic & Predictive Genomic Biomarkers in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.

EDRN Approaches to Biomarker Validation DMCC Statisticians Fred Hutchinson Cancer Research Center Margaret Pepe Ziding Feng, Mark Thornquist, Yingye Zheng,

Personalized Predictive Medicine and Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Experimental Design and Statistical Considerations in Translational Cancer Research (in 15 minutes) Elizabeth Garrett-Mayer, PhD Associate Professor of.

Steps on the Road to Predictive Oncology Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Moving from Correlative Studies to Predictive Medicine Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Therapeutic Equivalence & Active Control Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.

1 Statistics in Drug Development Mark Rothmann, Ph. D.* Division of Biometrics I Food and Drug Administration * The views expressed here are those of the.

Use of Candidate Predictive Biomarkers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.

Survival Analysis, Type I and Type II Error, Sample Size and Positive Predictive Value Larry Rubinstein, PhD Biometric Research Branch, NCI International.

Regulatory Affairs and Adaptive Designs Greg Enas, PhD, RAC Director, Endocrinology/Metabolism US Regulatory Affairs Eli Lilly and Company.

Efficient Designs for Phase II and Phase III Trials Jim Paul CRUK Clinical Trials Unit Glasgow.

The Use of Predictive Biomarkers in Clinical Trial Design Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Steps on the Road to Predictive Medicine Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Integration of Diagnostic Markers into the Development Process of Targeted Agents Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer.

Adaptive Designs for Using Predictive Biomarkers in Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.

Using Predictive Classifiers in the Design of Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute.

Personalized Predictive Medicine and Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

New Approaches to Clinical Trial Design Development of New Drugs & Predictive Biomarkers Richard Simon, D.Sc. Chief, Biometric Research Branch National.

Introduction to Design of Genomic Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Steps on the Road to Predictive Medicine Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Advanced Clinical Trial Educational Session Richard Simon, D.Sc. Biometric Research Branch National Cancer Institute

Design & Analysis of Phase III Trials for Predictive Oncology Richard Simon Chief, Biometric Research Branch National Cancer Institute

Moving From Correlative Science to Predictive Medicine Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Drug Development at CINJ Evolving challenges. Phase 1 Studies at CINJ Early drug trials– Fits easily in scope for single or limited number of institutions.

Adaptive Enrichment Designs for Confirmatory Clinical Trials Specifying the Intended Use Population and Estimating the Treatment Effect Richard Simon,

Overview of Standard Phase II Design Issues

Critical Reading of Clinical Study Results

Aiying Chen, Scott Patterson, Fabrice Bailleux and Ehab Bassily

Coiffier B et al. Proc ASH 2011;Abstract 265.

Statistics for Clinical Trials in Cancer Research

Presentation transcript:

Using Predictive Biomarkers in the Design of Adaptive Phase III Clinical Trials Richard Simon, D.Sc. Chief, Biometric Research Branch National Cancer Institute

Biometric Research Branch Website Powerpoint presentations and audio files Reprints & Technical Reports BRB-ArrayTools software BRB-ArrayTools Data Archive Sample Size Planning for Targeted Clinical Trials

Good Pivotal Clinical Trials Ask important questions Get reliable answers –Convincing to consumers of the trial results Regulators Practicing physicians Reimbursers

Important Question Does a new treatment administered in a specified manner to a specified population of patients result in improved patient benefit compared to treating the patient with a standard of care treatment

Reliable Answer Randomized control group Unbiased assessment of outcome Test pre-specified hypothesis Control for multiple analyses –Interim analyses –Patient subsets –Endpoints

Phase I and II Trials Develop information needed to plan effective pivotal trials Inference in phase I/II is local –It need only satisfy the study planners

Adaptive Trials Adaptiveness in phase I and II trials can help optimize the dose/schedule, regimen, patient population in order to develop the right pivotal trial Most drugs fail because –They are toxic –They are ineffective –They are not tested in the right dose/schedule/regimen for the right population of patients Rushing to do the wrong pivotal trial is a mistake Adaptiveness in phase III must be carefully structured to not interfere with the reliability and convincingness of the pivotal trial –Data used to frame a hypothesis should be distinct form the data used to frame the hypothesis

Many cancer treatments benefit only a small proportion of the patients to which they are administered Targeting treatment to the right patients can greatly improve the therapeutic ratio of benefit to adverse effects –Treated patients benefit –Treatment more cost-effective for society

“Biomarkers” Surrogate endpoints –A measurement made before and after treatment to determine whether the treatment is working –Surrogate for clinical benefit Predictive classifiers –A measurement made before treatment to select good patient candidates for the treatment

Surrogate Endpoints It is very difficult to properly validate a biomarker as a surrogate for clinical benefit. It requires a series of randomized trials with both the candidate biomarker and clinical outcome measured –Must demonstrate that treatment vs control differences for the candidate surrogate are concordant with the treatment vs control differences for clinical outcome –It is not sufficient to demonstrate that the biomarker responders survive longer than the biomarker non- responders

It is often more difficult and time consuming to properly “validate” an endpoint as a surrogate than to use the clinical endpoint in phase III trials

Biomarkers for use as endpoints in phase I or II studies need not be validated as surrogates for clinical outcome Unvalidated biomarkers can also be used for early “futility analyses” in phase III trials

Validation=Fit for Purpose FDA terminology of “valid biomarker” and “probable valid biomarker” are not applicable to predictive classifiers “Validation” has meaning only as fitness for purpose and the purpose of predictive classifiers are completely different than for surrogate endpoints Criteria for validation of surrogate endpoints should not be applied to biomarkers used for treatment selection

Medicine Needs Predictive Markers not Prognostic Factors Most prognostic factors are not used because they are not therapeutically relevant Most prognostic factor studies are not focused on a clear objective – they use a convenience sample of patients for whom tissue is available –often the patients are too heterogeneous to support therapeutically relevant conclusions

In new drug development –The focus should be on evaluating the new drug in a population defined by a predictive classifier, not on “validating” the classifier In developing a predictive classifier for restricting a widely used treatment –The focus should be on evaluating the clinical utility of the classifier; Is clinical outcome better if the classifier is used than if it is not used?

New Drug Developmental Strategy (I) Develop a diagnostic classifier that identifies the patients likely to benefit from the new drug Develop a reproducible assay for the classifier Use the diagnostic to restrict eligibility to a prospectively planned evaluation of the new drug Demonstrate that the new drug is effective in the prospectively defined set of patients determined by the diagnostic

Using phase II data, develop predictor of response to new drug Develop Predictor of Response to New Drug Patient Predicted Responsive New Drug Control Patient Predicted Non-Responsive Off Study

Applicability of Design I Primarily for settings where the classifier is based on a single gene whose protein product is the target of the drug With substantial biological basis for the classifier, it will often be unacceptable ethically to expose classifier negative patients to the new drug

Evaluating the Efficiency of Strategy (I) Simon R and Maitnourim A. Evaluating the efficiency of targeted designs for randomized clinical trials. Clinical Cancer Research 10: , 2004; Correction 12:3229,2006 Maitnourim A and Simon R. On the efficiency of targeted clinical trials. Statistics in Medicine 24: , 2005.

Two Clinical Trial Designs Compared Un-targeted design –Randomized comparison of T to C without screening for expression of molecular target Targeted design –Assay patients for expression of target –Randomize only patients expressing target

Separating Assay Performance from Treatment Specificity  1 = treatment effect for Target + patients  0 = treatment effect for Target – patients 1-  Proportion of patients target positive Sensitivity = Prob{Assay+ | Target +} Specificity = Prob{Assay- | Target -}

Randomized Ratio

Randomized Ratio sensitivity=specificity=0.9 Express target  0 =0  0 =  1 /

Screened Ratio

Screened Ratio sensitivity=specificity=0.9 Express target  0 =0  0 =  1 /

Trastuzumab Metastatic breast cancer 234 randomized patients per arm 90% power for 13.5% improvement in 1-year survival over 67% baseline at 2-sided.05 level If benefit were limited to the 25% assay + patients, overall improvement in survival would have been 3.375% –4025 patients/arm would have been required If assay – patients benefited half as much, 627 patients per arm would have been required

Gefitinib Two negative untargeted randomized trials first line advanced NSCLC –2130 patients 10% have EGFR mutations If only mutation + patients benefit by 20% increase of 1-year survival, then 12,806 patients/arm are needed For trial targeted to patients with mutations, 138 are needed

Treatment Hazard Ratio for Marker Positive Patients Number of Events for Targeted Design Number of Events for Traditional Design Percent of Patients Marker Positive 20%33%50% Comparison of Targeted to Untargeted Design Disease-Free Survival Endpoint Simon R, Development and Validation of Biomarker Classifiers for Treatment Selection, JSPI

Web Based Software for Comparing Sample Size Requirements

Developmental Strategy (II) Develop Predictor of Response to New Rx Predicted Non- responsive to New Rx Predicted Responsive To New Rx Control New RXControl New RX

Developmental Strategy (II) Do not use the diagnostic to restrict eligibility, but to structure a prospective analysis plan. Compare the new drug to the control overall for all patients ignoring the classifier. –If p overall  0.04 claim effectiveness for the eligible population as a whole Otherwise perform a single subset analysis evaluating the new drug in the classifier + patients –If p subset  0.01 claim effectiveness for the classifier + patients.

This analysis strategy is designed to not penalize sponsors for having developed a classifier It provides sponsors with an incentive to develop genomic classifiers

Key Features of Design (II) The purpose of the RCT is to evaluate treatment T vs C overall and for the pre- defined subset; not to modify or refine the classifier or to re-evaluate the components of the classifier. This design assumes that the classifier is a binary classifier, not a “risk index”

The Roadmap 1.Develop a completely specified genomic classifier of the patients likely to benefit from a new drug 2.Establish reproducibility of measurement of the classifier 3.Use the completely specified classifier to design and analyze a new clinical trial to evaluate effectiveness of the new treatment with a pre-defined analysis plan.

Guiding Principle The data used to develop the classifier must be distinct from the data used to test hypotheses about treatment effect in subsets determined by the classifier –Developmental studies are exploratory And not closely regulated by FDA –Studies on which treatment effectiveness claims are to be based should be definitive studies that test a treatment hypothesis in a patient population completely pre-specified by the classifier

Development of Genomic Classifiers Single gene or protein based on knowledge of therapeutic target Empirically determined based on evaluation of a set of candidate genes –e.g. EGFR assays Empirically determined based on genome- wide correlating gene expression or genotype to patient outcome after treatment

Development of Genomic Classifiers During phase II development or After failed phase III trial using archived specimens. Adaptively during early portion of phase III trial.

Adaptive Signature Design An adaptive design for generating and prospectively testing a gene expression signature for sensitive patients Boris Freidlin and Richard Simon Clinical Cancer Research 11:7872-8, 2005

Adaptive Signature Design End of Trial Analysis Compare E to C for all patients at significance level 0.04 –If overall H 0 is rejected, then claim effectiveness of E for eligible patients –Otherwise

Otherwise: –Using only the first half of patients accrued during the trial, develop a binary classifier that predicts the subset of patients most likely to benefit from the new treatment E compared to control C –Compare E to C for patients accrued in second stage who are predicted responsive to E based on classifier Perform test at significance level 0.01 If H 0 is rejected, claim effectiveness of E for subset defined by classifier

Classifier Development Using data from stage 1 patients, fit all single gene logistic models (j=1,…,M) Select genes with interaction significant t i = treatment indicator for patient i (0,1) x ij = log expression for gene j of patient i

Classification of Stage 2 Patients For i’th stage 2 patient, selected gene j votes to classify patient as preferentially sensitive to T if

Classification of Stage 2 Patients Classify i’th stage 2 patient as differentially sensitive to T relative to C if at least G selected genes vote for differential sensitivity of that patient

Treatment effect restricted to subset. 10% of patients sensitive, 10 sensitivity genes, 10,000 genes, 400 patients. TestPower Overall.05 level test46.7 Overall.04 level test43.1 Sensitive subset.01 level test (performed only when overall.04 level test is negative) 42.2 Overall adaptive signature design85.3

Overall treatment effect, no subset effect. 10,000 genes, 400 patients. TestPower Overall.05 level test74.2 Overall.04 level test70.9 Sensitive subset.01 level test1.0 Overall adaptive signature design70.9

Biomarker Adaptive Threshold Design Wenyu Jiang, Boris Freidlin & Richard Simon (Submitted for publication)

Biomarker Adaptive Threshold Design Randomized pivotal trial comparing new treatment E to control C Quantitative predictive biomarker B Survival or DFS endpoint

Biomarker Adaptive Threshold Design Have identified a univariate biomarker index B thought to be predictive of patients likely to benefit from E relative to C Eligibility not restricted by biomarker No threshold for biomarker determined Biomarker value scaled to range (0,1)

Procedure B S(b)=log likelihood ratio statistic for treatment effect in subset of patients with B  b T=max{S(0)+R, max{S(b)}} Compute null distribution of T by permuting treatment labels If the data value of T is significant at 0.05 level, then reject null hypothesis that E is ineffective Compute point and interval estimates of the threshold b

Estimation of Threshold

Traditional Phase II Trials Estimate the proportion of tumors that shrink by 50% or more when the drug is administered singly or in combination to patients with advanced stage tumors of a specific primary site

Problem With Traditional Approach Phase II single agent responses do not predict well for phase III success of combination regimens –Drugs active as single agents may not contribute to activity of combinations Some drugs that have effectiveness in phase III did not produce many responses in single agent phase II trials –Some molecularly targeted drugs are thought to be cytostatic rather than cytotoxic

Preferred Phase II Evaluation Standard regimen ± new drug Time to progression endpoint

Design Approaches Single arm trial of standard + new drug using historical controls –Often uninterpretable Randomized phase 2 design –Standard +- new drug –Requires larger sample size than traditional oncology phase II Randomized discontinuation design –All patients started on new drug –Randomized pts with stable disease at specified time –Requires large total sample size –Not a phase III design

Seamless Phase II/III Trial Randomized comparison of standard treatment ± new drug Size trial using phase III (e.g. survival) endpoint Perform interim futility analysis using phase II endpoint (e.g.biomarker or PFS) –If treatment vs control results are not significant for phase II endpoint, terminate accrual and do not claim any benefit of new treatment –If results are significant for intermediate endpoint, continue accrual and follow-up and do analysis of phase III endpoint at end of trial Interim analysis does not “consume” any of the significance level for the trial

Developmental Strategies Compared Perform phase III survival trial without phase II trial Perform phase III survival trial using futility analysis on survival, without phase II trial Perform randomized phase II trial and if significant for PFS then perform phase III survival trial Seamless II/III –Two-stage (i.e. suspend accrual during f/u for pfs) –Interim analysis (do not suspend accrual)

No Treatment Effect on Survival or PFS

Treatment Effect on Survival and PFS

No Treatment Effect on Survival or PFS Varying Significance Threshold for Interim Analysis Seamless Interim Analysis Design

Are Bayesian Methods More Adaptive than Frequentist Methods? With frequentist methods the algorithm for adaptivness must be specified in advance –To limit the type I error to 5% With Bayesian methods all prior distributions must be specified in advance –Subjective prior distributions are acceptable for phase I/II trials but not for pivotal trials Who cares about your prior? Bayesian inference usually does not control type I error Bayesian methods can control the type I error only if the algorithm for adaptiveness is specified in advance

Are Bayesian Methods More Adaptive than Frequentist Methods? Frequentist designs can adaptively vary the sample size Frequentist designs can incorporate multi-arm trials Frequentist designs do not usually adaptively modify the randomization weight but could if analysis were based on permutational significance tests Bayesian designs that modify the randomization weight are generally not robust to time trends in prognostic factors

Are Bayesian Methods More Adaptive than Frequentist Methods? Adaptive Bayesian designs are most likely to be useful and acceptable in pre-pivotal trials

Collaborators Biometric Research Branch, NCI –Boris Freidlin –Sally Hunsberger –Wenyu Jiang –Aboubakar Maitournam –Yingdong Zhao FDA –Sue Jane Wang

Dupuy A and Simon R. Critical review of published microarray studies for clinical outcome and guidelines for statistical analysis and reporting, Journal of the National Cancer Institute 99:147-57, Dobbin K and Simon R. Sample size planning for developing classifiers using high dimensional DNA microarray data. Biostatistics 8: , Simon R. Development and validation of therapeutically relevant predictive classifiers using gene expression profiling, Journal of the National Cancer Institute 98: , Simon R. Validation of pharmacogenomic biomarker classifiers for treatment selection. Cancer Biomarkers 2:89-96, Simon R. A checklist for evaluating reports of expression profiling for treatment selection. Clinical Advances in Hematology and Oncology 4: , Simon R, Lam A, Li MC, et al. Analysis of gene expression data using BRB-ArrayTools, Cancer Informatics 2:1-7, 2006 Using Genomic Classifiers In Clinical Trials

. Simon R and Maitnourim A. Evaluating the efficiency of targeted designs for randomized clinical trials. Clinical Cancer Research 10: , 2004; Correction 12:3229, Maitnourim A and Simon R. On the efficiency of targeted clinical trials. Statistics in Medicine 24: , Simon R. When is a genomic classifier ready for prime time? Nature Clinical Practice – Oncology 1:4-5, Simon R. An agenda for Clinical Trials: clinical trials in the genomic era. Clinical Trials 1: , Simon R. Development and validation of therapeutically relevant multi-gene biomarker classifiers. Journal of the National Cancer Institute 97: , Simon R. A roadmap for developing and validating therapeutically relevant genomic classifiers. Journal of Clinical Oncology 23: ,2005. Freidlin B and Simon R. Adaptive signature design. Clinical Cancer Research 11: , Simon R. and Wang SJ. Use of genomic signatures in therapeutics development in oncology and other diseases, The Pharmacogenomics Journal 6:166-73, Using Genomic Classifiers In Clinical Trials