Presentation is loading. Please wait.

Presentation is loading. Please wait.

1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

Similar presentations


Presentation on theme: "1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin."— Presentation transcript:

1 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin Madison chappell@stat.wisc.edu 2010 Joint Statistical Meetings; Vancouver

2 My thanks to The International Indian Statistical Association, Biometrics Section and Professor Debajyoti Sinha, Florida State University for inviting me to speak; An unnamed CMO of an unnamed pharmaceutical company; The audience. 2

3 3 Outline I.Goal of Phase I Trials in Cancer & Other Serious Diseases II.How we need Bayesian Phase I Trials III.Options for Dose Escalation IV.Example, with Design and Simulations V.Conclusions

4 4 I. Goal of Phase I Trials in Cancer & Other Serious Diseases Goal first delineated by Schneiderman (1967) as estimation of dose yielding a fixed percent (often 33%) of dose-limiting toxicities (DLTs) but not lower. This is called the Maximum Tolerable Dose (MTD). Phase I Trials are Usually Outcome-adaptive

5 5 Phases of a Clinical Trial Preclinical: Biochemical and animal studies Phase I, Cancer - estimate toxicity rates using few typically very sick subjects. Phase II - determines a therapy’s potential efficacy usually using a few very sick patients. Phase III – a large randomized controlled, possibly blinded, possibly multicenter, experiment Phase IV - a controlled trial of an approved treatment with long-term followup of safety and efficacy.

6 6 Schematic of Phase I Trial Dose % Toxicity 0 33 100 d1d1 d2d2... mtd

7 7 Allowable DLT rate can be lower than 33% but rarely higher Dose % Toxicity 0 20 100 d1d1 d2d2.. mtd

8 8 How We Need Bayesian Phase I Trials Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin Madison 2010 Joint Statistical Meetings; Vancouver

9 9 II. How we need Bayesian Phase I Trials A.In the Design - How will we set the doses? Guess at a series of levels starting at a low toxicity rate and moving to higher rates. - Isn’t this a prior? - Formal prior elicitation methods can be very helpful. B.In Dose Escalation C.In the Analysis - The data from a single phase I trial usually provide only a little information. - Information is usually available from other sources. - Model-based smoothing (e.g., for monotonicity) often useful. - Computation using conjugate priors is simple. - Multiple analyses can (should) be done. - Storer (1989) recommended that analysis be separated from design.

10 10 III. Options for Dose Escalation A.Algorithmic designs (Shih & Lin, 2006, in Chevret’s text). - These could be defined as designs where (de-)escalation depends only on outcomes in the most recent groups of patients. - However, some “permanent” changes, such as adjustments in cohort sizes, have been included in “algorithms”. - Most intuitive definition is “A (de-)escalation system which a human being can keep in his head.” B.Bayesian and other model-based designs.

11 11 A. Simple Algorithmic “Up-and-Down” Designs –Usually small cohorts of patients (1 – 5); –Dose levels increased and/or decreased until a predetermined sample size is reached; –Designed to estimate the MTD as 33%-ile with cohorts of size 3, 20%-ile with cohorts of size 5, etc.; –Not flexible (can also spend a lot of patients at low- toxicity doses), but can easily be modified to speed up via adaptive cohort sizes; –Other sources of information can be used.

12 12 Design A (“Traditional”): Groups of three patients are treated. Escalation occurs if no toxicity is observed in all three: otherwise, an additional three patients are treated at the same dose level. If only one of six has toxicity, escalation again continues; otherwise, the trial stops. Design B: Single patients are treated. The next patient is treated at the next lower dose level if a toxic response is observed, otherwise at the next higher dose level until sample size is reached. Storer’s (1989) phase I up-and-down classification

13 Design D: Groups of three patients are treated. Escalation if no toxicity is seen and de-escalation if more than one patient has toxicity. If one patient has toxicity, next group of three is treated at same dose level; repeat until sample size is reached. Similar to traditional except can go down. ( Design C: Similar to design D, except that the rule applies to the preceding three patients at any point, instead of using discrete batches of three. ) - Larger cohorts can be used to estimate the MTD as the dose with a 1/4 = 25%, 1/5 = 20% or 1/6 = 17% rates. - Designs can be combined, such as “BD”.

14 14 1.Prior guess is made as to the dose-response (toxicity) curve; 2.First patient is assigned to the prior MTD; 3.After the patient is fully followed, his or her outcome is used to update the prior curve; 4.Next patient is assigned to the new “posterior” MTD; 5.Repeated until sample size is reached. 6.Final posterior MTD is estimate. B. Bayesian Designs - The Continual Reassessment Method (CRM; O’Quigley, et al., 1990)

15 15 C. Operating Characteristics are Paramount 1.“Operating Characteristics”: How a design escalates the dose and changes cohort size, and how it stops. 2.Clinicians want to know how treatment decisions are made – they are fearful of the black box. 3.Operating Characteristics of Bayesian / other model-based designs can be described by example and simulation. 4.One cannot rely on clinicians' priors to produce [Bayesian] designs with acceptable operating characteristics. In practice, the statistician must at least specify prior uncertainty.

16 16 “4.One cannot rely on clinicians' priors to produce [Bayesian] designs with acceptable operating characteristics. In practice, the statistician must at least specify prior uncertainty.” Sometimes this requirement leads to absurdities. Ji, Li, and Bekele (2007) proposed a prior for a Bayesian method “that can be easily understood and implemented by nonstatisticians.” It is simple, and leads to a practical and sensible algorithm.

17 17 “4.One cannot rely on clinicians' priors to produce [Bayesian] designs with acceptable operating characteristics. In practice, the statistician must at least specify prior uncertainty.” Sometimes this requirement leads to absurdities. Ji, Li, and Bekele (2007) proposed a prior for a Bayesian method “that can be easily understood and implemented by nonstatisticians.” It is simple, and leads to a practical and sensible algorithm. But the recommended prior was Beta(.005,.005):

18 18 D. How ethical operating characteristics subvert current Bayesian designs 1.The “Continual Reassessment Method” [CRM; O’Quigley et al., 1990] attempts to assign subjects to the MTD and so efficiently estimate it. 2.The CRM does not directly address the following ethical/logistical issues in Phase I trials: a)The need for a starting dose lower than the prior MTD; b)The need for a maximum increase in dose per cohort; c)A desire to let cohort sizes vary; d)A desire to allow data on sub-dose-limiting toxicities to inform escalation & cohort size choices. 3.The CRM can be modified; Or one can incorporate elements of Escalation with Overdose Control (EWOC), which constrains predicted overdose rates; Or we can combine the two.

19 19 3.The CRM can be modified (cont.); The most common way to modify the CRM is to overlay algorithms (for initial dose, maximum increase, etc.) on top of it. But this also has disadvantages. Cheung (2005) defined coherence as “An escalation for a new patient is said to be coherent only when the previous patient does not show sign of toxicity. Likewise, a de-escalation is coherent only when a toxic outcome has just been seen.” He shows the CRM, EWOC, & all unmodified likelihood-based methods to be coherent. But many of the simple common modifications are not.

20 20 A Practical, Easy, Understandable, and Scientifically Sound Algorithmic Design for a Phase I Trials Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin Madison chappell@stat.wisc.edu 2010 Joint Statistical Meetings; Vancouver

21 21 IV. Example, with Design and Simulations A. Design Considerations 1.Agent is a radiopharmaceutical available in doses of 6.25; 2.Concern centers on toxicity to bone marrow and also kidney, bladder, and liver; 3.FDA requires that starting dose be 12.5 (units unspecified); 4.Investigators consider this to be very low and want rapid initial escalation; 5.Information is available on “sub-dose-limiting” toxicities – e.g., 999 neutrophils/mm 3 is grade 3 neutropenia but 1,001 is grade 2.

22 22 4.Investigators consider a 12.5 starting dose to be very low and want rapid initial escalation; 5.Information is available on “sub-dose-limiting” toxicities – e.g., 999 neutrophils/mm 3 is grade 3 neutropenia but 1,001 is grade 2. How to address these requirements? -Escalate if 0 DLTs in a cohort; -De-escalate if ≥ 2 DLTs in a cohort; -Start with small cohorts (size 2), escalate dose in increments of 12.5, maintain until higher risk suspected; -Permanently expand cohorts to size 5 when excessive risk seen, defined as ≥ 1 DLT and/or 2 sub-DLTs; -Permanently halve dose increment to 6.25 when ≥ 2 DLTs and/or ≥ 3 sub-DLTs seen; -Maintain dose if 1 DLT in a cohort; -Stop when ≥ 20 subjects reached (max 24); -“Highest dose with ≤ 20% DLTs recommended for phase II”.

23 23 B. Designs under investigation Design Type Cohort Size # DLTs # Sub- DLTs Action Sample Size Phase 2 Dose Design 1 Traditional 3+3 3 or 6 0/3 1/3  1/6 >1/6 >1/3 Any Escalate to next cohort Expand cohort to 6 pts at same dose Escalate to next cohort Stop escalation – MTD defined Random One below the stopping dose

24 24 B. Designs under investigation Design Type Cohort Size # DLTs # Sub- DLTs Action Sample Size Phase 2 Dose Design 2 “Traditional” 5+5 5 or 10 0/5 1/5  1/10 >1/10 >1/5 Any Escalate to next cohort Expand cohort to 10 pts at same dose Escalate to next cohort Stop escalation – MTD defined Random Highest dose with  20% DLTs

25 Design Type Cohort Size # DLTs # Sub- DLTs Action Sample Size Phase 2 Dose Design 3 Up-and-down, cohort size adaptive to two sub-DLTs; increment size adaptive 2 then 5 0/2 1/2 2/2 0/5 1/5 >1/5 0, 1/2 2/2 Any  2 > 2  2 > 2 Any Escalate to next cohort of 2 subjects Expand cohort, 5 subjects at same dose; all future cohort sizes = 5 De-escalate; all future cohort sizes = 5 and increments = 6.25 Escalate to next cohort Escalate to next cohort; all future increments = 6.25 Expand cohort, 10 subjects at same dose Expand cohort, 10 subjects at same dose; all future increments = 6.25 De-escalate; all future increments = 6.25 ~20-24 Highest dose with  20% DLTs

26 26 B. (cont.) Other designs considered 1.3+3, up-and-down until ≥ 20 patients reached: Slow, focuses on too-high doses, ignores sub-DLTs. 2.5+5, up-and-down until ≥ 20 patients reached: Very slow, ignores sub-DLTs. 3.Like Design 3 but cohort size expands 2 → 5 with a single DLT instead of two: Too slow, stays at low doses, doesn't identify phase 2 dose well. 4.Like Design 3 but cohort size expands with a single DLT instead of 2 and continues to accrue subjects until ≥ 2 toxicities are observed in a single cohort: Can have very large sample size and long duration.

27 27 C. Simulations 1.Simulations compared: Design 1: Traditional 3+3 escalation only; Design 2: “Traditional 5+5” escalation only; Design 3: Up-and-down, cohort size adaptive, dose increment size adaptive. 2.Two scenarios used, High Toxicity and Low Toxicity. 3.Comparisons made in terms of Accuracy, Toxicity, Sample size, and Duration. 4.10,000 simulations in each case, all figures accurate to # of digits reported.5+5, up-and-down until ≥ 20 patients reached: Very slow, ignores sub-DLTs.

28 28 Simulation: High-Toxicity Scenario

29 29 Simulation Results: High-Toxicity Scenario Precision

30 30 Simulation Results: High-Toxicity Scenario Toxicities “Sub-therapeutic”: at least 12.5 below true phase II dose

31 31 Simulation Results: High-Toxicity Scenario Sample Size / Duration

32 32 Simulation: Low-Toxicity Scenario

33 33 Simulation Results: Low-Toxicity Scenario Precision

34 34 Simulation Results: Low-Toxicity Scenario Toxicities “Sub-therapeutic”: at least 12.5 below true phase II dose

35 35 Simulation Results: Low-Toxicity Scenario Sample Size / Duration

36 36 V. Conclusions: A.A very simple algorithmic phase I design was presented which had good operating characteristics and produced good estimates of the phase II dose. B.The protocol of which this design formed a part was recently approved by the FDA with no comments whatsoever. C.The Chief Medical Officer with authority over the trial was satisfied: “We'd be delighted for you to present the statistics publically. I hope that the title is something along the line of " An incredibly brilliant and clinically perfect clinical trial design".....!

37 37 An Incredibly Brilliant and Clinically Perfect Clinical Trial Design Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin Madison chappell@stat.wisc.edu 2010 Joint Statistical Meetings; Vancouver


Download ppt "1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin."

Similar presentations


Ads by Google