Using Clinical Trial Data to Construct Policies for Guiding Clinical Decision Making S. Murphy & J. Pineau American Control Conference Special Session.

Slides:



Advertisements
Similar presentations
Assessing the Effects of Time-varying Predictors or Treatments: A Conceptual Discussion Daniel Almirall VA Medical Center, HSRD Duke Medical Center, Dept.
Advertisements

Piloting and Sizing Sequential Multiple Assignment Randomized Trials in Dynamic Treatment Regime Development 2012 Atlantic Causal Inference Conference.
Extraction and Transfer of Knowledge in Reinforcement Learning A.LAZARIC Inria “30 minutes de Science” Seminars SequeL Inria Lille – Nord Europe December.
CmpE 104 SOFTWARE STATISTICAL TOOLS & METHODS MEASURING & ESTIMATING SOFTWARE SIZE AND RESOURCE & SCHEDULE ESTIMATING.
Treatment Effect Heterogeneity & Dynamic Treatment Regime Development S.A. Murphy.
11 Confidence Intervals, Q-Learning and Dynamic Treatment Regimes S.A. Murphy Time for Causality – Bristol April, 2012 TexPoint fonts used in EMF. Read.
Experimental Trials. Oslin ExTENd Late Trigger for Nonresponse 8 wks Response TDM + Naltrexone CBI Random assignment: CBI +Naltrexone Nonresponse Early.
1 Meeting the Future in Managing Chronic Disorders: Individually Tailored Strategies S.A. Murphy Univ. of Michigan Oberlin College, Feb. 20, 2006.
Inference for Clinical Decision Making Policies D. Lizotte, L. Gunter, S. Murphy INFORMS October 2008.
Experimenting to Improve Clinical Practice S.A. Murphy AAAS, 02/15/13 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
1 Dynamic Treatment Regimes Advances and Open Problems S.A. Murphy ICSPRAR-2008.
Methodology for Adaptive Treatment Strategies for Chronic Disorders: Focus on Pain S.A. Murphy NIH Pain Consortium 5 th Annual Symposium on Advances in.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan JSM: August, 2005.
SMART Designs for Constructing Adaptive Treatment Strategies S.A. Murphy 15th Annual Duke Nicotine Research Conference September, 2009.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy LSU ---- Geaux Tigers! April 2009.
Substance Abuse, Multi-Stage Decisions, Generalization Error How are they connected?! S.A. Murphy Univ. of Michigan CMU, Nov., 2004.
An Experimental Paradigm for Developing Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan March, 2004.
Constructing Dynamic Treatment Regimes & STAR*D S.A. Murphy ICSA June 2008.
1 Dynamic Treatment Regimens S.A. Murphy PolMeth XXV July 10, 2008.
SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have CPDD June, 2005.
Dynamic Treatment Regimes: Challenges in Data Analysis S.A. Murphy Survey Research Center January, 2009.
Q-Learning and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan IMS/Bernoulli: July, 2004.
1 A Prediction Interval for the Misclassification Rate E.B. Laber & S.A. Murphy.
Sizing a Trial for the Development of Adaptive Treatment Strategies Alena I. Oetting The Society for Clinical Trials, 29th Annual Meeting St. Louis, MO.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Florida: January, 2006.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy NIDA DESPR February, 2007.
Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy Schering-Plough Workshop May 2007 TexPoint fonts used in EMF. Read the TexPoint manual before.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan UNC: November, 2003.
1 A Confidence Interval for the Misclassification Rate S.A. Murphy & E.B. Laber.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan PSU, October, 2005 In Honor of Clifford C. Clogg.
Planning Survival Analysis Studies of Dynamic Treatment Regimes Z. Li & S.A. Murphy UNC October, 2009.
Statistical Issues in Developing Adaptive Treatment Strategies for Chronic Disorders S.A. Murphy Univ. of Michigan CDC/ATSDR: March, 2005.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy RWJ Clinical Scholars Program, UMich April, 2007.
Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy, L. Gunter & B. Chakraborty ENAR March 2007.
1 Meeting the Future in Managing Chronic Disorders: Individually Tailored Strategies S.A. Murphy Herbert E. Robbins Collegiate Professorship in Statistics.
1 SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have UMichSpline February, 2006.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy ENAR March 2009.
1 Hybrid Agent-Based Modeling: Architectures,Analyses and Applications (Stage One) Li, Hailin.
A Finite Sample Upper Bound on the Generalization Error for Q-Learning S.A. Murphy Univ. of Michigan CALD: February, 2005.
Methodology for Adaptive Treatment Strategies R21 DA S.A. Murphy For MCATS Oct. 8, 2009.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy Psychiatric Biostatistics Symposium May 2009.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Yale: November, 2005.
Methods for Estimating the Decision Rules in Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan IBC/ASC: July, 2004.
1 Possible Roles for Reinforcement Learning in Clinical Research S.A. Murphy November 14, 2007.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan April, 2006.
SMART Designs for Developing Dynamic Treatment Regimes S.A. Murphy MD Anderson December 2006.
Exploratory Analyses Aimed at Generating Proposals for Individualizing and Adapting Treatment S.A. Murphy BPRU, Hopkins September 22, 2009.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy ISCTM, 2007.
1 A Prediction Interval for the Misclassification Rate E.B. Laber & S.A. Murphy.
Experiments and Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan Chicago: May, 2005.
Susan Murphy, PI University of Michigan Acknowledgements: MCAT network and NIH The Goal To facilitate methodological collaborations necessary for producing.
SMART Designs for Developing Dynamic Treatment Regimes S.A. Murphy Symposium on Causal Inference Johns Hopkins, January, 2006.
Experiments and Dynamic Treatment Regimes S.A. Murphy At NIAID, BRB December, 2007.
1 Machine/Reinforcement Learning in Clinical Research S.A. Murphy May 19, 2008.
Adaptive Treatment Strategies S.A. Murphy CCNIA Proposal Meeting 2008.
Adaptive Treatment Strategies S.A. Murphy Workshop on Adaptive Treatment Strategies Convergence, 2008.
Practical Application of Adaptive Treatment Strategies in Trial Design and Analysis S.A. Murphy Center for Clinical Trials Network Classroom Series April.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan January, 2006.
Revisiting an Old Topic: Probability of Replication D. Lizotte, E. Laber & S. Murphy Johns Hopkins Biostatistics September 23, 2009.
1 Variable Selection for Tailoring Treatment S.A. Murphy, L. Gunter & J. Zhu May 29, 2008.
Hypothesis Testing and Adaptive Treatment Strategies S.A. Murphy SCT May 2007.
Adaptive Treatment Strategies: Challenges in Data Analysis S.A. Murphy NY State Psychiatric Institute February, 2009.
Sequential, Multiple Assignment, Randomized Trials and Treatment Policies S.A. Murphy UAlberta, 09/28/12 TexPoint fonts used in EMF. Read the TexPoint.
Overview of Adaptive Treatment Regimes Sachiko Miyahara Dr. Abdus Wahed.
SMART Case Studies Module 3—Day 1 Getting SMART About Developing Individualized Adaptive Health Interventions Methods Work, Chicago, Illinois, June
Sequential, Multiple Assignment, Randomized Trials and Treatment Policies S.A. Murphy MUCMD, 08/10/12 TexPoint fonts used in EMF. Read the TexPoint manual.
STAR*D Objectives Compare relative efficacy of different treatment options –Goal is REMISSION, not just “response” –Less than half of patients with depression.
1 SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have NDRI April, 2006.
SMART Trials for Developing Adaptive Treatment Strategies S.A. Murphy Workshop on Adaptive Treatment Designs NCDEU, 2006.
System Control based Renewable Energy Resources in Smart Grid Consumer
Presentation transcript:

Using Clinical Trial Data to Construct Policies for Guiding Clinical Decision Making S. Murphy & J. Pineau American Control Conference Special Session June, 2009

2 Outline –Sequential Clinical Decision Making –Clinical Trials –Challenges Incomplete, primitive, mechanistic models Measures of Confidence –Illustration Long Term Goal: Improving Clinical Decision Making Using Data

3

4 Critical Decisions Which treatments should be offered first? How long should we wait for these treatments to work? How long should we wait before offering a transition to a maintenance stage? Which treatments should be offered next? All of these questions relate to the formulation of a policy.

5 Examples of Clinical Trials Sequenced RBT: Goal is to improve neonatal outcomes STAR*D: Goal is to achieve depression remission.

6 Jones’ Study for Drug-Addicted Pregnant Women rRBT 2 wks Response rRBT tRBT Random assignment: rRBT Nonresponse tRBT Random assignment: aRBT 2 wks Response Random assignment: eRBT tRBT rRBT Nonresponse

7 STAR*D

8 Challenges Incomplete Mechanistic Models –non-causal “associations” in data occur due to the unknown causes of the observations Small, Expensive, Data Sets with High Noise to Signal Ratio –Measures of confidence are essential

9 Conceptual Structure in the Behavioral Sciences (clinical trial data)

10 Unknown, Unobserved Causes (Incomplete Mechanistic Models)

11 The problem: Even when treatments are randomized, non-causal associations occur in the data. Solutions: – Recognize that parts of the transition probabilities (“system dynamics”) can not be informed by domain expertise as these parts reflect non-causal associations –Or use methods for constructing policies that “average” over the non-causal associations between action and cost or reward. Unknown, Unobserved Causes (Incomplete Mechanistic Models)

12 Measures of Confidence We would like measures of confidence for the following: –To assess if there is sufficient evidence that a particular observation (e.g. output of a biological test) should be part of the policy. –To assess if there is sufficient evidence that a subset of the actions lead to lower cost than the remaining actions. (reward=-cost)

13 Measures of Confidence Traditional methods for constructing measures of confidence require differentiability (if frequentist properties are desired). Optimal policies are constructed via non- differentiable operations (e.g. minimization/maximization).

14 STAR*D

15 STAR*D Stage 1 Observation: QIDS: low score is desirable Preference for type of Stage 1 treatment: Switch or Augment Stage 1Treatment Action: If Stage 1 preference is Switch then randomize switch to either Ser, Bup or Ven; if Stage 1 preference is Augment then randomize to augment with Bup or Bus. Stage 2 Observation: QIDS: low score is desirable Preference for type of Stage 2 treatment: Switch or Augment Stage 2 Treatment Action: If Stage 2 preference is Switch then randomize switch to either Mirt or Ntp: if Stage 2 preference is Augment then randomize to augment with Li or Thy Patients exit to follow-up if remission is achieved (QIDS ≤ 5).

16 Construct the policy to minimize cost (or maximize reward) Cost: minimum of time to remission and 30 weeks. Construct policy so as to minimize average cost

17 Algorithm Fitted Q-iteration with linear function approximation. One estimates the “state-action cost” function at stages 1,2 via a linear model. Use voting across bootstrap samples (approximate double bootstrap) to assess confidence that a particular action is best. (cost=-value=-benefit-to-go)

18

19 Conclusion for Stage 1(level 2) If QIDS is >13 then both Ven and Bup are best treatment actions If QIDS is <9 then Ser is best treatment action. If QIDS is around then no real winner(s).

20 Discussion If modern control methods are to be used with clinical trial data then these methods must accommodate the existence of unknown, unobserved variables influencing observations at multiple stages, should provide measures of confidence and must be combined with modern missing data methods.

21 This seminar can be found at: seminars/ACC06.09.ppt me with questions or if you would like a copy!

22 The Problem Many patients dropout of the study. Stage 1Stage 2 Remit38336 Move to next stage Dropout Sum