Dynamic Treatment Regimes: Challenges in Data Analysis S.A. Murphy Survey Research Center January, 2009.

Slides:



Advertisements
Similar presentations
Assessing the Effects of Time-varying Predictors or Treatments: A Conceptual Discussion Daniel Almirall VA Medical Center, HSRD Duke Medical Center, Dept.
Advertisements

Piloting and Sizing Sequential Multiple Assignment Randomized Trials in Dynamic Treatment Regime Development 2012 Atlantic Causal Inference Conference.
Treatment Effect Heterogeneity & Dynamic Treatment Regime Development S.A. Murphy.
11 Confidence Intervals, Q-Learning and Dynamic Treatment Regimes S.A. Murphy Time for Causality – Bristol April, 2012 TexPoint fonts used in EMF. Read.
1 Meeting the Future in Managing Chronic Disorders: Individually Tailored Strategies S.A. Murphy Univ. of Michigan Oberlin College, Feb. 20, 2006.
Inference for Clinical Decision Making Policies D. Lizotte, L. Gunter, S. Murphy INFORMS October 2008.
Using Clinical Trial Data to Construct Policies for Guiding Clinical Decision Making S. Murphy & J. Pineau American Control Conference Special Session.
Experimenting to Improve Clinical Practice S.A. Murphy AAAS, 02/15/13 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
1 Developing Dynamic Treatment Regimes for Chronic Disorders S.A. Murphy Univ. of Michigan RAND: August, 2005.
1 Dynamic Treatment Regimes Advances and Open Problems S.A. Murphy ICSPRAR-2008.
Causal Inference and Alternative Explanations S.A. Murphy Univ. of Michigan May, 2004.
1 Developing Adaptive Treatment Strategies using MOST Experimental Designs S.A. Murphy Univ. of Michigan Dallas: December, 2005.
Methodology for Adaptive Treatment Strategies for Chronic Disorders: Focus on Pain S.A. Murphy NIH Pain Consortium 5 th Annual Symposium on Advances in.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan JSM: August, 2005.
SMART Designs for Constructing Adaptive Treatment Strategies S.A. Murphy 15th Annual Duke Nicotine Research Conference September, 2009.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy LSU ---- Geaux Tigers! April 2009.
Substance Abuse, Multi-Stage Decisions, Generalization Error How are they connected?! S.A. Murphy Univ. of Michigan CMU, Nov., 2004.
An Experimental Paradigm for Developing Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan March, 2004.
Constructing Dynamic Treatment Regimes & STAR*D S.A. Murphy ICSA June 2008.
Screening Experiments for Developing Dynamic Treatment Regimes S.A. Murphy At ICSPRAR January, 2008.
1 Dynamic Treatment Regimens S.A. Murphy PolMeth XXV July 10, 2008.
SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have CPDD June, 2005.
1 A Prediction Interval for the Misclassification Rate E.B. Laber & S.A. Murphy.
Sizing a Trial for the Development of Adaptive Treatment Strategies Alena I. Oetting The Society for Clinical Trials, 29th Annual Meeting St. Louis, MO.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Florida: January, 2006.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy NIDA DESPR February, 2007.
Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy Schering-Plough Workshop May 2007 TexPoint fonts used in EMF. Read the TexPoint manual before.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan UNC: November, 2003.
1 A Confidence Interval for the Misclassification Rate S.A. Murphy & E.B. Laber.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan PSU, October, 2005 In Honor of Clifford C. Clogg.
Statistical Issues in Developing Adaptive Treatment Strategies for Chronic Disorders S.A. Murphy Univ. of Michigan CDC/ATSDR: March, 2005.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy RWJ Clinical Scholars Program, UMich April, 2007.
Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy, L. Gunter & B. Chakraborty ENAR March 2007.
1 Meeting the Future in Managing Chronic Disorders: Individually Tailored Strategies S.A. Murphy Herbert E. Robbins Collegiate Professorship in Statistics.
1 SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have UMichSpline February, 2006.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy ENAR March 2009.
A Finite Sample Upper Bound on the Generalization Error for Q-Learning S.A. Murphy Univ. of Michigan CALD: February, 2005.
Methodology for Adaptive Treatment Strategies R21 DA S.A. Murphy For MCATS Oct. 8, 2009.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan ACSIR, July, 2003.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy Psychiatric Biostatistics Symposium May 2009.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan February, 2004.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Yale: November, 2005.
Methods for Estimating the Decision Rules in Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan IBC/ASC: July, 2004.
1 Possible Roles for Reinforcement Learning in Clinical Research S.A. Murphy November 14, 2007.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan April, 2006.
SMART Designs for Developing Dynamic Treatment Regimes S.A. Murphy MD Anderson December 2006.
Exploratory Analyses Aimed at Generating Proposals for Individualizing and Adapting Treatment S.A. Murphy BPRU, Hopkins September 22, 2009.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy ISCTM, 2007.
1 A Prediction Interval for the Misclassification Rate E.B. Laber & S.A. Murphy.
Experiments and Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan Chicago: May, 2005.
Susan Murphy, PI University of Michigan Acknowledgements: MCAT network and NIH The Goal To facilitate methodological collaborations necessary for producing.
1 Dynamic Treatment Regimes: Interventions for Chronic Conditions (such as Poverty or Criminality?) S.A. Murphy Univ. of Michigan In Honor of Clifford.
SMART Designs for Developing Dynamic Treatment Regimes S.A. Murphy Symposium on Causal Inference Johns Hopkins, January, 2006.
Experiments and Dynamic Treatment Regimes S.A. Murphy At NIAID, BRB December, 2007.
1 Machine/Reinforcement Learning in Clinical Research S.A. Murphy May 19, 2008.
Adaptive Treatment Strategies S.A. Murphy CCNIA Proposal Meeting 2008.
Adaptive Treatment Strategies S.A. Murphy Workshop on Adaptive Treatment Strategies Convergence, 2008.
Practical Application of Adaptive Treatment Strategies in Trial Design and Analysis S.A. Murphy Center for Clinical Trials Network Classroom Series April.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan January, 2006.
1 Variable Selection for Tailoring Treatment S.A. Murphy, L. Gunter & J. Zhu May 29, 2008.
Hypothesis Testing and Adaptive Treatment Strategies S.A. Murphy SCT May 2007.
Adaptive Treatment Design and Analysis S.A. Murphy TRC, UPenn April, 2007.
Adaptive Treatment Strategies: Challenges in Data Analysis S.A. Murphy NY State Psychiatric Institute February, 2009.
1 Meeting the Future in Managing Chronic Disorders: Individually Tailored Strategies S.A. Murphy Univ. of Michigan In Honor of Clifford C. Clogg.
Sequential, Multiple Assignment, Randomized Trials and Treatment Policies S.A. Murphy UAlberta, 09/28/12 TexPoint fonts used in EMF. Read the TexPoint.
Sequential, Multiple Assignment, Randomized Trials and Treatment Policies S.A. Murphy MUCMD, 08/10/12 TexPoint fonts used in EMF. Read the TexPoint manual.
1 SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have NDRI April, 2006.
Motivation Using SMART research designs to improve individualized treatments Alena Scott 1, Janet Levy 3, and Susan Murphy 1,2 Institute for Social Research.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy NIDA Meeting on Treatment and Recovery Processes January, 2004.
SMART Trials for Developing Adaptive Treatment Strategies S.A. Murphy Workshop on Adaptive Treatment Designs NCDEU, 2006.
Presentation transcript:

Dynamic Treatment Regimes: Challenges in Data Analysis S.A. Murphy Survey Research Center January, 2009

2 Outline What are Dynamic Treatment Regimes? Myopic Decision Making Constructing Regimes Q-Learning Example using CATIE

3 Dynamic Treatment Regimes operationalize multi-stage decision making. These are individually tailored sequences of interventions, with intervention type and dosage adapted to the individual. Generalization from a one-time decision to a sequence of decisions concerning interventions Operationalize clinical practice. Each decision corresponds to a stage of intervention

4 Dynamic Treatment Regime “Jobs First” Welfare Program At each stage of intervention –Use individual characteristics (assets, income, age, health, employment), characteristics of the environment (domestic violence, incapacitated family member, # children, living arrangements…), –To select actions/interventions such as child care, job search skills training, amount of cash benefit, medical assistance, education, –In order to maximize long term rewards (maximize employment/independence over longer term).

5

6 Why use a Dynamic Treatment Regime? –High heterogeneity in response to any one intervention What works for one person may not work for another What works now for a person may not work later –Improvement often marred by relapse Remitted or few current symptoms is not the same as cured. –Co-occurring disorders/adherence problems are common

7 Outline What are Dynamic Treatment Regimes? Myopic Decision Making Constructing Regimes Q-Learning Example using CATIE

8 Myopic Decision Making In myopic decision making, decision makers use regimes that seek to maximize immediate rewards. Problems: –Ignore longer term consequences of present actions. –Ignore the range of feasible future actions/interventions –Ignore the fact that immediate responses to present actions may yield information that pinpoints best future actions (A dynamic treatment regime tells us how to use the observations to choose the actions/interventions.)

9 Treatment of Schizophrenia Myopic action: Offer patients a treatment that reduces schizophrenia symptoms for as many people as possible. The result: Some patients are not helped and/or experience abnormal movements of the voluntary muscles (TDs). The class of subsequent medications is greatly reduced. The mistake: We should have taken into account the variety of treatments available to those for whom the first treatment is ineffective. The message: Use an initial medication that may not have as large a success rate but that will be less likely to cause TDs.

10 Treatment of Opioid Dependence Myopic action: Choose an intensive multi-component treatment (methadone + counseling + behavioral contingencies) that immediately reduces opioid use for as many people as possible. The result: Behavioral contingencies are burdensome/expensive to implement and many people may not need the contingencies to improve. The mistake: We should allow the patient to exhibit poor adherence prior to implementing the behavioral contingencies. The message: Use an initial treatment that may not have as large an immediate success rate but carefully monitor patient adherence to ascertain if behavioral contingencies are required.

11 Outline What are Dynamic Treatment Regimes? Myopic Decision Making Constructing Regimes Q-Learning Example using CATIE

12 Basic Idea for Constructing a Regime: Move Backwards Through Stages. (Pretend you are “All-Knowing”)

13 2 Stages for each individual Observations available at j th stage Action at j th stage

14 2 Stages History available at each stage Primary Outcome/Reward:

15 A dynamic treatment regime is the sequence of decision rules: A simple decision rule is: given weights β, switch treatment at stage j if otherwise maintain on current treatment; S j is a vector summary of the history, H j.

16 Goal: Use data to construct decision rules that input information in the history at each stage and output a recommended decision; these decision rules should lead to a maximal mean Y. In the future we employ the actions recommended by the decision rules:

17 Example of Decision Rules Treatment of depression. Goal is to achieve and maintain remission. Provide Citalopram for up to 12 weeks gradually increasing dose as required. If either the maximum dose has been provided for two weeks, or 12 weeks have occurred, yet there is no remission, then if there has been a 50% improvement in symptoms, augment with Mirtazapine. else switch treatment to Bupropion. Else (remission is achieved) maintain on Citalopram and provide web-based disease management.

18 Idealized Data for Constructing the Dynamic Treatment Regime: Data from sequential, multiple assignment, randomized trials in which at each stage subjects are randomized among alternative options. That is, A j is a randomized action with known randomization probability. Binary actions with P[A j =1]=P[A j =-1]=.5

19 Outline What are Dynamic Treatment Regimes? Myopic Decision Making Constructing Regimes Q-Learning Example using CATIE

20 Regression-based methods for constructing decision rules Q-Learning (Watkins, 1989) (a popular method from computer science) A-Learning or optimal nested structural mean model (Murphy, 2003; Robins, 2004) The first method is an inefficient version of the second method when each stages’ covariates include the prior stages’ covariates and the actions are centered to have conditional mean zero.

21 Basic Idea for Constructing a Regime: Move Backwards Through Stages. (Pretend you are “All-Knowing”)

22 (k=2) Dynamic Programming

23 Approximate for S', S vector summaries of the history, A Simple Version of Q-Learning –binary actions Stage 2 regression: Use least squares with outcome, Y, and covariates to obtain Set Stage 1 regression: Use least squares with outcome, and covariates to obtain

24 Approximate for S', S vector summaries of the history, A Simple Version of Q-Learning –binary actions Stage j decision rule: Select treatment = 1 if Otherwise select treatment = -1

25 Outline What are Dynamic Treatment Regimes? Myopic Decision Making Constructing Regimes Q-Learning Example using CATIE

26 Clinical Antipsychotic Trials of Intervention Effectiveness (Schizophrenia) Multi-stage trial of 18 months duration Relaxed entry criteria A large number of sites representing a broad array of clinical settings (state mental health, academic, Veterans’ Affairs, HMOs, managed care) Approximately 1500 patients

27 CATIE Randomizations (simplified) Phase 1 Randomized Treatments OLAN QUET RISP ZIPR PERP Phase 2 Treatment preference Efficacy Tolerability Randomized Treatments CLOZ OLAN QUET RISP OLAN QUET RISP ZIPR Phase 3 Treatments selected many options by preference

28 Preliminary Analyses Reward: Time to Treatment Dropout Phase 1 analysis: –Controls: TD, recent exacerbation, site –Tailoring variable: pretreatment PANSS Phase 2 analysis: –Controls: TD, recent exacerbation, site –Tailoring variables: “treatment preference,” phase 1 treatment, end of phase 1 PANSS Constructing Dynamic Treatment Regimes using CATIE

29

30

31 Preliminary Analyses Myopic versus Non-myopic Analyses Reward: Integrated Quality of Life (QoL) Phase 1 analysis: –Controls: TD, recent exacerbation, site –Tailoring variable: pretreatment QoL Phase 2 analysis: –Controls: TD, recent exacerbation, site –Tailoring variables: “treatment preference,” phase 1 treatment, end of phase 1 QoL

32

33

34 Challenges It is extremely challenging to provide measures of confidence that possess “good frequentist properties.” Clinical Decision Support Systems –We need to be able construct dynamic treatment regimes that recommend a group of treatment actions when there is no evidence that a particular treatment action is best. Even in this randomized trial setting, the most straightforward analyses are subject to confounding bias. Some methods to avoid confounding bias are available.

35 Acknowledgements: This presentation is based on work with many individuals including Eric Laber, Dan Lizotte, John Rush, Scott Stoup, Joelle Pineau, Daniel Almirall and Bibhas Chakraborty,. address: Slides with notes at: Click on seminars > health science seminars

36 Causal Inference Challenges Behavioral/Social/Medical Sciences Incomplete mechanistic models –Unknown causes Use data on individuals to combat the dearth of mechanistic models. –Drawback: non-causal “associations” occur due to the unknown causes of the observations.

37 Unknown, Unobserved Causes (Incomplete Mechanistic Models)

38 Unknown, Unobserved Causes (Incomplete Mechanistic Models)

39 Unknown, Unobserved Causes (Incomplete Mechanistic Models)

40 The problem: Even when treatments are randomized, non-causal associations occur in the data. The solution: Statistical methods should appropriately “average” over the non-causal associations between treatment and reward. Unknown, Unobserved Causes (Incomplete Mechanistic Models)

41 Unknown, Unobserved Causes (Incomplete Mechanistic Models)

42 Unknown, Unobserved Causes Problem: We recruit students via flyers posted in dormitories. Associations between observations and rewards are highly likely to be (due to the unknown causes) non- representative. Solution: Sample a representative group of college students.

43 Summary of Solutions To Causal Problems If possible randomize treatments (e.g. actions). Develop methods that avoid being influenced by non-causal associations yet help you construct the policy. Subjects in your data should be representative of population of subjects.