Substance Abuse, Multi-Stage Decisions, Generalization Error How are they connected?! S.A. Murphy Univ. of Michigan CMU, Nov., 2004.

Slides:



Advertisements
Similar presentations
Treatment Effect Heterogeneity & Dynamic Treatment Regime Development S.A. Murphy.
Advertisements

1 Meeting the Future in Managing Chronic Disorders: Individually Tailored Strategies S.A. Murphy Univ. of Michigan Oberlin College, Feb. 20, 2006.
Using Clinical Trial Data to Construct Policies for Guiding Clinical Decision Making S. Murphy & J. Pineau American Control Conference Special Session.
Experimenting to Improve Clinical Practice S.A. Murphy AAAS, 02/15/13 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
1 Developing Dynamic Treatment Regimes for Chronic Disorders S.A. Murphy Univ. of Michigan RAND: August, 2005.
1 Dynamic Treatment Regimes Advances and Open Problems S.A. Murphy ICSPRAR-2008.
1 Developing Adaptive Treatment Strategies using MOST Experimental Designs S.A. Murphy Univ. of Michigan Dallas: December, 2005.
Methodology for Adaptive Treatment Strategies for Chronic Disorders: Focus on Pain S.A. Murphy NIH Pain Consortium 5 th Annual Symposium on Advances in.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan JSM: August, 2005.
SMART Designs for Constructing Adaptive Treatment Strategies S.A. Murphy 15th Annual Duke Nicotine Research Conference September, 2009.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy LSU ---- Geaux Tigers! April 2009.
An Experimental Paradigm for Developing Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan March, 2004.
Constructing Dynamic Treatment Regimes & STAR*D S.A. Murphy ICSA June 2008.
Screening Experiments for Developing Dynamic Treatment Regimes S.A. Murphy At ICSPRAR January, 2008.
SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have CPDD June, 2005.
Dynamic Treatment Regimes: Challenges in Data Analysis S.A. Murphy Survey Research Center January, 2009.
Q-Learning and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan IMS/Bernoulli: July, 2004.
1 A Prediction Interval for the Misclassification Rate E.B. Laber & S.A. Murphy.
Sizing a Trial for the Development of Adaptive Treatment Strategies Alena I. Oetting The Society for Clinical Trials, 29th Annual Meeting St. Louis, MO.
Screening Experiments for Dynamic Treatment Regimes S.A. Murphy At ENAR March, 2008.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Florida: January, 2006.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy NIDA DESPR February, 2007.
Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy Schering-Plough Workshop May 2007 TexPoint fonts used in EMF. Read the TexPoint manual before.
Michigan Team February, Amy Wagaman Bibhas Chakraborty Herle McGowan Susan Murphy Lacey Gunter Danny Almirall Anne Buu.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan UNC: November, 2003.
1 A Confidence Interval for the Misclassification Rate S.A. Murphy & E.B. Laber.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan PSU, October, 2005 In Honor of Clifford C. Clogg.
Planning Survival Analysis Studies of Dynamic Treatment Regimes Z. Li & S.A. Murphy UNC October, 2009.
Statistical Issues in Developing Adaptive Treatment Strategies for Chronic Disorders S.A. Murphy Univ. of Michigan CDC/ATSDR: March, 2005.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy RWJ Clinical Scholars Program, UMich April, 2007.
Hypothesis Testing and Dynamic Treatment Regimes S.A. Murphy, L. Gunter & B. Chakraborty ENAR March 2007.
1 SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have UMichSpline February, 2006.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy ENAR March 2009.
A Finite Sample Upper Bound on the Generalization Error for Q-Learning S.A. Murphy Univ. of Michigan CALD: February, 2005.
Methodology for Adaptive Treatment Strategies R21 DA S.A. Murphy For MCATS Oct. 8, 2009.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan ACSIR, July, 2003.
Dynamic Treatment Regimes, STAR*D & Voting D. Lizotte, E. Laber & S. Murphy Psychiatric Biostatistics Symposium May 2009.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan February, 2004.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan Yale: November, 2005.
Methods for Estimating the Decision Rules in Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan IBC/ASC: July, 2004.
Discussion of Profs. Robins’ and M  ller’s Papers S.A. Murphy ENAR 2003.
1 Possible Roles for Reinforcement Learning in Clinical Research S.A. Murphy November 14, 2007.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan April, 2006.
SMART Designs for Developing Dynamic Treatment Regimes S.A. Murphy MD Anderson December 2006.
Exploratory Analyses Aimed at Generating Proposals for Individualizing and Adapting Treatment S.A. Murphy BPRU, Hopkins September 22, 2009.
SMART Experimental Designs for Developing Adaptive Treatment Strategies S.A. Murphy ISCTM, 2007.
1 A Prediction Interval for the Misclassification Rate E.B. Laber & S.A. Murphy.
1 Section IV Study Designs for Investigating Adaptive Treatment Strategies Murphy.
Experiments and Adaptive Treatment Strategies S.A. Murphy Univ. of Michigan Chicago: May, 2005.
Susan Murphy, PI University of Michigan Acknowledgements: MCAT network and NIH The Goal To facilitate methodological collaborations necessary for producing.
1 Dynamic Treatment Regimes: Interventions for Chronic Conditions (such as Poverty or Criminality?) S.A. Murphy Univ. of Michigan In Honor of Clifford.
SMART Designs for Developing Dynamic Treatment Regimes S.A. Murphy Symposium on Causal Inference Johns Hopkins, January, 2006.
Experiments and Dynamic Treatment Regimes S.A. Murphy At NIAID, BRB December, 2007.
1 Machine/Reinforcement Learning in Clinical Research S.A. Murphy May 19, 2008.
Adaptive Treatment Strategies S.A. Murphy CCNIA Proposal Meeting 2008.
Adaptive Treatment Strategies S.A. Murphy Workshop on Adaptive Treatment Strategies Convergence, 2008.
Practical Application of Adaptive Treatment Strategies in Trial Design and Analysis S.A. Murphy Center for Clinical Trials Network Classroom Series April.
Experiments and Dynamic Treatment Regimes S.A. Murphy Univ. of Michigan January, 2006.
Hypothesis Testing and Adaptive Treatment Strategies S.A. Murphy SCT May 2007.
Adaptive Treatment Design and Analysis S.A. Murphy TRC, UPenn April, 2007.
Adaptive Treatment Strategies: Challenges in Data Analysis S.A. Murphy NY State Psychiatric Institute February, 2009.
Sequential, Multiple Assignment, Randomized Trials and Treatment Policies S.A. Murphy UAlberta, 09/28/12 TexPoint fonts used in EMF. Read the TexPoint.
Overview of Adaptive Treatment Regimes Sachiko Miyahara Dr. Abdus Wahed.
Sequential, Multiple Assignment, Randomized Trials and Treatment Policies S.A. Murphy MUCMD, 08/10/12 TexPoint fonts used in EMF. Read the TexPoint manual.
Sequential, Multiple Assignment, Randomized Trials Module 2—Day 1 Getting SMART About Developing Individualized Adaptive Health Interventions Methods Work,
Adaptive randomization
1 SMART Designs for Developing Adaptive Treatment Strategies S.A. Murphy K. Lynch, J. McKay, D. Oslin & T.Ten Have NDRI April, 2006.
Motivation Using SMART research designs to improve individualized treatments Alena Scott 1, Janet Levy 3, and Susan Murphy 1,2 Institute for Social Research.
An Experimental Paradigm for Developing Adaptive Treatment Strategies S.A. Murphy NIDA Meeting on Treatment and Recovery Processes January, 2004.
SMART Trials for Developing Adaptive Treatment Strategies S.A. Murphy Workshop on Adaptive Treatment Designs NCDEU, 2006.
Presentation transcript:

Substance Abuse, Multi-Stage Decisions, Generalization Error How are they connected?! S.A. Murphy Univ. of Michigan CMU, Nov., 2004

Outline Chronic Disorders Multi-Stage Decisions & Reinforcement Learning Statistical Challenges Generalization Error Discussion

Chronic Disorders

Why does the management of a chronic disorder such as drug dependence require multi-stage decisions? High variability across patients in response to any one treatment No Cure Relapse is likely without either continuous or intermittent treatment for a large proportion of people. What works now may not work later Exacerbations in disorder may occur if there are no alterations in treatment

Additional Issues in Managing the Chronic Disorder Treatment is often burdensome, especially over time Patient adherence is a critical issue Co-occurring problems are common

High-Level Questions What is the best sequencing of therapies? What is the best timings of alterations in therapies? What information do we use to make these decisions?

Multi-Stage Decisions & Reinforcement Learning

k Decisions on one individual Observation made prior to j th decision point Decision at j th decision point Primary outcome Y is a specified summary of decisions and observations

Goal : Construct decision rules that input data at each decision point and output a recommended decision; these decision rules should lead to a maximal mean Y. where eachis a low dimensional summary of for j=1,…., k

An example of a decision rule is: alter treatment if otherwise maintain on current treatment.

Conceptual Formulation

Statistical Challenges

Construction of summaries useful for decision making High dimensional noisy information In many cases summary must be meaningful Construction of decision rules Variety of data sources In many cases decision rule must be meaningful Experimental designs Evaluation versus construction/refinement of decision rules.

Experimental Design Adaptive Treatment Strategies are multi-component treatments Multiple decision points through time Different kinds of decisions Decision options for improving patients are often different from decision options for non-improving patients Services Research Delivery Mechanisms Training of Staff…….

Conceptual Formulation

Goal : Provide experimental designs for developing or refining adaptive treatment strategies Advocating for a series of developmental, experimental trials --- Collins, Murphy, Nair & Strecher (2004) Designs similar to full factorial designs --- Lavori & Dawson (2004), Lavori (2001) --- Murphy (2004) Need designs that are similar to balanced fractional factorials

Examples of sequential multiple assignment randomized trials: CATIE (2001) Treatment of Psychosis in Alzheimer’s Patients CATIE (2001) Treatment of Psychosis in Schizophrenia STAR*D (2003) Treatment of Depression Oslin (ongoing) Treatment of Alcoholism

Construction of decision rules Data: n (patient) trajectories of observations and decisions---from one patient we see: Decisions are randomized among feasible options Y is known summary of all decisions and observations Estimate decision rules so as to maximize mean of Y

Data Structure (k=2)

Some Methods Q-Learning (Watkins, 1989) (one of many……many methods from reinforcement learning) ---regression A-Learning (Murphy, 2003; Robins, 2004; Blatt et al. 2004) ---regression on a mean zero space Weighting (Murphy, van der Laan & Robins, 2002) ---weighted mean

Generalization Error

Goal : Find the decision rules that are best (maximize mean Y) within a restricted class of decision rules: e.g.,

One decision only! Data: is randomized with probability

The estimand is or equivalently

A Direct Approach But not feasible, particularly if more than one decision!!

Q-Learning Approximate Minimize

Even if our sample is infinite, is not necessarily close to The difference is the generalization error.

The message: In adding information we changed our goal. Our estimator will not (even asymptotically) achieve equivalently

Can we add information without changing our goal? –Can we “guide” Q-learning, A-learning and other methods closer to our goal? –Do we need a theory that compares biased estimators? –Is there a way to use a sieve without changing our goal?

Discussion

Open Problems How might we form summaries of high dimensional noisy data so as to make good decisions? (Prediction is not the goal) How might we use structural models to inform the design of randomized experiments? How should we design experiments if our goal is building or refining an adaptive treatment strategy?

Open Problems How might we use observational data to estimate good adaptive treatment strategies (e.g. decision rules)? How might we use data in which an adaptive treatment strategy was implemented to improve the decision rules? How might we “guide” Q-Learning or A-Learning so as to more closely achieve our goal? How might Bayesian methods be used here?

This seminar can be found at: cmu1104.ppt Further information on adaptive treatment strategies can be found at:

Q-Learning (k=2) Data:

Q-Learning Minimize