Todd Wagner, PhD October 2013 Propensity Scores. Outline 1. Background on assessing causation 2. Define propensity score (PS) 3. Calculate the PS 4. Use.

Slides:



Advertisements
Similar presentations
Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Sep. 16, 2005 Lilly Yue, Ph.D.* CDRH, FDA, Rockville MD * No official support.
Advertisements

The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
When Should a Clinical Trial Design with Pre-Stratification be Used? Group 1.
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
Controlling for Time Dependent Confounding Using Marginal Structural Models in the Case of a Continuous Treatment O Wang 1, T McMullan 2 1 Amgen, Thousand.
A workshop introducing doubly robust estimation of treatment effects
1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
V.: 9/7/2007 AC Submit1 Statistical Review of the Observational Studies of Aprotinin Safety Part I: Methods, Mangano and Karkouti Studies CRDAC and DSaRM.
Copyright EM LYON Par accord du CFC Cession et reproduction interdites Research in Entrepreneurship- The problem of unobserved heterogeneity Frédéric Delmar.
Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational.
Propensity Score Matching Lava Timsina Kristina Rabarison CPH Doctoral Seminar Fall 2012.
Observational Studies Based on Rosenbaum (2002) David Madigan Rosenbaum, P.R. (2002). Observational Studies (2 nd edition). Springer.
Method Reading Group September 22, 2008 Matching.
1 Arlene Ash QMC - Third Tuesday September 21, 2010 Analyzing Observational Data: Focus on Propensity Scores.
Clustered or Multilevel Data
Introduction to PSM: Practical Issues, Concerns, & Strategies Shenyang Guo, Ph.D. School of Social Work University of North Carolina at Chapel Hill January.
Experimental Group Designs
Comparing high-dimensional propensity score versus lasso variable selection for confounding adjustment in a novel simulation framework Jessica Franklin.
Advanced Statistics for Interventional Cardiologists.
EVAL 6970: Cost Analysis for Evaluation Dr. Chris L. S. Coryn Nick Saxton Fall 2014.
Propensity Score Matching and Variations on the Balancing Test Wang-Sheng Lee Melbourne Institute of Applied Economic and Social Research The University.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Introduction and Identification Todd Wagner Econometrics with Observational Data.
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
Multilevel Data in Outcomes Research Types of multilevel data common in outcomes research Random versus fixed effects Statistical Model Choices “Shrinkage.
Estimating the Effects of Two Classes of Drugs on Hemoglobin with a Doubly Robust Method APHA conference, October 30, 2012 Brian P. Johnson, MPH, Charles.
© Nuffield Trust 22 June 2015 Matched Control Studies: Methods and case studies Cono Ariti
The Price of Violence Long term effects of assault on labor force participation and health Petra Ornstein, Uppsala university.
Matching Estimators Methods of Economic Investigation Lecture 11.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
ECON 3039 Labor Economics By Elliott Fan Economics, NTU Elliott Fan: Labor 2015 Fall Lecture 21.
Propensity Score Matching for Causal Inference: Possibilities, Limitations, and an Example sean f. reardon MAPSS colloquium March 6, 2007.
Public Policy Analysis ECON 3386 Anant Nyshadham.
5-4-1 Unit 4: Sampling approaches After completing this unit you should be able to: Outline the purpose of sampling Understand key theoretical.
AFRICA IMPACT EVALUATION INITIATIVE, AFTRL Africa Program for Education Impact Evaluation David Evans Impact Evaluation Cluster, AFTRL Slides by Paul J.
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
Todd Wagner, PhD July 2012 Propensity Scores. Outline 1. Background on assessing causation –Randomized trials –Observational studies 2. Calculating a.
Non-experimental methods Markus Goldstein The World Bank DECRG & AFTPM.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Patsi Sinnott, PT, PhD March 2011 Instrumental Variables Models.
IMPORTANCE OF STATISTICS MR.CHITHRAVEL.V ASST.PROFESSOR ACN.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Applying Causal Inference Methods to Improve Identification of Health and Healthcare Disparities, and the Underlying Mediators and Moderators of Disparities.
1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.
Randomized Assignment Difference-in-Differences
Considering model structure of covariates to estimate propensity scores Qiu Wang.
Bilal Siddiqi Istanbul, May 12, 2015 Measuring Impact: Non-Experimental Methods.
Transparency in the Use of Propensity Score Methods
Todd Wagner, PhD February 2011 Propensity Scores.
Using Prior Scores to Evaluate Bias in Value-Added Models Raj Chetty, Stanford University and NBER John N. Friedman, Brown University and NBER Jonah Rockoff,
MATCHING Eva Hromádková, Applied Econometrics JEM007, IES Lecture 4.
(ARM 2004) 1 INNOVATIVE STATISTICAL APPROACHES IN HSR: BAYESIAN, MULTIPLE INFORMANTS, & PROPENSITY SCORES Thomas R. Belin, UCLA.
Alexander Spermann University of Freiburg, SS 2008 Matching and DiD 1 Overview of non- experimental approaches: Matching and Difference in Difference Estimators.
Experimental Evaluations Methods of Economic Investigation Lecture 4.
Copyright © 2015 Inter-American Development Bank. This work is licensed under a Creative Commons IGO 3.0 Attribution-Non Commercial-No Derivatives (CC-IGO.
Patricia Gonzalez, OSEP June 14, The purpose of annual performance reporting is to demonstrate that IDEA funds are being used to improve or benefit.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.
Analysis for Designs with Assignment of Both Clusters and Individuals
Constructing Propensity score weighted and matched Samples Stacey L
Sec 9C – Logistic Regression and Propensity scores
Impact evaluation: The quantitative methods with applications
Matching Methods & Propensity Scores
Matching Methods & Propensity Scores
Methods of Economic Investigation Lecture 12
Matching Methods & Propensity Scores
Impact Evaluation Methods: Difference in difference & Matching
Evaluating Impacts: An Overview of Quantitative Methods
Analysing RWE for HTA: Challenges, methods and critique
Presentation transcript:

Todd Wagner, PhD October 2013 Propensity Scores

Outline 1. Background on assessing causation 2. Define propensity score (PS) 3. Calculate the PS 4. Use the PS 5. Limitations of the PS 2

Causality Researchers are often interested in understanding causal relationships –Does drinking red wine affect health? –Does a new treatment improve mortality? Randomized trial provides a methodological approach for understanding causation 3

Randomization 4 Recruit Participants Random Sorting Treatment Group (A) Comparison Group (B) Outcome (Y) Note: random sorting can, by chance, lead to unbalanced groups. Most trials use checks and balances to preserve randomization

Trial analysis The expected effect of treatment is E(Y)=E(Y A )-E(Y B ) Expected effect on group A minus expected effect on group B (i.e., mean difference). 5

Trial Analysis (II) E(Y)=E(Y A )-E(Y B ) can be analyzed using the following general model y i = α + βx i + ε i Where –y is the outcome –α is the intercept –x is the mean difference in the outcome between treatment A relative to treatment B –ε is the error term –i denotes the unit of analysis (person) 6

Trial Analysis (III) The model can be expanded to control for baseline characteristics y i = α + βx i + δZ i + ε i Where –y is outcome –α is the intercept –x is the added value of the treatment A relative to treatment B –Z is a vector of baseline characteristics (predetermined prior to randomization) –ε is the error term –i denotes the unit of analysis (person) 7

Assumptions Right hand side variables are measured without noise (i.e., considered fixed in repeated samples) There is no correlation between the right hand side variables and the error term E(x i u i )=0 If these conditions hold, β is an unbiased estimate of the causal effect of the treatment on the outcome 8

Observational Studies Randomized trials may be –Unethical –Infeasible –Impractical –Not scientifically justified 9

Sorting without randomization 10 Patient characteristics Observed: health, income, age, gender. Provider characteristics Observed: staff, costs, congestion, Treatment group Comparison group Outcome Based on: Maciejewski and Pizer (2007) Propensity Scores and Selection Bias in Observational Studies. HERC Cyberseminar Sorting If everything is fully observed and correctly specified; results are not biased. Never happens in reality.

Sorting without randomization 11 Patient characteristics Provider Characteristics Treatment group Comparison group Outcome Sorting Unobserved factors affect outcome, but not sorting; treatment effect is biased. Fixed effects would be potential fix. Unobserved characteristics Teamwork, provider communication, patient education

Sorting without randomization 12 Patient characteristics Provider Characteristics Treatment group Comparison group Outcome Sorting Unobserved factors affect outcome and sorting. Treatment effect is biased. Causality isn’t identified. Unobserved characteristics Teamwork, provider communication, patient education

Sorting without randomization 13 Patient characteristics Provider Characteristics Treatment group Comparison group Outcome Sorting Unobserved factors affect outcome and sorting. Treatment effect is biased. Instrumental variables may offer insights on causal relationship, as related to exogenous factors. Unobserved characteristics Teamwork, provider communication, patient education Exogenous factors laws, programs, “prices”

Propensity Score Defined The PS uses observed information, which is multi-dimensional, to calculate a single variable (the score) The score is the predicted propensity to get sorted (usually thought of as propensity to get treatment). Expected treatment effect: E(Y)=E(Y A )-E(Y B ) Propensity Score is: Pr(Y=A | X i ) 14

Propensity Scores What it is: Another way to correct for observable characteristics What it is not: A way to adjust for unobserved characteristics The only way to make causal claims is to make huge assumptions. 15

Strong Ignorability To make statements about causation, you would need to make an assumption that treatment assignment is strongly ignorable. –Similar to assumptions of missing at random –Equivalent to stating that all variables of interest are observed 16

Calculating the PropensityScore 17 One group receives treatment and another group doesn’t Use logistic regression to estimate the probability that a person received treatment The predicted probability is the propensity score

Variables to Include Include variables that are unrelated to the exposure but related to the outcome This will decrease the variance of an estimated exposure effect without increasing bias 18 Brookhart MA, et al Am J Epidemiol Jun 15;163(12): Variable selection for propensity score models. Exposure Outcome

Variables to Include in PS 19 Patient characteristics Provider Characteristics Treatment group Comparison group Outcome Sorting Unobserved characteristics Teamwork, provider communication, patient education Exogenous factors laws, programs, “prices” NO YES

Variables to Exclude Exclude variables that are related to the exposure but not to the outcome These variables will increase the variance of the estimated exposure effect without decreasing bias Variable selection is particularly important in small studies (n<500) 20 Brookhart MA, et al Am J Epidemiol Jun 15;163(12): Variable selection for propensity score models.

Example: Resident Surgery Do cardiac bypass patients have better / worse outcomes when their surgery is conducted by a resident? CSP 474 –Randomized patients to radial artery or saphenous vein –Tracked primary surgeon 21

How do You Use a Propensity Score 22

Uses Understanding sorting and balance –Sorting is multidimensional –The PS provides a simple way of reducing this dimensionality to understand the similarity of the treatment groups Adjusting for covariance 23

Example Are surgical outcomes worse when the surgeon is a resident? Resident assignment may depend on –Patient risk –Availability of resident –Resident skill –Local culture 24

Resident Assignment 25 ORP value Age Canadian Functional Class Class Class Class Urgent priority Artery condition at site Calcified Sclerotic site site site site site site endo vascular harvest On pump surgery grafts grafts Bakaeen F, Sethi G, Wagner T, et al. Coronary Artery Bypass Graft Patency: Residents Versus Attending Surgeons. Annals of Thoracic Surgery. Assignment not associated with age or number of grafts Assignment associated with angina symptoms and planned harvesting technique

Propensity Score for Resident vs Attending Surgeon 26 Common support

Compare Three Scores Density kdensity m x kdensity m x B C

Poll Do any of these distributions concern you? Choose one A B C All of them None of them 28

RCTs and Propensity Scores What would happen if you used a propensity score with data from a RCT? 29

Share Common Support 30

Using the Propensity Score 1. Compare individuals based on similar PS scores (a matched analysis) 2. Conduct subgroup analyses on similar groups (stratification) 3. Include it as a covariate (quintiles of the PS) in the regression model 4. Use it to weight the regression (i.e., place more weight on similar cases) 5. Use both 3 and 4 together (doubly robust) 31

Matched Analyses The idea is to select controls that resemble the treatment group in all dimensions, except for treatment You can exclude cases and controls that don’t match, which can reduce the sample size/power. Different matching methods 32

Matching Methods Nearest Neighbor: rank the propensity score and choose control that is closest to case. Caliper: choose your common support and from within randomly draw controls 33

PS as a Covariate There seems to be little advantage to using PS over multivariate analyses in most cases. 1 PS provides flexibility in the functional form Propensity scores may be preferable if the sample size is small and the outcome of interest is rare Winkelmeyer. Nephrol. Dial. Transplant 2004; 19(7): Cepeda et al. Am J Epidemiol 2003; 158: 280–287

Doubly Robust Estimators Expected treatment effect: E(Y)=E(Y A )-E(Y B ) 1. Fit a logistic regression model for treatment conditional on the baseline variables. The predicted value from this regression gives the estimated propensity scores (PS i ) 2. Fit a regression model for outcome (Y i ) on the baseline variables for the treatment group only (Y i = A), and obtain the predicted values for the whole sample. 3. Fit the same regression model for outcome on the baseline variables for the control group only (Y i = B), and obtain the predicted values for the whole sample. 4. Plug the PS i, Pred (A), and Pred(B) into a formula for the double-robust estimator (essentially a PS weighted mean difference) and bootstrap the SE. 35 Emsley R, Lunt M, Pickles A, Dunn G Implementing double-robust estimators of causal effects The Stata Journal (2008) 8, Number 3, pp. 334–353,

Doubly Robust Estimators Have gained favor because DR provides some protection from mis-specification in either the regression or PS model. 36 Tsiatis, A. A Semiparametric Theory and Missing Data. New York: Springer. Leon, S., A. A. Tsiatis, and M. Davidian Semiparametric estimation of treatment effect in a pretest-posttest study. Biometrics 59: 1046–1055.

Limitations 37

Do the Unobservables Matter? Propensity scores focus only on observed characteristics, not on unobserved. Improbable that we fully observe the sorting process –Thus E(x i u i )≠0 –Multivariate (including propensity score) is biased and we need instrumental variables, fixed effects or RCT 38

Does Using PS Exacerbate Imbalance of Unobservables PS is based on observables. Brooks and Ohsfeldt, using simulated data, showed that PS models can create greater imbalance among unobserved variables. 39 Brooks and Ohsfeldt (2013): Squeezing the balloon: propensity scores and unmeasured covariate balance. HSR.

Summary 40

Overview Propensity scores offer another way to adjust for confound by observables Reducing the multidimensional nature of confounding can be helpful There are many ways to implement propensity scores and a growing interest in doubly robust estimators 41

Strengths Allow one to check for balance between control and treatment Without balance, average treatment effects can be very sensitive to the choice of the estimators Imbens and Wooldridge

Challenges Propensity scores are often misunderstood Not enough attention is placed on the PS model, itself Not enough attention is placed on robustness checks While a PS can help create balance on observables, PS models do not control for unobservables or selection bias 43

Further Reading Imbens and Wooldridge (2007) ww.nber.org/WNE/lect_1_match_fig.pdfww.nber.org/WNE/lect_1_match_fig.pdf Guo and Fraser (2010) Propensity Score Analysis. Sage. Tsiatis, A. A Semiparametric Theory and Missing Data. New York: Springer. Leon, S., A. A. Tsiatis, and M. Davidian Semiparametric estimation of treatment effect in a pretest-posttest study. Biometrics 59: 1046–1055. Rosenbaum, P. R., D. B. Rubin The central role of the propensity score in observational studies for causal effects. Biometrika 70: 41–55 Brookhart MA, et al Am J Epidemiol Jun 15;163(12): Variable selection for propensity score models. Brooks and Ohsfeldt (2013): Squeezing the balloon: propensity scores and unmeasured covariate balance. HSR. Emsley R, Lunt M, Pickles A, Dunn G Implementing double-robust estimators of causal effects The Stata Journal (2008) 8, Number 3, pp. 334– 353, 44