Propensity Score Matching in SPSS: How to turn an Audit into a RCT

Slides:



Advertisements
Similar presentations
Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Sep. 16, 2005 Lilly Yue, Ph.D.* CDRH, FDA, Rockville MD * No official support.
Advertisements

The Application of Propensity Score Analysis to Non-randomized Medical Device Clinical Studies: A Regulatory Perspective Lilly Yue, Ph.D.* CDRH, FDA,
Comparing Two Means: One-sample & Paired-sample t-tests Lesson 12.
Controlling for Time Dependent Confounding Using Marginal Structural Models in the Case of a Continuous Treatment O Wang 1, T McMullan 2 1 Amgen, Thousand.
Research Support Center Chongming Yang
Discriminant Analysis
M2 Medical Epidemiology
1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
Introduction to Propensity Score Weighting Weimiao Fan 10/10/
Lecture 28 Categorical variables: –Review of slides from lecture 27 (reprint of lecture 27 categorical variables slides with typos corrected) –Practice.
Teaching Statistics Using Stata Software Susan Hailpern BSN MPH MS Department of Epidemiology and Population Health Albert Einstein College of Medicine.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
HSRP 734: Advanced Statistical Methods July 24, 2008.
Estimation and Reporting of Heterogeneity of Treatment Effects in Observational Comparative Effectiveness Research Prepared for: Agency for Healthcare.
CHAPTER 14, QUANTITATIVE DATA ANALYSIS. Chapter Outline  Quantification of Data  Univariate Analysis  Subgroup Comparisons  Bivariate Analysis  Introduction.
© Copyright 2000, Julia Hartman 1 An Interactive Tutorial for SPSS 10.0 for Windows © Factorial Analysis of Variance by Julia Hartman Next.
Survey Experiments. Defined Uses a survey question as its measurement device Manipulates the content, order, format, or other characteristics of the survey.
Subclassification STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
© Copyright 2000, Julia Hartman 1 An Interactive Tutorial for SPSS 10.0 for Windows © Analysis of Covariance (GLM Approach) by Julia Hartman Next.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Repeated measures: Approaches to Analysis Peter T. Donnan Professor of Epidemiology and Biostatistics.
SAS Lecture 5 – Some regression procedures Aidan McDermott, April 25, 2005.
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
Stratification and Adjustment
How to Analyze Data? Aravinda Guntupalli. SPSS windows process Data window Variable view window Output window Chart editor window.
Advanced Statistics for Interventional Cardiologists.
© Nuffield Trust Evaluation methods – where can predictive risk models help? Adam Steventon Nuffield Trust 8 July 2013.
Using SPSS for Windows Part II Jie Chen Ph.D. Phone: /6/20151.
Propensity Score Matching and Variations on the Balancing Test Wang-Sheng Lee Melbourne Institute of Applied Economic and Social Research The University.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Selecting and Recruiting Subjects One Independent Variable: Two Group Designs Two Independent Groups Two Matched Groups Multiple Groups.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Article Review Cara Carty 09-Mar-06. “Confounding by indication in non-experimental evaluation of vaccine effectiveness: the example of prevention of.
The Practice of Social Research Chapter 14 – Quantitative Data Analysis.
Estimating Causal Effects from Large Data Sets Using Propensity Scores Hal V. Barron, MD TICR 5/06.
Using Weighted Data Donald Miller Population Research Institute 812 Oswald Tower, December 2008.
Imputation for Multi Care Data Naren Meadem. Introduction What is certain in life? –Death –Taxes What is certain in research? –Measurement error –Missing.
BIOST 536 Lecture 11 1 Lecture 11 – Additional topics in Logistic Regression C-statistic (“concordance statistic”)  Same as Area under the curve (AUC)
Statistically speaking…
Generalizing Observational Study Results Applying Propensity Score Methods to Complex Surveys Megan Schuler Eva DuGoff Elizabeth Stuart National Conference.
New Measures of Data Utility Mi-Ja Woo National Institute of Statistical Sciences.
Matching STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Social Science Research Design and Statistics, 2/e Alfred P. Rovai, Jason D. Baker, and Michael K. Ponton Between Subjects Analysis of Variance PowerPoint.
1 Hester van Eeren Erasmus Medical Centre, Rotterdam Halsteren, August 23, 2010.
Kin 304 Inferential Statistics Probability Level for Acceptance Type I and II Errors One and Two-Tailed tests Critical value of the test statistic “Statistics.
ANCOVA.
Chapter Seventeen Copyright © 2004 John Wiley & Sons, Inc. Multivariate Data Analysis.
Approaches to quantitative data analysis Lara Traeger, PhD Methods in Supportive Oncology Research.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
(ARM 2004) 1 INNOVATIVE STATISTICAL APPROACHES IN HSR: BAYESIAN, MULTIPLE INFORMANTS, & PROPENSITY SCORES Thomas R. Belin, UCLA.
Analysis of Variance (ANOVA) Scott Harris October 2009.
NURS 306, Nursing Research Lisa Broughton, MSN, RN, CCRN RESEARCH STATISTICS.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
Matching methods for estimating causal effects Danilo Fusco Rome, October 15, 2012.
Constructing Propensity score weighted and matched Samples Stacey L
Multiple Imputation using SOLAS for Missing Data Analysis
Applied Biostatistics: Lecture 2
Sec 9C – Logistic Regression and Propensity scores
An Interactive Tutorial for SPSS 10.0 for Windows©
Kin 304 Inferential Statistics
Presenter: Wen-Ching Lan Date: 2018/03/28
Annals of Internal Medicine • Vol. 167 No. 12 • 19 December 2017
Analysis of Observational Data
The Aga Khan University
The European Statistical Training Programme (ESTP)
Chapter: 9: Propensity scores
Emerging Analytical Techniques for Comparative Effectiveness Research
Presentation transcript:

Propensity Score Matching in SPSS: How to turn an Audit into a RCT Outline   What is Propensity score matching? Propensity Score Matching in SPSS Example: Comparing patients with both Gout & diabetes to those with diabetes only Dealing with missing data Mario D Hair Independent Statistics Consultant

What is Propensity score matching? Developed by Rosenbaum & Rubin (1983). Two aspects 1. Generate the propensity score 2. Apply it to balance the data. Search hits using ‘Propensity score matching’ by year. Slide provided by Beng So, ST6 Queen Elizabeth Hospital, Glasgow   Mario D Hair Independent Statistics Consultant

What is Propensity score matching? 1. Generate the propensity score The propensity score is the probability (from 0 to 1) of a case being in a particular group based on a given set of covariates. Generally calculated using logistic regression with group (Treatment /Control) as dependent , covariates as independent variables. Caveats & Limitations Can only be two groups. If more groups need to analyse them pairwise. The propensity score is only as good as the predictors used to generate it. Propensity score not generated for any case with any missing data. Not interested in any aspect of the logistic model other than the probabilities.   The propensity score is a balancing score: The differences between groups on the covariates condensed down into a single score so if two groups balanced on the propensity score then balanced on all the covariates. Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Slide provided by Beng So, ST6 Queen Elizabeth Hospital, Glasgow Mario D Hair Independent Statistics Consultant

What is Propensity score matching? 2. Apply propensity score to balance the data. Four main applications. Propensity score matching : Match one or more control cases with a propensity score that is (nearly) equal to the propensity score for each treatment case Stratification: Divide sample into strata based on rank-ordered propensity scores. Comparisons between groups are then performed within each stratum. Regression adjustment: Include propensity scores as a covariate in a regression model used to estimate the treatment effect. Weighting: Inverse probability of treatment weighting (IPTW) weights cases by the inverse of propensity score. Similar to use of survey sampling weights used to ensure samples are representative of specific populations. Often used in survival analyses. Austin (2011) reports that propensity score matching is better than stratification or regression adjustment and is at least as good as IPTW. It is increasingly the most widely used method. Mario D Hair Independent Statistics Consultant

Propensity Score Matching in SPSS Available in SPSS V22 but Prior to that only as ‘PS matching’ an extension command that requires both r and the r plug-in. Developed by Felix Thoemmes at Cornell University. PS matching: http://sourceforge.net/projects/psmspss/ contains Latest version of the software, psmatching 3.04 June 2015. (this talk uses 3.03) Installation instructions (in a file called ‘readme.txt’) Thoemmes 2012 paper describing the software (called ‘arxiv preprint.pdf’). Comparison of PS matching & SPSS Propensity score matching   Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Propensity score matching SPSS V22 PS Matching Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Example: Comparing 1714 patients with BOTH Gout & diabetes to 15,224 patients with ONLY diabetes Covariates Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Univariate stats: Comparing BOTH Gout & diabetes to those with ONLY diabetes * p < 0.05 †using t to d conversions d = 2t/sqrt(df) & d = ln(OR)*(√3/π) Mario D Hair Independent Statistics Consultant

PS Matching: Using a file with only the covariates Warning: PS Matching will not work if there are missing values on any variable Mario D Hair Independent Statistics Consultant

Propensity score matching SPSS V22 PS Matching However Propensity Score Matching does work if there are missing values on any variable Mario D Hair Independent Statistics Consultant

PS Matching has more options & diagnostics Mario D Hair Independent Statistics Consultant

PS Matching Outputs : Datasets Matched cases Paired cases wide format Mario D Hair Independent Statistics Consultant

Propensity score Matching SPSS V22 Output Mario D Hair Independent Statistics Consultant

PS Matching Outputs : Diagnostics Samples sizes of matched data Overall balance test (Hansen & Bowers, 2010) chisquare df p.value Overall .883 5.000 .971 Overall balance tests Relative multivariate imbalance L1 (Iacus, King, & Porro, 2010) Before matching After matching Multivariate imbalance measure L1 .290 .167 Covariates Means Treated Means Control SD Control Std. Mean Diff. Before After propensity .136 .097 .135 .058 .073 .524 .005 Age 69.928 65.511 69.938 12.416 11.049 .409 -.001 sex0 .667 .542 .661 .498 .473 .266 .014 sex1 .333 .458 .339 -.266 -.014 curr_smok1 .110 .189 .118 .392 .323 -.252 -.026 Thiazide11 .138 .165 .134 .371 .341 -.077 .012 Diuretic11 .460 .307 .461 .499 .306 -.002 Detailed balance Summary of any unbalanced covariate terms inc interactions Summary of unbalanced covariates (|d| > .25) No covariate exhibits a large imbalance (|d| > .25). Mario D Hair Independent Statistics Consultant

PS Matching Outputs : Diagnostic plots Histogram of propensity scores Jitter plot Mario D Hair Independent Statistics Consultant

PS Matching Outputs : Diagnostic plots Dotplot of standardized mean differences Graphical representation of data from detailed balance stats Covariates Std. Mean Diff. Before After propensity .524 .005 Age .409 -.001 sex0 .266 .014 sex1 -.266 -.014 curr_smok1 -.252 -.026 Thiazide11 -.077 .012 Diuretic11 .306 -.002 Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Adding the lipid data to matched file using merge where original (non active) is the keyed file Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Univariate stats: Comparing BOTH Gout & diabetes to those with ONLY diabetes Covariates after matching Age Be f ore Af t er Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant Univariate stats: Comparing BOTH Gout & diabetes to those with ONLY diabetes Lipids after matching Mario D Hair Independent Statistics Consultant

Dealing with missing data 1 Missing data in non covariate data: Use the paired data format Green is paired comparison, red is matched. There are no substantive changes Mario D Hair Independent Statistics Consultant

Dealing with missing data 2 Missing data in covariates: Use multiple imputation Separate creation of propensity scores from the matching Run logistic regression on imputed datasets Aggregate to get mean (median) propensity score Use the aggregate file to do the matching Load in the other variables Use imputation again if missing data in non-covariates Mario D Hair Independent Statistics Consultant

Mario D Hair Independent Statistics Consultant References Austin, P. C. (2011). An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate Behavioral Research, 46, 399-424. doi:10.1080/00273171.2011.568786 One of the foremost authors on the subject. Beal S J & Kupzyk K A, An Introduction to Propensity Scores What, When, and How. The Journal of Early Adolescence January 2014 vol. 34 no. 1 66-92 doi:10.1177/0272431613503215. Easy to read introduction   Iacus, S. M., King, G., & Porro, G. (2009). CEM: Coarsened exact matching software. Journal of Statistical Software, 30, 1-27. Reference for ‘relative multivariate imbalance test’ Mitra, R., & Reiter, J. P. (2012). A comparison of two methods of estimating propensity scores after multiple imputation. Statistical methods in medical research, 0962280212445945. Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70, 41-55. doi:10.1093/biomet/70.1.41. Seminal paper. Rubin, D. B. (1997). Estimating causal effects from large data sets using propensity scores. Annals of internal medicine, 127(8_Part_2), 757-763. Example of stratification. Thoemmes, F. (2012). Propensity score matching in SPSS. arXiv preprint arXiv:1201.6385. Explains use of ‘ps matching’. Mario D Hair Independent Statistics Consultant

Propensity Score Matching in SPSS: How to turn an Audit into a RCT Outline  What is Propensity score matching? Propensity Score Matching in SPSS Example: Comparing patients with both Gout & diabetes to those with diabetes only Dealing with missing data Thank you: Questions? Mario D Hair Independent Statistics Consultant