SPSS Propensity Score Matching: An overview

Slides:



Advertisements
Similar presentations
Housekeeping: Variable labels, value labels, calculations and recoding
Advertisements

1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
Descriptive Statistics. Descriptive Statistics: Summarizing your data and getting an overview of the dataset  Why do you want to start with Descriptive.
Recap of basic SPSS and statistics 5 th - 9 th December 2011, Rome.
Ann Arbor ASA ‘Up and Running’ Series: SPSS Prepared by volunteers of the Ann Arbor Chapter of the American Statistical Association, in cooperation with.
A Simple Guide to Using SPSS© for Windows
N The Experimental procedure involves manipulating something called the Explanatory Variable and seeing the effect on something called the Outcome Variable.
Introduction to SPSS (For SPSS Version 16.0)
 Overview of SPSS  Interface  Getting Started  Managing Data  Descriptive Statistics  Basic Analysis  Additional Resources.
X 11 X 12 X 13 X 21 X 22 X 23 X 31 X 32 X 33. Research Question Are nursing homes dangerous for seniors? Does admittance to a nursing home increase risk.
Data Liberation Training 2001 Complex Files: Pasting and Cutting with SPSS Université de Montréal Wendy Watkins April 24, 2001.
Propensity Scores How to do it – Part 1. X 11 X 12 X 13 X 21 X 22 X 23 X 31 X 32 X 33 No matrices were harmed in this presentation.
Logistic Regression Demo: dmdata2 and dmdata3 Bankloan Assignment: subscribe_training and subscribe_validate.
Analyses using SPSS version 19
Basics of Biostatistics for Health Research Session 1 – February 7 th, 2013 Dr. Scott Patten, Professor of Epidemiology Department of Community Health.
Inference for Two Population Means Goal: Hypothesis Test or Confidence interval for the “average difference” between two populations There are two designs.
PSC 47410: Data Analysis Workshop  What’s the purpose of this exercise?  The workshop’s research questions:  Who supports war in America?  How consistent.
Using Propensity Score Matching in Observational Services Research Neal Wallace, Ph.D. Portland State University February
Gary L. Kamer Statistician OSB/DBS. 2 Statistical Issues at Time of PMA Review Clinical Study Design Excess All-cause Late Mortality (31 to 365 days)
Additional Regression techniques Scott Harris October 2009.
Lecture 3,4 Dr. Maha Saud Khalid Measurement of disease frequency Ratio Proportions Rates Ratio Proportions Rates BMS 244.
SAMPLING Purposes Representativeness “Sampling error”
Looking for statistical twins
Class 10 Jeff Driskell, MSW, PhD
Bootstrap and Model Validation
Categorical Variables in Regression
AP Statistics Exam Review Topic #4
Session 15 Merging Data in SPSS
Nonparametric Statistics
SPSS: Using statistical software — a primer
Introduction to SPSS July 28, :00-4:00 pm 112A Stright Hall
BINARY LOGISTIC REGRESSION
Advanced Quantitative Techniques
Psychology 202a Advanced Psychological Statistics
Constructing Propensity score weighted and matched Samples Stacey L
Analyze ICD-10 Diagnosis Codes with Stata
DESCRIPTIVE STATISTICS
Notes on Logistic Regression
Experiments Why would a double-blind experiment be used?
Applied Biostatistics: Lecture 2
Sec 9C – Logistic Regression and Propensity scores
Just the basics: Learning about the essential steps to do some simple things in SPSS Larkin Lamarche.
Sampling Population: The overall group to which the research findings are intended to apply Sampling frame: A list that contains every “element” or.
SAMPLING Purposes Representativeness “Sampling error”
Simulation-Based Approach for Comparing Two Means
Statistical Analysis – Part 3
ECONOMETRICS ii – spring 2018
Data Entry and Managment
Merve denizci nazlıgül, M.s.
SPSS STATISTICAL PACKAGE FOR SOCIAL SCIENCES
Descriptive Statistics
Calculating Sample Size: Cohen’s Tables and G. Power
Multiple logistic regression
Lab 2 Data Manipulation and Descriptive Stats in R
Propensity Score Matching Makes Program Evaluation Easy
Nonparametric Statistics
Problems With Assistance Module 1 – Problem 3
Matched Pairs Designs (Ch. 7)
ECE 2202 Circuit Analysis II
Computing A Variable Mean
Two-Sample Between-Subjects Experiments and Independent-Samples t-Tests So far, we’ve talked about experiments in which we needed to take only one sample.
Secondary Data Analysis Lec 10
Hank Childs, University of Oregon
Chapter 7 (Probability)
Improving Overlap Farrokh Alemi, Ph.D.
Applied Statistics Using SPSS
By A.Arul Xavier Department of mathematics
Density Curves Normal Distribution Area under the curve
Density Curves Normal Distribution Area under the curve
Standard Normal Table Area Under the Curve
Presentation transcript:

SPSS Propensity Score Matching: An overview AnnMaria De Mars The Julia Group Hi. Ph.d. Taught stats 27 years. Stat consultant 27 years . President. Pepperdine Prof. Blah blah blah

Let me tell you what we’re going to do … Overview of PSM Review logistic regression PSM with quintiles PSM with nearest neighbor FUZZY match I will probably go over everything twice, and yes, I know I am doing that. It’s like when you take statistics and read the textbook. It all seems to make sense until you try to do the problems. Or, you can do it in class when the professor is there and you go home and can’t because you forgot a step.

Why ? Non-equivalent groups are being compared, e.g, hospitals with & without specialized trauma centers Are differences due to patient characteristics or to hospital characteristics Propensity score matching is used when non-equivalent groups are being compared. For example, hospitals with specialized trauma centers have higher mortality than other hospitals, however, they also see patients with more severe conditions.

What ? Propensity score is the conditional probability of receiving a treatment given pre-treatment characteristics The propensity score match can simultaneously control for several different variables on which treatment groups differ. So, given the condition that you are, say, a burn victim, under age 10 and female what is your probability of being seen in a trauma unit? What’s the probability of being seen in the trauma unit given the conditions that the patient is 16-25, male and has a gun shot wound

When? Propensity scores are ideally used when there is substantial overlap in the groups. If those matched are the most extreme high scorers from one group and the most extreme low scorers from the second group then neither is representative and the results are suspect. In this it is similar to Analysis of Covariance. You can’t take a class of gifted children, another class of children with mental retardation, covary for IQ and compare them. The differences are just too great.

How? Step one: A logistic regression is performed with group as the dependent variable. A propensity score is created for each subject giving their likelihood of being in group one or group two.

Step Two There are two options for step two. Either subjects can be grouped by scores, usually into quintiles, or matched by propensity scores.

Step Three: Quintiles Perform a regular regression analysis with group (quintiles) as a covariate? -No, not the best way. People do it when they have a small sample and are worried about losing data. But it’s not advisable.

Step Three, Option 2: Match Match subjects on propensity scores & perform a regular regression analysis on this subsample 1,000 subjects from rural hospitals matched with 1,000 subjects from urban hospitals who have the same propensity score

So simple, a child can do it? Two out of the three methods could be done with no syntax at all, just pointing and clicking While there are a number of reasons you WANT to do the syntax, including saving it as a record, because it is easier to make changes, I am not one of those people who sneers at pointing and clicking your way through an analysis. I think that is a perfectly fine way to get used to new procedures. However, if you do that, I strongly recommend you copy and paste your syntax

Propensity Scores, Quintiles Step Task SPSS commands 1 Create propensity scores Logistic Regression (dependent = CITY) Analyze > Regression > Logistic 2 Create quintiles Frequencies Analyze > Descriptive >Frequencies Transform > Recode into different variables 3 Match quintiles Select If Use Complex Samples 4 Conduct analysis Logistic regression with OUTCOME as dependent

Propensity Scores, Matching Step Task SPSS commands 1 Create propensity scores Logistic Regression (dependent = CITY) Analyze > Regression > Logistic 2 Create Dataset1 Select and Save Dataset1 Records Data > Select Cases >If File > Save (select variables) 3 Create Dataset2

PSM Continued Step Task SPSS commands Data > Sort Cases 4 Sort dataset1 Data > Sort Cases 5 Sort dataset2 Select and Save Dataset1 Records Data > Select Cases >If File > Save (select variables) Merge files

And that … … is the general idea There are many options, not nearly all of which we will cover in the next few days, but now you have the general idea

It would help if you already knew this A little about regression Something about SPSS syntax But, if not, oh well .