Multiple Imputation.

Slides:



Advertisements
Similar presentations
Latent normal models for missing data Harvey Goldstein Centre for Multilevel Modelling University of Bristol.
Advertisements

Treatment of missing values
CountrySTAT Team-I November 2014, ECO Secretariat,Teheran.
Missing Data Analysis. Complete Data: n=100 Sample means of X and Y Sample variances and covariances of X Y
Some birds, a cool cat and a wolf
Adapting to missing data

Statistics Lecture 20. Last Day…completed 5.1 Today Parts of Section 5.3 and 5.4.
How to deal with missing data: INTRODUCTION
Modeling Achievement Trajectories When Attrition is Informative Betsy J. Feldman & Sophia Rabe- Hesketh.
Partially Missing At Random and Ignorable Inferences for Parameter Subsets with Missing Data Roderick Little Rennes
Missing Data.. What do we mean by missing data? Missing observations which were intended to be collected but: –Never collected –Lost accidently –Wrongly.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
PEAS wprkshop 2 Non-response and what to do about it Gillian Raab Professor of Applied Statistics Napier University.
Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.
1 Multiple Imputation : Handling Interactions Michael Spratt.
Sampling Distributions & Standard Error Lesson 7.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
Imputation for Multi Care Data Naren Meadem. Introduction What is certain in life? –Death –Taxes What is certain in research? –Measurement error –Missing.
1 Multiple Regression A single numerical response variable, Y. Multiple numerical explanatory variables, X 1, X 2,…, X k.
Estimating  0 Estimating the proportion of true null hypotheses with the method of moments By Jose M Muino.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
1 Bayesian Essentials Slides by Peter Rossi and David Madigan.
Evaluating the Quality of Editing and Imputation: the Simulation Approach M. Di Zio, U. Guarnera, O. Luzi, A. Manzari ISTAT – Italian Statistical Institute.
A REVIEW By Chi-Ming Kam Surajit Ray April 23, 2001 April 23, 2001.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
A shared random effects transition model for longitudinal count data with informative missingness Jinhui Li Joint work with Yingnian Wu, Xiaowei Yang.
Tutorial I: Missing Value Analysis
1 Probability and Statistics Confidence Intervals.
- 1 - Preliminaries Multivariate normal model (section 3.6, Gelman) –For a multi-parameter vector y, multivariate normal distribution is where  is covariance.
Pre-Processing & Item Analysis DeShon Pre-Processing Method of Pre-processing depends on the type of measurement instrument used Method of Pre-processing.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Bias-Variance Analysis in Regression  True function is y = f(x) +  where  is normally distributed with zero mean and standard deviation .  Given a.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Research and Evaluation Methodology Program College of Education A comparison of methods for imputation of missing covariate data prior to propensity score.
HANDLING MISSING DATA.
Missing data: Why you should care about it and what to do about it
Multiple Imputation using SOLAS for Missing Data Analysis
MISSING DATA AND DROPOUT
CH 5: Multivariate Methods
The Centre for Longitudinal Studies Missing Data Strategy
Chapter Six Normal Curves and Sampling Probability Distributions
Maximum Likelihood & Missing data
Introduction to Survey Data Analysis
Multiple Imputation Using Stata
How to handle missing data values
Sampling Distribution
Sampling Distribution
The bane of data analysis
The European Statistical Training Programme (ESTP)
EM for Inference in MV Data
Task 6 Statistical Approaches
Bootstrapping Jackknifing
Missing Data Mechanisms
Non response and missing data in longitudinal surveys
Chapter 14 Monte Carlo Simulation
MANOVA Control of experimentwise error rate (problem of multiple tests). Detection of multivariate vs. univariate differences among groups (multivariate.
EM for Inference in MV Data
Chapter 4: Missing data mechanisms
The European Statistical Training Programme (ESTP)
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Clinical prediction models
Implementation of the Bayesian approach to imputation at SORS Zvone Klun and Rudi Seljak Statistical Office of the Republic of Slovenia Oslo, September.
Chapter 13: Item nonresponse
Classical regression review
Imputation Strategies When a Continuous Outcome is to be Dichotomized for Responder Analysis: A Simulation Study Lysbeth Floden, PhD1 Melanie Bell, PhD2.
Presentation transcript:

Multiple Imputation

Multiple Imputation Missing data method developed by Donald Rubin Simulate multiple samples of “complete” data, and compute estimates and standard errors from the complete data. Rubin distinguished multiple imputation from Different models Same model We will focus on same-model multiple imputation

Missing Data mechanism Missing data mechanisms MCAR (Missing completely at random)—missing data are a random subsample of complete data MAR (Missing at random)—missing data mechanism may depend on independent variables, but not the response

Missing Data mechanism Ignorable nonresponse MCAR Parameter for missing process different from data parameters Example for discussion Growth curve models for largemouth bass

Computer Example 5 Teachers, 3 methods, Y=relative improvement Method 10, 7 6 11 B 4 . 8.5 4,5 3 C 9 13 16 8

Multiple Imputation simulation Repeated draws i=1,…,M from the posterior predictive distribution of the missing data. The complete data sets have the same set of fully observed responses. In practice, there are numerous ways to generate complete data. Introductory methods rely on monotone missingness, and classic results for conditional distributions of jointly multivariate normal random variables.

Multiple Imputation simulation In a multivariate normal setting (some values of Y missing), we generate our draws from Y|X:

Multiple Imputation Estimation Combining results from imputation for parameters of interest is surprisingly straightforward. E.g., let q represent the PMM’s for Method. We can compute

Multiple Imputation Estimation Our estimate and its standard error can be computed as:

Multiple Imputation Estimation Combining estimates in SAS is non-standard. Our example with LSMeans is atypical, and more straightforward than most.