Multiple Imputation Multiple Regression. Input From SPSS *** Mult-Imput_M-Reg.sas ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav"

Slides:



Advertisements
Similar presentations
Technology Short Courses: Spring 2010 Kentaka Aruga
Advertisements

Cronbach’s Alpha & Maximized 4. SAS proc corr nosimple nocorr nomiss alpha; var q1-q10; run; Cronbach Coefficient Alpha VariablesAlpha Raw
Missing Data Analysis. Complete Data: n=100 Sample means of X and Y Sample variances and covariances of X Y
Path Analysis SAS/Calis. Read in the Data options formdlim='-' nodate pagno=min; TITLE 'Path Analysis, Ingram Data' ; data Ingram(type=corr); INPUT _TYPE_.
Topic 32: Two-Way Mixed Effects Model. Outline Two-way mixed models Three-way mixed models.
EPI 809/Spring Probability Distribution of Random Error.
Latent Growth Curve Modeling In Mplus:
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Psychology 202b Advanced Psychological Statistics, II February 1, 2011.
Multiple regression analysis
Matrix A matrix is a rectangular array of elements arranged in rows and columns Dimension of a matrix is r x c  r = c  square matrix  r = 1  (row)
Be humble in our attribute, be loving and varying in our attitude, that is the way to live in heaven.
How to deal with missing data: INTRODUCTION
Modeling Achievement Trajectories When Attrition is Informative Betsy J. Feldman & Sophia Rabe- Hesketh.
Psych 524 Andrew Ainsworth Data Screening 2. Transformation allows for the correction of non-normality caused by skewness, kurtosis, or other problems.
Multiple Regression continued… STAT E-150 Statistical Methods.
Inferential statistics Hypothesis testing. Questions statistics can help us answer Is the mean score (or variance) for a given population different from.
Screening the Data Tedious but essential!.
STAT 3130 Statistical Methods II Missing Data and Imputation.
Topic 16: Multicollinearity and Polynomial Regression.
Moderators. Definition Moderator - A third variable that conditions the relations of two other variables Example: SAT-Quant and math grades in school.
23-1 Analysis of Covariance (Chapter 16) A procedure for comparing treatment means that incorporates information on a quantitative explanatory variable,
Topic 7: Analysis of Variance. Outline Partitioning sums of squares Breakdown degrees of freedom Expected mean squares (EMS) F test ANOVA table General.
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Statistics for the Social Sciences Psychology 340 Fall 2013 Correlation and Regression.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Lecture 6 Correlation and Regression STAT 3120 Statistical Methods I.
Multiple regression - Inference for multiple regression - A case study IPS chapters 11.1 and 11.2 © 2006 W.H. Freeman and Company.
Multiple Regression The Basics. Multiple Regression (MR) Predicting one DV from a set of predictors, the DV should be interval/ratio or at least assumed.
Randomized Block Design (Kirk, chapter 7) BUSI 6480 Lecture 6.
Applied Epidemiologic Analysis - P8400 Fall 2002 Lab 10 Missing Data Henian Chen, M.D., Ph.D.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
Introduction to Multiple Imputation CFDR Workshop Series Spring 2008.
Regression in SAS Caitlin Phelps. Importing Data  Proc Import:  Read in variables in data set  May need some options incase SAS doesn’t guess the format.
6-3 Multiple Regression Estimation of Parameters in Multiple Regression.
Topic 13: Multiple Linear Regression Example. Outline Description of example Descriptive summaries Investigation of various models Conclusions.
BUSI 6480 Lecture 8 Repeated Measures.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
Lecture 3 Topic - Descriptive Procedures Programs 3-4 LSB 4:1-4.4; 4:9:4:11; 8:1-8:5; 5:1-5.2.
Analysis Overheads1 Analyzing Heterogeneous Distributions: Multiple Regression Analysis Analog to the ANOVA is restricted to a single categorical between.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
Simple Linear Regression. Data available : (X,Y) Goal : To predict the response Y. (i.e. to obtain the fitted response function f(X)) Least Squares Fitting.
1crmda.KU.edu Todd D. Little University of Kansas Director, Quantitative Training Program Director, Center for Research Methods and Data Analysis Director,
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
General Structural Equations (LISREL)
Tutorial I: Missing Value Analysis
Multiple Imputation using SAS Don Miller 812 Oswald Tower
1 Experimental Statistics - week 13 Multiple Regression Miscellaneous Topics.
Two-Group Discriminant Function Analysis. Overview You wish to predict group membership. There are only two groups. Your predictor variables are continuous.
Missing Data and Multiple Imputation By Jon Atwood Collaborator LISA.
1 SPSS MACROS FOR COMPUTING STANDARD ERRORS WITH PLAUSIBLE VALUES.
Pre-Processing & Item Analysis DeShon Pre-Processing Method of Pre-processing depends on the type of measurement instrument used Method of Pre-processing.
7/14/2003(c) 2003 Strategic Matching, Inc.1 29 th International Traffic Records Forum Using Multiple Imputation to Resolve Missing Data Issues.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
Chapter 11 Linear Regression and Correlation. Explanatory and Response Variables are Numeric Relationship between the mean of the response variable and.
Multiple Imputation using SAS Don Miller 812 Oswald Tower
Multiple Imputation using SOLAS for Missing Data Analysis
SPSS Examples from Our Homework
Linear Mixed Models in JMP Pro
6-1 Introduction To Empirical Models
Presenter: Ting-Ting Chung July 11, 2017
Type=Corr SAS.
Linear Regression and Correlation
Linear Regression and Correlation
Considerations for the use of multiple imputation in a noninferiority trial setting Kimberly Walters, Jie Zhou, Janet Wittes, Lisa Weissfeld Joint Statistical.
Presentation transcript:

Multiple Imputation Multiple Regression

Input From SPSS *** Mult-Imput_M-Reg.sas ***; PROC IMPORT OUT= WORK.IntroQuest DATAFILE= "C:\Users\Vati\Documents\StatData\IntroQ\IntroQ.sav" DBMS=SPSS REPLACE; Run; Use the Import Wizard to bring the data into SAS.

Create Missingness Variable Data Priapus; set IntroQuest; SATM_Miss = 0; If SATM =. then SATM_Miss = 1;

Check For Missing Data proc means n nmiss; run;

Check Correlates of Missingness on SATM proc corr nosimple; var SATM_Miss; with statoph gender ideal nucoph year; run;

Predictorr Ideal-.017 Statoph.084 * Nucoph.007 Year.082* Gender-.057 * p <.05

Oh Crap ! We have a lot of missing data on SATM Missingness on SATM is associated with statophobia and year. It is not missing completely at random. Need to employ multiple imputation.

Create Five Imputations Proc MI seed=69301 out=MIdata; var gender ideal nucoph SATM year; run;

Patterns of Missingness Most frequent pattern of missing data is missing on SATM only. GroupStatophGenderIdealNucophSATMYearFreq 1 XXXXXX540 2 XXXX.X139 3 XXX.XX1 4 XX.XXX2 5 XX.X.X3 6.XXXXX3 7.XXX.X5 8.XX.XX1

Means By Pattern of Missingness. GroupGroup Means StatophGenderIdealNucophSATMYear

Estimated Means & Covariances EM (Posterior Mode) Estimates _TYPE__NAME_StatophGenderIdealNucophSATMYear MEAN COVStatoph COVGender COVIdeal COVNucoph COVSATM COVYear

Analyze the Imputed Data Proc Reg outest = MRbyImput covout; Model Statoph = gender ideal nucoph SATM year / stb; By _Imputation_; run; Proc MIAnalyze; modeleffects intercept gender ideal nucoph SATM year; run; See the complete output herehere In every imputation, Gender, SATM, and Year have significant effects.

Proc MIAnalyze Output Pools the results from the five imputations. The variance in the scores is partitioned between that among imputations and that within imputations. Ideally, little of the variance is due to differences among imputations.

Variance Among/Within Imputations ParameterVariance BetweenWithinTotal intercept gender ideal nucoph SATM year

“Relative Increase in Variance” is the increase in variance due to having missing data imputed (relative to the condition where no data are missing). Low is good. “Fraction of Missing Information,” is an index of how much more precise the parameter estimate would have been if there had been no missing data. Low is good.

“Relative efficiency” tells you how much power you have for the number of imputations you have employed relative to what you would have if you used an uncountably large number of imputations. High is good.

RIV, FMI, & RE ParameterRelative Increase in Variance Fraction Missing Information Relative Efficiency intercept gender ideal nucoph SATM year

ParameterEstimate 95% Confidence Limits DFMinMaxtPr > |t| intercept gender ideal nucoph SATM <.0001 year

Conclusions Women report greater fear of the stats course than do men. Reported Math Aptitude is inversely correlated with fear of stats.