1 Modeling change Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy

Slides:



Advertisements
Similar presentations
Multiple Regression and Model Building
Advertisements

G Lecture 10 SEM methods revisited Multilevel models revisited
A. The Basic Principle We consider the multivariate extension of multiple linear regression – modeling the relationship between m responses Y 1,…,Y m and.
Lesson 10: Linear Regression and Correlation
Logistic Regression I Outline Introduction to maximum likelihood estimation (MLE) Introduction to Generalized Linear Models The simplest logistic regression.
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Simple Logistic Regression
EPI 809/Spring Probability Distribution of Random Error.
GEE and Mixed Models for longitudinal data
Graphs in HLM. Model setup, Run the analysis before graphing Sector = 0 public school Sector = 1 private school.
Creating Graphs on Saturn GOPTIONS DEVICE = png HTITLE=2 HTEXT=1.5 GSFMODE = replace; PROC REG DATA=agebp; MODEL sbp = age; PLOT sbp*age; RUN; This will.
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Some Terms Y =  o +  1 X Regression of Y on X Regress Y on X X called independent variable or predictor variable or covariate or factor Which factors.
Modeling Continuous Longitudinal Data. Introduction to continuous longitudinal data: Examples.
Multiple regression analysis
Chapter 10 Simple Regression.
Regression Analysis. Unscheduled Maintenance Issue: l 36 flight squadrons l Each experiences unscheduled maintenance actions (UMAs) l UMAs costs $1000.
Chapter 11 Survival Analysis Part 3. 2 Considering Interactions Adapted from "Anderson" leukemia data as presented in Survival Analysis: A Self-Learning.
Analysis – Regression The ANOVA through regression approach is still the same, but expanded to include all IVs and the interaction The number of orthogonal.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
EPI 809/Spring Multiple Logistic Regression.
Incomplete Block Designs
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 15-1 Chapter 15 Multiple Regression Model Building Basic Business Statistics 11 th Edition.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Week 14 Chapter 16 – Partial Correlation and Multiple Regression and Correlation.
Copyright ©2011 Pearson Education 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft Excel 6 th Global Edition.
Logistic Regression II Simple 2x2 Table (courtesy Hosmer and Lemeshow) Exposure=1Exposure=0 Disease = 1 Disease = 0.
Copyright ©2011 Pearson Education, Inc. publishing as Prentice Hall 15-1 Chapter 15 Multiple Regression Model Building Statistics for Managers using Microsoft.
Psyc 235: Introduction to Statistics DON’T FORGET TO SIGN IN FOR CREDIT!
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation.
Topic 14: Inference in Multiple Regression. Outline Review multiple linear regression Inference of regression coefficients –Application to book example.
CHAPTER 14 MULTIPLE REGRESSION
1 Experimental Statistics - week 10 Chapter 11: Linear Regression and Correlation Note: Homework Due Thursday.
Regression Examples. Gas Mileage 1993 SOURCES: Consumer Reports: The 1993 Cars - Annual Auto Issue (April 1993), Yonkers, NY: Consumers Union. PACE New.
Regression For the purposes of this class: –Does Y depend on X? –Does a change in X cause a change in Y? –Can Y be predicted from X? Y= mX + b Predicted.
Correlation and Linear Regression. Evaluating Relations Between Interval Level Variables Up to now you have learned to evaluate differences between the.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Time-dependent covariates and further remarks on likelihood construction Presenter Li,Yin Nov. 24.
April 6 Logistic Regression –Estimating probability based on logistic model –Testing differences among multiple groups –Assumptions for model.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Multivariate Analysis. One-way ANOVA Tests the difference in the means of 2 or more nominal groups Tests the difference in the means of 2 or more nominal.
April 4 Logistic Regression –Lee Chapter 9 –Cody and Smith 9:F.
6-1 Introduction To Empirical Models Based on the scatter diagram, it is probably reasonable to assume that the mean of the random variable Y is.
March 28, 30 Return exam Analyses of covariance 2-way ANOVA Analyses of binary outcomes.
Right Hand Side (Independent) Variables Ciaran S. Phibbs June 6, 2012.
HLM Models. General Analysis Strategy Baseline Model - No Predictors Model 1- Level 1 Predictors Model 2 – Level 2 Predictors of Group Mean Model 3 –
1 1 Slide Simple Linear Regression Estimation and Residuals Chapter 14 BA 303 – Spring 2011.
Right Hand Side (Independent) Variables Ciaran S. Phibbs.
Chapter 10 The t Test for Two Independent Samples
1 Experimental Statistics - week 12 Chapter 12: Multiple Regression Chapter 13: Variable Selection Model Checking.
LESSON 6: REGRESSION 2/21/12 EDUC 502: Introduction to Statistics.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 14-1 Chapter 14 Multiple Regression Model Building Statistics for Managers.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Applied Epidemiologic Analysis - P8400 Fall 2002 Labs 6 & 7 Case-Control Analysis ----Logistic Regression Henian Chen, M.D., Ph.D.
ANCOVA.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
1 Statistics 262: Intermediate Biostatistics Mixed models; Modeling change.
1 Experimental Statistics - week 12 Chapter 11: Linear Regression and Correlation Chapter 12: Multiple Regression.
1 Experimental Statistics - week 11 Chapter 11: Linear Regression and Correlation.
We will use the 2012 AP Grade Conversion Chart for Saturday’s Mock Exam.
Inferential Statistics Psych 231: Research Methods in Psychology.
Chapter 15 Multiple Regression Model Building
Applied Biostatistics: Lecture 2
Multiple Regression.
CHAPTER 29: Multiple Regression*
CHAPTER 12 More About Regression
Presentation transcript:

1 Modeling change Kristin Sainani Ph.D. Stanford University Department of Health Research and Policy

2 Within vs. Between subject effects… Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept time chem Example 1: last week’s example. CHEM: A significant chemical effect (time-dependent predictor) could either represent a between-subjects effect or a within-subjects effect. Because these data were made-up, we happened to know that there was primarily a within-subjects effect…

3 Example 1..from last time… id time1 time2 time3 time4 chem1 chem2 chem3 chem patients with depression are given a drug that increases levels of a “happy chemical” in the brain. At baseline, all 6 patients have similar levels of this happy chemical and scores >=14 on a depression scale. Researchers measure depression score and brain-chemical levels at three subsequent time points: at 2 months, 3 months, and 6 months post-baseline. Here are the data in broad form:

4 Example 2… Same as example 1, but made up to have more between-subjects effect than within-subjects effect. id time1 time2 time3 time4 chem1 chem2 chem3 chem

5 Example 3… Same as example 1, but made up to have ONLY between-subjects effect id time1 time2 time3 time4 chem1 chem2 chem3 chem

Example 2 (more between-subjects effects): showing two subjects… id=1: score id=2: chem id=1: chem id=2: score

Example 2:

Example 3:all between-subjects effects

Example 3:

19 Results, example 1… Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept chem time Example 1: significant chem effect. CHEM: proc mixed data=hrp262.long2; model score=chem time / solution; random int/subject=id; run;

20 Results, example 2… Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept <.0001 chem time Example 2: significant chem effect CHEM: proc mixed data=hrp262.long2; model score=chem time / solution; random int/subject=id; run;

21 Results, example 3… The Mixed Procedure Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept <.0001 chem <.0001 time Example 3: significant chem effect CHEM: proc mixed data=hrp262.long3; model score=chem time / solution; random int/subject=id; run;

22 All three examples give the same result… There’s no way to tell if change in chemical levels is causing change in depression score (which is what we probably care about). What to do?...

23 Options Examine graphs! Evaluate baseline relationship of chemical1 and score1 using regular linear regression Here, you will find a strong relationship only in examples 2 and 3, suggesting strong between- subjects effects. Drawback: you still can’t rule out within-subjects effects (could have both) Use only baseline value of chemical (chem1) as a predictor in GEE or Mixed and add a time*chem1 interaction to the model to evaluate change. Here, you will find that chem1 main effect is significant but chem1*time is not in examples 2 and 3 Drawback: A significant time*chem1 interaction would indicate that baseline chemical levels predict change in depression score over time, which is slightly different than saying that change in chemical level predicts change in depression score. Correlate change in time-dependent predictor with change in repeated-measures outcome… Calculate overall change or percent change in outcome and regress this on overall change or percent change in the predictor: see chapter 8 of Twisk (2 time points only) OR model all the changes together (vector of changes)…

24 The change model

25 SAS code to change data… data hrp262.change; set hrp262.broad; time=0; ctime=2; cscore=time2-time1; cchem=chem2-chem1; output; time=1; ctime=1; cscore=time3-time2; cchem=chem3-chem2; output; time=2; ctime=3; cscore=time4-time3; cchem=chem4-chem3; output; label cchem='change in chemical'; label cscore='change in depression score'; run;

id cscore cchem time id time1 time2 time3 time4 chem1 chem2 chem3 chem So, look at change in depression score as your outcome variable. All time- dependent predictors also get a change score.

27 Example 1: naïve linear regression cscore = *cchem

28 Example 2: naïve linear regression cscore = *cchem

29 Example 3: naïve linear regression cscore = *cchem

30 Modeling changes (mixed)… EXAMPLE 1 : Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept time cchem EXAMPLE 2: Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept time cchem EXAMPLE 3: Solution for Fixed Effects Standard Effect Estimate Error DF t Value Pr > |t| Intercept time cchem

31 Modeling changes (GEE)… EXAMPLE 1: Analysis Of GEE Parameter Estimates Standard 95% Confidence Parameter Estimate Error Limits Z Pr > |Z| Intercept time cchem <.0001 EXAMPLE 2: Analysis Of GEE Parameter Estimates Standard 95% Confidence Parameter Estimate Error Limits Z Pr > |Z| Intercept time cchem EXAMPLE 3: Analysis Of GEE Parameter Estimates Standard 95% Confidence Parameter Estimate Error Limits Z Pr > |Z| Intercept cchem time

32 The change model In fact, it often turns out that the changes are not correlated within subjects (we’ve essentially already corrected for between subject variability by using change scores). e.g., just regular old linear regression, where each person contributes three “independent” observations…

33 The change model If time intervals are unequal and unbalanced, you might choose to include the change in time as a predictor in the model. Otherwise, you could face confounding by the time between measurements (which might be correlated with the time-dependent predictor)…

34 References Jos W. R. Twisk. Applied Longitudinal Data Analysis for Epidemiology: A Practical Guide. Cambridge University Press, 2003.