Applications to AD with Sample SAS Codes

Slides:



Advertisements
Similar presentations
Longitudinal data analysis in HLM. Longitudinal vs cross-sectional HLM Similar things: Fixed effects Random effects Difference: Cross-sectional HLM: individual,
Advertisements

Longitudinal Relationship of Cognitive Functioning with Depressive Symptoms in Older Adults Archana Jajodia, Ph.D. University of Southern California.
Data: Crab mating patterns Data: Typists (Poisson with random effects) (Poisson Regression, ZIP model, Negative Binomial) Data: Challenger (Binomial with.
Department of Neurology, Mayo Clinic Arizona
Neuropathology and Cognitive Scores Workgroup The role of vascular and Alzheimer’s Disease pathology in differential cognitive impairment among older adults.
Comparison of Repeated Measures and Covariance Analysis for Pretest-Posttest Data -By Chunmei Zhou.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Longitudinal Data Analysis: Why and How to Do it With Multi-Level Modeling (MLM)? Oi-man Kwok Texas A & M University.
Attrition and its effects – example from analysis of the MRC cognitive function and aging study Fiona Matthews MRC Biostatistics Unit.
Designing longitudinal studies in epidemiology Donna Spiegelman Professor of Epidemiologic Methods Departments of Epidemiology and Biostatistics
Modeling Menstrual Cycle Length in Pre- and Peri-Menopausal Women Michael Elliott Xiaobi Huang Sioban Harlow University of Michigan School of Public Health.
Standardization of Pedigree Collection. Genetics of Alzheimer’s Disease Alzheimer’s Disease Gene 1 Gene 2 Environmental Factor 1 Environmental Factor.
The best of both worlds Pharma R&D IT - Informatics Rudi Verbeeck Guided analytics in the hands of the SME.
 Combines linear regression and ANOVA  Can be used to compare g treatments, after controlling for quantitative factor believed to be related to response.
Biostatistics Case Studies 2007 Peter D. Christenson Biostatistician Session 3: Incomplete Data in Longitudinal Studies.
Multilevel Linear Models Field, Chapter 19. Why use multilevel models? Meeting the assumptions of the linear model – Homogeneity of regression coefficients.
Biostatistics Case Studies 2008 Peter D. Christenson Biostatistician Session 5: Choices for Longitudinal Data Analysis.
Multilevel Linear Modeling aka HLM. The Design We have data at two different levels In this case, 7,185 students (Level 1) Nested within 160 Schools (Level.
Statistical Models for the Analysis of Single-Case Intervention Data Introduction to:  Regression Models  Multilevel Models.
Osteoarthritis Initiative Analytic Strategies for the OAI Data December 6, 2007 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and.
©2012 MFMER | ADNI Clinical Core Paul Aisen Ron Petersen Michael Donohue Jennifer Salazar WW ADNI Washington, DC July 17, 2015.
BUSI 6480 Lecture 8 Repeated Measures.
Topic 26: Analysis of Covariance. Outline One-way analysis of covariance –Data –Model –Inference –Diagnostics and rememdies Multifactor analysis of covariance.
Developing a Mixed Effects Model Using SAS PROC MIXED
Introduction to SAS Essentials Mastering SAS for Data Analytics Alan Elliott and Wayne Woodward SAS ESSENTIALS -- Elliott & Woodward1.
Process Approach to Detecting Early Cognitive Impairment Rhoda Au, Ph.D., Friday Harbor 9/14/15 MBA.
©2012 MFMER | ADNI Clinical Core Paul Aisen Ron Petersen Michael Donohue Jennifer Salazar ADNI Steering Committee Meeting Washington, DC April.
1 Statistics 262: Intermediate Biostatistics Regression Models for longitudinal data: Mixed Models.
Biostatistics Case Studies Peter D. Christenson Biostatistician Session 3: Missing Data in Longitudinal Studies.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
1 Statistics 262: Intermediate Biostatistics Mixed models; Modeling change.
Education 793 Class Notes ANCOVA Presentation 11.
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
Multilevel Modeling Programs David A. Kenny January 23, 2014.
Longitudinal Data & Mixed Effects Models Danielle J. Harvey UC Davis.
Association Study of Late-Onset
An Introduction to Latent Curve Models
Jefferson Davis Research Analytics
Evaluation of the Sleep Regularity Index (SRI) among First-Year College Students: Association with Alcohol Use, Caffeine Consumption, Academic Load, and.
Is “Dead” the same as “Missing”?
Designing longitudinal studies in epidemiology
Applied Biostatistics: Lecture 2
Sec 9C – Logistic Regression and Propensity scores
Epidemiology of Dementia: the MoVIES Project
Linear Mixed Models in JMP Pro
Statistical Reporting Format
Statistical Models for the Analysis of Single-Case Intervention Data
Metrics of time in longitudinal models
Statistics 262: Intermediate Biostatistics
Imaging AD Progression Amyloid Imaging Agents.
Panel & longitudinal data
Advanced Quantitative Analysis
Statistical Models for the Analysis of Single-Case Intervention Data
ADNI Clinical Core Paul Aisen Ron Petersen Michael Donohue
A Gentle Introduction to Linear Mixed Modeling and PROC MIXED
AAIC 2018 Biomarker Rates of Change in Autosomal Dominant Versus “Sporadic” Alzheimer Disease John C. Morris, MD On behalf of MW Weiner, L Beckett, T.
An Introductory Tutorial
JAMA Ophthalmology Journal Club Slides: Longitudinal Associations Between Visual Impairment and Cognitive Functioning Zheng DD, Swenor BK, Christ SL, West.
When the Mean isn’t Enough
Fixed, Random and Mixed effects
Introduction to SAS Essentials Mastering SAS for Data Analytics
Tharick A. Pascoal, Sulantha Mathotaarachchi, Monica Shin, Andrea L
Longitudinal Data & Mixed Effects Models
Study on Learning Effect in Visuomotor Tracking
European Prevention Alzheimer’s Dementia
Rachael Bedford Mplus: Longitudinal Analysis Workshop 23/06/2015
Autoregressive and Growth Curve Models
Introduction to SAS Essentials Mastering SAS for Data Analytics
Figure 1 NMF in ADD patients and classification of prodromal Alzheimer’s disease participants. Grey matter ... Figure 1 NMF in ADD patients and classification.
Statistical Models for the Analysis of Single-Case Intervention Data
Presentation transcript:

Applications to AD with Sample SAS Codes Chengjie Xiong Washington University October 15, 2016

Database Reality in ADCs Subjects have different number/length of follow-ups; The time interval between two adjacent visits differs among subjects; Longer cognitive follow-ups, much shorter biomarkers follow-ups; Missing data may NOT be random.

Shared Longitudinal Statistics To estimate the longitudinal rates of change (the slope); To compare the slopes across groups; To adjust for the effect of covariates in the estimation/comparison of slopes; To estimate/compare nonlinear pattern over time

Linear Mixed Models for Longitudinal Data (in English) Outcome at Time t = Fixed Effects+Random Effects+Errors SAS implements the model by specifying all three terms in PROC MIXED.

A Scientific Example To compare the rate of cognitive change in autosomal dominant AD from the Dominantly Inherited Alzheimer Network (DIAN) and late onset AD (LOAD)? DIAN mutation carriers (amyloid precursor protein, presenilin 1 and 2); LOAD: NACC pathologically confirmed AD.

DIAN Mutation Carriers Cohort Description NACC Participants N=239 DIAN Mutation Carriers N=129 Mean Age at Baseline (SD) 84.2 (8.9) 44.2 (8.0) Mean Age at Symptom Onset (SD) 86.6 (9.01) 45.1 (9.7) Mean Years of follow up (SD) 4.1 (1.8) 2.3 (1.2) Gender (% Female) 61.5 62.0 Mean Years Education (SD) 15.4 (2.7) 13.6 (2.7) Mutation (PSEN1:PSEN2:APP) 103 : 3 : 23 CDR at Baseline   CDR 0 (%) 239 (100) 56 (43.4) CDR 0.5 (%) 48 (37.2) CDR 1 (%) 16 (12.4) % ApoE4+ 1 E4 2 E4 28.4 31.8 0.5 5.4

An Example of Spaghetti Plot

An Assumed Linear Growth Model

SAS Data Set Format Data set format: (Time=EYO=expected years to onset, i.e., CDR>=0.5) ID Cohort Time Cognition NACC 0.6 20 1 NACC 1.8 18 2 DIAN 0.9 22 …

SAS Codes to ESTIMATE Slopes/Intercepts PROC MIXED DATA=A; CLASS Cohort ID; MODEL Cog=Cohort Cohort*Time/noint s ddfm=satterthwaite; RANDOM Int Time/sub=ID type=un; REPEATED /sub=ID type=AR(1); RUN;

SAS Outputs—2 Slopes and 2 Intercepts Solution for Fixed Effects Effect Cohort Est SE DF t-value Pr>|t| NACC DIAN Cohort*Time

SAS Codes to COMPARE Slopes/Intercepts PROC MIXED DATA=A; CLASS Cohort ID; MODEL Cog=Cohort Time Cohort*Time/ddfm=satterthwaite; RANDOM Int Time/sub=ID type=un; REPEATED /sub=ID type=AR(1); RUN;

Outputs—P-values for Testing Equality of Slopes (Intercepts) Type 3 Tests of Fixed Effects Effect Num DF Den DF F value Pr>F Cohort Time Cohort*

Adding APOE4: To ESTIMATE Slopes/Intercepts PROC MIXED DATA=a; CLASS Cohort ID E4; MODEL Cog=Cohort*E4 Cohort*E4*Time/noint s ddfm=; RANDOM Int Time/type=un sub=ID; REPEATED /type=AR(1) sub=ID; ESTIMATE ‘comparing slopes for E4-’ Cohort*E4*Time 1 -1 0 0/e; RUN;

Outputs—4 Slopes+4 Intercepts Solution for Fixed Effects Effect Cohort E4 Est SE DF t-value Pr>|t| Cohort*E4 NACC … 1 Cohort*E4*Time ….

Adding APOE4: To COMPARE Slopes/Intercepts PROC MIXED DATA=A; CLASS Cohort ID E4; MODEL Cog=Cohort E4 Cohort*E4 Time Cohort*Time E4*Time Cohort*E4*Time/ddfm=; RANDOM Int Time/type=un sub=ID; REPEATED /type=AR(1) sub=ID; RUN;

Type 3 Tests of Fixed Effects SAS Outputs Type 3 Tests of Fixed Effects Effect Num DF Den DF F value Pr>F Cohort E4 Cohort*E4 Time Cohort*Time E4*Time Cohort*E4*Time

Nonlinear Changes Subjects in the NACC data set progressed to a higher CDR, justifying the nonlinear growth; Subjects in the DIAN started at baseline with either EYO<0 or EYO>0.

One Possible Analytic Approach -8 -6 -4 -2 0 2 4 6 8 Estimated Years to Onset (EYO) Instead of comparing groups on age, we use Estimated Years to Onset (EYO) of symptoms EYO=0 at symptom onset

Nonlinear Changes—Preparing SAS Data Set Time1=EYO; and 0 if EYO>0; Time2=EYO; and 0 if EYO<0; Data set format: ID Cohort Time1 Time2 Cog ….

To ESTIMATE Slopes and Intercepts PROC MIXED DATA=A; CLASS Cohort ID; MODEL Cog=Cohort Cohort*Time1 Cohort*Time2/noint s ddfm=; RANDOM Int Time1 Time2/type=un sub=ID; REPEATED /TYPE=AR(1) SUB=ID; RUN;

Nonlinear Changes: To COMPARE Slopes/Intercepts PROC MIXED DATA=A; CLASS Cohort ID ; MODEL Cog=Cohort Time1 Cohort*TIME1 TIME2 Cohort*TIME2/ddfm=; RANDOM Int Time1 Time2/type=UN sub=ID; REPEATED /type=AR(1) sub=ID; RUN;

Results: DIAN-NACC Comparison To compare cognitive performance and rate of change between DIAN MC and LOAD from NACC with pathologically confirmed AD.

Slope Prior to EYO = 0 (Preclinical) Cognitive Test (Mean annual change & SE) NACC (n=239) DIAN MC (n=129) p-value Animal Fluency -1.07 (0.12) -0.64 (0.21) 0.0752 Boston Naming -0.49 (0.08) -0.04 (0.15) 0.0073 Digits Backward -0.24 (0.04) -0.19 (0.07) 0.5139 Digits Forward -0.22 (0.04) -0.13 (0.08) 0.2803 Logical Memory – Imm. -0.42 (0.09) -0.13 (0.16) 0.1017 Logical Memory – Del. -0.60 (0.10) -0.33 (0.17) 0.1737 MMSE -0.58 (0.09) -0.33 (0.15) 0.1466 Trails A 4.10 (0.62) 1.45 (0.96) 0.0212 Trails B 16.97 (1.50) 8.08 (2.56) 0.0030 WAIS-R Digit Symbol -2.30 (0.24) -1.78 (0.42) 0.2758 Composite (All above) -0.13 (0.01) -0.07 (0.02) 0.0050 Non-speed Composite * -0.09 (0.01) -0.06 (0.02) 0.0709 *Non-speed Composite includes: Boston Naming Digits F Digits B Logical Memory Immediate Logical Memory Delayed MMSE

Score at EYO = 0 (Symptom Onset) Cognitive Test (Mean & SE) NACC (n=239) DIAN MC (n=129) p-value Animal Fluency 14.60 (0.35) 17.22 (0.70) 0.0009 Boston Naming 24.41 (0.33) 26.14 (0.60) 0.0119 Digits Backward 5.67 (0.13) 5.91 (0.26) 0.4255 Digits Forward 7.70 (0.15) 7.72 (0.29) 0.9300 Logical Memory – Imm. 11.35 (0.30) 10.75 (0.58) 0.3689 Logical Memory – Del. 9.61 (0.33) 8.24 (0.63) 0.0539 MMSE 26.77 (0.27) 25.78 (0.52) 0.0928 Trails A 59.81 (2.17) 37.79 (4.02) <.0001 Trails B 186.40 (5.21) 117.54 (10.80) WAIS-R Digit Symbol 30.10 (0.96) 44.45 (1.72) Composite (All above) -0.24 (0.04) 0.13 (0.07) Non-speed Composite -0.05 (0.04) -0.09 (0.08) 0.6830

Slope After EYO = 0 (Progression) Cognitive test (mean annual change & SE) NACC (n=239) DIAN MC (n=129) P-value Animal fluency -1.56 (0.16) -1.07 (0.24) 0.0884 Boston naming -1.22 (0.15) -0.77 (0.24) 0.1114 Digits backward -0.33 (0.06) -0.43 (0.08) 0.3232 Digits forward -0.23 (0.07) -0.45 (0.10) 0.0726 Logical memory – imm. -1.40 (0.13) -1.24 (0.19) 0.4936 Logical memory – del. -1.46 (0.13) -0.81 (0.20) 0.0058 Mmse -1.84 (0.21) -1.64 (0.30) 0.5868 Trails A 14.86 (1.87) 7.89 (2.53) 0.0284 Trails B 25.47 (3.42) 18.98 (4.77) 0.2717 WAIS-R digit symbol -3.68 (0.56) - 4.00 (0.80) 0.7452 Composite (all above) -0.25 (0.02) -0.16 (0.03) 0.0279 Non-speed composite -0.26 (0.02) -0.23 (0.04) 0.4827

Bear In Mind LOAD:DIAN =Old:Young =Many:Few comorbidities =Without:With EYO =…

Thanks NACC investigators/staff; DIAN investigators/staff; Dr. Lena McCue from Biostatistics Core Washington University Knight ADRC