Lecture 5 - Latent Variables: Spirits in the Material World or Ghost in the Machine? Karen Bandeen-Roche Hurley-Dorrier Professor and Chair of Biostatistics.

Slides:



Advertisements
Similar presentations
Multilevel Models with Latent Variables Daniel J. Bauer Department of Psychology University of North Carolina 9/13/04 SAMSI Workshop.
Advertisements

Hierarchical Linear Modeling: An Introduction & Applications in Organizational Research Michael C. Rodriguez.
LOGO Older people’s driving habits, visual abilities, and subjective assessment of daily visual functioning Professor: Liu Student: Ruby.
A Conceptual Introduction to Multilevel Models as Structural Equations
MCUAAAR: Methods & Measurement Core Workshop: Structural Equation Models for Longitudinal Analysis of Health Disparities Data April 11th, :00 to.
Structural Equation Modeling Using Mplus Chongming Yang Research Support Center FHSS College.
Delineating the Possible Mechanisms Underlying Longitudinal Associations in Observational Studies on Aging Karen Bandeen-Roche 1, Luigi Ferrucci 2, Yi.
Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University of Surrey.
Item Response Theory in Health Measurement
Improving health worldwide George B. Ploubidis The role of sensitivity analysis in the estimation of causal pathways from observational.
Statistics 100 Lecture Set 7. Chapters 13 and 14 in this lecture set Please read these, you are responsible for all material Will be doing chapters
Sean D. Kristjansson Andrew C. Heath Andrey P. Anokhin Substance Use Among Older Adolescents: A Latent Class Analysis.
TigerStat ECOTS Understanding the population of rare and endangered Amur tigers in Siberia. [Gerow et al. (2006)] Estimating the Age distribution.
Latent Growth Curve Modeling In Mplus:
1 Graphical Diagnostic Tools for Evaluating Latent Class Models: An Application to Depression in the ECA Study Elizabeth S. Garrett Department of Biostatistics.
How to evaluate the cross-cultural equivalence of single items Melanie Revilla, Willem Saris RECSM, UPF Zurich – 15/16 July.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Linear Regression and Correlation Analysis
Statistical Analysis SC504/HS927 Spring Term 2008 Session 7: Week 23: 7 th March 2008 Complex independent variables and regression diagnostics.
Analysis of Individual Variables Descriptive – –Measures of Central Tendency Mean – Average score of distribution (1 st moment) Median – Middle score (50.
Statistical hypothesis testing – Inferential statistics II. Testing for associations.
T-tests and ANOVA Statistical analysis of group differences.
PY1PR2 lecture 3: Simple regression
June 2, 2008Stat Lecture 18 - Review1 Final review Statistics Lecture 18.
Regression and Correlation Methods Judy Zhong Ph.D.
Statistical Methods For Engineers ChE 477 (UO Lab) Larry Baxter & Stan Harding Brigham Young University.
Linear Regression Inference
Simple Linear Regression
Modeling errors in physical activity data Sarah Nusser Department of Statistics and Center for Survey Statistics and Methodology Iowa State University.
Psyc 235: Introduction to Statistics DON’T FORGET TO SIGN IN FOR CREDIT!
September In Chapter 14: 14.1 Data 14.2 Scatterplots 14.3 Correlation 14.4 Regression.
Biostatistics Case Studies 2015 Youngju Pak, PhD. Biostatistician Session 4: Regression Models and Multivariate Analyses.
Introduction Multilevel Analysis
Statistics for clinicians Biostatistics course by Kevin E. Kip, Ph.D., FAHA Professor and Executive Director, Research Center University of South Florida,
Functional Impairment and Depressive Symptoms: Mitigating Effects of Trait Hope Jameson K. Hirsch, Ph.D. 1,2, S. Kaye, B.S. 1, & Jeffrey M. Lyness, M.D.
Meredith Cook – PharmD Candidate Mercer University COPHS August, 2012 Cognitive Trajectories after Postoperative Delirium.
Lecture 2 Sampling design Analysis of data. 1. Sampling design, - the what, the where, and the how.. It is never possible to compile a complete data set,
SEM: Basics Byrne Chapter 1 Tabachnick SEM
Applied Quantitative Analysis and Practices LECTURE#23 By Dr. Osman Sadiq Paracha.
The Role of Glare in Driving Gary S. Rubin Institute of Ophthalmology London, UK.
Introduction to Multilevel Modeling Stephen R. Porter Associate Professor Dept. of Educational Leadership and Policy Studies Iowa State University Lagomarcino.
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Statistical Methods II&III: Confidence Intervals ChE 477 (UO Lab) Lecture 5 Larry Baxter, William Hecker, & Ron Terry Brigham Young University.
VALIDITY AND VALIDATION: AN INTRODUCTION Note: I have included explanatory notes for each slide. To access these, you will probably have to save the file.
Separation of Longitudinal Change from Re-Test Effect using a Multiple-Group Latent Growth Model Richard N. Jones, John N. Morris, Adrienne N. Rosenberg,
Introduction to logistic regression and Generalized Linear Models July 14, 2011 Introduction to Statistical Measurement and Modeling Karen Bandeen-Roche,
Constructs AKA... AKA... Latent variables Latent variables Unmeasured variables Unmeasured variables Factors Factors Unobserved variables Unobserved variables.
Assessing Estimability of Latent Class Models Using a Bayesian Estimation Approach Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University Departments.
INTRODUCTION TO CLINICAL RESEARCH Introduction to Statistical Inference Karen Bandeen-Roche, Ph.D. July 12, 2010.
Analysis of Experiments
Ensuring Consistency in Assessment of Continuing Care Needs: An Application of Differential Item Functioning Analysis R. Prosser, M. Gelin, D. Papineau,
Lecture 10 Introduction to Linear Regression and Correlation Analysis.
Introduction to Multilevel Analysis Presented by Vijay Pillai.
Residuals. Why Do You Need to Look at the Residual Plot? Because a linear regression model is not always appropriate for the data Can I just look at the.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
MATH 2311 Section 5.4. Residuals Examples: Interpreting the Plots of Residuals The plot of the residual values against the x values can tell us a lot.
Using Latent Variable Models in Survey Research Roger E. Millsap Arizona State University Contact: (480)
Week 2 Normal Distributions, Scatter Plots, Regression and Random.
Correlation & Simple Linear Regression Chung-Yi Li, PhD Dept. of Public Health, College of Med. NCKU 1.
ABSTRACT Purpose: Many experiences in late-life may contribute to depression, but some older adults appear more vulnerable than others. We investigated.
Robert E. Larzelere Isaac J. Washburn Mwarumba Mwavita Ronald B
Latent Class Regression
Latent Variable Modeling Summary / Final Thoughts
The Potential Mediating Role of Emotion Dysregulation
An Introduction to Latent Variable Modeling
Chapter 14: Correlation and Regression
Lecture 14 Review of Lecture 13 What we’ll talk about today?
Karen Bandeen-Roche Department of Biostatistics & COAH
CHAPTER 29: Multiple Regression*
Chapter 4, Regression Diagnostics Detection of Model Violation
Presentation transcript:

Lecture 5 - Latent Variables: Spirits in the Material World or Ghost in the Machine? Karen Bandeen-Roche Hurley-Dorrier Professor and Chair of Biostatistics Johns Hopkins University Introduction to Statistical Measurement and Modeling Nanjing University July 15, 2011

Johns Hopkins Claude D. Pepper Older Americans Independence Center The National Institutes of Health The Brookdale National Foundation

Objectives for you to leave here knowing… What is a latent variable? What are some common latent variable models? What is the role of assumptions in latent variable models? Why should I consider using—or decide against using—latent variable models?

Ordinary Linear Regression Residual as Latent Variable X Y. ε YX ε

Mixed effect / Multi-level models Random effects as Latent Variables time vital non- vital β 0 + β 1 β0β0 β2β2 β 2 + β 3

Mixed effect / Multi-level models Random effects as Latent Variables b 0i = random intercept b 2i = random slope (could define more) Population heterogeneity captured by spread in intercepts, slopes time vital non- vital β 0 + β 1 β0β0 β2β2 β 2 + β 3 + b 0i slope: - |b 2i |

Mixed effect / Multi-level models Random effects as Latent Variables time vital non- vital β 0 + β 1 β0β0 β2β2 β 2 + β 3 + b 0i slope: - |b 2i | YX ε t b

Frailty Latent Variable Illustration Frailty Adverse outcomes Y1Y1 YpYp … Determinants e1e1 epep

Why do people use latent variable models? The complexity of my problem demands it NIH wants me to be sophisticated Reveal underlying truth (e.g. “discover” latent types) Operationalize and test theory Sensitivity analyses Acknowledge, study issues with measurement; correct attenuation; etc.

Well-used latent variable models Latent variable scale Observed variable scale ContinuousDiscrete ContinuousFactor analysis LISREL Discrete FA IRT (item response) DiscreteLatent profile Growth mixture Latent class analysis, regression, transition General software: MPlus, Latent Gold, WinBugs (Bayesian), NLMIXED (SAS), glmm

Tailored software: AMOS, LISREL, CALIS (SAS)

Example: Theory Infusion Inflammation: central in cellular repair Hypothesis: dysregulation=key in accel. aging –Muscle wasting (Ferrucci et al., JAGS 50: ; Cappola et al, J Clin Endocrinol Metab 88: ) –Receptor inhibition: erythropoetin production / anemia (Ershler, JAGS 51:S18-21) Stimulus (e.g. muscle damage) IL-1 # TNF  IL-6CRP inhibition up-regulation # Difficult to measure. IL-1RA = proxy

Frailty Latent Variable Illustration Inflam. regulation Adverse outcomes Y1Y1 YpYp … Determinants e1e1 epep Theory informs relations (arrows) ς λ1λ1 λpλp Measurement Structural

Theory infusion InCHIANTI data (Ferrucci et al., JAGS, 48: ) LV method: factor analysis model –two independent underlying variables –down-regulation IL-1RA path=0 –conditional independence Inflammation 2 Down-reg. IL-6 TNFα CRP IL-1RA IL-18 Inflammation 1 Up-reg Bandeen-Roche et al., Rejuv Res, 2009

Interlude What is mind-body dualism? To whom does the idea owe? –Rene Descartes To whom does the title of my talk owe? –The Police –Ghost in the Machine: Gilbert Ryle

ANOTHER WELL-USED LATENT VARIABLE MODEL Motivation: Self-reported Visual functioning Questionnaires have proliferated –This talk: Activities of Daily Vision 5 (ADV) –“Far vision” subscale: How much difficulty with reading signs (night, day); seeing steps (day, dim light); watching TV = Y1,...,Y5 Question of interest: What aspects of vision determine “far vision” function One point of view on such “function”: Latent subpopulations

Analysis of underlying subpopulations Latent class analysis / regression POPULATION … P1P1 PJPJ CiCi Y1Y1 YMYM Y1Y1 YMYM …… ∏ 11 ∏ 1M ∏ J1 ∏ JM 19-Goodman, 1974; 27-McCutcheon, 1987 XiXi

Analysis of underlying subpopulations Method: Latent class analysis/ regression Seeks homogeneous subpopulations Features that characterize latent groups –Prevalence in overall population –Proportion reporting each symptom –Number of them – Assumption: reporting heterogeneity unrelated to measured or unmeasured characteristics – conditional independence, non differential measurement by covariates of responses within latent groups : partially determine features

no x

LCR: Self-reported Visual functioning Study: Salisbury Eye Evaluation (SEE; West et al ) –Representative of community-dwelling elders –n=2520; 1/4 African American –This talk: 1643 night drivers Analyses control for potential confounders: –Demographic: age (years), sex (1{female}), race (1{nonwhite}), education (years) –Cognition: Mini-Mental State Exam score (MMSE; 30-0 points) –Depression: GHQ subscale score (0-6) –Disease burden: # reported comorbidities

Aspects of vision Visual acuity:.3 logMAR (about 2 lines) Contrast sensitivity: 6 letters Glare sensitivity: 6 letters Stereoacuity:.3 log arc seconds Visual field: root-2 central points missed –Latter two: span approximately.60 IQR

MODEL CHECKING IS POSSIBLE!

Observed (solid) and Predicted (dash) Item Prevalence vs Acuity Plots

One last issue Identifiability Models can be too big / complex A model is non-identifiable if distinct parameterizations lead to identical data distributions –i.e. analysis not grounded in data Weak identifiability is common too: –Analysis only indirectly grounded in data (via the model)

Identifiability data (ground) model analysis strong

Identifiability data (ground) model analysis weak

Identifiability data (ground) model analysis non

Objectives for you to leave here knowing… What is a latent variable? What are some common latent variable models? What is the role of assumptions in latent variable models? Why should I consider using—or decide against using—latent variable models?

Postlude “ Where does the answer lie? … it’s something we can’t buy” Have a great career