1 Graphical Diagnostic Tools for Evaluating Latent Class Models: An Application to Depression in the ECA Study Elizabeth S. Garrett Department of Biostatistics.

Slides:



Advertisements
Similar presentations
Sinead Williamson, Chong Wang, Katherine A. Heller, David M. Blei
Advertisements

Methods for Evaluating the Performance of Diagnostic Tests in the Absence of a Gold Standard: A Latent Class Model Approach Elizabeth Garrett-Mayer Division.
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Previous Lecture: Distributions. Introduction to Biostatistics and Bioinformatics Estimation I This Lecture By Judy Zhong Assistant Professor Division.
INTRODUCTION TO MACHINE LEARNING Bayesian Estimation.
Chapter 10: Estimating with Confidence
Chapter 11- Confidence Intervals for Univariate Data Math 22 Introductory Statistics.
5.3 Psychological Disorders
Chapter Seventeen HYPOTHESIS TESTING
. PGM: Tirgul 8 Markov Chains. Stochastic Sampling  In previous class, we examined methods that use independent samples to estimate P(X = x |e ) Problem:
Mental Health from a Public Health Perspective Professor Carol S. Aneshensel Department of Community Health Sciences 10/12/09.
8 Statistical Intervals for a Single Sample CHAPTER OUTLINE
Visual Recognition Tutorial
Using ranking and DCE data to value health states on the QALY scale using conventional and Bayesian methods Theresa Cain.
8-1 Introduction In the previous chapter we illustrated how a parameter can be estimated from sample data. However, it is important to understand how.
Correlational Designs
Chapter 9 Title and Outline 1 9 Tests of Hypotheses for a Single Sample 9-1 Hypothesis Testing Statistical Hypotheses Tests of Statistical.
CHAPTER 4 Research in Psychology: Methods & Design
1 Bayesian methods for parameter estimation and data assimilation with crop models Part 2: Likelihood function and prior distribution David Makowski and.
Methods for Evaluating the Performance of Diagnostic Tests in the Absence of a “Gold Standard:” A Latent Class Model Approach Elizabeth S. Garrett Division.
Overview G. Jogesh Babu. Probability theory Probability is all about flip of a coin Conditional probability & Bayes theorem (Bayesian analysis) Expectation,
Simple Linear Regression
Bipolar Disorder Research by: Lisette Rodriguez & Selena Nuon.
Major Depressive Disorder Natalie Gomez Psychology Period 1.
Analysis and Visualization Approaches to Assess UDU Capability Presented at MBSW May 2015 Jeff Hofer, Adam Rauk 1.
Introduction to MCMC and BUGS. Computational problems More parameters -> even more parameter combinations Exact computation and grid approximation become.
Review of Chapters 1- 5 We review some important themes from the first 5 chapters 1.Introduction Statistics- Set of methods for collecting/analyzing data.
Random Regressors and Moment Based Estimation Prepared by Vera Tabakova, East Carolina University.
Finding Scientific topics August , Topic Modeling 1.A document as a probabilistic mixture of topics. 2.A topic as a probability distribution.
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
LOGISTIC REGRESSION A statistical procedure to relate the probability of an event to explanatory variables Used in epidemiology to describe and evaluate.
Depression. DMS-IV Criteria (1) depressed mood most of the day, nearly every day, as indicated by either subjective report (e.g., feels sad or empty)
Inference for Regression Simple Linear Regression IPS Chapter 10.1 © 2009 W.H. Freeman and Company.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
An Introduction to Statistics and Research Design
September 18-19, 2006 – Denver, Colorado Sponsored by the U.S. Department of Housing and Urban Development Conducting and interpreting multivariate analyses.
Estimating a Population Mean
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Prediction statistics Prediction generally True and false, positives and negatives Quality of a prediction Usefulness of a prediction Prediction goes Bayesian.
Latent Class Regression Model Graphical Diagnostics Using an MCMC Estimation Procedure Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University
Confidence Interval & Unbiased Estimator Review and Foreword.
Instructor Resource Chapter 3 Copyright © Scott B. Patten, Permission granted for classroom use with Epidemiology for Canadian Students: Principles,
Assessing Estimability of Latent Class Models Using a Bayesian Estimation Approach Elizabeth S. Garrett Scott L. Zeger Johns Hopkins University Departments.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Chapter Eight: Quantitative Methods
Understanding Basic Statistics Fourth Edition By Brase and Brase Prepared by: Lynn Smith Gloucester County College Chapter Nine Hypothesis Testing.
Week 7 Chapter 6 - Introduction to Inferential Statistics: Sampling and the Sampling Distribution & Chapter 7 – Estimation Procedures.
Beginning Statistics Table of Contents HAWKES LEARNING SYSTEMS math courseware specialists Copyright © 2008 by Hawkes Learning Systems/Quant Systems, Inc.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
10.1 – Estimating with Confidence. Recall: The Law of Large Numbers says the sample mean from a large SRS will be close to the unknown population mean.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
ESTIMATION OF THE MEAN. 2 INTRO :: ESTIMATION Definition The assignment of plausible value(s) to a population parameter based on a value of a sample statistic.
10.1 Estimating with Confidence Chapter 10 Introduction to Inference.
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 12 More About Regression 12.1 Inference for.
Ch 1. Introduction Pattern Recognition and Machine Learning, C. M. Bishop, Updated by J.-H. Eom (2 nd round revision) Summarized by K.-I.
+ The Practice of Statistics, 4 th edition – For AP* STARNES, YATES, MOORE Chapter 8: Estimating with Confidence Section 8.1 Confidence Intervals: The.
Generalization Performance of Exchange Monte Carlo Method for Normal Mixture Models Kenji Nagata, Sumio Watanabe Tokyo Institute of Technology.
Chapter Nine Hypothesis Testing.
CHAPTER 10 Comparing Two Populations or Groups
MCMC Output & Metropolis-Hastings Algorithm Part I
Statistical Quality Control, 7th Edition by Douglas C. Montgomery.
Manic Depression.
SA3202 Statistical Methods for Social Sciences
CHAPTER 29: Multiple Regression*
Introduction to Inference
9 Tests of Hypotheses for a Single Sample CHAPTER OUTLINE
Bayesian Network Reasoning with Gibbs Sampling
Clinical characteristics of Depression
Section 12.2 Comparing Two Proportions
ESTIMATION OF THE MEAN AND PROPORTION
Presentation transcript:

1 Graphical Diagnostic Tools for Evaluating Latent Class Models: An Application to Depression in the ECA Study Elizabeth S. Garrett Department of Biostatistics Johns Hopkins University

2 GOAL 1. Provide tools for choosing the most appropriate latent class model. 2. Interpret objective diagnostic methods in reference to the latent class model.

3 Table of Contents 1. Introduction 2. Previous Work 3. Model Estimation 4. Diagnostic Methods for Latent Class Models 5. Extensions to Latent Class Regression 6. Application to the ECA Study 7. Validating Diagnostic Criteria for Depression Using LCM 8. Discussion and Further Research

4 Outline I. Depression in relation to the LCM II. Approach to Estimation III. The ECA Study IV. Predicted Frequency Check Plot V. Latent Class Estimability Display VI. Interpretation of Findings VII. Revisions

5 Motivating Question How should we describe “major depression?” – not depressed, depressed – none, moderate, severe – none, mild, moderate, severe – none, mood symptoms, somatic symptoms, both

6 How we conceptualize “major depression”  We use indicators of symptoms such as self-reported presence of sadness, weight change, etc.  A combination of these indicators is thought to define depression.  Using these combinations, we commonly seek to categorize individuals into depression classes.  These classes represent the construct “depression.”  “Depression” is a latent variable.  The construct of “Depression” can then be used for classification, description, and prediction

7 Depression in the Diagnostic and Statistical Manual of Mental Disorders, 3rd Edition DSM-III Criteria (generally): A. Dysphoria for 2 or more weeks B. Reported symptoms in 4 or more of the following symptom groups: 1. loss of appetite, weight change 2. insomnia, hypersomnia 3. retarded movement, restlessness 4. disinterest in sex 5. fatigue 6. feelings of guilt or worthlessness 7. trouble concentrating, thoughts slow or mixed 8. morbid thoughts, suicidal thoughts/attempts

Latent Class Model: Main Ideas l There are M classes of depression (e.g. none, mild, severe).  m represents the proportion of individuals in the population in class m (m=1,…,M) l Each person is a member of one of the M classes, but we do not know which. The latent class of individual i is denoted by  i. l Symptom prevalences vary by class. The prevalence for symptom j in class m is denoted by p mj. l Given class membership, the symptoms are independent.

9 Latent Class Model  M : number of classes  p  i : vector of symptom probabilities given latent class  i  : probability of being in latent class m, m=1,…M.  : the true latent class of individual i.  : vector of individual i’s report of symptoms.

10 Estimation Approach Bayesian Approach : Quantify beliefs about p, , and  before and after observing data. Bayesian Terminology: Prior Probability: What we believe about unknown parameters before observing data. Posterior Probability: What we believe about the parameters after observing data.

11 Bayesian Estimation Approach We estimated the models using a Markov chain Monte Carlo (MCMC) algorithm: Specify prior probability distribution: P(p, ,  ) Combine prior with likelihood to obtain posterior distribution: P(p, ,  |Y)  P(p, ,  ) x L(Y| p, ,  ) Estimate posterior distribution for each parameter using iterative procedure. P(  1 |Y) = ∫ P(p, ,  |Y)

12 The Epidemiologic Catchment Area Study 3481 community-dwelling individuals in Baltimore were interviewed using the NIMH Diagnostic Interview Schedule. 8 self-reported symptom groups were completed for 2938 individuals*. 6 month prevalence of symptoms was assessed. * those with organic brain disorder were omitted as per DSM-III criterion

13 The Epidemiologic Catchment Area Study 

14 Predicted Frequency Check ( PFC ) Plot Compare observed symptom pattern frequencies to what the model predicts for a new sample of data from the same population. Symptom patterns: » no reported symptoms » report dysphoria only » report all symptoms 2 9 = 512 possible patterns

15 Example: Pattern : » restlessness/retarded movement » dysphoria We observed 24 individuals with this symptom pattern:

16 Example: 95% confidence interval for frequency? Non-parametric (saturated model) estimate:

17 Model Based Estimation Predicted frequency of pattern and prediction interval in the 3 class model: (x) 2.5% 97.5%

18 Model Based Estimation Comparison of model based prediction interval to empirical confidence interval: 97.5% 2.5% Observed

19 Predicted Frequency Check Plot

20 Predicted Frequency Check Plot

21 Latent Class Estimability Display ( LCED ) Is there enough data to estimate all of the parameters in the model? » 2 class model: 19 parameters » 3 class model: 29 parameters » 4 class model: 39 parameters Problems arise when: » small data set » small class size e.g. N=1000 and class size = individuals in class to estimate symptom prevalences » small data set and small class size

22 Weak “Identifiability” (Weak Estimability) Definition: A parameter in a (Bayesian) model is weakly identified if the posterior distribution of the parameter is approximately the same as the prior. P(  1 )  P(  1 |Y) If a model is weakly identified it is still “valid”, but we cannot make inferences from the data about the weakly identified parameters.

23 Examples

24 Latent Class Estimability Display

25 Interpretation  Depression appears to be ‘dimensional’ » none » mild » severe  2% of population is in severe class  14% in mild class: are they depressed or not?  How does this compare to the DSM-III definition? 

26 Work Not Included in Talk 1. MCMC Algorithm 2. Log Odds Ratio Check Plot 3. Predicted Class Assignment Display 4. Extensions to Regression

27 Revisions Already Implemented 1. New example for Chapter 5 (LCRR) 2. Background/justification of latent class model as “gold-standard” in validation 3. Splus programs: on website with a “user’s guide”