Statistical methods in longitudinal studies Jouko Miettunen, PhD Department of Psychiatry University of Oulu

Slides:



Advertisements
Similar presentations
The analysis of survival data in nephrology. Basic concepts and methods of Cox regression Paul C. van Dijk 1-2, Kitty J. Jager 1, Aeilko H. Zwinderman.
Advertisements

Surviving Survival Analysis
Latent Class Analysis in Mplus Version 3 Karen Nylund Social Research Methods Graduate School of Education & Information Studies Graduate School of Education.
If we use a logistic model, we do not have the problem of suggesting risks greater than 1 or less than 0 for some values of X: E[1{outcome = 1} ] = exp(a+bX)/
Introduction to Survival Analysis October 19, 2004 Brian F. Gage, MD, MSc with thanks to Bing Ho, MD, MPH Division of General Medical Sciences.
Departments of Medicine and Biostatistics
HIV Risk Behaviors and Alcohol Intoxication among Injection Drug Users in Puerto Rico Tomás D. Matos, MS Center for Addiction Studies Universidad Central.
TEMPERAMENT IN INDIVIDUALS WITH PSYCHOTIC DISORDERS BEFORE AND AFTER THE ONSET OF ILLNESS Jouko Miettunen Department of Psychiatry,
DECISION SUPPORT RESEARCH TEAM “Providing expertise to improve health & wellbeing of families” Retention in a Study of Prenatal Care: Implications of attrition.
1 Structural validity of psychiatric scales Jouko Miettunen, PhD Department of Psychiatry University of Oulu
LONGITUDINAL STUDY OF TEMPERAMENT IN INDIVIDUALS WITH PSYCHOTIC DISORDERS AND HEALTHY CONTROLS Jouko Miettunen (1), Eka Roivainen (2), Juha Veijola (1,3),
April 25 Exam April 27 (bring calculator with exp) Cox-Regression
Biostatistics ~ Types of Studies. Research classifications Observational vs. Experimental Observational – researcher collects info on attributes or measurements.
Journal Club Alcohol, Other Drugs, and Health: Current Evidence January–February 2009.
CHILDREN’S MENTAL HEALTH PROBLEMS IN RHODE ISLAND: THE PREVALENCE AND RISK FACTORS Hanna Kim, PhD and Samara Viner-Brown, MS Rhode Island Department of.
Measures of Disease Association Measuring occurrence of new outcome events can be an aim by itself, but usually we want to look at the relationship between.
PH6415 Review Questions. 2 Question 1 A journal article reports a 95%CI for the relative risk (RR) of an event (treatment versus control as (0.55, 0.97).
Clustered or Multilevel Data
Chapter 11 Survival Analysis Part 2. 2 Survival Analysis and Regression Combine lots of information Combine lots of information Look at several variables.
Multilevel Modeling Soc 543 Fall Presentation overview What is multilevel modeling? Problems with not using multilevel models Benefits of using.
A Longitudinal Study of Maternal Smoking During Pregnancy and Child Height Author 1 Author 2 Author 3.
Assessing Survival: Cox Proportional Hazards Model Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Analysis of Complex Survey Data
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Repeated measures: Approaches to Analysis Peter T. Donnan Professor of Epidemiology and Biostatistics.
Biostat Didactic Seminar Series Analyzing Binary Outcomes: Analyzing Binary Outcomes: An Introduction to Logistic Regression Robert Boudreau, PhD Co-Director.
Survival Analysis: From Square One to Square Two
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 10: Survival Curves Marshall University Genomics Core.
報 告 者 王瓊琦. postpartum depression : identification of women at risk.
Jouko Miettunen University of Oulu, Finland
Multiple Choice Questions for discussion
Trauma, Posttraumatic Stress Disorder, and Substance Use Disorders Naomi Breslau, Ph.D. Department of Epidemiology.
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
Biostatistics Case Studies 2005 Peter D. Christenson Biostatistician Session 4: Taking Risks and Playing the Odds: OR vs.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Assessing Survival: Cox Proportional Hazards Model
Dealing with bi-directionality in mental health research: The experience of the mental health project of the MRC/UVRI E. Kinyanda 1,2 1 MRC/UVRI Uganda.
POTH 612A Quantitative Analysis Dr. Nancy Mayo. © Nancy E. Mayo A Framework for Asking Questions Population Exposure (Level 1) Comparison Level 2 OutcomeTimePECOT.
Excepted from HSRP 734: Advanced Statistical Methods June 5, 2008.
1 Longitudinal associations between childhood and adolescent emotional problems, behavioral problems and substance use Jouko Miettunen, adjunct professor.
INTRODUCTION TO SURVIVAL ANALYSIS
HSRP 734: Advanced Statistical Methods July 17, 2008.
Copyright restrictions may apply Predictive Values of Psychiatric Symptoms for Internet Addiction in Adolescents: A 2-Year Prospective Study Ko C-H, Yen.
Introduction to Survival Analysis Utah State University January 28, 2008 Bill Welbourn.
LENGTH OF FIRST HOSPITALIZATION AND OTHER PREDICTORS FOR RE-HOSPITALIZATION IN PSYCHOSES Jouko Miettunen, PhD Department of Psychiatry University of Oulu,
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
Dr. Abednego Musau. School violence is widely held to have become a serious problem in recent decades in many countries. It includes violence between.
Osteoarthritis Initiative Analytic Strategies for the OAI Data December 6, 2007 Charles E. McCulloch, Division of Biostatistics, Dept of Epidemiology and.
1 Using National Hospital Ambulatory Medical Care Survey (NHAMCS) data for injury analysis Linda McCaig Ambulatory Care Statistics Branch Division of Health.
Lecture 9: Analysis of intervention studies Randomized trial - categorical outcome Measures of risk: –incidence rate of an adverse event (death, etc) It.
Statistical Issues in the Analysis of Patient Outcomes April 11, 2003 Elizabeth Garrett Oncology Biostatistics Acknowledgement: Thanks to Ron Brookmeyer.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
Describing the risk of an event and identifying risk factors Caroline Sabin Professor of Medical Statistics and Epidemiology, Research Department of Infection.
1 Multivariable Modeling. 2 nAdjustment by statistical model for the relationships of predictors to the outcome. nRepresents the frequency or magnitude.
Gender difference in the effects of self- rated health on mortality among the oldest-old in China Jiajian Chen 1 Zheng Wu 2 1 East-West Center, Honolulu,
Master’s Essay in Epidemiology I P9419 Methods Luisa N. Borrell, DDS, PhD October 25, 2004.
A Threshold Effect in the Relation of Stressful Life Events and Preterm Delivery Nedra Whitehead, Ph.D.
Chapter Seventeen Copyright © 2004 John Wiley & Sons, Inc. Multivariate Data Analysis.
REGRESSION MODEL FITTING & IDENTIFICATION OF PROGNOSTIC FACTORS BISMA FAROOQI.
A Survival Comparison between HIV+ U.S.-born Latinos and Foreign-born Latinos in Houston, Texas Raouf Arafat, MD, MPH, Adebowale Awosika- Olumo.
Multi-state piecewise exponential model of hospital outcomes after injury DE Clark, LM Ryan, FL Lucas APHA 2007.
Transportation-related Injuries among US Immigrants: Findings from National Health Interview Survey.
Meta-analysis of observational studies Nicole Vogelzangs Department of Psychiatry & EMGO + institute.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 13: Multiple, Logistic and Proportional Hazards Regression.
Research on the relationship between childhood sleep problems and substance use in adolescents and young adults is limited. This knowledge gap has been.
Repeated measures: Approaches to Analysis
April 18 Intro to survival analysis Le 11.1 – 11.2
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Presentation transcript:

Statistical methods in longitudinal studies Jouko Miettunen, PhD Department of Psychiatry University of Oulu

2 Topics of this presentation n Logistic regression analysis n Survival analysis n Analysis of variance n Random regression analysis n Structural equation modeling n Latent class analysis n Imputing missing data

3 Logistic regression analysis (1) n Most common modeling method to analyze confounders in epidemiology, especially in longitudinal studies n Outcome variable should be dichotomized (no/yes, healthy/sick) n Exposure variables can be both dichotomized or continuous

4 Variables in logistic regression n Include sociodemographic variables, e.g. sex, social class n Include previous known risk factors u Especially if statistically significant in the model n Do not include too many variables u Depends on data size and distribution of variables n Do not include intercorrelating variables

5 Example data set Northern Finland 1966 Birth Cohort n Women who were living in the provinces of Oulu and Lapland and were due to deliver during 1966 n N = 12,058 live births n N = 10,934 living 1997 in Finland n Data on biological, socio-economic and health conditions collected prospectively from pregnancy up to the age of 35 years n Data from several registers and e.g. from large follow-ups at 14 and 31 years

6 Example question Northern Finland 1966 Birth Cohort n What predicts rehospitalization in psychoses? n N = 158 hospital treated cases n Exposure variables u sex u father’s social class (1980) u familial risk u onset age u length of first hospitalization u diagnosis (schz / other psychosis)

7 SPSS Output (1)

8 SPSS Output (2)

9 Survival analysis (1) n Examines time between two events, e.g. u from birth to illness onset u from illness onset to death u from end of treatment to rehospitalization n Kaplan-Meier model estimates probability of events in each time point

10 Survival analysis (2) n Required information uEvent (0,1) uTime to event (days, months,…) or to censoring uData is censored due to FEnd of follow-up time FLoss of contact FOr e.g. other dead than that of interest

11 Example question Northern Finland 1966 Birth Cohort n What predicts age of suicide? n People alive and living in Finland at 16 years (N=10,934) n Data till end of 2001 u 58 (0.5%) suicides u 140 (1.3%) other deaths u 10,736 (98.2%) alive n Predictor variable: u family type at birth (full, single)

12 log rank test, p=0.002 Test Statistics for Equality of Survival Distributions SPSS Output (1)

13 Survival analysis (3) n Difference or trend in difference between groups should be about the same across time, at least curves shouldn’t cross (if statistically tested) n Can be done also with small samples n Curve can be presented as survival or as hazard function n References, e.g. u Parmar & Machin: Survival analysis. A practical approach. John Wiley & Sons, 1995.

14 SPSS Output (2)

15 Example question (2) Age of suicide and family type n Possible confounding variables u sex u social class 1966 (I-II,III-IV,V) u average school mark at 14 u psychiatric diagnosis (no, yes) u crime (no, violent, non-violent) n Cox regression analysis

16 Cox regression analysis SPSS Output (3)

17 Cox regression analysis SPSS Output (4)

18 Analysis of variance n ANOVA u One continuous outcome (dependent) variable n MANOVA u Several continuous outcome variables n Repeated measurements ANOVA u Same measurements are made several times on each subject n ANOVA, MANOVA and rANOVA u Only categorical predictors n ANCOVA, MANCOVA, rANCOVA u Also continuous predictors

19 Example question Difference in size of hippocampus n Northern Finland 1966 Birth Cohort u Follow-up study n Schizophrenia patients (N=56) vs. healthy controls (N=104) n Repeated measurements ANCOVA u Measurements of right and left side were thought as repeated measurements

20 Example table Schizophrenia and Comparison subjects Hippocampus volumes F Sig. Model 1 Within effect: side20.3< Diagnosis Gender Model 2 Within effect: side Covariate: brain vol.35.0< Diagnosis< Gender Familial psychosis Perinatal risk Handedness Tanskanen et al. Schizophrenia Research (in press)

21 Random regression analysis n Random regression analysis = Random-effects (multilevel) models = … u Allow presence of missing data u Allow time-varying covariates u Allow subjects measured at different timepoints u Takes into account several levels of subjects (multilevel analysis)

22 Random regression analysis n Available software u SAS Proc Mixed u Stata (GLLAMM) u Specific multilevel modeling software F MLWin F HLM

23 Random regression analysis n References u Goldstein et al. Tutorial in biostatistics. Multilevel modelling of medical data. Stat Med, 21, , u Hedeker & Mermelstein. Application of random-effects regression models in relapse research, Addiction, 91, S211-30, u Sharma et al. A longitudinal study of plasma cortisol and depressive symptomatology by random regression analysis. Biol Psychiatry 31, , u Tilling et al. A new method for predicting recovery after stroke. Stroke 32, , u Homepage of Don Hedeker: F u Homepage of Sophia Rabe-Hesketh (GLLAMM) F

24 Structural Equation Modeling n Combination of factor analysis and regression n Continuous and discrete predictors and outcomes n Relationships among measured or latent variables

25 Caring orientation Expertise orientation Life orientation Catalytic- co-operational nursing Controlling nursing Confirming nursing male, p=.002 older, p<.0001 no children, p=.048 Swedish, p<.0001 older, p<.0001 no children, p=.036 Finnish, p=.020 younger, p=.0003 sairaanhoit, p=.020 no children, p<.0001 older, p=.034 Swedish, p<.0001 older, p0.002 older, p= (r=.64) + (r=.11) + (r=.27) + (r=.47) (r=.22) + (r=.44) + (r=.18) + (r=.19) Orientation to nursing Orientation to learning nursing Example: Nursing orientation Vanhanen-Nuutinen et al. (manuscript)

26 Structural Equation Modeling n References u Bentler & Stein. Structural equation models in medical research. Stat Methods Med Res 1: 159–181, u Bollen. Structural equations with latent variables. John Wiley & Sons, Inc, New York, u Finch & West. The investigation of personality structure: statistical models. J Res Pers 31: 439– 485, u MacCallum & Austin. Applications of structural equation modeling in psychological research. Annu Rev Psychol 51: 201–226, 2000.

27 Latent class analysis n Specific statistical method developed to group subjects according to selected characteristics u Classifies subjects to groups u Identifies characteristics that indicate groups

28 Example: Anti-Social Behavior n Damaged property n Fighting n Shoplifting n Stole <$50 n Stole >$50 n Use of force n Seriously threaten n Intent to injure n Use Marijuana n Use other drug n Sold Marijuana n Sold hard drugs n ‘Con’ somebody n Stole an Automobile n Broken into a building n Held stolen goods n Gambling Operation n National Longitudinal Survey of Youth (NLSY) n Respondent ages between 16 and 23 n Background information: age, gender and ethnicity n N=7, antisocial dichotomously scored behavior items: Reference:

29 Example: Anti-Social Behavior Damage Property FightingShopliftingStole <$50Gambling... Male Race Age C

30 Example: Anti-Social Behavior probabilities

31 Relationship between class probabilities and age by gender FemalesMales (age)

32 n Summary of four classes: u Property Offense Class (9.8%) u Substance Involvement Class (18.3%) u Person Offenses Class (27.9%) u Normative Class (44.1%) n Classification Table: Example: Anti-Social Behavior Rows: Average latent class probability for most likely latent class membership Columns: Latent class

33 Latent class analysis n References u Muthén & Muthén. Integrating person-centered and variable-centered analyses: Growth mixture modeling with latent trajectory classes. Alcohol Clin Exp Res, 24, , u default.htm n More references and examples u Homepage of Mplus software:

34 Missing data n Major problem in longitudinal studies n Usually data is not missing at random n One “solution” u Compare included and excluded cases F Not very good! F Smaller sample size give less power (change to get low p-values)

35 Imputing single missing data n With mean of sample (or subsample) u Gives less variability to data n Nearest neighbour imputation u Gives less variability to data n Use regression techniques to predict missing data n Mean of variables of same subject measuring appr. same thing u e.g. in psychological scales n Now “missing value analysis” also in SPSS

36 Multiple imputation n Requires special software u SAS/STAT (PROC MI & PROC MIANALYZE) u S-PLUS (MICE) u SOLAS for Missing Data Analysis 3.0 n References u Kmetic et al. Multiple imputation to account for missing data in a survey: estimating the prevalence of osteoporosis. Epidemiology, 13, , u McCleary. Using multiple imputation for analysis of incomplete data in clinical research. Nurs Res, 51, , u Streiner. The case of the missing data: methods of dealing with dropouts and other research vagaries. Can J Psychiatry, 47, 68-75, 2002.

37 General references in Finnish n Metsämuuronen. Tutkimuksen tekemisen perusteet ihmistieteissä (2003) n Nummenmaa et al. Tutkimusaineiston analyysi (1997) n Uhari & Nieminen. Epidemiologia & Biostatistiikka (2001) n SPSS, SAS, etc. manuals