John B. Willett and Judith D. Singer Harvard Graduate School of Education Examine our book, Applied Longitudinal Data Analysis (Oxford University Press,

Slides:



Advertisements
Similar presentations
What is Event History Analysis?
Advertisements

Judith D. Singer & John B. Willett Harvard Graduate School of Education Discrete-time survival analysis ALDA, Chapters 10, 11, and 12 Times change, and.
Multilevel Event History Modelling of Birth Intervals
What is Event History Analysis?
Cross Sectional Designs
1 Arlene Ash QMC - Third Tuesday September 21, 2010 (as amended, Sept 23) Analyzing Observational Data: Focus on Propensity Scores.
Multilevel survival models A paper presented to celebrate Murray Aitkin’s 70 th birthday Harvey Goldstein ( also 70 ) Centre for Multilevel Modelling University.
HSRP 734: Advanced Statistical Methods July 24, 2008.
Latino fathers’ childbearing intentions: The view from mother-proxy vs. father self-reports Lina Guzman, Jennifer Manlove, & Kerry Franzetta.
Carl E. Bentelspacher, Ph.D., Department of Social Work Lori Ann Campbell, Ph.D., Department of Sociology Michael Leber Department of Sociology Southern.
Chapter 6: Correlational Research Examine whether variables are related to one another (whether they vary together). Correlation coefficient: statistic.
SC968: Panel Data Methods for Sociologists
Multiple Logistic Regression RSQUARE, LACKFIT, SELECTION, and interactions.
Extending the multilevel model for change ALDA, Chapter Five
Judith D. Singer & John B. Willett Harvard Graduate School of Education Modeling discontinuous and nonlinear change ALDA, Chapter Six “Things have changed”
Judith D. Singer & John B. Willett Harvard Graduate School of Education Extending the discrete-time hazard model ALDA, Chapter Twelve “Some departure from.
John B. Willett & Judith D. Singer Harvard Graduate School of Education Introducing discrete-time survival analysis ALDA, Chapter Eleven “To exist is to.
Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence “Time is the one immaterial object which we cannot influence—neither speed up.
From last time….. Basic Biostats Topics Summary Statistics –mean, median, mode –standard deviation, standard error Confidence Intervals Hypothesis Tests.
John B. Willett & Judith D. Singer Harvard Graduate School of Education Introducing the Multilevel Model for Change: ALDA, Chapter Three “When you’re finished.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 13 Introduction to Linear Regression and Correlation Analysis.
Genetic Factors Predisposing to Homosexuality May Increase Mating Success in Heterosexuals Written by Zietsch et. al By Michael Berman and Lindsay Tooley.
Longitudinal Research: Present status and future prospects John B. Willett & Judith D. Singer Harvard University Graduate School of Education Contact us.
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Unit 5c: Adding Predictors to the Discrete Time Hazard Model © Andrew Ho, Harvard Graduate School of EducationUnit 5c– Slide 1
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 12: Multiple and Logistic Regression Marshall University.
Unit 4c: Taxonomies of Logistic Regression Models © Andrew Ho, Harvard Graduate School of EducationUnit 4c – Slide 1
Survival analysis Brian Healy, PhD. Previous classes Regression Regression –Linear regression –Multiple regression –Logistic regression.
© Willett & Singer, Harvard University Graduate School of EducationS077/Week #1– Slide 1 S077: Applied Longitudinal Data Analysis Week #1: What Are The.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
Statistical Analyses & Threats to Validity
MODELING SELECTION WITH MULTINOMIAL TREATMENT MODELS: AN EXAMPLE USING PARENTAL ROLES KEVIN SHAFER SCHOOL OF SOCIAL WORK BRIGHAM YOUNG UNIVERSITY.
Stability of Resilience in Children of Adolescent Mothers Keri Weed University of South Carolina Aiken Deborah Keogh, and John Borkowski University of.
Factors that Influence Retention in Greek Therapeutic Communities Erianna Daliani MSc (Gerasimos Papanastasatos) KETHEA Research Dept. 11th European Conference.
Longitudinal Research methods in personality psychology.
G Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression.
Epidemiology The Basics Only… Adapted with permission from a class presentation developed by Dr. Charles Lynch – University of Iowa, Iowa City.
© Willett & Singer, Harvard University Graduate School of Education S077/Week #4– Slide 1 S077: Applied Longitudinal Data Analysis Week #4: What Are The.
Lecture 3-3 Summarizing r relationships among variables © 1.
Introduction ► College-student drinking remains a significant problem on campuses across the nation. ► It is estimated that 38-44% of college students.
Unit 5b: The Logistic Regression Approach to Life Table Analysis © Andrew Ho, Harvard Graduate School of EducationUnit 5b– Slide 1
Sep 2005:LDA - ONS1 Event history data structures and data management Paul Lambert Stirling University Prepared for “Longitudinal Data Analysis for Social.
Tobacco Control Research Conference July 2014 Determinants of smoking initiation in South Africa Determinants of smoking initiation in South Africa.
Has Public Health Insurance for Older Children Reduced Disparities in Access to Care and Health Outcomes? Janet Currie, Sandra Decker, and Wanchuan Lin.
Statistics and Quantitative Analysis U4320 Segment 12: Extension of Multiple Regression Analysis Prof. Sharyn O’Halloran.
Abecedarian Project. Problems With Prior Research few early childhood programs have been sufficiently well controlled to permit scientists to evaluate.
Observation & Analysis. Observation Field Research In the fields of social science, psychology and medicine, amongst others, observational study is an.
What is Development? Systematic changes and continuities –In the individual –Between conception and death “Womb to Tomb” Three broad domains –Physical,
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
Between groups designs (2) – outline 1.Block randomization 2.Natural groups designs 3.Subject loss 4.Some unsatisfactory alternatives to true experiments.
© Willett & Singer, Harvard University Graduate School of EducationS077/Week #3– Slide 1 S077: Applied Longitudinal Data Analysis Week #3: What Topics.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Data structure for a discrete-time event history analysis Jane E. Miller, PhD.
Mixed-Design ANOVA 5 Nov 2010 CPSY501 Dr. Sean Ho Trinity Western University Please download: treatment5.sav.
G Class 11 Statistical Methods for the Analysis of Change Administrative Issues Why study change? Overview of methodological issues Overview of.
© Willett & Singer, Harvard University Graduate School of Education S077/Week #5– Slide 1 S077: Applied Longitudinal Data Analysis Week #5: What Are The.
Mixed ANOVA Models combining between and within. Mixed ANOVA models We have examined One-way and Factorial designs that use: We have examined One-way.
Today’s Schedule – 10/30 Ch. 11 & 12.2 Quiz Finish Daily Show Clip
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
Personally Important Posttraumatic Growth as a Predictor of Self-Esteem in Adolescents Leah McDiarmid, Kanako Taku Ph.D., & Aundreah Walenski Presented.
Personal Control over Development: Effects on the Perception and Emotional Evaluation of Personal Development in Adulthood.
1 Kuo-hsien Su, National Taiwan University Nan Lin, Academia Sinica and Duke University Measurement of Social Capital: Recall Errors and Bias Estimations.
What is Development? Systematic changes and continuities –In the individual –Between conception and death “Womb to Tomb” Three broad domains –Physical,
Transitions in Conjoint Alcohol and Tobacco Use among Adolescents Kristina M. Jackson University of Missouri, Columbia & Missouri Alcoholism Research Center.
Sources of Increasing Differential Mortality among the Aged by Socioeconomic Status Barry Bosworth, Gary Burtless and Kan Zhang T HE B ROOKINGS I NSTITUTION.
Mixed-Design ANOVA 13 Nov 2009 CPSY501 Dr. Sean Ho Trinity Western University Please download: treatment5.sav.
Stephen Nkansah-Amankra, PhD, MPH, MA 1, Abdoulaye Diedhiou, MD, PHD, H.L.K. Agbanu, MPhil, Curtis Harrod, MPH, Ashish Dhawan, MD, MSPH 1 University of.
© Willett, Harvard University Graduate School of Education, 6/13/2016S052/II.2(a3) – Slide 1 S052/II.2(a3): Applied Data Analysis Roadmap of the Course.
Trends in Access to Substance Abuse Treatment for Women and Men: Jeanne C. Marsh, PhD, Hee-Choon Shin, PhD, Dingcai Cao, PhD University of Chicago.
Types of questions TVEM can answer
Presentation transcript:

John B. Willett and Judith D. Singer Harvard Graduate School of Education Examine our book, Applied Longitudinal Data Analysis (Oxford University Press, 2003) at: gseacademic.harvard.edu/~alda Longitudinal Research: Present Status and Future Prospects “Time is the one immaterial object which we cannot influence— neither speed up nor slow down, add to nor diminish.” Maya Angelou

Rothman, KJ., (1996) Lessons from John Graunt, Lancet, Vol. 347, Issue 8993 Graunt’s accomplishments Analyzed mortality statistics in London and concluded correctly that more female than male babies were born and that women lived longer than men. Created the first life table assessing out of every 100 babies born in London, how many survived until ages 6, 16, 26, etc  AgeDied Survived Unfortunately, the table did not give a realistic representation of true survival rates because the figures for ages after 6 were all guesses. The first recorded longitudinal study of event occurrence: Graunt’s Notes on the Bills of Mortality (1662)

Recorded his son’s height every six months from his birth in 1759 until his 18 th birthday Buffon (1777) Histoire Naturelle & Scammon, RE (1927) The first seriation study of human growth, Am J of Physical Anthropology, 10, / Adolescent growth spurt The first longitudinal study of growth: Filibert Gueneau de Montbeillard ( )

Does a galloping horse ever have all four feet off the ground at once? Making continuous TIME amenable to study: Eadweard Muybridge (1887) Animal Locomotion

Annual searches for keyword 'longitudinal' in 6 OVID databases, between 1982 and 2002 Medicine (451%) Psychology (365%) Education (down 8%) Economics (361%) Sociology (245%) Agriculture/ Forestry (326%) What about now?: How much longitudinal research is being conducted?

What’s the “quality” of today’s longitudinal studies? First, the good news: More longitudinal studies are being published, and an increasing %age of these are “truly” longitudinal First, the good news: More longitudinal studies are being published, and an increasing %age of these are “truly” longitudinal ’03‘99 47%33% longitudinal 26%36% 2 waves 45%38% 4 or more waves 29%26% 3 waves Now, the bad news: Very few of these longitudinal studies are using “modern” analytic methods Now, the bad news: Very few of these longitudinal studies are using “modern” analytic methods 15%7%Growth modeling 5%2%Survival analysis 9%6%Ignoring age-heterogeneity 7% 8% 6% “Simplifying” analyses by…. Setting aside waves Combining waves 17%8%Separate but parallel analyses 32%38%Wave-to-wave regression 29%40%Repeated measures ANOVA Read 150 articles published in 10 APA journals in 1999 and 2003

Comments received this year from two reviewers for Developmental Psychology of a paper that fit individual growth models to 3 waves of data on vocabulary size among young children: Reviewer B: “The analyses fail to live up to the promise…of the clear and cogent introduction. I will note as a caveat that I entered the field before the advent of sophisticated growth-modeling techniques, and they have always aroused my suspicion to some extent. I have tried to keep up and to maintain an open mind, but parts of my review may be naïve, if not inaccurate.” Reviewer A: “I do not understand the statistics used in this study deeply enough to evaluate their appropriateness. I imagine this is also true of 99% of the readers of Developmental Psychology. … Previous studies in this area have used simple correlation or regression which provide easily interpretable values for the relationships among variables. … In all, while the authors are to be applauded for a detailed longitudinal study, … the statistics are difficult. … I thus think Developmental Psychology is not really the place for this paper.” Part of the problem may well be reviewers’ ignorance

1.Within-person descriptive: How does an infant’s neurofunction change over time? 2Within-person summary: What is each child’s rate of development? 3Between-person comparison: How do these rates vary by child characteristics? 1.Within-person descriptive: Does each married couple eventually divorce? 2.Within-person summary: If so, when are couples most at risk of divorce? 3.Between-person comparison: How does this risk vary by couple characteristics? Individual Growth Model/ Multilevel Model for Change Discrete- and Continuous-Time Survival Analysis Espy et al. (2000) studied infant neurofunction 40 infants observed daily for 2 weeks; 20 had been exposed to cocaine, 20 had not. Infants exposed to cocaine had lower rates of change in neurodevelopment. South (2001) studied marriage duration. 3,523 couples followed for 23 years, until divorce or until the study ended. Couples in which the wife was employed tended to divorce earlier. Questions about systematic change over timeQuestions about whether and when events occur What kinds of research questions require longitudinal methods?

residuals for person i, one for each occasion j Level-2 model for level-1 intercepts Level-2 model for level-1 slopes intercept for person i (“initial status”) slope for person i (“growth rate”) 1 Modeling change over time: An overview Postulating statistical models at each of two levels in a natural hierarchy At level-1 (within person): Model the individual change trajectory,which describes how each person’s status depends on time At level-2 (between persons): Model inter-individual differences in change, which describe how the features of the change trajectories vary across people Example: Changes in delinquent behavior among teens (ID & 12 person sample from full sample of 124)

Example: Grade of first heterosexual intercourse as a function of early parental transition status (PT) logit(hazard) PT=1 PT=0 Grade logit(hazard) PT=1 PT=0 Grade The Censoring Dilemma What do you do with people who don’t experience the event during data collection? (Non-occurrence tells you a lot about event occurrence, but they don’t have known event times.) The Survival Analysis Solution Model the hazard function, the temporal profile of the conditional risk of event occurrence among those still “at risk” ( those who haven’t yet experienced the event ) Discrete-time: Time is measured in intervals Hazard is a probability & we model its logit Continuous-time : Time is measured precisely Hazard is a rate & we model its logarithm “shift in risk” corresponding to unit differences in PT “baseline” (logit) hazard function Modeling event occurrence over time: An overview

1.You have much more flexibility in research design Not everyone needs the same rigid data collection schedule— cadence can be person specific Not everyone needs the same number of waves—can use all cases, even those with just one wave! 2.You can identify temporal patterns in the data Does the outcome increase, decrease, or remain stable over time? Is the general pattern linear or non-linear? Are there abrupt shifts at substantively interesting moments? 3.You can include time varying predictors (those whose values vary over time) Participation in an intervention Family composition, employment Stress, self-esteem 4.You can include interactions with time (to test whether a predictor’s effect varies over time) Some effects dissipate—they wear off Some effects increase—they become more important Some effects are especially pronounced at particular times. Four important advantages of modern longitudinal methods

Murnane, Boudett & Willett (1999): Used NLSY data to track the wages of 888 HS dropouts Number and spacing of waves varies tremendously across people 40% earned a GED: RQ: Does earning a GED affect the wage trajectory, and if so how? Empirical growth plots for 2 dropouts GED Three plausible alternative discontinuous multilevel models for change Is the individual growth trajectory discontinuous? Wage trajectories of male HS dropouts

LNW earned a GED GED receipt Upon GED receipt, wages rise immediately by 4.2% Post-GED receipt, wages rise annually by 5.2% (vs. 4.2% pre-receipt) White/ Latino Black Race At dropout, no racial differences in wages Racial disparities increase over time because wages for Blacks increase at a slower rate 12 th grade dropouts 9 th grade dropouts Highest grade completed Those who stay longer have higher initial wages This differential remains constant over time Displaying prototypical discontinuous trajectories (Log Wages for HS dropouts pre- and post-GED attainment)

Ginexi, Howe & Caplan (2000) 254 interviews at unemployment offices (within 2 mos of job loss) 2 other 3-8 mos mos Assessed CES-D scores and unemployment status (UNEMP) at each wave RQ: Does reemployment affect the depression trajectories and if so how? The person-period dataset Unemployed all 3 waves Reemployed by wave 2 Reemployed by wave 3 Hypothesizing that the TV predictor’s effect is constant over time:  2i Add the TV predictor to the level-1 model to register these shifts Level 1: Level 2: Including a time-varying predictor: Trajectories of change after unemployment

Everyone starts on the declining UNEMP=1 line If you get a job you drop 5.11 pts to the UNEMP=0 line Lose that job and you rise back to the UNEMP=1 line CESD UNEMP=1 UNEMP=0 Months since job loss Assume its effect is constant When UNEMP=1, CES-D declines over time When UNEMP=0, CES-D increases over time??? CESD UNEMP=1 UNEMP=0 Months since job loss Allow its effect to vary over time Everyone starts on the declining UNEMP=1 line Get a job and you drop to the flat UNEMP=0 line Effect of UNEMP is 6.88 on layoff and declines over time (by 0.33/month) CESD UNEMP=1 UNEMP=0 Months since job loss Finalize the model Must these lines be parallel?: Might the effect of UNEMP vary over time? Is this increase real?: Might the line for the re- employed be flat? This is the “best fitting” model of the set Determining if the time-varying predictor’s effect is constant over time 3 alternative sets of prototypical CES-D trajectories

Wheaton, Roszell & Hall (1997) Asked 1,393 Canadians whether (and when) each first had a depression episode 27.8% had a first onset between 4 and 39 RQ: Is there an effect of PD, and if so, is it long-term or short-term? Age fitted hazard Age fitted hazard Well known gender effect Effect of PD coded as TV predictor, but in two different ways: long-term & short-term Postulating a discrete-time hazard model Parental death treated as a short-term effect Odds of onset are 462% higher in the year a parent dies Parental death treated as a long-term effect Odds of onset are 33% higher among people who parents have died Using time-varying predictors to test competing hypotheses about a predictor’s effect: Risk of first depression onset: The effect of parental death

Foster (2000): Tracked hospital stay for 174 teens Half had traditional coverage Half had an innovative plan offering coordinating mental health services at no cost, regardless of setting (didn’t need hospitalization to get services) RQ: Does TREAT affect the risk of discharge (and therefore length of stay)? Treatment Comparison Main effects model Predictor (ns) TREAT No statistically significant main effect of TREAT Interaction with time model ** TREAT*(log Time) *** There is an effect of TREAT, especially initially, but it declines over time Is a time-invariant predictor’s effect constant over time? Risk of discharge from an inpatient psychiatric hospital

Tivnan (1980) Played up to 27 games of Fox ‘n Geese with 17 1 st and 2 nd graders A strategy that guarantees victory exists, but it must be deduced over time NMOVES tracks the number of turns a child takes per game (range 1-20) RQ: What trajectories do children follow when learning the game? A level-1 logistic model “Standard” level-2 models Is the individual growth trajectory non-linear? Tracking cognitive development over time What features should the hypothesized model display? A smooth curve joining the asymptotes A lower asymptote, because everyone makes at least 1 move and it takes a while to figure out what’s going on An upper asymptote, because a child can make only a finite # moves each game

NMOVES Game Good readers (READ=1.58) Poor readers (READ=-1.58) Displaying prototypical logistic growth trajectories (NMOVES for poor and good readers for the Fox ‘n Geese data)

Where to go to learn more

A limitless array of non-linear trajectories awaits… Four illustrative possibilities