Sep 2005:LDA - ONS1 Event history data structures and data management Paul Lambert Stirling University Prepared for “Longitudinal Data Analysis for Social.

Slides:



Advertisements
Similar presentations
Poverty trajectories after risky life events in Germany, Spain, Denmark and the United Kingdom: a latent class approach Leen Vandecasteele Post-doctoral.
Advertisements

What is Event History Analysis?
Multilevel Event History Modelling of Birth Intervals
SC968: Panel Data Methods for Sociologists
What is Event History Analysis?
Multilevel Event History Models with Applications to the Analysis of Recurrent Employment Transitions Fiona Steele.
Survival Analysis. Key variable = time until some event time from treatment to death time for a fracture to heal time from surgery to relapse.
Non-employment and the welfare state: the UK and Germany compared J Clasen, J. Davidson, H. Ganssmann, A. Mauer Journal of European Social Policy, 16,
HSRP 734: Advanced Statistical Methods July 24, 2008.
Quantitative Longitudinal Data Paul Lambert and Vernon Gayle Stirling University Prepared for “Longitudinal Data Analysis for Social Science Researchers:
SC968: Panel Data Methods for Sociologists
Part 21: Hazard Models [1/29] Econometric Analysis of Panel Data William Greene Department of Economics Stern School of Business.
Main Points to be Covered
Data Sources The most sophisticated forecasting model will fail if it is applied to unreliable data Data should be reliable and accurate Data should be.
How Long Until …? Given a strike, how long will it last?
Causal-Comparative Research
Event History Analysis: Introduction Sociology 229 Class 3 Copyright © 2010 by Evan Schofer Do not copy or distribute without permission.
TOOLS OF POSITIVE ANALYSIS
Main Points to be Covered Cumulative incidence using life table method Difference between cumulative incidence based on proportion of persons at risk and.
German Life History Survey 201 respondents. The German life history survey (GLHS) Retrospective survey about life histories of men and women born in ,
Analysis of Complex Survey Data
PEPA is based at the IFS and CEMMAP © Institute for Fiscal Studies Identifying social effects from policy experiments Arun Advani (UCL & IFS) and Bansi.
Survival Analysis: From Square One to Square Two
17. Duration Modeling. Modeling Duration Time until retirement Time until business failure Time until exercise of a warranty Length of an unemployment.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
Stratification and Adjustment
Rob Lancaster, Orbitz Worldwide Survival Analysis & TTL Optimization.
Understanding Trends in Occupational Sex Segregation By Daniel Guinea-Martin Advanced Centre for Scientific Research, Spain (formerly at the Office for.
Longitudinal Data Analysis for Social Science Researchers Thinking About Event Histories
Essentials of survival analysis How to practice evidence based oncology European School of Oncology July 2004 Antwerp, Belgium Dr. Iztok Hozo Professor.
HSRP 734: Advanced Statistical Methods July 10, 2008.
1 Introduction to medical survival analysis John Pearson Biostatistics consultant University of Otago Canterbury 7 October 2008.
Assessing Survival: Cox Proportional Hazards Model
Longitudinal Data Analysis Professor Vernon Gayle
Linear Scheduling Method Definition A simple diagram to show location and time at which a certain crew will be working on a given operation.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
“Further Modeling Issues in Event History Analysis by Robert E. Wright University of Strathclyde, CEPR-London, IZA-Bonn and Scotecon.
The Chicago Guide to Writing about Multivariate Analysis, 2nd Edition. Data structure for a discrete-time event history analysis Jane E. Miller, PhD.
University of Warwick, Department of Sociology, 2014/15 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Week 7 Logistic Regression I.
Modelling Longitudinal Data Survival Analysis. Event History. Recurrent Events. A Final Point – and link to Multilevel Models (perhaps).
HSRP 734: Advanced Statistical Methods July 17, 2008.
Assessing Binary Outcomes: Logistic Regression Peter T. Donnan Professor of Epidemiology and Biostatistics Statistics for Health Research.
The dynamics of poverty in Ethiopia : persistence, state dependence and transitory shocks By Abebe Shimeles, PHD.
Presenting large scale forecast results in an intuitive way An industry case study. David Edison.
University of Warwick, Department of Sociology, 2012/13 SO 201: SSAASS (Surveys and Statistics) (Richard Lampard) Survival Analysis/Event History Analysis:
1 Lecture 6: Descriptive follow-up studies Natural history of disease and prognosis Survival analysis: Kaplan-Meier survival curves Cox proportional hazards.
‘Interpreting coefficients from longitudinal models’ Professor Vernon Gayle and Dr Paul Lambert (Stirling University) Wednesday 1st April 2009.
More complex event history analysis. Start of Study End of Study 0 t1 0 = Unemployed; 1 = Working UNEMPLOYMENT AND RETURNING TO WORK STUDY Spell or Episode.
01/20151 EPI 5344: Survival Analysis in Epidemiology Cox regression: Introduction March 17, 2015 Dr. N. Birkett, School of Epidemiology, Public Health.
MARKET APPRAISAL. Steps in Market Appraisal Situational Analysis and Specification of Objectives Collection of Secondary Information Conduct of Market.
Generating Summaries from FOT Data ITS World Congress, Detroit 2014 Dr. Sami Koskinen, VTT
Stretching Your Data Management Skills Chuck Humphrey University of Alberta Atlantic DLI Workshop 2003.
Demand Forecasting Prof. Ravikesh Srivastava Lecture-11.
: LSS1 Longitudinal Studies Seminars: Longitudinal Analyses Using STATA Stirling University, Data and Variable Management Paul Lambert.
Getting the most out of interactive and developmental data Daniel Messinger
STATA WORKSHOP
Additional Regression techniques Scott Harris October 2009.
Linking data resources Paul Lambert, University of Stirling Presentation to the Scottish Civil Society Data Partnership Project (S-CSDP), Webinar 3 on.
Capture-recapture Models for Open Populations Multiple Ages.
The Mortality Effects of Health Insurance for the Near-Elderly Uninsured Jose Escarce David Geffen School of Medicine at UCLA and RAND Coauthors: Daniel.
SURVIVAL ANALYSIS WITH STATA. DATA INPUT 1) Using the STATA editor 2) Reading STATA (*.dta) files 3) Reading non-STATA format files (e.g. ASCII) - infile.
DURATION ANALYSIS Eva Hromádková, Applied Econometrics JEM007, IES Lecture 9.
[Topic 11-Duration Models] 1/ Duration Modeling.
Measures of disease frequency Simon Thornley. Measures of Effect and Disease Frequency Aims – To define and describe the uses of common epidemiological.
Longitudinal Analysis. AQMeN Introduction to Advanced Quantitative Techniques Thursday 13 th May 2010 Longitudinal Analysis Professor Vernon Gayle University.
April 18 Intro to survival analysis Le 11.1 – 11.2
Life history data Repeated measurements of states or events
Advanced quantitative methods for social scientists (2017–2018) LC & PVK Session 6 Event History Analysis / survival (and other tools for social and individual.
Jeffrey E. Korte, PhD BMTRY 747: Foundations of Epidemiology II
Presentation transcript:

Sep 2005:LDA - ONS1 Event history data structures and data management Paul Lambert Stirling University Prepared for “Longitudinal Data Analysis for Social Science Researchers”, Stirling University, 2-6 th September 2006

Sep 2005:LDA - ONS2 Event history data structures and data management 1. Structures of event history data 2. Measuring and analysing life-courses

Sep 2005:LDA - ONS3 Event history data analysis Alternative data sources –Panel / cohort (more reliable) –Retrospective (cheaper, but recall errors) Aka: ‘Survival data analysis’; ‘Failure time analysis’; ‘hazards’; ‘risks’;.. Specific analytical techniques required (SPSS has some; Stata has more) Focus shifts to length of time in a ‘state’ - analyses determinants of time in state

4 Event history dataset PersonEventDurationStart N_p=4N_e=3

Sep 2005:LDA - ONS5 Key to event histories is ‘state space’

Sep 2005:LDA - ONS6 Event history data permutations Single state single episode –Eg Duration in first post-school job till end Single episode competing risks –Eg Duration in job until promotion / retire / unemp. Multi-state multi-episode –Eg adult working life histories Time varying covariates –Eg changes in family circumstances as influence on employment durations

Sep 2005:LDA - ONS7 Using Stata with event history data See the Stata manual ‘Survival Analysis and Epidemiological Tables’ stset: declares survival time data: stset dur, failure(status) (each case is an episode; variable dur is the length of the episode; variable status indicates whether record was right censored - value 0 means it was censored, ie, the end of event didn’t occur within observation period) Many specialist event history data analysis functions built into Stata Common pitfall: panel ~= duration data

Sep 2005:LDA - ONS8 ‘Wide’ versus ‘Long’ in event history data Relevant to multi-state or multi-episode data  ‘Wide’ = state space typologies / sequences  ‘Long’ = multiple states stacked above each other  Model controls for residual heterogeneity

Sep 2005:LDA - ONS9 Continuous v’s Discrete time Continuous time (‘spell files’, ‘event oriented’)  One episode per case, time in case is a variable Discrete time  One episode per time unit, type of event and event occurrence as variables  More flexible: time-varying covariates Stata –Oriented to continuous time data –Discrete time data formats are usually analysed by first transforming to continuous time

Sep 2005:LDA - ONS10

11

Sep 2005:LDA - ONS12 Transforming between continuous and discrete time SPSS –Discrete to continuous: simple aggregation –Continuous to discrete: bespoke programme Stata –Discrete to continuous: simplete aggregation (collapse) –Continuous to discrete: stsplit

Sep 2005:LDA - ONS13 Continuous v’s Discrete time Continuous  Gives full data on length of time period  More manageable data Discrete time  More flexible: time-varying covariates  More easily estimated: basic logistic regression with residual heterogeneity  Usually some time period detail is lost  [Germany: annual panels as discrete time EH data]

Sep 2005:LDA - ONS14 Multiple comparisons –Event history: multivariate=chopping up data – SPSS and Stata usually do this automatically (may use ‘select if’ / ‘by’)

Sep 2005:LDA - ONS15 Continuous time and time varying covariates? Continuous time‘Episode-splitting’summary idstartdurcohabIdspdurcohidcoh ?From mth Discrete time captures Time-Varying variables easily. Continuous time needs some work: Summary totals obtained through aggregation Episode splitting / discretising - automated in Stata, not SPSS

Sep 2005:LDA - ONS16 Ties In continuous time data, no two events should end at precisely the same time Estimation of survival functions assumes no tied ending points In practice, measurement units mean ties are common Analytial solution –Breslow 1974 (Stata default) –Alternative calculations

Sep 2005:LDA - ONS17 Event history data structures and data management 1. Structures of event history data 2. Measuring and analysing life-courses

Sep 2005:LDA - ONS18

Sep 2005:LDA - ONS19 Obtaining event history data We need State (simple characterisation of circumstance) –Start state (single v’s multi-state) –End state (single v’s competing risks) Duration (start and end time) Censoring indicator (right-censoring) –Did the event end during observation, or not? Other covariates –Must be able to map them to the event record

Sep 2005:LDA - ONS20 Obtaining event history data Data collection –‘Diary’ records / sequences of events –See questionnaire schedules Data construction : calculating duration –Dur = (end – start + 1) –Time[end | start] = data in an absolute unit –E.g. {calendar months; years; days} since 1900 –Stata and SPSS date conversion functions

Sep 2005:LDA - ONS21 Some UK event history datasets British Household Panel Study (see separate ‘combined life history’ files) National Birth Cohort Studies Family and Working Lives Survey Social Change and Economic Life Initiative Youth Cohort Studies

Sep 2005:LDA - ONS22 Descriptive : Kaplan-Meir survival

Sep 2005:LDA - ONS23 Modelling: Cox’s regression

Sep 2005:LDA - ONS24 Models – Loglinear models [e.g. Gilbert 1993; Vermunt 1997] Event histories are categorical measures Discrete time approach allows log-linear model MenWomen End of spell (last record) Year

Sep 2005:LDA - ONS25 Conceptualising parametric assumptions Stata is good for this:

Sep 2005:LDA - ONS26 Sequence analyses Descriptive techniques to characterise state sequences –‘career centred mode of analysis’ (Taris 2000) Categorical sequences –(c.f. growth curves / regression trajectories) Various possible implementations: –Latent class growth curves –Optimal matching analysis –Discriminant analysis –Sequence analysis Relies upon a theory of the nature of the career and state spaces

Sep 2005:LDA - ONS27 Overview: Event history data structures Focus upon speed until event Importance of censoring Categorisation of social science information Causality –Causal effects upon speed to transitions ‘event history models provide a time-relatd empirical representation of the structure of causal arguments’ [Blossfeld and Rohwer 2003, p24] But: restrictive state spaces and analytical options make for a limited description..