Longitudinal Data Analysis Professor Vernon Gayle

Slides:



Advertisements
Similar presentations
An Introduction to the UK Data Archive and the Economic and Social Data Service November 2007 Jack Kneeshaw, UKDA.
Advertisements

Introduction to longitudinal studies in the UK
Training opportunities – What do I need? And where can I get it? Vernon Gayle
What is Event History Analysis?
ESRC Future Strategy for Resources and Methods Professor Ian Diamond Chief Executive ESRC.
Depression and work incapacity in Scotland: Evidence from the Scottish Health and British Household Panel Surveys Matt Sutton Will Whittaker Health Methodology.
Mixed methods in longitudinal research Nick Buck UK Longitudinal Studies Centre University of Essex.
The Research Value of Longitudinal Data Vernon Gayle
Introduction to ESDS Longitudinal and ESDS International survey data and resources Jack Kneeshaw Economic and Social Data Service City University, 27 April.
Multilevel Event History Modelling of Birth Intervals
What is Event History Analysis?
Multilevel Event History Models with Applications to the Analysis of Recurrent Employment Transitions Fiona Steele.
University of Warwick 30th April 2004 Proposals for a Longitudinal Survey of Ethnic Minorities for the UK – results from a scoping study Anne Green University.
Longitudinal Data Analysis for Social Science Researchers Introduction to Panel Models
Methods of Economic Investigation Lecture 2
Work–life harmonisation and fertility in Australia: an event history analysis using HILDA data Hideki Nakazato, Konan University, Kobe, Japan
1 Growing Up in the 1990s – An exploration of the educational experiences of cohorts of Rising 16s in the BHPS Vernon Gayle, University of Stirling & ISER.
Social Stratification: the enduring concept that shapes the lives of Britain’s youth - Empirical analysis using the British Household Panel Survey Roxanne.
Research designs and methods
Dependent questioning on the Longitudinal Study of Young People in England (LSYPE) Iain Noble, Strategic Analysis, DfES Patten Smith, BMRB.
1 Scottish Social Survey Network: Master Class 1 Data Analysis with Stata Dr Vernon Gayle and Dr Paul Lambert 23 rd January 2008, University of Stirling.
The impact of job loss on family dissolution Silvia Mendolia, Denise Doiron School of Economics, University of New South Wales Introduction Objectives.
Demystifying Data Reference Helping non-specialists make sense of data.
Quantitative methods for researching lives through time Heather Laurie Institute for Social and Economic Research University of Essex
The Social Profile of Rural Britain: Insights from longitudinal datasets Heather Joshi Gareth Hughes & Brian Dodgeon Centre for Longitudinal Studies Institute.
M. Fall, JP. Lorgnet et alii 26/02/2010 Individual Dynamics of Poverty, a study tackling changes in poverty in France via the SILC survey.
Employment Decisions of European Women After Childbirth Chiara Pronzato (ISER) EPUNet Conference, May 9th 2006.
ESRC: Overview and Priorities Maria Sigala. ESRC in Context ▶ Non-Departmental Public Body, established in 1965 ▶ The major public sector funder of social.
Longitudinal Data Analysis for Social Science Researchers Research Value of Longitudinal Data
Analysis of Clustered and Longitudinal Data
1 Types and Sources of Data UAPP 702 Research Methods for Urban & Public Policy Based on notes by Steven W. Peuquet, Ph.D.
Single and Multiple Spell Discrete Time Hazards Models with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David K. Guilkey.
SIT008 – Research Design in Practice Week 5 Luke Sloan Cross-Sectional & Longitudinal Designs Week 5 Luke Sloan Cross-Sectional & Longitudinal Designs.
Understanding Society: the UK Household Longitudinal Study Professor Vernon Gayle ISER, Essex University
Centre for Market and Public Organisation Understanding the effect of public policy on fertility Mike Brewer (Institute for Fiscal Studies) Anita Ratcliffe.
Research Terminology for The Social Sciences.  Data is a collection of observations  Observations have associated attributes  These attributes are.
Using the Health Survey for England to examine ethnic differences in obesity, diet and physical activity Vanessa Higgins & Angela Dale Centre for Census.
GEODE, 16 Jan 2007 Occupational Analysis – Issues and Examples Grid Enabled Occupational Data Environment GEODE Project workshop, 16 th January 2007 Vernon.
The dynamics of shared care in the UK Stephen McKay Professor of Social Research School of Social Policy University of Birmingham, UK International Society.
Longitudinal Data: An introduction to some conceptual issues Vernon Gayle.
Understanding Society – an overview. Title | Date Background Understanding Society is a longitudinal study based on a household panel design Basic design.
Following lives from birth and through the adult years The Circumstances of Early Motherhood Denise D. Hawkes 19 th December 2007 The.
Sep 2005:LDA - ONS1 Event history data structures and data management Paul Lambert Stirling University Prepared for “Longitudinal Data Analysis for Social.
EE325 Introductory Econometrics1 Welcome to EE325 Introductory Econometrics Introduction Why study Econometrics? What is Econometrics? Methodology of Econometrics.
ESDS resources for managing data Jack Kneeshaw Economic and Social Data Service University of Essex, 27 January 2009.
Longitudinal data analysis for social science researchers University of Stirling 5 th September 2006 Prof. John Field, Stirling University Longitudinal.
An Introduction to the Economic and Social Data Service January 2006 Jack Kneeshaw, ESDS.
Finding Microlevel Data for Economists at Princeton University: Education and Labor.
Early Motherhood in the UK: Micro and Macro Determinants Denise Hawkes and Heather Joshi Centre for Longitudinal Research Institute of Education University.
Finding and analysing variables in the CLS cohorts Brian Dodgeon Centre for Longitudinal Studies Institute of Education University College London.
Essex Dependent Interviewing Workshop 17/09/2004 British Household Panel Survey.
‘Interpreting coefficients from longitudinal models’ Professor Vernon Gayle and Dr Paul Lambert (Stirling University) Wednesday 1st April 2009.
More complex event history analysis. Start of Study End of Study 0 t1 0 = Unemployed; 1 = Working UNEMPLOYMENT AND RETURNING TO WORK STUDY Spell or Episode.
1 Lifestyle, participation, identity and life satisfaction Nick Buck Institute for Social and Economic Research
Academic perspectives: Quantitative and qualitative paradigms in studying migrant youth identity Paul Lambert (University of Stirling) Presentation to.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Demand Forecasting Prof. Ravikesh Srivastava Lecture-11.
: LSS1 Longitudinal Studies Seminars: Longitudinal Analyses Using STATA Stirling University, Data and Variable Management Paul Lambert.
Household Structure and Household Structure and Childhood Mortality in Ghana Childhood Mortality in Ghana Winfred Avogo Victor Agadjanian Department of.
STATA WORKSHOP
UKHLS Consultation Launch, 19/06/07, RSS Initial Conditions and Life Histories Heather Laurie ISER
Abcdefghijkl Scottish School Leavers’ Survey A brief overview by Barry Stalker.
Longitudinal Analysis. AQMeN Introduction to Advanced Quantitative Techniques Thursday 13 th May 2010 Longitudinal Analysis Professor Vernon Gayle University.
The benefits of a longitudinal approach to policy-making Introduction to Understanding Society, 29 th June 2016, Cardiff University Professor Melanie Jones.
Eliminating Reproductive Risk Factors and Reaping Female Education and Work Benefits: A Constructed Cohort Analysis of 50 Developing Countries Qingfeng.
Section 2: Longitudinal study samples
Look ahead: Advanced modelling and learning resources
Section 2: Types of longitudinal studies
Section 1: What are longitudinal studies?
Key Challenges Strategic Advisor For Data Resources
Presentation transcript:

Longitudinal Data Analysis Professor Vernon Gayle University of Stirling vernon.gayle@stir.ac.uk www.longitudinal.stir.ac.uk

Structure of this Presentation Introduction Longitudinal Data (the simple concept) Longitudinal Social Science Datasets Temporal Data Collection Data Collection Modes Longitudinal Data Structures Benefits of Longitudinal Data Longitudinal Models (15 years in 2 slides) Conclusions Where Next?

Introduction For many social research projects cross-sectional data will be sufficient Most social research projects can be improved by the analysis of longitudinal data Some research questions require longitudinal data

Introduction Some research questions require longitudinal data Flows into and out of childhood poverty The effects of family migration on the woman’s subsequent employment activities Numerous policy intervention examples Numerous examples relating to ‘individual’ development

Introduction Longitudinal research is the lifeblood of the study of individual development. It has been pointed out many times that the most important questions concerning individual development can be answered only be applying a longitudinal design whereby the same individuals are followed through time. (Bergman & Magnusson 1990)

Longitudinal Data Much of the time we are only interested in how some social phenomena affects a later outcome Primary school Standard Grade experiences results (S4) We are using longitudinal data but standard cross-sectional techniques are still suitable Analytically this is fairly trivial

Repeated outcome measures (one per contact) Longitudinal Data Repeated outcome measures (one per contact) ID Year Age Employment 1 1991 16 Student 1992 17 Student 1 1993 18 Student 1 1994 19 Unemployed 1 1995 20 Employed (ft) 1 1996 21 Employed (ft) 1 1997 22 Employed (ft) 1 1998 23 Maternity Leave 1 1999 24 Family Care 2001 25 Employed (pt) Analysis of repeated measures (i.e. multiple outcomes) is more complex Specialist techniques are required!

Longitudinal Social Science Datasets Micro social science datasets Large n (e.g. BHPS 10k adults 5k households) Small t (contacts once per year) Large x (hundreds of variables) Contrast with time-series datasets Unemployment rates; inflation; share prices etc. Macro level (often countries) Small n (one country; 10 EU states) Frequent contacts (months by month for many years) Few variables, sometime just one (e.g. inflation)

Longitudinal Social Science Datasets Two main forms of micro social longitudinal datasets 1 Panel Dataset Repeated contacts data collection Sociologist Paul Lazarsfeld opinion research in 1930s Common example is the Household Panel Study

Longitudinal Social Science Datasets Cohort Study Repeated contacts data collection Principally concerned with charting the development of a particular ‘group’ from a certain point in time (simply a specific form of panel design in my view) A birth cohort of babies born in a particular year A youth cohort, a group of pupils who completed compulsory education in the same year A group of newly qualified doctor

Longitudinal Social Science Datasets Panel Dataset Examples (Household Panel Studies) US Panel Study of Income Dynamics (PSID) began in 1968 http://psidonline.isr.umich.edu/ Germany Socio-Economic Panel (SOEP) began in 1984 http://www.diw.de/en/soep British Household Panel Survey BHPS (1991 onwards) 5k households, 10k adults, http://www.iser.essex.ac.uk/survey/bhps

Longitudinal Social Science Datasets New British Panel Survey Understanding Society (US) Also known as the UK Household Longitudinal Study (UKHLS) Began in January 2009 Incorporates and extends the BHPS 40k UK households (4k Scottish Households) 4k households in a special ethnic minorities sample Innovations include linking to administrative data; spatial data; biometric data; qualitative data; child data (from age 10) http://www.understandingsociety.org.uk/

Longitudinal Social Science Datasets Cohort Dataset Examples Birth Cohort Studies 1946, 1958, 1970 & Millennium Cohort Study http://www.cls.ioe.ac.uk/ http://www.nshd.mrc.ac.uk/ Youth Cohort Study of England and Wales (YCS) http://www.data-archive.ac.uk/findingData/ycsTitles.asp BMA Cohort Study of Newly Qualified Doctors http://www.bma.org.uk/healthcare_policy/cohort_studies/index.jsp

Longitudinal Social Science Datasets Scottish Datasets Examples Growing Up in Scotland (GUS) A birth cohort study of 8k children http://www.crfr.ac.uk/gus/ Scottish Longitudinal Study A panel study of 274k people based on Census records http://www.lscs.ac.uk/sls/

Temporal Data Collection Prospective Retrospective Most studies collect a mixture of both

Longitudinal Data Structures A simple example of a panel (repeated contacts) dataset ID Year Age Gender Employment Marital Status 1 1991 16 Female Student Single 1 1992 17 Female Student Single 1 1993 18 Female Student Single 1 1994 19 Female Unemployed Single 1 1995 20 Female Employed (ft) Cohabiting 1 1996 21 Female Employed (ft) Cohabiting 1 1997 22 Female Employed (ft) Cohabiting 1 1998 23 Female Maternity Leave Married 1 1999 24 Female Family Care Married 1 2001 25 Female Employed (pt) Separated

Longitudinal Data Structures “Long” format dataset ID Year Age Gender Employment Marital Status 1991 16 1 1992 17 1 1993 18 1 1994 19 1 1995 20 1 1996 21 1 1997 22 1998 23 1 1999 24 1 2001 25

Longitudinal Data Structures Repeated outcome measures (one per contact) ID Year Age Gender Employment Marital Status 1 1991 16 Female Student Single 1 1992 17 Female Student Single 1 1993 18 Female Student Single 1 1994 19 Female Unemployed Single 1 1995 20 Female Employed (ft) Cohabiting 1 1996 21 Female Employed (ft) Cohabiting 1 1997 22 Female Employed (ft) Cohabiting 1 1998 23 Female Maternity Leave Married 1 1999 24 Female Family Care Married 1 2001 25 Female Employed (pt) Separated

Longitudinal Data Structures Time constant explanatory variables ID Year Age Gender Employment Marital Status 1 1991 16 Female Single 1 1992 17 Female Single 1 1993 18 Female Single 1 1994 19 Female Single 1 1995 20 Female Cohabiting 1 1996 21 Female Cohabiting 1 1997 22 Female Cohabiting 1 1998 23 Female Married 1 1999 24 Female Married 1 2001 25 Female Separated

Longitudinal Data Structures Time changing explanatory variables ID Year Age Gender Employment Marital Status 1 1991 16 Female Student Single 1 1992 17 Female Student Single 1 1993 18 Female Student Single 1 1994 19 Female Unemployed Single 1 1995 20 Female Employed (ft) Cohabiting 1 1996 21 Female Employed (ft) Cohabiting 1 1997 22 Female Employed (ft) Cohabiting 1 1998 23 Female Maternity Leave Married 1 1999 24 Female Family Care Married 1 2001 25 Female Employed (pt) Separated

Longitudinal Data Structures Time to an event Time to first childbirth 1991 1998 X ID=1

Benefits of Longitudinal Data Some research questions require longitudinal data Micro-level change over time Flows into and out of childhood poverty The effects of family migration Policy intervention examples ‘Individual’ development

Benefits of Longitudinal Data Additional methodological benefits Temporal ordering of events (direction of causality) Improved control for omitted explanatory variables (residual heterogeneity) Improved control for the effects of previous states (state dependence) Exploring the effects of both ageing and cohort membership (age-period-cohort effects)

Longitudinal Models Two main modelling approaches in social science research 1. Event history analysis, time to an event Also known as duration analysis; survival analysis; failure time analysis; duration economics; hazard modelling Generally time is continuous and we model the probability of an event occurring given that it has not already occurred (hazard)

Longitudinal Models Two main modelling approaches in social science research Panel data analysis Regression models suitable for repeated observations Time generally conceptualised as being discrete Extension of standard regression models (glm) Closely related to multilevel modelling (glmm) More advanced versions (e.g. dynamic models) Alternative terminology variance components models; hierarchical linear models; cross-sectional time series; random effects modelling

Conclusion For many social research projects cross-sectional data will be sufficient Most social research projects can be improved by the analysis of longitudinal data Some research questions require longitudinal data

Conclusion Longitudinal are not a panacea but data facilitate The study of micro-level change social change over time (and also social stability) A better understanding of the temporal ordering of events (direction of causality) Improved control for omitted explanatory variables (residual heterogeneity) Improved control for the effects of previous states (state dependence) Exploration of the effects of both ageing and cohort membership (age-period-cohort effects)

Where Next? More advanced skills are required Extension from standard modelling techniques Software (I recommend Stata) www.longitudinal.stir.ac.uk Annotated reading list http://www.longitudinal.stir.ac.uk/refs/reading_lda_08.pdf AQMeN training planned for early 2011