MODELLING REPEATED MEASURES ON FAMILY MEMBERS IN GEOGRAPHICAL AREAS IAN PLEWIS UNIVERSITY OF MANCHESTER PRESENTATION TO RESEARCH METHODS FESTIVAL OXFORD,

Slides:



Advertisements
Similar presentations
Continuity and Change in the Generation(s) and Life-course of Social Exclusion John Hobcraft University of York.
Advertisements

Multilevel modelling short course
Social Statistics Estimation and complex survey design Ian Plewis, CCSR, University of Manchester.
Multilevel Event History Models with Applications to the Analysis of Recurrent Employment Transitions Fiona Steele.
Latent normal models for missing data Harvey Goldstein Centre for Multilevel Modelling University of Bristol.
Seven Deadly Sins of Dyadic Data Analysis David A. Kenny February 14, 2013.
MCUAAAR: Methods & Measurement Core Workshop: Structural Equation Models for Longitudinal Analysis of Health Disparities Data April 11th, :00 to.
Cross Sectional Designs
The choice between fixed and random effects models: some considerations for educational research Claire Crawford with Paul Clarke, Fiona Steele & Anna.
Longitudinal and Multilevel Methods for Models with Discrete Outcomes with Parametric and Non-Parametric Corrections for Unobserved Heterogeneity David.
Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University of Surrey.
Nonresponse bias in studies of residential mobility Elizabeth Washbrook, Paul Clarke and Fiona Steele University of Bristol Research Methods Festival,
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Multilevel survival models A paper presented to celebrate Murray Aitkin’s 70 th birthday Harvey Goldstein ( also 70 ) Centre for Multilevel Modelling University.
Multiple Regression Fenster Today we start on the last part of the course: multivariate analysis. Up to now we have been concerned with testing the significance.
Risk of Low Birth Weight Associated with Family Poverty in Korea Bong Joo Lee Se Hee Lim Department of Social Welfare, Seoul National University. A Paper.
Latino fathers’ childbearing intentions: The view from mother-proxy vs. father self-reports Lina Guzman, Jennifer Manlove, & Kerry Franzetta.
Logistic Regression Multivariate Analysis. What is a log and an exponent? Log is the power to which a base of 10 must be raised to produce a given number.
School of Veterinary Medicine and Science Multilevel modelling Chris Hudson.
Depression, Partnership Quality and Partnership Breakdown An analysis of the Millennium Cohort Study Anna Garriga Kathleen Kiernan University of York.
Discrete-time Event History Analysis Fiona Steele Centre for Multilevel Modelling Institute of Education.
A multilevel approach to geography of innovation Martin Srholec TIK Centre University of Oslo DIME International Workshop.
Multilevel Modeling Soc 543 Fall Presentation overview What is multilevel modeling? Problems with not using multilevel models Benefits of using.
Elaboration Elaboration extends our knowledge about an association to see if it continues or changes under different situations, that is, when you introduce.
Foster Care Reunification: The use of hierarchical modeling to account for sibling and county correlation Emily Putnam-Hornstein, MSW Center for Social.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Analysis of Clustered and Longitudinal Data
Peter Congdon, Centre for Statistics and Department of Geography, Queen Mary University of London. 1 Spatial Path Models with Multiple.
Practical statistics for Neuroscience miniprojects Steven Kiddle Slides & data :
Introduction to Multilevel Modeling Using SPSS
1 Faculty of Social Sciences Induction Block: Maths & Statistics Lecture 5: Generalisability of Social Research and the Role of Inference Dr Gwilym Pryce.
Exploring the role of the family in multilevel models of school effectiveness and student achievement using Swedish registry data Rob French Longitudinal.
GEODE, 16 Jan 2007 Occupational Analysis – Issues and Examples Grid Enabled Occupational Data Environment GEODE Project workshop, 16 th January 2007 Vernon.
Lecture 5 “additional notes on crossed random effects models”
From GLM to HLM Working with Continuous Outcomes EPSY 5245 Michael C. Rodriguez.
Chapter 1: The What and the Why of Statistics
Parents’ basic skills and children’s test scores Augustin De Coulon, Elena Meschi and Anna Vignoles.
The What and the Why of Statistics The Research Process Asking a Research Question The Role of Theory Formulating the Hypotheses –Independent & Dependent.
Chapter 1: The What and the Why of Statistics  The Research Process  Asking a Research Question  The Role of Theory  Formulating the Hypotheses  Independent.
Modelling non-independent random effects in multilevel models Harvey Goldstein and William Browne University of Bristol NCRM LEMMA 3.
Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
The Choice Between Fixed and Random Effects Models: Some Considerations For Educational Research Clarke, Crawford, Steele and Vignoles and funding from.
Academic Research Academic Research Dr Kishor Bhanushali M
Lost but not forgotten : attrition in the Étude longitudinale du développement des enfants du Québec (ÉLDEQ), Julien BÉRARD-CHAGNON and Simona.
Roghayeh parsaee  These approaches assume that the study sample arises from a homogeneous population  focus is on relationships among variables 
AN EXAMPLE OF COOPERATION & SOME WIDER ISSUES Ian Plewis (Bedford Group, Institute of Education) & Stephen Morris (Social Research Division, Department.
10. Introduction to Multivariate Relationships Bivariate analyses are informative, but we usually need to take into account many variables. Many explanatory.
Module III Multivariate Analysis Techniques- Framework, Factor Analysis, Cluster Analysis and Conjoint Analysis Research Report.
Item-Non-Response and Imputation of Labor Income in Panel Surveys: A Cross-National Comparison ITEM-NON-RESPONSE AND IMPUTATION OF LABOR INCOME IN PANEL.
Chapter 5 Multilevel Models
Tutorial I: Missing Value Analysis
1 Family Variables: Latent Class & Latent Transtion Alan C. Acock Presented at the Conference on Research with Dyads and families Purdue University May,
HYPOTHESIS TESTING FOR DIFFERENCES BETWEEN MEANS AND BETWEEN PROPORTIONS.
6. Ordered Choice Models. Ordered Choices Ordered Discrete Outcomes E.g.: Taste test, credit rating, course grade, preference scale Underlying random.
Biostatistics Regression and Correlation Methods Class #10 April 4, 2000.
[Part 5] 1/43 Discrete Choice Modeling Ordered Choice Models Discrete Choice Modeling William Greene Stern School of Business New York University 0Introduction.
Multivariate Statistics Latent Growth Curve Modelling. Random effects as latent variables: SEM for repeated measures data Dr Patrick Sturgis University.
Children’s Emotional and Behavioral Problems and Their Parents’ Labor Supply Patrick Richard, Ph.D., M.A. Nicholas C. Petris Center on Health Markets and.
Methods of Presenting and Interpreting Information Class 9.
Factors Influencing Response Rates to ALSPAC Questionnaires
Handling Attrition and Non-response in the 1970 British Cohort Study
Bivariate & Multivariate Regression Analysis
Stephen W. Raudenbush University of Chicago December 11, 2006
Krystle Lange & Regan A. R. Gurung University of Wisconsin, Green Bay
Does cognitive ability in childhood predict fertility
A New Approach to the Study of Teams: The GAPIM
Cross Sectional Designs
From GLM to HLM Working with Continuous Outcomes
Types of questions TVEM can answer
Presentation transcript:

MODELLING REPEATED MEASURES ON FAMILY MEMBERS IN GEOGRAPHICAL AREAS IAN PLEWIS UNIVERSITY OF MANCHESTER PRESENTATION TO RESEARCH METHODS FESTIVAL OXFORD, 2 JULY 2008

There are research questions in the social sciences that require descriptions and explanations of variability in one or more outcomes within and between families.

Some of these questions can be addressed by partitioning and modelling: i.variation within individuals (when we have repeated measures); ii.variation between individuals within families (with the appropriate design); iii.variation between families within areas/neighbourhoods, schools, cross-classifications etc. (with a clustered or spatial design).

The Millennium Cohort Study (MCS), for example, meets these three criteria because, as well as being longitudinal, data are collected from the main respondent (mothers), their partner (if in the household), the cohort child (or children if multiple birth) and older sibs. Also, MCS was originally clustered by ward (although residential mobility reduces the spatial clustering over time).

So we can model y tijk, assumed continuous, where: t = 1…T ijk are measurement occasions (level 1); i = 1…I jk are individual family members (level 2); j = 1..J k are families (level 3); k = 1..K are neighbourhoods (level 4). This is the usual nested or hierarchical structure.

For variables that change with age or time, educational attainment for example, we can use a polynomial growth curve formulation:

Do growth rates vary systematically with individual, family and neighbourhood characteristics? We model variation in the random effects b qijk in terms of variables at: i.the individual level, e.g. gender, ii.the family level, e.g. family income, iii.the neighbourhood level, e.g. Index of Multiple Deprivation.

We might prefer to model the variation in the time-varying outcome in terms of one or more time-varying explanatory variables. For example, we might be interested in how individuals’ smoking behaviour varies as their income changes. This model raises the tricky issue of endogeneity of individual (u ijk ) and family effects (v 0jk ) generated by unobserved heterogeneity at these two levels that is correlated with x, especially if we are interested in the causal effect of income on behaviour.

The growth curve and conditional models are compared in, for example: Plewis, Multivariate Behavioral Research, 2001.

These two approaches both ignore the fact that family members have labels: mother, father, oldest sibling etc. Often, we would like to know how the behaviour and characteristics of one family member are related to those of other family members. a.The influences of parents on children and children on parents: mental health and behaviour. b.The influence of one parent (or, more generally, partner) on another: quitting (or starting) smoking. c.The influence of one sibling on others: risky behaviour. d.The association of parents’/partners’ characteristics on each other’s behaviour: educational qualifications and health behaviours.

In these cases, a multivariate approach (within a multilevel framework) can be more informative as Raudenbush, Brennan and Barnett, J. Family Psych. (1995) first pointed out.

Their model is based on repeated measures of men and women as members of intact couples: Correlations between males and females within individuals - r(e M e F ) - and at the family level - r(u 0 u 1 ) - can be estimated. The equalities of within and between variances for men and women can be tested. Time varying variables can be introduced if appropriate.

This basic model can be extended to: 1.Situations where ‘growth’ isn’t obviously applicable, as might be the case for binary and categorical variables (mover-stayer or mixture models). 2.Couples plus children. 3.Changing household structures (two parents to one parent for example).

Suppose we have repeated measures of whether (and how much) mothers, fathers and their adolescent children smoke and that we are interested in influences across family members over time, and of educational qualifications (and area of residence) on smoking behaviour.

M, t-1 M, t F, t-1 F, t M, Ed. A, t A, t-1 F, Ed. M, smok F, smok

t = 2..T i ; min(T i = 3); neighbourhood level omitted for ease of exposition.

The model allows for correlated random effects at the individual level (u i for M, F and A) and also correlated residuals at each occasion (e ti also for M, F and A), multivariate Normal in each case.

Estimation and specification issues: 1.Simultaneity – y 2 is predicted by y 1 but y 2 is a predictor of y 3 etc. so we need a FIML estimation method. Not a problem with continuous x and y but perhaps more problematic for non-linear models. MCMC needed? 2.Autocorrelation structure for level one residuals? E(e t e t-1 ) < 0?

3.Endogeneity of family residuals (u i ) – could use y 1 as a control for this but then min(T i ) =4. 4.Missing data – essentially complete case analysis without imputation. 5.Assumption of Normality for random effects. Clark and Etilé, J. Health Economics, 2006 use a similar model but estimate using a modified EM procedure with non- parametric individual random effects.

Estimates from bivariate probit model: MCS waves 1 and 2, intact couples MOTHERPARTNER Educ. Quals., mother (0.017) (0.017) Educ. Quals., partner (0.017) (0.016) Smoking, mother, S1 2.6 (0.073) 0.41 (0.083) Smoking, partner, S10.45 (0.062) 2.6 (0.054) Smoking interaction, S (0.10)-0.30 (0.10) Within h/h correlation0.21

Effects of gaining, losing, changing partner – previous research suggests that there are some for smoking but papers have not linked one family member to another. Focus here on women because men generally not followed up in cohort studies. Studies like BHPS might be more informative.

Changes in household structure, MCS, waves 1 to 2 and 1 to 3. Waves 1 to 2 (%)Waves 1 to 3 (%) Same couple7774 Same single parent1110 Change partner Acquire partner Lose partner4.55.1

Can accumulate data over time intervals. Can specify more complex models for women who change partners. Expect interactions between δ and other explanatory variables. Single level but can be repeated for women gaining/losing partners more than once.

CONCLUDING REMARKS 1.Multilevel modeling is a powerful technique for disentangling sources of variation. 2.Using bivariate (or multivariate) multilevel models for repeated measures data extends the range of hypotheses about within household influences on behaviour that can be tested. 3. It is important to model what actually happens to households over time and not always to rely on analyses of intact couples. 4.What hypotheses might we consider for neighbourhood effects?

POSSIBLE EXTENSIONS 1.The problem of limited dependent variables, especially for measures of amount smoked which is censored at zero. Multilevel extensions of Tobit, mixture or Heckman selection models might be useful here. 2.Combining growth curve models with models for non-smokers. For example, Carlin et al. (Biostatistics, 2001) suggest a mixture model.