The choice between fixed and random effects models: some considerations for educational research Claire Crawford with Paul Clarke, Fiona Steele & Anna.

Slides:



Advertisements
Similar presentations
Introduction Describe what panel data is and the reasons for using it in this format Assess the importance of fixed and random effects Examine the Hausman.
Advertisements

The Great Divide: fixed vs. random effects in an education context Claire Crawford with Paul Clarke, Fiona Steele & Anna Vignoles and funding from ESRC.
Contextual effects In the previous sections we found that when regressing pupil attainment on pupil prior ability schools vary in both intercept and slope.
Centre for Market and Public Organisation An application of geographical data: inequalities in school access Paul Gregg, and Neil Davies, University of.
Rebecca Allen and Anna Vignoles Institute of Education, University of London and Presentation to Bristol Choice.
REGRESSION, IV, MATCHING Treatment effect Boualem RABTA Center for World Food Studies (SOW-VU) Vrije Universiteit - Amsterdam.
CORRELATION. Overview of Correlation u What is a Correlation? u Correlation Coefficients u Coefficient of Determination u Test for Significance u Correlation.
There are at least three generally recognized sources of endogeneity. (1) Model misspecification or Omitted Variables. (2) Measurement Error.
Objectives 10.1 Simple linear regression
Random Assignment Experiments
Income and Child Development Lawrence Berger, University of Wisconsin Christina Paxson, Princeton University Jane Waldfogel, Columbia Univerity.
Linking administrative data to TALIS and PISA
Nonresponse bias in studies of residential mobility Elizabeth Washbrook, Paul Clarke and Fiona Steele University of Bristol Research Methods Festival,
Irwin/McGraw-Hill © Andrew F. Siegel, 1997 and l Chapter 12 l Multiple Regression: Predicting One Factor from Several Others.
Economics 105: Statistics GH 24 due Wednesday. Hypothesis Tests on Several Regression Coefficients Consider the model (expanding on GH 22) Is “race” as.
Psychological Distress and Timely Use of Routine Care: The Importance of Having Health Insurance and a Usual Source of Care among Women with Children Whitney.
Specification Error II
Socioeconomic factors in relation to Autism Spectrum Disorders Dheeraj Rai and Selma Idring 17 March 2014
© Institute for Fiscal Studies Children’s outcomes and family background Claire Crawford.
Empirical Analysis Doing and interpreting empirical work.
NCRM Annual Meeting January People Lorraine Dearden (Dir)John ‘Mac’ McDonald Sophia Rabe-HeskethAnna Vignoles Kirstine HansenNikos Tzavidis James.
EPUNet Conference – BCN 06 “The causal effect of socioeconomic characteristics in health limitations across Europe: a longitudinal analysis using the European.
C82MCP Diploma Statistics School of Psychology University of Nottingham 1 Designing Experiments In designing experiments we: Manipulate the independent.
BACKGROUND RESEARCH QUESTIONS  Does the time parents spend with children differ according to parents’ occupation?  Do occupational differences remain.
Evaluating Hypotheses Chapter 9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics.
Evaluating Hypotheses Chapter 9 Homework: 1-9. Descriptive vs. Inferential Statistics n Descriptive l quantitative descriptions of characteristics ~
Clustered or Multilevel Data
Topic 3: Regression.
1 The Effect of Benefits on Single Motherhood in Europe Libertad González Universitat Pompeu Fabra May 2006.
1 Research Method Lecture 11-1 (Ch15) Instrumental Variables Estimation and Two Stage Least Square ©
Mitchell Hoffman UC Berkeley. Statistics: Making inferences about populations (infinitely large) using finitely large data. Crucial for Addressing Causal.
S-005 Types of research in education. Types of research A wide variety of approaches: –Theoretical studies –Summaries of studies Reviews of the literature.
Beyond test scores: the role of primary schools in improving multiple child outcomes Claire Crawford and Anna Vignoles Institute of Education, University.
Forecasting Revenue: An Example of Regression Model Building Setting: Possibly a large set of predictor variables used to predict future quarterly revenues.
Analysis of Clustered and Longitudinal Data
The mathematics skills of school children: How does England compare to the high performing East Asian jurisdictions?
What influences English and Mathematics attainment at age 11? Evidence from the EPPSE project.
-- Preliminary, Do Not Quote Without Permission -- VALUE-ADDED MODELS AND THE MEASUREMENT OF TEACHER QUALITY Douglas HarrisTim R. Sass Dept. of Ed. LeadershipDept.
Empirical Example Walter Sosa Escudero Universidad de San Andres - UNLP.
Neil Harrison, David James and Kathryn Last The impact of a skills-led qualification on GCSE attainment: the case of ASDAN's Certificate of Personal Effectiveness.
Chapter 2 The Research Enterprise in Psychology. n Basic assumption: events are governed by some lawful order  Goals: Measurement and description Understanding.
School Dropout in Rural Vietnam: Does Gender Matter?
The Research Enterprise in Psychology. The Scientific Method: Terminology Operational definitions are used to clarify precisely what is meant by each.
NUFE 1 General Education, Vocational Education and Individual Income in Rural China HUANG Bin Center for Public Finance Research Faculty of Public Finance.
Intergenerational Poverty and Mobility. Intergenerational Mobility Leblanc’s Random Family How does this excerpt relate to what we have been talking about?
Introduction Multilevel Analysis
Parents’ basic skills and children’s test scores Augustin De Coulon, Elena Meschi and Anna Vignoles.
Key issues from yesterday and for panel Lorraine Dearden.
Welfare Regimes and Poverty Dynamics: The Duration and Recurrence of Poverty Spells in Europe Didier Fouarge & Richard Layte Presented by Anna Manzoni.
1 Data Linkage for Educational Research Royal Statistical Society March 19th 2007 Andrew Jenkins and Rosalind Levačić Institute of Education, University.
HAOMING LIU JINLI ZENG KENAN ERTUNC GENETIC ABILITY AND INTERGENERATIONAL EARNINGS MOBILITY 1.
Application 3: Estimating the Effect of Education on Earnings Methods of Economic Investigation Lecture 9 1.
Objectives 2.1Scatterplots  Scatterplots  Explanatory and response variables  Interpreting scatterplots  Outliers Adapted from authors’ slides © 2012.
The Choice Between Fixed and Random Effects Models: Some Considerations For Educational Research Clarke, Crawford, Steele and Vignoles and funding from.
© Institute for Fiscal Studies, 2008 When you are born matters: the impact of date of birth on child cognitive outcomes in England Claire Crawford, Lorraine.
Impediments to the estimation of teacher value added Steven Rivkin Jun Ishii April 2008.
Chapter 22: Building Multiple Regression Models Generalization of univariate linear regression models. One unit of data with a value of dependent variable.
Research Design. What is Research Design ? Plan for getting from the research question to the conclusion Blueprint for data collection and interpretation.
Introduction to Statistics for the Social Sciences SBS200, COMM200, GEOG200, PA200, POL200, or SOC200 Lecture Section 001, Fall 2015 Room 150 Harvill.
Lecturer’s desk Physics- atmospheric Sciences (PAS) - Room 201 s c r e e n Row A Row B Row C Row D Row E Row F Row G Row H Row A
Children’s Emotional and Behavioral Problems and Their Parents’ Labor Supply Patrick Richard, Ph.D., M.A. Nicholas C. Petris Center on Health Markets and.
GS/PPAL Section N Research Methods and Information Systems
Is High Placebo Response Really a Problem in Clinical Trials?
Stephen W. Raudenbush University of Chicago December 11, 2006
Apprenticeships for Young People in England. Is there a payoff?
Advanced Panel Data Methods
Dante Contreras Sebastián Bustos Paulina Sepúlveda
Regression Forecasting and Model Building
Instrumental Variables Estimation and Two Stage Least Squares
Advanced Panel Data Methods
Presentation transcript:

The choice between fixed and random effects models: some considerations for educational research Claire Crawford with Paul Clarke, Fiona Steele & Anna Vignoles

Motivation Appropriate modelling of pupil achievement – Pupils clustered within schools → hierarchical models Two popular choices: fixed and random effects Which approach is best in which context? – May depend whether primary interest is pupil or school characteristics – But idea is always to move closer to a causal interpretation

Outline of talk Why SEN? Fixed and random effects models in the context of our empirical question Data and results Conclusions

Special educational needs (SEN) One in four Year 6 pupils (25% of 10 year olds) in England identified as having SEN – With statement (more severe): 3.7% – Without statement (less severe): 22.3% SEN label means different things in different schools and for different pupils – Huge variation in numbers of pupils labelled across schools – Assistance received also varies widely Ongoing policy interest (recent Green Paper)

Why adjust for school effects? Want to estimate causal effect of SEN on pupil attainment no matter what school they attend Need to adjust for school differences in SEN labelling – e.g. children with moderate difficulties more likely to be labelled SEN in a high achieving school than in a low achieving school (Keslair et al, 2008; Ofsted, 2004) – May also be differences due to unobserved factors Hierarchical models can account for such differences – Fixed or random school effects?

Fixed effects vs. random effects Long debate: – Economists tend to use FE models – Educationalists tend to use RE/multi-level models But choice must be context and data specific

Basic model FE: u s is school dummy variable coefficient RE: u s is school level residual – More flexible and efficient than FE, but: – Additional assumption required: E [u s |X is ] = 0 That is, no correlation between unobserved school characteristics and observed pupil characteristics Both: models assume: E [e is |X is ] = 0 – That is, no correlation between unobserved pupil characteristics and observed pupil characteristics

Relationship between FE, RE and OLS FE: RE: Where:

How to choose between FE and RE Very important to consider sources of bias: – Is RE assumption (i.e. E [u s |X is ] = 0 ) likely to hold? Other issues: – Number of clusters – Sample size within clusters – Rich vs. sparse covariates – Whether variation is within or between clusters What is the real world consequence of choosing the wrong model?

Sources of selection Probability of being SEN may depend on: – Observed school characteristics e.g. ability distribution, FSM distribution – Unobserved school characteristics e.g. values/motivation of SEN coordinator – Observed pupil characteristics e.g. prior ability, FSM status – Unobserved pupil characteristics e.g. education values and/or motivation of parents

Intuition I If probability of being labelled SEN depends ONLY on observed school characteristics: – e.g. schools with high FSM/low achieving intake are more or less likely to label a child SEN Random effects appropriate as RE assumption holds (i.e. unobserved school effects are not correlated with probability of being SEN)

Intuition 2 If probability of being labelled SEN also depends on unobserved school characteristics: – e.g. SEN coordinator tries to label as many kids SEN as possible, because they attract additional resources Random effects inappropriate as RE assumption fails (i.e. unobserved school effects are correlated with probability of being SEN) FE accounts for these unobserved school characteristics, so is more appropriate – Identifies impact of SEN on attainment within schools rather than between schools

Intuition 3 If probability of being labelled SEN depends on unobserved pupil/parent characteristics: – e.g. some parents may push harder for the label and accompanying additional resources; – alternatively, some parents may not countenance the idea of their kid being labelled SEN Neither FE nor RE will address the endogeneity problem: – Need to resort to other methods, e.g. IV

Other considerations Other than its greater efficiency, the RE model may be favoured over FE where: – Number of observations per cluster is large e.g. ALSPAC vs. NPD – Most variation is between clusters e.g. UK (between) vs. Sweden (within) – Have rich covariates

Can tests help? Hausman test: – Commonly used to test the RE assumption i.e. E [u s |X is ] = 0 – But really testing for differences between FE and RE coefficients Over-interpretation, as coefficients could be different due to other forms of model misspecification and sample size considerations (Fielding, 2004) – Test also assumes: E [e is |X is ] = 0

Data Avon Longitudinal Study of Parents and Children (ALSPAC) – Recruited pregnant women in Avon with due dates between April 1991 and December 1992 – Followed these mothers and their children over time, collecting a wealth of information: Family background (including education, income, etc) Medical and genetic information Clinic testing of cognitive and non-cognitive skills Linked to National Pupil Database

Looking at SEN in ALSPAC Why is ALSPAC good for looking at this issue? – Availability of many usually unobserved individual and school characteristics: e.g. IQ, enjoyment of school, education values of parents, headteacher tenure

Descriptive statistics 17% of sample are identified as having SEN at age 10 Individual characteristicsSchool characteristics Standardised KS1 APS-0.104**% eligible for FSM-0.002** IQ (age 8)-0.003**H’teacher tenure: 1-2 yrs-0.044** SDQ (age 7)0.012**H’teacher tenure: 3-9 yrs-0.046** Mum high qual vocational-0.028*H’teacher tenure: 10+ yrs Mum high qual O-level Mum high qual A-level-0.033* Mum high qual degree-0.019Observations5,417 Notes: relationship between selected individual and school characteristics and SEN status. Omitted categories are: mum’s highest qualification is CSE level; head teacher tenure < 1 year.

SEN results Fixed effects Random effects Intra-school correlation % difference M1: KS1 APS only ** [0.025] ** [0.025] M2: M1 + admin data ** [0.025] ** [0.025] M3: M2 + typical survey data ** [0.025] ** [0.024] M4: M3 + rich survey data ** [0.024] ** [0.024] M5: M4 + school level data ** [0.024] ** [0.024] Notes: ** indicates significance at the 1% level; * at the 5% level. Robust standard errors are shown in parentheses.

Summary of SEN results SEN appears to be strongly negatively correlated with progress between KS1 and KS2 – SEN pupils score around 0.3 SDs lower Choice of model does not seem to matter here – FE and RE give qualitatively similar results – Suggests correlation between probability of having SEN and unobserved school characteristics is not important Consistency across specifications suggests regression assumption is also likely to hold

Summary of FSM results In contrast to the SEN results, the estimated effects of FSM on attainment decrease as richer data is used – Suggests that the regression assumption may fail in models with few controls, such as those based on admin data There are also relatively larger differences between FE and RE models until we add school characteristics – Suggests that the RE assumption is less likely to hold here

Conclusions Approach each problem with agnostic view on model – Should be determined by theory and data, not tradition FE should be preferred when the selection of pupils into schools is poorly understood or data is sparse RE should be preferred when the selection of pupils into schools is well understood and data is rich Worth remembering that neither FE nor RE deals with correlation between observed and unobserved individual characteristics

FSM results Fixed effects Random effects Intra-school correlation % difference M1: KS1 APS only ** [0.028] ** [0.028] 11.5 M2: M1 + admin data ** [0.028] ** [0.027] 13.1 M3: M2 + typical survey data ** [0.029] ** [0.028] 15.7 M4: M3 + rich survey data ** [0.028] ** [0.028] 14.6 M5: M4 + school level data ** [0.028] ** [0.028] 6.7 Notes: ** indicates significance at the 1% level; * at the 5% level. Robust standard errors are shown in parentheses.