Trends in Inequality of Educational Opportunity in the Netherlands 1900-1980: The Effect of Missing Data Maarten L. Buis & Harry B.G. Ganzeboom Department.

Slides:



Advertisements
Similar presentations
Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.
Advertisements

Treatment of missing values
CountrySTAT Team-I November 2014, ECO Secretariat,Teheran.
Research on Improvements to Current SIPP Imputation Methods ASA-SRM SIPP Working Group September 16, 2008 Martha Stinson.
Forecasting Models With Linear Trend. Linear Trend Model If a modeled is hypothesized that has only linear trend and random effects, it will be of the.
Grandparenting and health in Europe: a longitudinal analysis Di Gessa G, Glaser K and Tinker A Institute of Gerontology, Department of Social Science,
Approaches for Addressing Issues of Missing Data in the Statistical Modeling of Adolescent Fertility Dudley L. Poston, Jr. Texas A&M University & Eugenia.
Some birds, a cool cat and a wolf
CJT 765: Structural Equation Modeling Class 3: Data Screening: Fixing Distributional Problems, Missing Data, Measurement.

QUALITATIVE AND LIMITED DEPENDENT VARIABLE MODELS.
Before doing comparative research with SEM … Prof. Jarosław Górniak Institute of Sociology Jagiellonian University Krakow.
Lecture 23 Multiple Regression (Sections )
How to deal with missing data: INTRODUCTION
Modeling Achievement Trajectories When Attrition is Informative Betsy J. Feldman & Sophia Rabe- Hesketh.
Statistical Methods for Missing Data Roberta Harnett MAR 550 October 30, 2007.
PEAS wprkshop 2 Non-response and what to do about it Gillian Raab Professor of Applied Statistics Napier University.
Business Forecasting Chapter 4 Data Collection and Analysis in Forecasting.
Multiple imputation using ICE: A simulation study on a binary response Jochen Hardt Kai Görgen 6 th German Stata Meeting, Berlin June, 27 th 2008 Göteborg.
Modeling Menstrual Cycle Length in Pre- and Peri-Menopausal Women Michael Elliott Xiaobi Huang Sioban Harlow University of Michigan School of Public Health.
Cristina Iannelli Moray House School of Education Edinburgh University Education and Social Mobility : Scottish Evidence.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
G Lecture 121 Analysis of Time to Event Survival Analysis Language Example of time to high anxiety Discrete survival analysis through logistic regression.
Workshop on methods for studying cancer patient survival with application in Stata Karolinska Institute, 6 th September 2007 Modeling relative survival.
Guide to Handling Missing Information Contacting researchers Algebraic recalculations, conversions and approximations Imputation method (substituting missing.
Copyright © 2012 Wolters Kluwer Health | Lippincott Williams & Wilkins Chapter 19 Process of Quantitative Data Analysis and Interpretation.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
Multiple Imputation (MI) Technique Using a Sequence of Regression Models OJOC Cohort 15 Veronika N. Stiles, BSDH University of Michigan September’2012.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
G Lecture 11 G Session 12 Analyses with missing data What should be reported?  Hoyle and Panter  McDonald and Moon-Ho (2002)
CAREER PEAKS IN THE NETHERLANDS FOR LABOUR MARKET ENTRY COHORTS FROM THE 1950s 1960s, 1970s Testing Goldthorpe’s assumption of occupational maturity around.
Basics of Data Cleaning
Handling Attrition and Non- response in the 1970 British Cohort Study Tarek Mostafa Institute of Education – University of London.
1 Introduction to Survey Data Analysis Linda K. Owens, PhD Assistant Director for Sampling & Analysis Survey Research Laboratory University of Illinois.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Oslo, 24–26 September 2012 Work Session on Statistical Data Editing APPLICATION OF THE DEVELOPED SAS MACRO FOR EDITING AND IMPUTATION AT.
SW 983 Missing Data Treatment Most of the slides presented here are from the Modern Missing Data Methods, 2011, 5 day course presented by the KUCRMDA,
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
1 G Lect 13W Imputation (data augmentation) of missing data Multiple imputation Examples G Multiple Regression Week 13 (Wednesday)
Investigating Patterns Cornell Notes & Additional Activities.
The Impact of Missing Data on the Detection of Nonuniform Differential Item Functioning W. Holmes Finch.
1 G Lect 13M Why might data be missing in psychological studies? Missing data patterns Overview of statistical approaches Example G Multiple.
Missing Values Raymond Kim Pink Preechavanichwong Andrew Wendel October 27, 2015.
A REVIEW By Chi-Ming Kam Surajit Ray April 23, 2001 April 23, 2001.
Simulation Study for Longitudinal Data with Nonignorable Missing Data Rong Liu, PhD Candidate Dr. Ramakrishnan, Advisor Department of Biostatistics Virginia.
Item-Non-Response and Imputation of Labor Income in Panel Surveys: A Cross-National Comparison ITEM-NON-RESPONSE AND IMPUTATION OF LABOR INCOME IN PANEL.
Special Topics in Educational Data Mining HUDK5199 Spring term, 2013 March 13, 2013.
Tutorial I: Missing Value Analysis
 Seeks to determine group membership from predictor variables ◦ Given group membership, how many people can we correctly classify?
Pre-Processing & Item Analysis DeShon Pre-Processing Method of Pre-processing depends on the type of measurement instrument used Method of Pre-processing.
Economics 173 Business Statistics Lecture 28 © Fall 2001, Professor J. Petry
DATA STRUCTURES AND LONGITUDINAL DATA ANALYSIS Nidhi Kohli, Ph.D. Quantitative Methods in Education (QME) Department of Educational Psychology 1.
HANDLING MISSING DATA.
Missing data: Why you should care about it and what to do about it
Handling Attrition and Non-response in the 1970 British Cohort Study
Multiple Imputation using SOLAS for Missing Data Analysis
The Centre for Longitudinal Studies Missing Data Strategy
Maximum Likelihood & Missing data
Introduction to Survey Data Analysis
Multiple Imputation.
How to handle missing data values
Dealing with missing data
Does cognitive ability in childhood predict fertility
Presenter: Ting-Ting Chung July 11, 2017
The European Statistical Training Programme (ESTP)
MEASUREMENT OF THE QUALITY OF STATISTICS
CH2. Cleaning and Transforming Data
Chapter 4: Missing data mechanisms
Clinical prediction models
Chapter 13: Item nonresponse
Presentation transcript:

Trends in Inequality of Educational Opportunity in the Netherlands : The Effect of Missing Data Maarten L. Buis & Harry B.G. Ganzeboom Department of Social Research Methodology Vrije Universiteit Amsterdam RC28, Oslo, May

Buis & Ganzeboom, Oslo Conclusions (1) A steady trend towards less IEO in the Netherlands remains visible throughout the 20th century. However, on closer scrutiny there appears to be evidence of a slower trend or even stability for the earlier and most recent cohorts. Spline analyses of trends confirms this.

Buis & Ganzeboom, Oslo Conclusions (2) Missing data in father’s occupation vary by education of respondent: MV are about 3 times more prevalent among the lowest educated than among the highest educated. One would hypothesize that this mitigates measures of IEO and the historical trend therein. Multiply imputed data for FISEI: –Level of IEO increases –(Linear) trends in IEO becomes steeper

Buis & Ganzeboom, Oslo Previous research IEO = Inequality of Educational Opportunity = association between father’s occupation and respondent’s education. Previous research: long-term linear trend towards less IEO: –Cohorts : Ganzeboom & De Graaf, 1989, De Graaf & Ganzeboom, 1990a, 1990b. –Cohorts : Ganzeboom & Luijkx, This holds for both linear regression models en sequential logits (first two transitions).

Buis & Ganzeboom, Oslo ISMF Now 51 studies on the Netherlands, collected between 1958 and 2004, N > men and women 25+. Recent additions (since 2002 and Breen 2004): 16 studies, appr. 30% of the N. Father: FISEI – International Socio-economic Index of Occupational Status. Education: level of education scaled relative to benchmarks: primary = 6, highest secondary = 12, university complete = 17.

Buis & Ganzeboom, Oslo 20056

7 Research questions How do trend and level estimates of IEO depend upon data qualities: –Measures used –Quality and nature of the sample –Non-response –Missing values

Buis & Ganzeboom, Oslo Missing values MCAR = Missing Completely at Random MAR = Missing at Random: missingness is random given the values of control (X) variables. NMAR: Not Missing at Random: missingness depends upon values of Y-variable. Rubin 1987, Little & Rubin 2002, Allison Multiple hotdeck imputation in STATA.

Buis & Ganzeboom, Oslo Complete case analysis (listwise deletion) OK, if MCAR. Biased if MAR. Inefficient (too large standard errors – this can be quite dramatic. Linear trend: –EDU = *FIS – 5.3*FIS*COH etc. (Men) (.17) (.08) (.45) –EDU = *FIS – 3.3*FIS*COH etc. (Women) (.17) (.08) (.44)

Buis & Ganzeboom, Oslo

Buis & Ganzeboom, Oslo Hot deck imputation Classify all cases by combinations of predictor variables (COH, FED, MED, ISEI). Stratify the cases by these combinations. Substitute the missing FISEI by valid FISEI of random (nearest) neighbor. Key idea: do not only borrow the systematic (predicted) part, but also the error term.

Buis & Ganzeboom, Oslo Multiple hot deck imputation Do hot deck imputation several times (10- 20). Bootstrap from each stratum a sample (with replacement) of stratum size. Random selection of neighbor varies by imputation cycle. Key idea: Rubin (1987): pp Get the variance-covariance estimation right.

Buis & Ganzeboom, Oslo Key results FISEI predicted by COH (4), FED (7), MED (7), ISEI (8). 10 imputations Linear trend result: –EDU = *FIS – 6.1*FIS*COH etc. (Men) (.33) (.12) (.64) –EDU = *FIS – 3.7*FIS*COH etc. (Women) (.46) (.12) (.64)

Buis & Ganzeboom, Oslo Non-linearities Linear splines Estimates with 1, 2, 3, 4 etc. knots (and a uniform distribution). We were happy with the result with 3 knots. Test of equality of slopes: –Between trajectories –Between men and women

Buis & Ganzeboom, Oslo

Buis & Ganzeboom, Oslo Results Complete case analysis finds: –Decline in IEO occurs between cohorts 1920 and Before 1920 and after 1960, the trend can be assumed to be flat. –There is a constant difference in IEO between men and women: women’s educational attainment appr. 10% less dependent on FIS than men’s.

Buis & Ganzeboom, Oslo

Buis & Ganzeboom, Oslo Multiple hot deck imputed data Finds pattern very similar to complete case analysis. But decline of IEO between 1920 and 1960 is steeper! However, standard errors of effects have increased (despite inclusion of more information).