Could that be true? Methodological issues when deriving educational attainment from different administrative datasources and surveys Bart F.M. Bakker Manager.

Slides:



Advertisements
Similar presentations
Annual growth rates derived from short term statistics and annual business statistics Dr. Pieter A. Vlag, Dr. K. van Bemmel Department of Business Statistics,
Advertisements

Innovation data collection: Methodological procedures & basic forms Regional Workshop on Science, Technology and Innovation (STI) Indicators.
Innovation data collection: Advice from the Oslo Manual South East Asian Regional Workshop on Science, Technology and Innovation Statistics.
Constructing Confidence Intervals based on Register Statistics Thomas Laitila Statistics Sweden and Örebro university Presentation.
Determination of Administrative Data Quality : Recent results and new developments Piet J.H. Daas, Saskia J.L. Ossen, and Martijn Tennekes Statistics Netherlands.
HYPOTHESIS TESTING Four Steps Statistical Significance Outcomes Sampling Distributions.
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
Why sample? Diversity in populations Practicality and cost.
Impact Evaluation Session VII Sampling and Power Jishnu Das November 2006.
Modular 15 Ch 10.1 to 10.2 Part I. Ch 10.1 The Language of Hypothesis Testing Objective A : Set up a Hypothesis Testing Objective B : Type I or Type II.
GEOG3025 Census and administrative data sources 3: Integration and future development.
Volunteer Angler Data Collection and Methods of Inference Kristen Olson University of Nebraska-Lincoln February 2,
Dan Piett STAT West Virginia University
CHAPTER 8 Estimating with Confidence
Comparing approaches of different (partly) register-based countries Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
Chapter 8 Introduction to Inference Target Goal: I can calculate the confidence interval for a population Estimating with Confidence 8.1a h.w: pg 481:
Introduction to Inferential Statistics. Introduction  Researchers most often have a population that is too large to test, so have to draw a sample from.
PARAMETRIC STATISTICAL INFERENCE
Improving the Design of UK Business Surveys Gareth James Methodology Directorate UK Office for National Statistics.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
Emerging methodologies for the census in the UNECE region Paolo Valente United Nations Economic Commission for Europe Statistical Division International.
Transition from traditional census to sample survey? (Experience from Population and Housing Census 2011) Group of Experts on Population and Housing Censuses,
Dominique van Roon Team socio-economic state Microdata on education level.
Section 2 Part 2.   Population - entire group of people or items for which we are collecting data  Sample – selections of the population that is used.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
The Dutch Virtual Census of 2001 A New Approach by Combining Different Sources Eric Schulte Nordholt ECE Census meetings Geneva, November 2004.
Collecting the household data as a sub-sample. Rome May 2014 Jonas Kylov Gielfeldt.
Using administrative registers in sample surveys European Conference on Quality in Official Statistics 3-–6 May 2010 Kaja Sõstra Statistics Estonia.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Methodology used for estimating Census tables based on incomplete information Eric Schulte Nordholt Senior researcher and project leader of the Census.
>>. ESSnet Measuring Global Value Chains 1.Globalisation indicators 2.Methodological development and support for International Organisation and Sourcing.
BPS - 3rd Ed. Chapter 131 Confidence Intervals: The Basics.
1 Chapter 9 Hypothesis Testing. 2 Chapter Outline  Developing Null and Alternative Hypothesis  Type I and Type II Errors  Population Mean: Known 
Eurostat Statistical matching when samples are drawn according to complex survey designs Training Course «Statistical Matching» Rome, 6-8 November 2013.
6.1 Inference for a Single Proportion  Statistical confidence  Confidence intervals  How confidence intervals behave.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
The availability of Dutch census microdata Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands Division Social.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
Improving of Household Sample Surveys Data Quality on Base of Statistical Matching Approaches Ganna Tereshchenko Institute for Demography and Social Research,
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
Beijing, October 19, th International Roundtable on Business Survey Frames Co-ordinating role of the Business Register in Economic Statistics session.
Representativity Indicators for Survey Quality Programme: Cooperation Theme: Socio-economic sciences and Humanities Activity: Socio-economic and scientific.
Chapter 6 Lecture 3 Sections: 6.4 – 6.5. Sampling Distributions and Estimators What we want to do is find out the sampling distribution of a statistic.
© Statistisches Bundesamt, VI A Statistisches Bundesamt The new method of the next german Population census Johann Szenzenstein, Federal Statistical Office,
QUALITY ASSESSMENT OF THE REGISTER-BASED SLOVENIAN CENSUS 2011 Rudi Seljak, Apolonija Flander Oblak Statistical Office of the Republic of Slovenia.
ICCS 2009 IDB Seminar – Nov 24-26, 2010 – IEA DPC, Hamburg, Germany Training Workshop on the ICCS 2009 database Weights and Variance Estimation picture.
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
1 of 22 INTRODUCTION TO SURVEY SAMPLING October 6, 2010 Linda Owens Survey Research Laboratory University of Illinois at Chicago
Jacco Daalmans Estimation of Dutch census tables.
1 1 Using administrative registers to evaluate the effects of proxy interviews in the Norwegian Labour Force Survey Øyvin Kleven, Ib Thomsen and Ole Villund.
Drop out statistics EU 2020 and the Labour Force Survey UOE (UNESCO/OECD/Eurostat) data The student register Danish measures of drop out.
STA248 week 121 Bootstrap Test for Pairs of Means of a Non-Normal Population – small samples Suppose X 1, …, X n are iid from some distribution independent.
Inference: Conclusion with Confidence
More on Inference.
Implementation of Quality indicators for administrative data
Statistics Netherlands Division Social and Spatial Statistics
I n f o r m a t i o n e n Wir bewegen
More on Inference.
Sample surveys versus business register evaluations:
WORKSHOP ON CORE VARIABLES
Quality evaluation of register-based statistics
Statistical units in the public sector
Data validation handbook
Heinrich Brüngger, Director
A bootstrap method for estimators based on combined administrative and survey data Sander Scholtus (Statistics Netherlands) NTTS Conference 13 March 2019.
2.7 Annex 3 – Quality reports
Statistical Power.
Presentation transcript:

Could that be true? Methodological issues when deriving educational attainment from different administrative datasources and surveys Bart F.M. Bakker Manager Section Socio-Economic State Statistics Netherlands Bart F.M. Bakker Manager Section Socio-Economic State Statistics Netherlands Presentation for the IAOS Conference on Reshaping Official Statistics Shanghai, October 14-16, 2008

Could that be true?2 The problem Increasing use of administrative data for official statistics, because: lower costs smaller response burden covering all elements of the population for small domain statististics Surveys only additional The problem: unknown or poor quality of part of the administrative data unknown or poor quality of statistical outcomes if administrative sources are combined

Could that be true?3 General idea Administrative data are collected with one or more traditional survey techniques, so: they have the same errors as traditional surveys The size of the errors depends on the audits the register keeper execute Variables that are important to the register keeper are assumed to be of better quality

Could that be true?4

5

6 An example: educational attainment The goal of the project Determining the educational attainment of as many persons as possible that can be used to derive a background variable for all kinds of research and, if the validity is reasonable, can be used for the estimation of the educational attainment in small areas and small subgroups not one register available

Could that be true?7 Sources CRIHO: students in higher education from 1986 ERR: students who did an exam in general secondary education from 1999 Education Number Registers: students in secondary general education from 2004 CWI: job-seekers who are registered as such in the employment exchange from 1990 WSF: students with student grants from 1999 LFS: 1% samples from the population aged >15 from 1996

Could that be true?8 Table 1. The registers and their quality Source CRIHOERR Education RegistersWSFCWI MeasurementObject Validity register variablegood reasonable Measurement error register variablenil many Processing error register variablenilfewnilfew statistical variablenil many Representation Coverage error register target populationnil a few schools are missing from second year alright, improvements still possiblenil statistical target population only public higher education in the Netherlands from 1986 only (large part of) public secondary general education from 1999 only (large part of) secondary education from 2003 only higher education in the Netherlands from 1995 only a large part of jobseekers from 1990 Linking error statistical target populationnil few Correction error statistical target populationnil

Could that be true?9 Micro-integration: harmonisation Determine the classification of educational attainment Harmonise the copied information on the training programmes Derive the classification Derive information whether certificates are attained The date that the certificates are attained

Could that be true?10 Micro-integration: correction for measurement errors Is the educational attainment valid at the reference date? 1. Border that the probability is <5% that someone will attain a higher level 2. Probability <5% that someone has attained a higher level since the latest certificate is attained Both empirically determined with the use of life tables

Could that be true?11 Micro-integration: correction for measurement errors For one person on one reference date more than one valid score on educational attainment is available Choose the source with the best quality: 1.CRIHO, Education Number Register, ERR 2.LFS 3.WSF CWI only for weighting

Could that be true?12 Derive educational attainment Derive the highest educational level attained from: all followed training programmes before reference date the certificates that are attained before reference date validity on reference date choose source with best quality downgrade the followed training programmes not ended with a certificate impute with the use of age <15 years

Could that be true?13 Results: coverage age 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% coverage register 15+ LFS 15+ PR register LFS 15+

Could that be true?14 Weighting the data Coverage shows selectivity underrepresentation of vocational education on secondary level overrepresentation of youngsters Weight to the population, result in two vectors the valid scores on educational attainment on reference date and a weight

Could that be true?15 Conclusions Administrative data have the same errors as traditional surveys And some more… Combining data from registers and surveys is promising But complicated Always do research on the quality of the administrative data