The availability of Dutch census microdata Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands Division Social.

Slides:



Advertisements
Similar presentations
DATA FROM ADMINISTRATIVE SOURCES
Advertisements

Counting the Dutch, The Future of the Virtual Census in the Netherlands Presentation at the seminar Counting the 7 Billion 24 February 2012 * Geert Bruinooge.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
REPUBLIC OF TURKEY TURKISH STATISTICAL INSTITUTE TurkStat Population and Demography Statistics Department Population and Migration Statistics Team
Combined use of data from registers and sample surveys
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
Searching the University of Alberta Library’s Statistics Canada-based Websites 2001 Census of Canada Canadian Centre for Justice Statistics Canadian Business.
© John M. Abowd 2005, all rights reserved Sampling Frame Maintenance John M. Abowd February 2005.
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
1 Evaluation of New Content on the 2008 ACS: Service-Connected Disability Status and Ratings Kelly Ann Holder Housing and Household Economic Statistics.
Joint UNECE/Eurostat Work Session on Migration Statistics 3 March, 2008, Geneva, Switzerland Selected methods to improve emigration estimates MEASURING.
Becoming Canadian Citizens: Intent, process and outcome Kelly Tran, Tina Chui: Statistics Canada Stan Kustec, Martha Justus: Citizenship and Immigration.
5 Marzo 2007 EMERGING METHODOLOGIES OF CONTINUOUS USE OF REGISTERS AND GEOCODED DATABASES IN THE ITALIAN POPULATION AND HOUSING CENSUS Fabio Crescenzi,
The Application of the Concept of Uniqueness for Creating Public Use Microdata Files Jay J. Kim, U.S. National Center for Health Statistics Dong M. Jeong,
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
Dutch Virtual Census Presentation at the International Seminar on Population and Housing Censuses; Beyond the 2010 Round November, 2012 Egon Gerards,
The Statistical Business Register of Macao SAR Government of Macao SAR Statistics and Census Service.
Comparing approaches of different (partly) register-based countries Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
ISCO-08 - Current Status and plans to support implementation David Hunter Department of Statistics International Labour Office United Nations Expert Group.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
Introduction to the Public Use Microdata Sample (PUMS) File from the American Community Survey Updated February 2013.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
JOINT UNECE-UNFPA TRAINING WORKSHOP ON POPULATION AND HOUSING CENSUSES GENEVA, 5-6 JULY 2010 GOOD PRACTICES IN DISSEMINATING POPULATION CENSUS RESULTS.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
Transition from traditional census to sample survey? (Experience from Population and Housing Census 2011) Group of Experts on Population and Housing Censuses,
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
S T A T I S T I C S A U S T R I A May 13th – 15th Register Based Census “The Austrian Principles of Redundancy” UNECE/Eurostat.
Population census micro data for research: the case of Slovenia Danilo Dolenc Statistical Office of the Republic of Slovenia Ljubljana, First Regional.
Register-based migration statistics and using additional administrative data sources Barica Razpotnik Statistical Office of the Republic of Slovenia UNECE.
Geneva, 21 May 2012 Snezana Lakcevic Statistical Office of the Republic of Serbia Head of Population Census Division Workshop on Censuses Using Registers.
The Dutch Virtual Census based on registers and already existing surveys Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
ISR Training Jan. 21,  Canada’s largest survey  Complete population count  Gathers information on the demographic, social and economic conditions.
The Dutch Virtual Census of 2001 A New Approach by Combining Different Sources Eric Schulte Nordholt ECE Census meetings Geneva, November 2004.
Regional Workshop on International Migration Statistics Cairo, Egypt 30/6/2009-3/7/2009.
Methodology used for estimating Census tables based on incomplete information Eric Schulte Nordholt Senior researcher and project leader of the Census.
The project for developing the methodology of register- based censuses in Estonia Kristi Lehto Statistics Estonia Methodology and analysis department Senior.
CES Recommendations for 2020 round on census methodology Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
1 1 Anonymised Integrated Event History Datasets for Researchers Johan Heldal Statistics Norway.
Using Targeted Perturbation of Microdata to Protect Against Intelligent Linkage Mark Elliot, University of Manchester Cathie.
Why register-based statistics? Eric Schulte Nordholt Statistics Netherlands Division Social and Spatial Statistics Department Support and Development Section.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
Social Statistics Department Population and Demography Group–Population and Migration Team PRIME MINISTRY REPUBLIC OF TURKEY TURKISH STATISTICAL INSTITUTE.
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
Statistics Netherlands’ modernization programme: the use of administrative data, lessons learned and the way ahead. Geert Bruinooge Assistant Director.
Creating Open Data whilst maintaining confidentiality Philip Lowthian, Caroline Tudor Office for National Statistics 1.
New challenges for Social statistics, EurostatLuxemburg, 23 September 2008 New approach to migration statistics in Lithuania NEW APPROACH TO MIGRATION.
Access to microdata in the Netherlands: from a cold war to co-operation projects Eric Schulte Nordholt Senior researcher and project leader of the Census.
The Integrated Public Use Microdata Series database IPUMSwww.ipums.org Lab 1 Background on the IPUMS and SPSS.
1 1 Topics difficult to measure in a register-based census Harald Utne Census Project Statistics Norway UNECE-Eurostat Meeting on Population.
Workshop on Collection and Dissemination of Socio-economic Data from Population and Housing Censuses New Delhi, India, May 2012 United Nations Demographic.
© Statistisches Bundesamt, VI A Statistisches Bundesamt The new method of the next german Population census Johann Szenzenstein, Federal Statistical Office,
Jacco Daalmans Estimation of Dutch census tables.
13-Jul-07 State of the art of the ISCO-08 implementation.
REPUBLIC OF TURKEY TURKISH STATISTICAL INSTITUTE TurkStat Demography Statistics Department Population and Migration Statistics Group EXPERIENCES.
CENSUS MICRODATA OF TURKEY Meryem DEMIRCI, Turkish Statistical Institute, June, 2006.
Marc Hamel and Julie Trépanier May 21, 2014 Canadian Statistical Demographic Database: A research project.
Overview of External Migration Statistics in Georgia Workshop on the use of administrative data for measuring migration in Georgia April 5-6, 2016, Tbilisi,
ASDC Annual Meeting November 10, 2011 Kathleen Gabler Socioeconomic Research Associate Center for Business and Economic Research Culverhouse College of.
Methods for Data-Integration
2021 Population Census and migration statistics in Spain.
Census developments in the Netherlands
Statistics Netherlands Division Social and Spatial Statistics
Use of the business register in the Dutch labour statistics
Census Planning and Management
Telling Canada’s story in numbers Marie-Josée Major
Key Considerations for Planning and Management of Census Operations
Item 2.2 Scientific Use Files for the Time Use Survey
Stratification, calibration and reducing attrition rate in the Dutch EU-SILC Judit Arends.
Key Considerations for Planning and Management of Census Operations
Presentation transcript:

The availability of Dutch census microdata Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands Division Social and Spatial Statistics Department Support and Development Section Research and Development Workshop on Communication and Dissemination of Census Results in Geneva 16 May 2008

2 Contents Historical introduction Registers used for the virtual census Micro linkage Social Statistical Database Publicity about Dutch censuses Harmonisation Microdata availability Statistical Disclosure Control

3 Historical introduction Till 1899: Ministry of Home Affairs 1899: 8 th Census 1971: 14 th Census Till 1995: more and more surveys Last twelve years: moving to a register-based statistical office Reasons: Unwillingness (non-response) Reduction of response burden Reduction of expenses

4 Registers used for the virtual census External registers (maintenance by register holders): Population Register (PR), 16 million records demographic variables: sex, day of birth, marital status, country of birth etc. Fiscal administration (FIBASE), jobs, 7.2 million records and pensions and life insurance benefits, 2.7 million records Social Security administrations, 2 million records, auxiliary information integration process Internal registers (maintenance by Statistics Netherlands): Jobs file (employees), 6.5 million records and Self-employed persons, 790 thousand records dates of job, branch of economic activity General Business Register, records size class, (economic) activity Housing Register, about 7 million records housing variables

5 Micro linkage Linkage key: Registers Social security and Fiscal number (SoFi), unique since 26 November 2007: Citizen Service Number Surveys Sex, date of birth, address (postal code and house number) Linkage key replaced by RIN-person Linkage strategy Optimizing number of matches Minimizing number of mismatches and missed matches

6 Social Statistical Database Social Statistical Database (SSD): Set of integrated microdata files with coherent and detailed demographic and socio-economic data on persons, households, jobs and benefits No remaining internal conflicting information SSD-set: Population Register (backbone) Integrated jobs file Integrated file of (social and other) benefits Surveys, e.g. LFS Combining element: RIN-person

7 Publicity about Dutch censuses The Dutch Virtual Census of 2001 was a successful alternative for a traditional census Tables: GB/menu/themas/dossiers/volk stellingen/publicaties/2005- virtual-dutch-census-art.htmhttp:// GB/menu/themas/dossiers/volk stellingen/publicaties/2005- virtual-dutch-census-art.htm Book: GB/menu/themas/dossiers/volk stellingen/publicaties/2001-b57- e-pub.htmhttp:// GB/menu/themas/dossiers/volk stellingen/publicaties/2001-b57- e-pub.htm

8 Harmonisation (1) More information about the Dutch traditional Censuses (including those of 1960 and 1971): For 1960 and 1971 the same variables as for 2001 if not available: constructed based on existing variables in Census data Variables not internationally harmonised (e.g. sex, age, marital status, household position, country of birth, economic status, household size and country of citizenship) same classification and priority rules as for 2001

9 Harmonisation (2) Household size and country of citizenship: missing for 1960 Religious denomination (philosophy of life): only for 1960 and 1971 Place of residence one year prior to the census: only for 2001 International classifications Branch of current economic activity: ISIC / NACE Occupation: ISCO-COM Level of educational attainment: ISCED

10 Harmonisation (3) SexXXX AgeXXX Country of citizenshipXX Marital statusXXX Household positionXXX Religious denominationXX Country of birthXXX Household sizeXX Place of residence one year prior to the census X Economic statusXXX Level of educational attainment XXX OccupationXXX Branch of current economic activity XXX

11 Microdata availability One percent samples for three years (1960, 1971 and 2001) IPUMS (Integrated Public Use Microdata Series): Weighting to population totals Protecting according to rules for public use microdata files with Mu-ARGUS Microdata sets for all three years available for research! DANS (Data Archiving and Networked Services):

12 Statistical Disclosure Control (1) Microdata under contract (MUC): 1.No direct identifiers 2.Rule against spontaneous recognition: each combination of an extremely identifying variable, a very identifying variable and an identifying variable should occur at least 100 times in the population 3.Extension of this rule: maximum level of detail of some variables (occupation, level of education, branch of economic activity) is determined by the most detailed direct regional variable 4.Each region that can be distinguished in the microdata should contain at least 10,000 inhabitants 5.No direct regional variables in panel data

13 Statistical Disclosure Control (2) Identifying variables Direct (formal) identifiers Name, address, citizen service number, … Indirect identifiers, differentiated into Extremely identifying (E) Very identifying (V) Identifying (I) V E I

14 Statistical Disclosure Control (3) Examples of identifying variables Extremely identifying: Regional variables (residence, work, …) Very identifying: Sex, nationality + Extremely identifying variables Identifying: Age, occupation, education + Very identifying variables E V I

15 Statistical Disclosure Control (4) Public use microdata files: 1.Microdata must be at least one year old 2.No direct identifiers or direct regional variables 3.Only 1 kind of indirect regional variables. Values of indirect regional variables sufficiently scattered. Each area should contain at least 200,000 persons in the target population and should consist of municipalities from at least six of the twelve provinces. No dominating municipality in any area. 4.At most 15 indirect identifiers 5.No sensitive variables

16 Statistical Disclosure Control (5) Public use microdata files (continued): 6.Sampling weights should not provide additional identifying information 7.Rule against spontaneous recognition: at least 200,000 individuals in the population for each category of an identifying variable 8.Another rule against spontaneous recognition: at least 1000 individuals in the population for each category in the crossing of two identifying variables 9.At least 5 households per combination of categories of household variables 10.Records should be in random order

17 Statistical Disclosure Control (6) Microdata for remote analyses Remote execution: Scripts are sent (on line) to Statistics Netherlands and applied to the microdata; SDC is applied before returning the results (Compare with on-site microdata) Remote access: On-line access to confidentialized microdata sets (Compare with microdata under contract or on-site)

18 Time for questions and discussion