1 A theoretical framework for register-based statistics --- Can we carry on without it? Li-Chun Zhang Statistics Norway

Slides:



Advertisements
Similar presentations
The Dutch Censuses of 1960, 1971 and 2001 Producing public use files in the IPUMS project Wijnand Advokaat Statistics Netherlands Division Social and Spatial.
Advertisements

United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
United Nations Workshop on Revision 3 of Principles and recommendations for Population and Housing Censuses and Census Evaluation Amman, Jordan, 19 – 23.
A new sampling method: stratified sampling
Stratified Simple Random Sampling (Chapter 5, Textbook, Barnett, V
United Nations Expert Group Meeting on Revising the Principles and Recommendations for Population and Housing Censuses New York, 29 October – 1 November.
The Use of Administrative Sources for Economic Statistics An Overview Steven Vale Office for National Statistics UK.
Chapter 4 Demographic Data. Chapter Outline Sources Of Demographic Data Population Censuses Registration Of Vital Events Combining The Census And Vital.
United Nations Workshop on Revision 3 of Principles and recommendations for Population and Housing Censuses and Census Evaluation Amman, Jordan, 19 – 23.
1 1 Establishing a register-based statistical system Example: Population and housing censuses in Norway Statistical Training Course Use of Administrative.
GEOG3025 Census and administrative data sources 3: Integration and future development.
Target population-> Study Population-> Sample 1WWW.HIVHUB.IR Target Population: All homeless in country X Study Population: All homeless in capital shelters.
United Nations Workshop on Principles and Recommendations for a Vital Statistics System, Revision 3, for African English-speaking countries Addis Ababa,
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
From Sample to Population Often we want to understand the attitudes, beliefs, opinions or behaviour of some population, but only have data on a sample.
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
12th Meeting of the Group of Experts on Business Registers
List frames area frames and administrative data, are they complementary or in competition? Elisabetta Carfagna University of Bologna Department of Statistics.
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
Record matching for census purposes in the Netherlands Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics Netherlands.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
Emerging methodologies for the census in the UNECE region Paolo Valente United Nations Economic Commission for Europe Statistical Division International.
Quality issues on the way from survey to administrative data: the case of SBS statistics of microenterprises in Slovakia Andrej Vallo, Andrea Bielakova.
Using Multiple Methods to Reduce Errors in Survey Estimation: The Case of US Farm Numbers Jaki McCarthy, Denise Abreu, Mark Apodaca, and Leslee Lohrenz.
Collecting Electronic Data From the Carriers: the Key to Success in the Canadian Trucking Commodity Origin and Destination Survey François Gagnon and Krista.
Implementation of quality indicators in the Finnish statistics production process Kari Djerf Statistics Finland Q2008, Rome Italy.
Longitudinal Data Recent Experience and Future Direction August 2012.
The Dutch Virtual Census based on registers and already existing surveys Eric Schulte Nordholt Senior researcher and project leader of the Census Statistics.
The Dutch Virtual Census of 2001 A New Approach by Combining Different Sources Eric Schulte Nordholt ECE Census meetings Geneva, November 2004.
for statistics based on multiple sources
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Methodology used for estimating Census tables based on incomplete information Eric Schulte Nordholt Senior researcher and project leader of the Census.
A comparison of sample and register based survey: the case of labour market data De Gregorio C., Filipponi D., Martini A., Rocchetti I.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
1 1 A two-phase life-cycle model of integrated statistical micro data Li-Chun Zhang Statistics Norway
1 1 A statistical approach to surrogate data Li-Chun Zhang Statistics Norway
Why register-based statistics? Eric Schulte Nordholt Statistics Netherlands Division Social and Spatial Statistics Department Support and Development Section.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
Pilot Census in Poland Some Quality Aspects Geneva, 7-9 July 2010 Janusz Dygaszewicz Central Statistical Office POLAND.
Comparison and integration among different sources for determining the legal foreign population stock in Italy Costanza Giovannelli Joint.
1 For a Population Statistical Register Characteristics and Potentials for the Official Statistics Central department for administrative data and archives.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
1 1 Topics difficult to measure in a register-based census Harald Utne Census Project Statistics Norway UNECE-Eurostat Meeting on Population.
Representativity Indicators for Survey Quality Programme: Cooperation Theme: Socio-economic sciences and Humanities Activity: Socio-economic and scientific.
Register-based statistics production Administrative data used for statistical purposes Bo Sundgren 2010 Part 1.
© Statistisches Bundesamt, VI A Statistisches Bundesamt The new method of the next german Population census Johann Szenzenstein, Federal Statistical Office,
1 A prediction approach to representative sampling Ib Thomsen & Li-Chun Zhang Statistics Norway
S T A T I S T I K A U S T R I A Quality Assessment of register-based Statistics A Quality Framework Manuela LENK Directorate.
QUALITY ASSESSMENT OF THE REGISTER-BASED SLOVENIAN CENSUS 2011 Rudi Seljak, Apolonija Flander Oblak Statistical Office of the Republic of Slovenia.
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
Beyond 2011 Administrative data sources and low-level aggregate models for producing population counts.
1 STK 4600: Statistical methods for social sciences. Survey sampling and statistical demography Surveys for households and individuals.
INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.
Q2010 Special session 34 Data quality and inference under register information Discussion by Carl-Erik Särndal.
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
5.8 Finalise data files 5.6 Calculate weights Price index for legal services Quality Management / Metadata Management Specify Needs Design Build CollectProcessAnalyse.
Chapter 5 Sampling and Surveys. Section 5.3 Sample Surveys in the Real World.
provide information Challenges in the transition from traditional to register- based census in Austria Conference of European Statisticians.
Small area estimation combining information from several sources Jae-Kwang Kim, Iowa State University Seo-Young Kim, Statistical Research Institute July.
INFO 7470/ECON 7400/ILRLE 7400 Register-based statistics John M. Abowd and Lars Vilhuber March 4, 2013 and April 4, 2016.
Adjusting for coverage error in administrative sources in population estimation Owen Abbott Research, Development and Infrastructure Directorate.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
Implementation of Quality indicators for administrative data
Sub-regional workshop on integration of administrative data, big data
6.1 Quality improvement Regional Course on
Quality assurance and assessment in the vital statistics system
Kaija Ruotsalainen Statistics Finland
Presentation transcript:

1 A theoretical framework for register-based statistics --- Can we carry on without it? Li-Chun Zhang Statistics Norway

Statistical data by combination of sources: Coverage, content & relevance

Quality: Statistical vs. administrative register Wallgren & Wallgren (2007, Wiley): –“An administrative register is maintained to store records on all objects to be administered.” (Ideally) –“A statistical register is based on data from administrative registers that have been processed to suit statistical purposes.” A defining distinction in perspectives –Administrative register: Individual data of all importance –Statistical register: Properties at various aggregated levels  Quality of register-based statistics  Micro-data quality of a statistical register Notable lag of theoretical framework (Platek and Särndal, Holt, Nanopoulos, 2001) –A framework for quality assessment –Theoretical frameworks for different quality aspects

Process accuracy vs. statistical accuracy: Any unbiased, efficient estimators based on statistical registers? Process accuracy –Matching/mismatching rate –Extent of duplicates –Amount of missing values –… Statistical accuracy –Coverage –Relevance –Inherent stochastic variation  An example of the UK claimant register (Holt, 2007, TAS) –people claiming unemployment related benefits –entire population of claimants (say 1.5 million) –no sampling error and arguably a perfect measure –derived once each month on the same working day –daily variation about 10,000 in this count

A historic parallel: Survey sampling before Neyman (1934) The representative method (Kiær, 1895) with a three-stage design using 1890 census as frame: –1st: 128 counties and 23 towns throughout the country –2nd: cohorts of males of age 17, 22, 27, 32, etc. –3rd: persons with surname initial A, B, C, L, M, N ISI-committee 1924 report: “I think I may venture to say that nowadays there is hardly one statistician, who in principle will contest the legitimacy of the representative method”. (Jensen) Representative sampling (Neyman, 1934):

Comparisons to non-sampling errors in sample survey and census Unidentified units in register & non-response in survey –Related to under-coverage –Yes, imputation. But a quite different theory! –Example: register households  ‘Imputation’ of household identity  Which imputation methods do you use? Hot-deck? Definitional error in register source & measurement error –Related to relevance –Yes, a kind of measurement error. But bias dominates! And often clearly different in different sub-populations. –Example: register unemployment (REG_unemp) REG_unemp = ILO_unemp + Bias + Random_error Sample SurveyCensusRegister-based survey Coverage errors Relevance errors Non-response errors Integration errors Measurement errors Sampling errors Coverage errors Matching/mismatching errors Missing-link errors Aggregation errors (Partial classification)

A theory for detailed statistics: Signal or noise?

A theory for micro-data quality Reality at “Storgata 9”: –H0101: Astrid (72) - widow –H0102: Tommy (32) & Jenny (29) & Ronny (2) - cohabitation –H0201: Olav (29) & Lena (29) - cohabitation since Census 2001 –H0202: Knut (27) - single Register: –H0101: Astrid (72) - widow –H0101: Tommy (32) & Jenny (29) & Ronny (2) - cohabitation –H0101: Olav (29) - single –?: Lena (29) - single –?: Knut (27) - single Only Astrid is correctly registered. But when/how does it matter? Administrative register => Individual data of all importance => Unit-specific error Statistical register => A theory of types - How real is a record: how are variables related to each other - How representative is a record: distribution of the types Imputed cohabitation in household register