Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.

Slides:



Advertisements
Similar presentations
Variance Estimation When Donor Imputation is Used to Fill in Missing Values Jean-François Beaumont and Cynthia Bocci Statistics Canada Third International.
Advertisements

1 Editing the Integrated Census in Israel. EDITING THE INTEGRATED CENSUS IN ISRAEL Prepared by Eva Rotenberg, Central Bureau of Statistics, Israel (1)
Standardized Scales.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Quality assurance -Population and Housing Census Alma Kondi, INSTAT, Albania.
Weighting and Imputation for CORE Social Housing Statistics Julia Bowman & Niall Goulding.
1 Editing Administrative Data and Combined Data Sources Introduction.
IMPUTING MISSING VALUES FOR HIERARCHICAL POPULATION DATA Overview of Database Research Muhammad Aurangzeb Ahmad Nupur Bhatnagar.
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
E&I for 2006 Canadian Census Mike Bankier Statistics Canada
The new HBS Chisinau, 26 October Outline 1.How the HBS changed 2.Assessment of data quality 3.Data comparability 4.Conclusions.
Eurostat Statistical Data Editing and Imputation.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
12th Meeting of the Group of Experts on Business Registers
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
THE MAIN INNOVATIONS OF DATA EDITING AND IMPUTATION FOR THE 2010 ITALIAN AGRICULTURAL CENSUS G. Bianchi, R. M. Lipsi, P. Francescangeli, G. Ruocco, A.
Register-Based Census 2011 in Slovenia – Some Quality Aspects Danilo Dolenc Statistical Office of the Republic of Slovenia UNECE-Eurostat Expert Group.
New and Emerging Methods Maria Garcia and Ton de Waal UN/ECE Work Session on Statistical Data Editing, May 2005, Ottawa.
Topic (ii): New and Emerging Methods Maria Garcia (USA) Jeroen Pannekoek (Netherlands) UNECE Work Session on Statistical Data Editing Paris, France,
Challenges in Collecting Police-Reported Crime Data Colin Babyak Household Survey Methods Division ICES III - Montreal – June 20, 2007.
Jeroen Pannekoek - Statistics Netherlands Work Session on Statistical Data Editing Oslo, Norway, 24 September 2012 Topic (I) Selective and macro editing.
Editing a Mixture of Canadian 2006 Census and Tax Data Mike Bankier Statistics Canada 2006 Work Session on Statistical Data Editing
Stop the Madness: Use Quality Targets Laurie Reedman.
Topic (vi): New and Emerging Methods Topic organizer: Maria Garcia (USA) UNECE Work Session on Statistical Data Editing Oslo, Norway, September 2012.
National design, fieldwork and data harmonization for Labour Force Survey Irena Svetin Statistical Office of the Republic of Slovenia September 2014.
Valentina Stoevska ILO Department of Statistics Workshop on MDG Data Reconciliation: Employment Indicators, Beirut, July
Session topic (iii) – Editing and Imputation in the context of data integration from multiple sources and mixed modes Discussants Felipa Zabala, Orietta.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
Lyne Guertin Census Data Processing and Estimation Section Social Survey Methods Division Methodology Branch, Statistics Canada UNECE April 28-30, 2014.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
Outlining a Process Model for Editing With Quality Indicators Pauli Ollila (part 1) Outi Ahti-Miettinen (part 2) Statistics Finland.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
Evaluating the Quality of Editing and Imputation: the Simulation Approach M. Di Zio, U. Guarnera, O. Luzi, A. Manzari ISTAT – Italian Statistical Institute.
Expert Group Meeting on MDG, Astana, 5-8 Oct.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector Sources of discrepancies between.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Methods and software for editing and imputation: recent advancements at Istat M. Di Zio, U. Guarnera, O. Luzi, A. Manzari ISTAT – Italian Statistical Institute.
Statistics Canada Citizenship and Immigration Canada Methodological issues.
Generic Statistical Data Editing Models (GSDEMs) Workshop on the Modernisation of Official Statistics The Hague, 24 November 2015.
Processing Methodology of Tax Data at Statistics Canada Authors: François Brisebois, Richard Laroche and Rossana Manriquez (Statistics Canada) Presenter:
The 2011 Census: Estimating the Population Alexa Courtney.
Automatic Editing Data. A New Version of DIA System Prepared by J.M. Gomez Presented by D.Lorca National Statistical Institute of Spain.
Study of Editing and Imputation Practices at Statistics Finland Janika Konnu and Pauli Ollila Statistics Finland Q2010: Editing session Wednesday 5 th.
Towards the 2011 UK Census Editing Strategy Heather Wagstaff and Steven Rogers Methodology Directorate Office for National Statistics, U.K.
Census quality evaluation: Considerations from an international perspective Bernard Baffour and Paolo Valente UNECE Statistical Division Joint UNECE/Eurostat.
The development of a data editing and imputation tool set UN/ECE Work Session on Statistical Data Editing Topic (ii): Global solutions to editing Claude.
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
Ljubljana, 11 Mai 2011UNECE Work session on SDE Topic (vii) New and emerging methods 1 Topic (vii): New and emerging methods Discussion Discussants: Rudi.
Ljubljana, 11 Mai 2011UNECE Work session on SDE Topic (vii) New and emerging methods 1 Topic (vii): New and emerging methods Introduction Session organizers:
Workshop on MDG, Bangkok, Jan.2009 MDG 3.2: Share of women in wage employment in the non-agricultural sector National and global data.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
1 Handbook on Population and Housing Census Editing Department of Economic and Social Development United Nations Statistics Division Studies in Methods,
Generic Statistical Data Editing Models (GSDEMs)
Creation of synthetic microdata in 2021 Census Transformation Programme (proof of concept) Robert Rendell.
Canadian Census E&I – Lessons Learned from 2006 with Plans for 2011
Session D12: Multisource statistics New sources: new modelling approaches Author: Gras Fabrice, Eurostat, unit B1, Methodology and corporate architecture.
Evolving Data Processing in the Statistics Centre – Abu Dhabi
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
CENSUS 2013 Key experiences Budapest, June 2017.
Albania 2021 Population and Housing Census - Plans
Generic Statistical Business Process-Censuses
Data processing German foreign trade statistics
Treatment of Missing Data Pres. 8
Multi-Mode Data Collection
Presentation transcript:

Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data Editing (Oslo, Norway, 26 September 2012)

Outline Census Overview Edit and Imputation Methodologies Societal Differences and Challenges Performance Analysis Data Editing in the 2005 Census Conclusions

2011 Abu Dhabi Census Overview First census conducted by SCAD Main collection via CAPI, October questions Three methodologies used for edit and imputation, each with its own purpose: Donor Deterministic Manual

Edit and Imputation Methodologies Donor Imputation Canadian Census Edit and Imputation System v4.5 (CANCEIS) hot deck module Substitutes invalid value with value from “donor” record Deterministic Imputation Correct data via hard-coded rules (SAS) Applied mostly for out-of-scope responses Manual Imputation Manually check and modify data. Difficult cases like very large households.

Societal and Cultural Differences Very large household sizes: ~5 persons average Contrast to typical ~2.5 averages in Western countries Error rates increase with family size; used less exacting DLTs to account for this Households of 17+ treated as individual records, with some manual imputation as well

Societal Differences continued Complex relationships in large households Extended families High proportion of household servants Multiple wives – special consistency rules required Large Expatriate Population Many live in shared living arrangements Significant portion live in employer-provided camps Shares and collectives treated as 1-person households

Imputation Performance Example Statistics Predictive Accuracy: R 2 generated by regressing true on imputed values, used to assess predictive ability. Estimation Accuracy: Difference in means of true and imputed values, m 1, used to assess aggregate imputation accuracy. Imputation performance for Age Test Data: Starting with clean data, introduced two types of errors: missing data and “interchange” errors. Most performance measures from Euredit project (Charlton, 2003) Charlton, J. C. (2003).“Evaluating New Methods for Data Editing and Imputation - Results from the Euredit Project”, UNECE Statistical Data Editing Work session. Madrid, Spain.

Data Editing in the 2005 Census 2005: Manual and Deterministic imputation Phase 1: validation edits, outlier detection via SQL Small subset imputed via deterministic imputation Phase 2: Most failed records corrected manually Comparison to performance unknown 2005: Three methodologists, several months’ preparation 15 data clerks, 4+ months 2011: Two methodologists, 5 months total

Conclusions Modern edit and imputation methodology successfully applied in distinct cultural context Reliable results Measurable changes More efficient approach Special thanks to CANCEIS E&I unit, Statistics Canada