Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and.

Slides:



Advertisements
Similar presentations
Handling attrition and non- response in longitudinal data Harvey Goldstein University of Bristol.
Advertisements

Data Imputation United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Customer Support Center Measurement Business Headquarters YOKOGAWA
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Quality assurance -Population and Housing Census Alma Kondi, INSTAT, Albania.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 1 Section 2 – Slide 1 of 22 Chapter 1 Section 2 Observational Studies, Experiments, and.
© John M. Abowd 2005, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2005.
Why sample? Diversity in populations Practicality and cost.
Census Processing Procedures Matt Sobek Funded by the National Science Foundation Minnesota Population Center.
Using SMS-Gateways for Monitoring Progress and Quality of Data Collection: Lessons Learned from the 2010 Population Census of Indonesia Thoman Pardosi.
Multiple Indicator Cluster Surveys Data Interpretation, Further Analysis and Dissemination Workshop Overview of Data Quality Issues in MICS.
Brief Overview of Data Processing of Afghanistan Household Listing, Pilot Census Results, Population and Housing Census and NRVA Survey Brief Overview.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.
National Statistical Office, Thailand 2-6 December 2013, Hanoi, Viet Nam Census Evaluation.
Chapter 8 Introduction to Hypothesis Testing. Hypothesis Testing Hypothesis testing is a statistical procedure Allows researchers to use sample data to.
Household Surveys ACS – CPS - AHS INFO 7470 / ECON 8500 Warren A. Brown University of Georgia February 22,
Copyright 2010, The World Bank Group. All Rights Reserved. PROCESSING, Part 1 Data capture, editing, imputation and tabulation Quality assurance for census.
Joint UNECE/Eurostat Meeting on Population and Housing Censuses (28-30 October 2009) Accuracy evaluation of Nuts level 2 hypercubes with the adoption of.
Chapter 9 Audit Sampling: An Application to Substantive Tests of Account Balances This presentation focuses (like my course) on MUS. It omits the effect.
Using IPUMS.org Katie Genadek Minnesota Population Center University of Minnesota The IPUMS projects are funded by the National Science.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
Topic (ii): New and Emerging Methods Maria Garcia (USA) Jeroen Pannekoek (Netherlands) UNECE Work Session on Statistical Data Editing Paris, France,
1 Archiving Michael J. Levin Harvard Center for Population and Development Studies
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
Current Population Survey Sponsor: Bureau of Labor Statistics Collector: Census Bureau Purpose: Monthly Data for Analysis of Labor Market Conditions –CPS.
Principles of Information Systems, Sixth Edition Systems Design, Implementation, Maintenance, and Review Chapter 13.
Population and Housing Census and Survey Editing
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Addis Ababa,
Data Analysis.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Understanding Sampling
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
1 SIPP IMPUTATION SCHEME AND DISCUSSION ITEMS Presenters: Nat McKee - Branch Chief Census Bureau Demographic Surveys Division (DSD) Income Surveys Programming.
Statistical Expertise for Sound Decision Making Quality Assurance for Census Data Processing Jean-Michel Durr 28/1/20111Fourth meeting of the TCG - Lubjana.
ICCS 2009 IDB Workshop, 18 th February 2010, Madrid 1 Training Workshop on the ICCS 2009 database Weighting and Variance Estimation picture.
Notes 1.3 (Part 1) An Overview of Statistics. What you will learn 1. How to design a statistical study 2. How to collect data by taking a census, using.
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
Copyright 2010, The World Bank Group. All Rights Reserved. Managing Data Processing Section B.
Data Processing of the 2010 Population and Housing Census September 2008, Bangkok, Thailand National Statistical Office, Thailand.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
The Scientific Method How to Solve just about anything Chemistry.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
Sullivan – Statistics: Informed Decisions Using Data – 2 nd Edition – Chapter 1 Section 2 – Slide 1 of 21 Chapter 1 Section 2 Observational Studies, Experiments,
INFO 4470/ILRLE 4470 Visualization Tools and Data Quality John M. Abowd and Lars Vilhuber March 16, 2011.
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
Chapter 8 Sampling Methods and the Central Limit Theorem.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
1 Handbook on Population and Housing Census Editing Department of Economic and Social Development United Nations Statistics Division Studies in Methods,
Census Planning and Management
Census Mobile Data Capture Using CSPro in Lesotho
The treatment of uncertainty in the results
Census of Population & Housing 2001 Sri Lanka
Chapter 6 Database Design
PRODUCTION PROCESS AND FLOW
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Administrative Data and their Use in Economic Statistics
Generic Statistical Business Process-Censuses
CSPro: Census and Survey Processing System
The role of metadata in census data dissemination
Treatment of Missing Data Pres. 8
Manual Data Capture – Key Entry
Quality assurance and assessment in the vital statistics system
Presentation transcript:

Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and Development Studies Harvard University

Harvard Center for Population and Development Studies2

3 The Census Process Data collection Capture Editing Tabulation and Dissemination Archiving

Harvard Center for Population and Development Studies4 History of census editing Early years – manual or nothing Computers Within record editing Between record editing Hot decking

Harvard Center for Population and Development Studies5 What is editing Editing is the systematic inspection of invalid and inconsistent responses, and subsequent manual or aurtomatic correction according to pre-determined rules. The editing team!!

Harvard Center for Population and Development Studies6 Why edit? Edited vs unedited data Always preserve original data Consider the users!!

Harvard Center for Population and Development Studies7 Table 1. Sample population by 15-year age group and sex, using unedited and edited data

Harvard Center for Population and Development Studies8 Initial data sets contain errors How over-editing is harmful  Timeliness  Finances  Distortion of true values  False sese of security

Harvard Center for Population and Development Studies9 What we have to look out for Treatment of unknowns Spurious changes Using tolerances Learning from the editing process Quality assurance Costs of editing

Harvard Center for Population and Development Studies10 Types of Correction  Manual correction Names Sex  Automatic correction Assign an unknown Assign a value Impute a value

Harvard Center for Population and Development Studies11 Types of editing  Top Down The usual way Is simple and straight forward  Multiple-variable editing approach Uses more information Is likely to be a better guess

Harvard Center for Population and Development Studies12 Two parts of a national edit Structure editing Content editing

Harvard Center for Population and Development Studies13 Methods of Correction and Imputation  When imputation is not needed – toggling sexes  Static imputation – cold deck technique  Dynamic imputation – hot deck technique

Harvard Center for Population and Development Studies14 Goals of the edit Imputed household should closely resemble failed edit household Imputed data should come from a single donor person or house resembling donee Equally good donors should have equal chances

Harvard Center for Population and Development Studies15 Figure 1. Sample editing specifications to correct sex variable, in pseudocode

Harvard Center for Population and Development Studies16 Hot Deck Geographic considerations Use of related items Order of the items changes the matrices Complexity of the imputation matrices

Harvard Center for Population and Development Studies17 In developing hot decks Imputation matrices – structure of the matrices Standardized imputation matrices Seeding the decks Big, but not too big Understanding what the matrix is doing When the matrix is too small … Occupation and industry!!

Harvard Center for Population and Development Studies18 Aids to checking edits 1. Listings 2. Writing whole households before and after with changes 3. Frequency matrices

Harvard Center for Population and Development Studies19 Figure 4. Example of a listing summary for Malawi 2008 Census [LISTING]

Harvard Center for Population and Development Studies20 Figure 5. Example of a listing summary for Lesotho 2006 Census [LISTING]

Harvard Center for Population and Development Studies21 Figure 8. Example of a write listing for Ethiopia 2007 Census [WRITE]

Harvard Center for Population and Development Studies22 Figure 10. Example of a frequency distribution for Sudan 2008 Census [FREQUENCY]

Harvard Center for Population and Development Studies23 Figure 11. Example of a frequency distribution for additional edit for Zambia 1990 Census [FREQUENCY]

Harvard Center for Population and Development Studies24 Other considerations Running the edit three times: seed, run, check Saving original responses Imputation flags

Harvard Center for Population and Development Studies25 Conclusions Edits part of the series of census procedures Usually more for aesthetics than technical enhancement Hardware and software changing rapidly The revolution continues!