United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,

Slides:



Advertisements
Similar presentations
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Asunción,
Advertisements

1 Editing the Integrated Census in Israel. EDITING THE INTEGRATED CENSUS IN ISRAEL Prepared by Eva Rotenberg, Central Bureau of Statistics, Israel (1)
Data Imputation United Nations Statistics Division (UNSD) 16 March 2011 Santiago, Chile.
Unido.org/statistics International workshop on industrial statistics 8 – 10 July, Beijing Non response in industrial surveys Shyam Upadhyaya.
Burton Reist Chief, 2020 Research and Planning Office U.S. Census Bureau 2014 SDC and CIC Steering Committee Meeting March 5, Census Updates.
Module B-4: Processing ICT survey data TRAINING COURSE ON THE PRODUCTION OF STATISTICS ON THE INFORMATION ECONOMY Module B-4 Processing ICT Survey data.
Harvard Center for Population and Development Studies1 Census Editing and the Art of Motorcycle Maintenance Michael J. Levin Center for Population and.
Organised by United Nations Statistics Division (UNSD) in conjunction with the African Centre for Statistics Addis Ababa, Ethiopia, 14 – 18 September 2009.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
1 The Structure of Error Components in 2010 Census Coverage Error Estimation: P-sample estimates Mary H. Mulry & Bruce D. Spencer U.S. Census Bureau Northwestern.
Quality assurance -Population and Housing Census Alma Kondi, INSTAT, Albania.
1 The 2010 Census Coverage Measurement Survey Patrick J. Cantwell U.S Census Bureau Annual Meeting of the Association of Public Data Users September 25,
Kevin Deardorff Assistant Division Chief, Decennial Management Division U.S. Census Bureau 2014 SDC / CIC Conference April 2, Census Updates.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
NLSCY – Non-response. Non-response There are various reasons why there is non-response to a survey  Some related to the survey process Timing Poor frame.
© John M. Abowd 2005, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2005.
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 15.
Copyright 2010, The World Bank Group. All Rights Reserved. PROCESSING, Part 1 Data capture, editing, imputation and tabulation Quality assurance for census.
Sampling. Concerns 1)Representativeness of the Sample: Does the sample accurately portray the population from which it is drawn 2)Time and Change: Was.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting, Part I.
Nonresponse issues in ICT surveys Vasja Vehovar, Univerza v Ljubljani, FDV Bled, June 5, 2006.
Multiple Indicator Cluster Surveys Survey Design Workshop Sampling: Overview MICS Survey Design Workshop.
Central egency for public mobilization and statistics.
Overview of error model for estimates of foreign-born immigration using data from the American Community Survey Mary H. Mulry U.S. Census Bureau 2011 International.
© John M. Abowd 2007, all rights reserved Analyzing Frames and Samples with Missing Data John M. Abowd March 2007.
European Conference on Quality in Official Statistics Session 26: Quality Issues in Census « Rome, 10 July 2008 « Quality Assurance and Control Programme.
Editing a Mixture of Canadian 2006 Census and Tax Data Mike Bankier Statistics Canada 2006 Work Session on Statistical Data Editing
MGT-491 QUANTITATIVE ANALYSIS AND RESEARCH FOR MANAGEMENT OSMAN BIN SAIF Session 16.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
UNSD-ESCWA Regional Workshop on Census Data Processing in the ESCWA region: Contemporary technologies for data capture, methodology and practice of data.
Copyright 2010, The World Bank Group. All Rights Reserved. Reducing Non-Response Section B 1.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Addis Ababa,
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Asunción,
UNSD Regional Workshop on Census Data Processing for the English speaking African Countries: Contemporary technologies for data capture, methodology and.
1 SIPP IMPUTATION SCHEME AND DISCUSSION ITEMS Presenters: Nat McKee - Branch Chief Census Bureau Demographic Surveys Division (DSD) Income Surveys Programming.
© John M. Abowd 2007, all rights reserved General Methods for Missing Data John M. Abowd March 2007.
May 12-15, Evaluating the Integrated Census Israel Pnina ZADKA Central Bureau of Statistics Israel.
Paolo Valente - UNECE Statistical Division Slide 1 Technology for census data coding, editing and imputation Paolo Valente (UNECE) UNECE Workshop on Census.
United Nations Workshop on Revision 3 of Principles and Recommendations for Population and Housing Censuses and Evaluation of Census Data, Amman 19 – 23.
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
United Nations Workshop on Evaluation and Analysis of Census Data, 1-12 December 2014, Nay Pyi Taw, Myanmar DATA VALIDATION-I Evaluation of editing and.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Asunción,
1 A Study of Sources for the Error Structure in Estimates of Census Coverage Error Components Mary H. Mulry U.S. Census Bureau 2009 International Total.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Asunción,
UNSD-UNESCAP Regional Workshop on Census Data Processing: Contemporary technologies for data capture, methodology and practice of data editing, documentation.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Asunción,
Overview of Census Evaluation through Demographic Analysis Pres. 3 United Nations Regional Workshop on the 2010 World Programme on Population and Housing.
The 2011 Census: Estimating the Population Alexa Courtney.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
1/22#/ Post Enumeration Survey for Population Census Jaewon Lee Statistical Research Institute Statistics Korea.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
1 Handbook on Population and Housing Census Editing Department of Economic and Social Development United Nations Statistics Division Studies in Methods,
Methodologies and Procedures for Evaluating Coverage and Content Error Pres. 6 United Nations Regional Workshop on the 2010 World Programme on Population.
Post Enumeration Surveys Pres. 2
The European Statistical Training Programme (ESTP)
Generic Statistical Business Process-Censuses
Treatment of Missing Data Pres. 8
Tabulation and Dual System of Estimation (DSE) Pres. 9
Chapter 13: Item nonresponse
Methodologies and Procedures for Evaluating Coverage and Content Error Pres. 6 United Nations Regional Workshop on the 2010 World Programme on Population.
Presentation transcript:

United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010 Treatment of Missing Data Pres. 8

United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010 Treatment of Missing Data Why are some data missed?  Refusals  Item non-response  Time constraints  Paucity of resources  Lax enumerators  Units not found  Insufficient data for matching, etc.

United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010 Treatment of Missing Data Four types of missing data  Unit missing data - Household non-interview  Item missing data - When some information for household or person is available and some information is not available  Unresolved match or residence status – When match or residence status in P-sample could not be determined for PES Estimation  Unresolved enumeration status – When correct or erroneous enumeration status in E-sample could not be determined for PES estimation

How to treat missing data ?  A. doing nothing  B. use only the complete records  C. use a weighting method  D. impute a missing value  E. probability for unresolved status United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

A. Doing nothing  If missing data are very few, it may not have significant effect on data usages and one can ignore them  Requires to work with an incomplete dataset with missing data United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

B. Use only the complete records  Easy but risky option. The subset of respondents may be: Non representative of the total population under study  Estimates may be seriously biased, unless non-response doesn’t depend on any of the variables of interest  This option can be envisaged only for a rapid descriptive analysis United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

C. Use a weighting method  Unit non-response: Increase the respondents’ weight to compensate for the non- respondents. The objective is to produce roughly unbiased estimates  Item non-response: Possible to use reweighting methods but the main disadvantage is to have different weights for the same record (one for each of the variables). That’s why it is generally not used for item non-response United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

D. Imputation  The process of imputation changes one or more responses or missing values in a record or several records to ensure internally coherent records result  Before using any imputation method, the best strategy is to start with manual study of responses; imputation can then handle the remaining unresolved edit failures  Two methods of imputation: Cold Deck and Hot Deck  Cold Deck Imputation: Used mainly for missing or unknown values (not for inconsistent/invalid values) Values are imputed on a proportional basis from a distribution of valid responses (e.g., from previous census) In doing so, cold deck draws values from a fixed (but possibly outdated) distribution of values United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

D. Imputation (contd.)  Hot Deck or Dynamic Imputation: Used for both missing data and inconsistent/invalid items Uses one or more variables to estimate the likely response based on data about individuals with similar characteristics The “donor set” (or imputation matrix) constantly changes through updating; therefore, imputations dynamically change during the process of editing all the records Thus, hot deck draws from a distribution that dynamically changes with each imputation and eventually (through modifications) “approaches” the distribution of current data set Caution: if the different items for a particular record have unknown values, hot deck may not use the same “donor” to impute for both missing values; in this case, it is preferable to use the same donor for both items United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

E. Probability for unresolved status  Unresolved match or residence status in P-sample: Estimate probabilities of match (residence) status  Form cells/groups to estimate probabilities  Each cell be homogenous with respect to probability to be estimated  Different/hetrogenous Probabilities between cells/groups  Use reasons for field follow-up to form cells  Unresolved enumeration status in E-sample: Estimate probabilities of correct enumeration  Form cells/groups to estimate probabilities  Each cell be homogenous with respect to probability to be estimated  Different/hetrogenous probabilities between cells/groups  Use reasons for field follow-up to form cells United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

E. Probability for unresolved status  Example: Estimate probability of match (residence) status for a cell  Total cases sent for field follow-up = 100  Number of cases resolved after field follow-up = 80  Number of matched cases out of 80 =48  Number of nonmatched cases out of 80 = = 32  Probability of match for an unresolved case is = 48/80 = 0.60  Probability of nonmatch for an unresolved case is = 32/80 =0.40  Unresolved enumeration status in E-sample: Estimate probabilities of correct enumeration for a cell  Same methodology as described for match status United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

Summary results of missing data operations  Essential to evaluation, process planning and management: i) number of cases of each type of error; ii) unit non- response rates; iii) non-response rates for each item; iv) imputation rates for each item; v) unresolved status by type, ….  Important to generate edit trail showing all data changes and substituted values with their tallies  If original value of data is changed in any way; flags should be added onto each item that is changed or imputed  This information is critical for planning of future censuses; e.g., As a means to investigate age threshold below which female with “child ever born” triggers a query edit and to decide if threshold should be modified for future rounds United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

A useful reference Handbook on Population and Housing Census Editing Rev. 1 Available on the UNSD website and currently under printing United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan, November, 2010

Thank You!