Imputation in the 2001 Census Robert Beatty NILS User Forum 11 December 2009.

Slides:



Advertisements
Similar presentations
Measuring Coverage: Post Enumeration Surveys Owen Abbott Office for National Statistics, UK.
Advertisements

Matching of administrative data to validate the 2011 Census in England and Wales NRS & RSS Edinburgh, October 2012.
Quality assurance -Population and Housing Census Alma Kondi, INSTAT, Albania.
Copyright 2010, The World Bank Group. All Rights Reserved. Estimation and Weighting Part II.
Using address information from health card registrations : Perspectives from Northern Ireland using the Northern Ireland Longitudinal Study (NILS) Paul.
The Excel NORMDIST Function Computes the cumulative probability to the value X Business Statistics: A First Course, 5e © 2009 Prentice-Hall, Inc
2011 Census Using administrative data to address under-enumeration Robert Beatty Northern Ireland Census Office.
Palestinian Central Bureau of Statistics (PCBS) Palestine Poverty Maps 2009 March
Sampling Methods and Sampling Theory Alex Stannard.
General Register Office for S C O T L A N D information about Scotland's people Producing small area housing and household statistics from Council Tax.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Arun Srivastava. Types of Non-sampling Errors Specification errors, Coverage errors, Measurement or response errors, Non-response errors and Processing.
Near East Regional Workshop - Linking Population and Housing Censuses with Agricultural Censuses. Amman, Jordan, June 2012 Improving Efficiency.
BPS - 5th Ed. Chapter 81 Producing Data: Sampling.
Selection of participants EHES Training Material.
Volunteer Angler Data Collection and Methods of Inference Kristen Olson University of Nebraska-Lincoln February 2,
Use of survey (LFS) to evaluate the quality of census final data Expert Group Meeting on Censuses Using Registers Geneva, May 2012 Jari Nieminen.
12th Meeting of the Group of Experts on Business Registers
The ACS and the 2010 Census Richard Lycan and Charles Rynerson Population Research Center Portland State University GIS in Action March, 2011.
Central egency for public mobilization and statistics.
Coverage assessment and adjustment methodology Owen Abbott Methodology Directorate, ONS.
Plans for Access to UK Microdata from 2011 Census Emma White Office for National Statistics 24 May 2012.
2011 CENSUS Coverage Assessment – What’s new? OWEN ABBOTT.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Bangkok,
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
Internet versus paper mode effects in the 2011 Census of England and Wales: analysis of Census Quality Survey agreement rates Cal Ghee 26 September 2014.
Editing a Mixture of Canadian 2006 Census and Tax Data Mike Bankier Statistics Canada 2006 Work Session on Statistical Data Editing
GEOG3025 Census and administrative data 1: Sources and methods.
1 Basic requirements for using a household survey to produce good quality migration data Dean H. Judson, Ph.D. Immigration Statistics Staff.
Choosing Core NILS data and its impact on Research Rónán Adams Máire Brolly NILS User Forum 11 th December 2009.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Bangkok,
GROUP 2 Practical C. Question 1 Cut off will depend on the country situation : 1 pig may be significant Frequency distribution – take the lower 10 – 20%
Post enumeration survey in the 2009 Pilot Census of Population, Households and Dwellings in Serbia Olga Melovski Trpinac.
Towards a high quality 2011 Census The 2011 Census Questionnaire Pete Benton Deputy Director, Census Programme.
WP 19 Assessment of Statistical Disclosure Control Methods for the 2001 UK Census Natalie Shlomo University of Southampton Office for National Statistics.
2008 Population Census of Cambodia Post Enumeration Survey Mrs. Hang Lina Deputy Director General National Institute of Statistics, Min. of Planning Regional.
Sources of Errors M&E Capacity Strengthening Workshop, Addis Ababa 4 to 8 June 2012 Arif Rashid, TOPS.
Design of the 2011 Census Coverage Survey Owen Abbott (ONS) James Brown (Institute of Education)
Household Surveys: American Community Survey & American Housing Survey Warren A. Brown February 8, 2007.
RESEARCH METHODS Lecture 29. DATA ANALYSIS Data Analysis Data processing and analysis is part of research design – decisions already made. During analysis.
Chapter 6: 1 Sampling. Introduction Sampling - the process of selecting observations Often not possible to collect information from all persons or other.
Post Enumeration Survey Baku Training Module.  Discuss:  What is a Post Enumeration Survey?  How is it undertaken in Australia?  Questions Overview.
Mozambique Carlos C. Singano Post-Enumeration Survey – Requirements, Planning, Designing and Executing Adis Ababa Workshop September 2009 Carlos.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys Asunción,
Analysis of the characteristics of internet respondents to the 2011 Census to inform 2021 Census questionnaire design Orlaith Fraser & Cal Ghee.
2011 Census Data Quality Assurance Strategy: Plans and developments for the 2009 Rehearsal and 2011 Census Paula Guy BSPS 10 th September 2009.
Sampling Design and Analysis MTH 494 Ossam Chohan Assistant Professor CIIT Abbottabad.
1 Introduction to Statistics. 2 What is Statistics? The gathering, organization, analysis, and presentation of numerical information.
1 SAMPLING FRAMES FOR/FROM AGRICULTURAL CENSUS Mukesh K. Srivastava FAO Statistics Division Roundtable, Samoa, March 2009.
The 2011 Census: Estimating the Population Alexa Courtney.
Probability Sampling. Simple Random Sample (SRS) Stratified Random Sampling Cluster Sampling The only way to ensure a representative sample is to obtain.
1 Data Collection and Sampling ST Methods of Collecting Data The reliability and accuracy of the data affect the validity of the results of a statistical.
First meeting of the Technical Cooperation Group for the Population and Housing Censuses in South East Europe Vienna, March 2010 POST-ENUMERATION.
1/22#/ Post Enumeration Survey for Population Census Jaewon Lee Statistical Research Institute Statistics Korea.
United Nations Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Amman, Jordan,
Managing Multi Mode Collection Instruments in the 2011 UK Census Frank Nolan, Heather Wagstaff, Ruth Wallis Office for National Statistics UK.
United Nations Regional Workshop on the 2010 World Programme on Population and Housing Censuses: Census Evaluation and Post Enumeration Surveys, Addis.
Introduction/ Section 5.1 Designing Samples.  We know how to describe data in various ways ◦ Visually, Numerically, etc  Now, we’ll focus on producing.
Adjusting for coverage error in administrative sources in population estimation Owen Abbott Research, Development and Infrastructure Directorate.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
Quality Assurance in Population and Housing Censuses
Agenda Introduction Why have a PES? Essential features of a PES
Methodologies & Procedures for Evaluation
Methodologies and Procedures for Evaluating Coverage and Content Error Pres. 6 United Nations Regional Workshop on the 2010 World Programme on Population.
2000 POPULATION AND HOUSING CENSUS:
Integrating administrative data – the 2021 Census and beyond
Post Enumeration Surveys Pres. 2
Towards a Fully Adjusted Census Database for the 2011 Census
Methodologies and Procedures for Evaluating Coverage and Content Error Pres. 6 United Nations Regional Workshop on the 2010 World Programme on Population.
Presentation transcript:

Imputation in the 2001 Census Robert Beatty NILS User Forum 11 December 2009

Coverage How Census deals with Missing households Missing people within households Incomplete returns

Coverage Census is statutory Census Act (Northern Ireland) 1969 Penalties for non-compliance Therefore counts everyone Doesn’t it?

Coverage Population in thousands Published Census figure MYE 19911,5781,607

Coverage Population in thousands Published Census figure MYE 19911,578 (enumerated) 1,607 (best estimate)

Coverage - international Australia 2006 – 96% coverage Don’t impute but adjust MYEs New Zealand 2006 – 95% response rate NZ imputed for non-response, but only on 4 key variables Canada ‘adjust for non-responding households’ – need to know about occupied households

Adjustment issues 1991 coverage – 98% But inference about population? Non-response not homogeneous Young adults Lower social class Deprived areas

Coverage Acknowledge under-enumeration 1991 Census 1,578k MYE 1,607k Decision to adjust Census 2001 database Objective – all Census outputs to fully reflect whole population ‘One Number Census’ Census = MYE

Coverage Population in thousands Published Census figure MYE 19911,578 (enumerated) 1, ,685 (adjusted) 1,689

Coverage ‘One Number Census’ method Basic principle to use a large-scale Census Coverage Survey (CCS) to estimate under- enumeration in sampled areas Apply survey estimates elsewhere

Census Coverage Survey UK split into about 100 Estimation Areas (each about 0.5m population) Three in Northern Ireland About 200 postcodes / 3,000 households per Estimation Area Three socio-economic strata within EA Separate analysis in each strata within EA

Census Coverage Survey Fieldwork about 3 weeks after Census day Face to face interviews Trained interviewers Given map of postcode boundary Asked to re-enumerate the postcode Short questionnaire - coverage

Matching Forms scanned into system Special matching software developed Database retrieval system CCS returns carefully matched with Census returns – error rate estimated to be under 0.1 per cent

Dual System Estimator (DSE) Use matched Census and CCS data DSE estimates adjustment for those missed in both Census and CCS Counted By CCS Yes No Counted Yesn 11 n 10 n 1+ By Census Non 01 n 00 n 0+ n +1 n +0 n ++ DSE estimate for the area (under certain assumptions): n ++ = n 1+  n +1  n 11

DSE : Simple Example Fish pond Day 1:Catch 950 fish, mark with a red dot. Day 2:Catch 900 fish, mark with a blue dot. Matched: 855 had blue and red dots. Question – how many fish in the pond?

Dual System Estimator (DSE) Counted Day 2 Yes No Counted Yes Day 1No45 n 00 n n +0 n ++ DSE estimate of the actual number of fish: n ++ = 950  900  855 = 1,000

Analysis Separately for each age-sex group, within each stratum, within each EA Apply DSE method to each sampling point (postcodes) within CCS area Estimate function DSE = f(observed count) Apply to all other sampling points within stratum (within EA), and aggregate

Ratio Estimation Regression-type estimator Each dot represents a CCS area Use Census figure to estimate “true” figure

The One Number Census process

Imputing households Use dummy forms as location Use dummy forms as ‘constraint’? Dependence on enumerators Ireland 2006 – 15% of properties vacant

One Number Census outcome 2001 Census response rate of 95% 4.3% in wholly imputed households (mostly linked to dummy forms(3.0%)) 0.4% additional people in already enumerated households Imputed 80,000 people

Coverage Population in thousands Published Census figure MYE 19911,578 (enumerated) 1, ,685 (adjusted) 1,689

Response rates by age

Quality of returns So far, considered non-respondents Person & Household imputation What about quality of returns actually made? Decision taken to go for ‘complete’ returns Item imputation

Edit and Impute - Edit Limited number of ‘hard’ edits – can’t be married if aged under 16 Larger number of ‘soft’ edits - quality

Edit and Impute - Impute General principle of ‘complete’ data set No ‘Not stated’ entries in outputs Item imputation used Donor imputation system No different in principle to systems used in sample surveys

Edit and Impute - Impute Level of item imputation differed by variable Not applied to religion

Summary Objective in 2001 that Census outputs should reflect whole population Person and household imputation 5% of persons imputed Complete records generated for all returns through ‘item’ imputation

I told them in 1951 it was just you, me and the dog, but they keep coming back every 10 years to check.

Looking forward Date for your diaries … 27 March 2011

Any questions?

Usual residence definition Historical – present on night Most countries now ‘usually resident’ Definitions do exist (UN) 2001 – self-assessed 2011 – instructions ‘Intention to stay’