Canadian Census E&I – Lessons Learned from 2006 with Plans for 2011

Slides:



Advertisements
Similar presentations
IMPACT OF ONLINE EDITS AND INTERNET FEATURES IN THE 2006 CANADIAN CENSUS Presented by Mike Bankier on behalf of: Danielle Laroche and Chantal Grondin Statistics.
Advertisements

The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
Edit and Imputation of the 2011 Abu Dhabi Census Glenn Hui and Hanan AlDarmaki Statistics Centre - Abu Dhabi UNECE CES Work Session on Statistical Data.
2012 Applied Demography Conference Session 2C Julien Bérard-Chagnon Demography Division Statistics Canada Monday, January 9, 2012 Using tax data to estimate.
E&I for 2006 Canadian Census Mike Bankier Statistics Canada
National Household Survey: collection, quality and dissemination Laurent Roy Statistics Canada March 20, 2013 National Household Survey 1.
Editing a Mixture of Canadian 2006 Census and Tax Data Mike Bankier Statistics Canada 2006 Work Session on Statistical Data Editing
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Quality Assurance Programme of the Canadian Census of Population Expert Group Meeting on Population and Housing Censuses Geneva July 7-9, 2010.
Estimation of preliminary unemployment rates by means of multiple imputation UN/ECE-Work Session on Data Editing Vienna, April 2008 Thomas Burg, Statistics.
Lyne Guertin Census Data Processing and Estimation Section Social Survey Methods Division Methodology Branch, Statistics Canada UNECE April 28-30, 2014.
A Quality Driven Approach to Managing Collection and Analysis
An Overview of Editing and Imputation Methods for the next Italian Censuses Gianpiero Bianchi, Antonia Manzari, Alessandra Reale UNECE-Eurostat Meeting.
Evaluating imputation of sex and age for substitutes in substitute households Michael Ryan 2008 UNECE Work Session on Statistical Data Editing.
Workshop on World Programme for the Census of Agriculture 2020 Amman, Jordan May 2016 Theme 8: Demographic and social characteristics Technical Session.
Weighting and imputation PHC 6716 July 13, 2011 Chris McCarty.
Regional Roundtable on
Senior Consultant, The Annie E. Casey Foundation
Census 2016 Counting Ireland’s Housing Stock
Marketing Research Aaker, Kumar, Leone and Day Eleventh Edition
Introduction and Methodology
Peter Linde, Interviewservice Statistics Denmark
Introduction to Hypothesis Test – Part 2
2011 Census The First Results
Statistics Netherlands Division Social and Spatial Statistics
I n f o r m a t i o n e n Wir bewegen
Population, Family and Community
Editing and Imputing Income Data in the 2008 Integrated Census prepared by Yael Klejman Israel Central Bureau of Statistics UNITED NATIONS ECONOMIC.
UNECE Work Session on Gender Statistics Belgrade November, 2017
LISA, Anticipating the Next Generation of Longitudinal Data
Update and Overview of Administrative Records for the 2020 Census
Data collection with Internet
Nonresponse Bias in a Nationwide Dual-Mode Survey
IPUMS CPS Summer Data Workshop June 4, 2018 Kari Williams
LISA, Anticipating the Next Generation of Longitudinal Data
An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.
The European Statistical Training Programme (ESTP)
Survey phases, survey errors and quality control system
Multi-Mode Data Collection Approach
GEOG 204 Introductory GIS for the Social Sciences
Survey phases, survey errors and quality control system
Demographic Analysis and Evaluation
The European Statistical Training Programme (ESTP)
2011 Census - Household Results for Wales
2011 POPULATION AND HOUSING CENSUS PREPARATORY WORKS
Census Planning and Management
Population Estimation Beyond 2021
Generic Statistical Business Process-Censuses
Data collection with Internet
Metro ACEs Data 2018 Community Health Needs Assessment
Integrating Gender into Population and Housing Censuses
Turkish Statistical Institute
Multi-Mode Data Collection Approach
PROCESSING OF DATA The collected data in research is processed and analyzed to come to some conclusions or to verify the hypothesis made. Processing of.
Field procedures and non-sampling errors
Andrew Jenkins and Rosalind Levačić
Treatment of Missing Data Pres. 8
Changes in the Canadian Census of Population Program
Key Considerations for Planning and Management of Census Operations
FAMILY GENERATION BY REGISTERS – APPROVED METHODS AND IMPROVEMENTS FOR THE AUSTRIAN CENSUS 2021 Group of Experts on Population and Housing Censuses.
Barış DULKADİR TURKSTAT Expert
Multi-Mode Data Collection
Chapter 13: Item nonresponse
Data collection with Internet
Recommended Tabulations of the Principles and Recommendations for Population and Housing Censuses, Rev. 2 Session 4 United Nations Statistics Division.
SMALL AREA ESTIMATION FOR CITY STATISTICS
Chapter 5: The analysis of nonresponse
Innovations on the Canadian Census
Turkish Statistical Institute Demographic Statistics Department
Key Considerations for Planning and Management of Census Operations
Presentation transcript:

Canadian Census E&I – Lessons Learned from 2006 with Plans for 2011 Mike Bankier, Statistics Canada, bankier@statcan.ca Work Session on Statistical Data Editing Vienna Austria, April 21-23 2008

Outline of Talk Changes Made for 2006 Census Impact of adjusting occupancy status and imputation of total non-response households Processing of demographic variables with an emphasis on age Possible enhancements to E&I for 2011

Changes to 2006 Census 73% of dwellings mailed questionnaires 18% of dwellings responded by Internet 85% gave permission to link to tax form Questionnaires captured using ICR Non-Response Follow-Up (NRFU) done from centralized offices Failed Edit Follow-Up (FEFU) done from call centres

2006 Census Changes These new approaches reduced the field staff required by 46% Because of widespread labour shortages in some regions, the collection period was extended from mid-July to the end of Aug. (Census day May15) National NR rate 2.8% in 2006 vs 1.6% in 2001

Dwelling Classification Survey Mistakes made in field classifying dwellings as occupied or unoccupied. Sample of dwellings revisited to reassess occupancy status for dwellings where no response received DCS estimated 17.4% of 934,564 dwelling classified as unoccupied were occupied and 29.1% of 366,527 dwellings classified as occupied but with no responses were actually unoccupied Occupancy status for individual dwellings adjusted. Resulted in a 3.6% increase in the number of occupied dwellings and a 5.2% decrease in the number of unoccupied dwellings

Imputation of Total NR Households After the DCS adjustment, total non-response dwellings had all responses imputed by borrowing unimputed responses from another household Using a single donor for total non-response was less likely to produce implausible results Weighting used in 2001 to convert unoccupied dwellings to occupied - it could transfer population from one city block to another and be noticed by users

Demographic E&I Demographic E&I does minimum change imputation for blanks and inconsistencies so later program can form Census families All demographic variables for all persons in household are imputed simultaneously using CANCEIS Three types of Census families Couples without children Couples with children Lone Parents with children

Couple Editing Concepts For a couple, they should be both adults (age >=15) and both married or both common-law and have appropriate relationships to Person 1

Child/Parent Editing Concepts For a child/parent pair At least one parent must be 15 or more years older than the child and A female parent must not be more than 50 years older than a child and The relationships to Person 1 should be appropriate

0.85% In Wrong 5 Year Age Range - Data Capture Error

Analysis of Imputation of Age AGEU and AGE represent respectively the age of the person before and after minimum change donor imputation 99.11% had AGEU = AGE 0.61% had AGEU = Blank/Invalid 0.28% had AGEU ≠ AGE because of an inconsistency between AGEU and another variable

AGE Imputation for WIFE

Female Lone Parent vs Child Ages Before Imputation

Female Lone Parent vs Child Ages After Imputation

WIFE vs Child Ages Before Imputation

WIFE vs Child Ages After Imputation

Number of Children by Age Difference With Mother

2011 Changes – Small Domains Small domain (e.g. centenarians, same sex married couples) can have upwards bias because of response or data capture errors for persons outside the small domain Sometimes no alternate source of data to verify the small domain count and the domain is too large to be manually reviewed 100%

2011 Changes – Small Domains Manually review 20% sample of persons age 95+ to determine those with incorrect age For other 80% of persons age 95+, use nearest neighbour imputation to determine those with incorrect age Then in 2nd step, blank out incorrect ages and impute

2011 Changes – Use Failed Records as Donors Sometimes stratum failure rate is so high that number of donors is insufficient Failed records could be used as donors since frequently failed record is missing just one or two responses and would be suitable for imputing other responses

2011 Changes - More Minimum Change Donor Imputation Will do more minimum change donor imputation and less deterministic imputation where possible Will combine modules so more variables are imputed simultaneously where possible

Concluding Remarks Sophisticated E&I programs can do a better job detecting and resolving edit failures With this comes the responsibility to make few assumptions regarding the characteristics of the non-respondents or those giving inconsistent responses The impact of imputation should be made clear to users E&I should not be viewed as a panacea such that data quality standards can be lowered