An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude.

Slides:



Advertisements
Similar presentations
By: Saad Rais, Statistics Canada Zdenek Patak, Statistics Canada
Advertisements

1 ESTIMATION IN THE PRESENCE OF TAX DATA IN BUSINESS SURVEYS David Haziza, Gordon Kuromi and Joana Bérubé Université de Montréal & Statistics Canada ICESIII.
The Challenge of Integrating New Surveys into an Existing Business Survey Infrastructure Éric Pelletier Statistics Canada ICES-III Montréal, Québec, Canada.
Paul Smith Office for National Statistics
Survey of Electronic Commerce and Technology: Past, Present and Future Challenges Jason Raymond Third International Conference on Establishment Surveys.
Integrated Business Statistics Program (IBSP) Introduction Daniela Ravindra Director, Enterprise Statistics Division November 9th, 2010.
Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.
The estimation strategy of the National Household Survey (NHS) François Verret, Mike Bankier, Wesley Benjamin & Lisa Hayden Statistics Canada Presentation.
Quality indicators for measuring and enhancing the composition of survey response Q2008 – Special topic session, July 9 Jelke Bethlehem and Barry Schouten.
The Many Ways of Improving the Industrial Coding for Statistics Canada’s Business Register Yanick Beaucage ICES III June 2007.
Optimizing CATI Call Scheduling International Total Survey Error Workshop Hidiroglou, M.A., with Choudhry, G.H., Laflamme, F. Statistics Canada 1 Statistics.
MISUNDERSTOOD AND MISUSED
STAT262: Lecture 5 (Ratio estimation)
FINAL REPORT: OUTLINE & OVERVIEW OF SURVEY ERRORS
08/08/2015 Statistics Canada Statistique Canada Paradata Collection Research for Social Surveys at Statistics Canada François Laflamme International Total.
André Loranger New York, June 2014 The Integrated Business Statistics Program at Statistics Canada Presentation to the UNCEEA Assistant Chief Statistician.
18/08/2015 Statistics Canada Statistique Canada Responsive Collection Design (RCD) for CATI Surveys and Total Survey Error (TSE) François Laflamme International.
Combining administrative and survey data: potential benefits and impact on editing and imputation for a structural business survey UNECE Work Session on.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Q2010, Helsinki Development and implementation of quality and performance indicators for frame creation and imputation Kornélia Mag László Kajdi Q2010,
Use of Administrative Data in Statistics Canada’s Annual Survey of Manufactures Steve Matthews and Wesley Yung May 16, 2004 The United Nations Statistical.
The Future of Administrative Data ICES III End Panel Discussion Don Royce Statistics Canada June 2007.
Impact of using fiscal data on the imputation strategy of the Unified Enterprise Survey of Statistics Canada Ryan Chepita, Yi Li, Jean-Sébastien Provençal,
Collecting Electronic Data From the Carriers: the Key to Success in the Canadian Trucking Commodity Origin and Destination Survey François Gagnon and Krista.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
A Strategy for Prioritising Non-response Follow-up to Reduce Costs Without Reducing Output Quality Gareth James Methodology Directorate UK Office for National.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
Stop the Madness: Use Quality Targets Laurie Reedman.
Prioritizing Follow-up of Non- Respondents Using Scores for the Canadian Quarterly Survey of Financial Statistics for Enterprises Pierre Daoust Statistics.
Prioritizing Follow-up for the Canadian Quarterly Survey of Financial Statistics for Enterprises Pierre Daoust Statistics Canada ICES III, Montréal Statistique.
A Theoretical Framework for Adaptive Collection Designs Jean-François Beaumont, Statistics Canada David Haziza, Université de Montréal International Total.
Eurostat Weighting and Estimation. Presented by Loredana Di Consiglio Istituto Nazionale di Statistica, ISTAT.
SNA seminar in the Caribbean Integrated questionnaires Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February,
A Quality Driven Approach to Managing Collection and Analysis
Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Multivariate selective editing via mixture models: first applications to Italian structural business surveys Orietta Luzi, Guarnera U., Silvestri F., Buglielli.
Unified Enterprise Survey New Horizons International Conference on Establishment Surveys Daniela Ravindra and Marie Brodeur Montreal, June 2007 Statistics.
METIS 2011 Workshop Session III – National Implementation of the GSBPM Alice Born and Tim Dunstan Thursday October 6, 2011 Implementation of the GSBPM.
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
Metadata models to support the statistical cycle: IMDB
Chapter 1 Introduction and Data Collection
Implementation of Quality indicators for administrative data
Theme (v): Managing change
Redesigning French structural business statistics, using more administrative data ICESIII, Montréal, june 2007.
Implementation of a more efficient way of collecting data SBS: use of administrative data Statistics Belgium June 2009.
Étienne Saint-Pierre and Serge Godbout, Statistics Canada
Planning the change to a targeted survey design
Survey phases, survey errors and quality control system
ESTP COURSE ON PRODCOM STATISTICS
Survey phases, survey errors and quality control system
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Assessing Quality of Paradata to Better Understand the Data Collection Process for CAPI Social Surveys François Laflamme Milana Karaganis European Conference.
PRODCOM SURVEY IN THE UNITED KINGDOM
Sampling Methods.
Business Statistics: A First Course (3rd Edition)
Istat - Structural Business Statistics
ANALYSIS OF POSSIBILITY TO USE TAX AUTHORITY DATA IN STS. RESULTS
Metadata used throughout statistics production
New Techniques and Technologies for Statistics 2017  Estimation of Response Propensities and Indicators of Representative Response Using Population-Level.
Sampling and estimation
The Swedish survey on turnover in the service sector
Changes in the Canadian Census of Population Program
METIS 2011 Workshop Session III – National Implementation of the GSBPM
2.7 Annex 3 – Quality reports
Multi-Mode Data Collection
Étienne Saint-Pierre, Statistics Canada
Adaptive mixed-mode design WP1
Innovations on the Canadian Census
Presentation transcript:

An Active Collection using Intermediate Estimates to Manage Follow-Up of Non-Response and Measurement Errors Jeannine Claveau, Serge Godbout and Claude Turmelle Statistics Canada International Total Survey Error Workshop Québec, June 20, 2011

Outline Introduction Quality Indicators (QI) Measure of Impact (MI) Scores Future Work 2 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Unified Enterprise Surveys (UES) UES consists of 58 annual business surveys integrated in terms of content, collection and data processing Collect information on enterprise financial variables Collection period: February to early October Telephone pre-contact used for new units in the sample Mail questionnaires for initial data gathering Telephone follow-up conducted to collect data from non- respondent and to resolve failed edits 3 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Unified Enterprise Surveys (UES) Score function is used to prioritize telephone follow-up for non-response Score based on weighted sampling revenue For most of the UES surveys: no score function used for failed edits follow-up Collection Processing System: Blaise Paradata in Blaise Transaction History files 4 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Integrated Business Statistics Program (IBSP) IBSP is under development to redesign and expand UES to integrate other enterprise surveys and sub-annual surveys Goal: Reduce operating costs Enhance quality assurance IBSP will integrate 120 surveys by 2016 (phase 1: 2014) Electronic questionnaire (electronic data collection) will be the principal collection mode offered to enterprise 5 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Current UES – Processing Model Collection, processing and analysis are run sequentially Estimates produced at very end only Collection ends at set date Sampling Collection Processing Analysis Dissemination Statistics Canada • Statistique Canada 20/11/2018 20/11/2018 6

Statistics Canada • Statistique Canada IBSP – Estimates Model Collection, processing and analysis will be run in parallel Estimates will be produced and re-run periodically Collection could end earlier when pre-specified quality target has been met Collection Sampling Dissemination Processing Analysis 7 Statistics Canada • Statistique Canada 20/11/2018 20/11/2018 7

Active Collection Role: Manage follow-up of non-response and measurement errors (failed edits) Responsive Design (Laflamme and Karaganis, 2010) or Dynamic adaptative approach (Schouten, Calinescu and Luiten, 2011) that uses data available during collection to modify collection strategy Estimates and quality indicators will be produced periodically throughout collection: e.g. monthly basis Then scores measuring impact on estimates and on quality indicators are calculated to allocate and prioritize telephone follow-up 8 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Basic Collection Strategy Initial Sample S Production of Intermediate Estimates Successive Designs d0 d1 d2 di-2 di-1 NR1 NR2 NR3 NRi-1 NRi Observed NR and Response R1 R2 R3 Ri-1 Ri Statistics Canada • Statistique Canada 20/11/2018

Parameter and Estimator Variables of interest: Set of I key variables Parameters of interest: Stratified expansion estimators: Sampling variances: (under a stratified Bernoulli design): Where i, k and h identify respectively the I variables, the Nh units and the H strata Nh = stratum population size ph = unit sampling probability within stratum nh = the stratum sample size 10 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Non-Response Response propensity model: Estimation: Auxiliary data and paradata would be used to estimate response propensities Estimation: In case on non-response, we will either use imputation or reweighting to account for missing data Response propensities could be used to form imputation or reweighting homogeneous classes for reducing the non- response bias (Haziza and Beaumont, 2007) Stratified expansion estimators: 11 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Quality Indicators (QI) Role: Monitor collection progress Help to allocate and prioritize collection efforts Can be item-based Specific to a variable of interest Variance, CV Item response rate of a variable of interest Bias: MSE: 12 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Quality Indicators (QI) Can be covariate-based Derived from statistics on the estimated response propensities given the covariates X Independent from the variables of interest Examples of covariate-based QIs (Schouten, 2011) : Mean response propensity: R-indicator: Standardized Maximal Bias: Standardized Maximal Variance: Standardized Maximal MSE: 13 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Measure of Impact (MI) Scores Types of Scores Common types: Edit-related and estimate-related score functions Example: Predicted difference in estimates (Hedlin, 2008) Proposal: Generalize the MI Score to include quality-related score functions For an estimated parameter (estimate or quality indicator) Definition: Where is the estimated parameter after changing reported values and/or covariates of unit k respectively to and/or and is a scaling factor 14 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Measure of Impact (MI) Scores MI Score for an estimated total: Requires predicted values to compare to reported values Proposal: Use imputation to obtain predicted values Used to prioritize units for failed edit follow-up 15 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Measure of Impact (MI) Scores MI Score for item-based quality indicators MI Score for estimated sampling variance for expansion estimators Specific to a variable of interest Also use imputation to obtain predicted values Linked directly to quality of output estimates Prioritize units for failed edit follow-up 16 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Measure of Impact (MI) Scores MI Score for item-based quality indicator MI Score for covariate-based quality indicator Used to prioritize units for both non-response and failed edit follow- up 17 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Active Collection Management A large number of variables to monitor Monitoring all of them will be a challenge Not all equally important Identify a limited number of key variables For each key variable Quality monitored using item-based QIs and MI Scores For the non-key variables Quality controlled using covariate-based QIs 18 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Active Collection Management MI scores for each estimated parameter and quality indicator are considered local scores In order to prioritize units for telephone follow-up, global score per unit is needed Derive global MI Score (Hedlin, 2008) Sum, maximum or Euclidian distance could be used Some QIs are appropriate for evaluating the impact of non- response and others for the impact of edit failures Derive one global score for non-response follow-up and one global score for failed edit follow-up 19 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Control Quality with Covariated-Based QIs Goal: Increase the average of the response propensities while improving their homogeneity. 20 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Control Quality with Covariated-Based QIs Goal: Increase the average of the response propensities while improving their homogeneity. 21 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Control Quality with Covariated-Based QIs Goal: Increase the average of the response propensities while improving their homogeneity. 22 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Summary Current Approach Proposed Approach Follow-up and editing A score function with no link with estimates Prioritization based on frame (static) information Follow-up and editing for influential units based on estimates and quality Prioritization based on frame, paradata (dynamic) and estimates Processing Results (and quality measures) are known only at the end of the process Produce results (and quality measures) during collection to manage collection Cut-off collection Based on weighted response rate Based on achieved quality of estimates 23 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018 23

Quality Indicators (QI) Measure of Impact (MI) Scores Summary Quality Indicators (QI) Measure of Impact (MI) Scores Quality (accuracy) specific to a domain and an estimate Impact of a unit on an estimate or on a quality indicator Monitor collection and analysis progress Allocate and prioritize collection and analysis efforts Proactively identify problems Assess quality of produced estimates Close active collection Non-response and failed-edit follow-up 24 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Summary Covariate-based QIs Item-based QIs Independent of survey variables Related to survey variables Used to all variables Used with MI Scores to monitor specified key variables Mean response propensity R-indicator Standardized Maximal Bias, Variance and MSE Other… Item response rate Variance, CV Bias, MSE 25 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Non-response follow-up Summary Non-response follow-up MI Scores Failed Edit follow-up One global score Item response rate Mean response propensity R-indicator Variance, CV Item-based MSE Standardized Maximal Bias, Variance and MSE Estimated total Estimated sampling variance 26 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Future Work Methodology development Response propensity model: development of a model based on data and paradata Item-based and covariate-based QIs Validation of the proposed strategy Conduct simulation studies and develop prototypes using current UES environment Summer 2011 prototype: response rates, imputation rate, CV and MI scores Next prototype: Other local and global MI scores and QIs 27 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Discussion What quality indicators are appropriate to measure the risks of potential bias in the estimates? What is the best way to use quality indicator (e.g. R-indicator) to monitor collection of highly skewed business surveys? The proposed approach obviously affects the response propensities throughout collection. Although we can adjust the estimator later on to take this into account, is it something we should move away from? Or should we take advantage of it? In the proposed approach, are there any additional aspects that should be considered? 28 Statistics Canada • Statistique Canada Statistics Canada • Statistique Canada 20/11/2018 20/11/2018

Statistics Canada • Statistique Canada Merci / Thank You For more information,  Pour plus d’information, please contact: veuillez contacter : Jeannine Claveau jeannine.claveau@statcan.gc.ca Serge Godbout serge.godbout@statcan.gc.ca Claude Turmelle claude.turmelle@statcan.gc.ca Statistics Canada • Statistique Canada 20/11/2018