Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.

Slides:



Advertisements
Similar presentations
Use of Tax Data in the Unified Enterprise Survey (UES) Workshop on Use of Administrative Data in Economics Statistics Marie Brodeur Moscow November, 2006.
Advertisements

Guidelines on Integrated Economic Statistics United Nations Statistics Division Regional Seminar on Developing a Programme for the Implementation Programme.
Towards a Better Integration of Survey and Tax Data in the Unified Enterprise Survey Claude Turmelle Statistics Canada ICES-III Montréal, Québec, Canada.
United Nations Oslo City Group on Energy Statistics 8 th Oslo Group Meeting, Baku, Azerbaijan September 2013 ESCM Chapter 8: Data Quality and Metadata.
Chapter 6 Database Design
Chapter 2 Database Environment Pearson Education © 2014.
An Integrated Approach to Economic Statistics “ The Canadian Experience” UNSD – IBGE Workshop on Manufacturing Statistics Kevin Roberts Rio de Janeiro,
Jim Tebrake New York, March 2014 The Integrated Business Statistics Program at Statistics Canada Director General Statistics Canada Statistics Canada Statistique.
André Loranger New York, June 2014 The Integrated Business Statistics Program at Statistics Canada Presentation to the UNCEEA Assistant Chief Statistician.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Institutional arrangements and legal framework for energy statistics United Nations Statistics Division International Workshop on Energy Statistics
Database Environment 1.  Purpose of three-level database architecture.  Contents of external, conceptual, and internal levels.  Purpose of external/conceptual.
Setting up a National Warehouse of Official Statistics in India P C Mohanan Deputy Director general National Statistical Organisation Ministry of Statistics.
Administrative Data at Statistics Canada – Current Uses and the Way Forward 27 th Voorburg Group Meeting Warsaw, Poland André Loranger October 4, 2012.
Sub-session 1B: General Overview of CRVS systems.
Metadata: Integral Part of Statistics Canada Quality Framework International Conference on Agriculture Statistics October 22-24, 2007 Marcelle Dion Director.
Database Systems – Data Warehousing
Copyright 2010, The World Bank Group. All Rights Reserved. Integrating Agriculture into National Statistical Systems Section A 1.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
1 Business Register: Quality Practices Eddie Salyers
The Canadian Integrated Approach to Economic Surveys Marie Brodeur, Peter Koumanakos, Jean Leduc, Éric Rancourt, Karen Wilson Statistics Canada International.
Data Warehousing at STC MSIS 2007 Geneva, May 8-10, 2007 Karen Doherty Director General Informatics Branch Statistics Canada.
1 Quality Assurance In moving information from statistical programs into the hands of users we have to guard against the introduction of error. Quality.
Judy Lee Enterprise Statistics Division Statistics Canada I 1 Developing Metadata Standards in an Integration Project at Statistics Canada United Nations.
Statistics Canada’s Real Time Remote Access Solution 2011 MSIS Meeting – Karen Doherty May 2011.
The Future of Administrative Data ICES III End Panel Discussion Don Royce Statistics Canada June 2007.
Integration of Annual Economic Collections – The Australian Experience ICESIII, Canada, 2007 Presented by Eden Brinkley.
Reform and Modernization of Russian Statistics. New Challenges in Data Collection and Compilation International Seminar on Modernizing Official Statistics:
CZECH STATISTICAL OFFICE Na padesátém 81, CZ Praha 10, Czech Republic 1 Subsystem QUALITY in Statistical Information System Czech.
Statistics Canada Statistique Canada Integrated Questionnaire Design Friends of the Chair Group on Integrated Economic Statistics Marie Brodeur, Michel.
Use of Administrative Data Seminar on Developing a Programme on Integrated Statistics in support of the Implementation of the SNA for CARICOM countries.
Editing of linked micro files for statistics and research.
1 Chapter 1 Introduction to Databases Transparencies.
Copyright 2010, The World Bank Group. All Rights Reserved. Principles, criteria and methods Part 2 Quality management Produced in Collaboration between.
SNA seminar in the Caribbean Integrated questionnaires Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February,
Developing and applying business process models in practice Statistics Norway Jenny Linnerud and Anne Gro Hustoft.
Integrated Approach Processing Marie Brodeur Director General, Industry Statistics Branch, Statistics Canada St. Lucia February, 2014 SNA seminar in the.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
Copyright 2010, The World Bank Group. All Rights Reserved. Recommended Tabulations and Dissemination Section B.
Joseph Lukhwareni Statistics South Africa Reengineering projects focusing on metadata and the statistical cycle Statistics South Africa, South Africa 3-5.
Integrated metadata systems History Status Vision Roadmap
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
26 January 2016CountrySTAT Training for the Philippines Introduction to FAOSTAT and CountrySTAT 1 Overview of the FAOSTAT and CountrySTAT Candido J. Astrologo,
Overview and challenges in the use of administrative data in official statistics IAOS Conference Shanghai, October 2008 Heli Jeskanen-Sundström Statistics.
Role of the IMDB in the CBA and IM Strategy Presented to Information Management Committee Standards Division June
Unified Enterprise Survey New Horizons International Conference on Establishment Surveys Daniela Ravindra and Marie Brodeur Montreal, June 2007 Statistics.
Chapter 2 Database Environment.
Central Warehousing Work Session of Friends of the Chair Group Bern, Switzerland 6 – 8 June 2007 Marie Brodeur/Michel Girard Statistics Canada.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Administrative Data at Statistics Canada – Current Uses and the Way Forward Wesley Yung and Peter Lys, Statistics Canada.
Development Account: 6th Tranche Strengthening the capacity of National Statistical Offices (NSOs) in the Caribbean Small Island Developing States to fulfill.
Metadata requirements for archiving structured data Alice Born Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (9-11 April.
Metadata models to support the statistical cycle: IMDB
Implementation of Quality indicators for administrative data
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Session 2: Institutional arrangements for energy statistics
Chapter 2 Database Environment Pearson Education © 2009.
Guidelines on Integrated Economic Statistics
Generic Statistical Business Process Model (GSBPM)
Unified Enterprise Survey
Database Environment Transparencies
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Working on coherence and consistency of an output database
Guidelines on Integrated Economic Statistics
Guidelines on Integrated Economic Statistics
Guidelines on Integrated Economic Statistics
The role of metadata in census data dissemination
Presentation transcript:

Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics Montreal, QC June

Overview Introduction of data warehousing as a concept Approaches to holding data Introduction to the Statistics Canada’s Unified Enterprise Statistics (UES) Program Centralized warehousing of UES data Example of the data warehouse at work

Subject-matter areas need or generate different types of information Data to support collection Questionnaires and supporting metadata Frame and sample information Status of each respondent during collection Survey data Administrative data Post-collection processing Edits (metadata) Imputation specifications Allocation specifications Generation of “clean” datasets Tabulation of estimates/analysis of results Value of estimate Data quality indicators Suppression patterns Analysis of coherence Input Data

Input Data Warehouse A copy of statistical input data specifically structured for querying and reporting Collection Post-collection processing Tabulation of estimates

Approaches to organizing information holdings Decentralized In a completely decentralized approach, each subject matter area maintains its own input data Centralized Centralized data warehouse contains all input data from all subject matter program areas All program areas need to use common concepts and standards for classification, or else a concordance would have to be found among these systems. These are extremes along a continuum

Centralized approach Advantages Economies of scale should lead to reduced overall development and maintenance costs Some human resource issues are eased (knowledge and skills retention and transfer) Eases integration of data to support data analysis, coherence analysis, etc. Allows subject-matter divisions to specialize in data analysis rather than data management

Decentralized approach Advantages Specialized subject matter expertise readily available Subject matter areas are not dependent on a central authority to make changes therefore flexibility is increased Care and control of the data is clearly established

Questions to address in moving to a more centralized environment What purpose does it serve? What must be done to the statistical model to ensure compatibility with other data sources? What mechanisms need to be in place to ensure productive client-service relationship? Who is custodian of the data? Do the benefits in moving to a more centralized environment truly outweigh the costs?

Statistics Canada and the Unified Enterprise Survey Program In the late 1990’s, Statistics Canada undertook a major program to improve the quality of the provincial economic accounts released by the Agency and the annual business surveys that feed into accounts These surveys were integrated in order to increase the quality of data produced from these surveys in terms of Consistency Coherence Breadth Depth

Features of the UES Improved frame (business register) Sampling made to be consistent across surveys and improved coverage Harmonized content and common collection applications Administrative data are to be used instead of survey data if possible and if the data are of good quality Common post-collection processing systems Common storage of data Central contact management system Improvements in outputs

Moving to a more centralized environment What is the purpose? The UES data warehouse forms a repository of all the files created through the processing phases of UES and accompanying metadata. This supports the work of analysts and survey managers in subject matter divisions, collection managers, statistical methodologists and users in the System of National Accounts

Moving to a more centralized environment What must be done to the statistical model to ensure compatibility with other data sources? The statistical model for UES surveys forced the harmonization of concepts, definitions and classifications across surveys Integration of survey and administrative data required the mapping of tax data to survey data (harmonized conceptually as well as characteristically)

Moving to a more centralized environment What mechanisms need to be in place to ensure productive client-service relationship? Project management structure for the UES that crosses functional boundaries Change management function to ensure seamless integration of surveys into UES

Moving to a more centralized environment Who is custodian of the data? ESD controls access to all common systems. Subject matter divisions are exclusively responsible for dissemination, including the determination of aggregations and data suppressions (due to quality and confidentiality)

Moving to a more centralized environment Do the benefits in moving to a more centralized environment truly outweigh the costs? Reduction in development costs Development of best practices that can be shared across the bureau Single point of access for input data improves security of all UES related data Rationalization of hardware to minimize the number of servers

The UES Data Warehouse UES Warehouse is centrally managed within Enterprise Statistics Division Major components of the data warehouse include: Metadata repository Processing metadata Central data store (CDS) External data Data that originate outside UES but have been integrated in the UES framework

The UES Data Warehouse Systems interfacing with the data warehouse: Unified Tracking and Retrieval Tool (USTART) Integrated Questionnaire Metadata System (IQMS) UES Processing Interface Working Estimation Environment (WEE) interface Macro-data adjustment Facility

Operational applications Operational monitoring Coherence analysis Baseline information for operational research Quality measures (i.e. response rate analysis) Integrated data analysis

Response rates in collection

Final response rates

The centralized system in action Outcomes The centralized input data warehouse provides a centralized tool that allows users to track performance on a consistent basis Same method Same source data

Conclusion The centralized data warehouse offers benefits to statistical programs There are a number of conditions that must be fulfilled for success Purpose Data compatibility Client-service relationship Custodian of data Cost-benefit