Generic Statistical Data Editing Models (GSDEMs)

Slides:



Advertisements
Similar presentations
StEPS at EIAWhere We Are Now Paula Weir and Sue Harris Energy Information Administration, U.S. Department of Energy ICES3 Topic Contributed Session: Generalized.
Advertisements

United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation An update on the work of the High-level Group for the.
Producing and managing metadata Workshop on Writing Metadata for Development Indicators Lusaka, Zambia 30 July – 1 August 2012 Writing Metadata for Development.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Generic Statistical Information Model (GSIM) Thérèse Lalor and Steven Vale United Nations Economic Commission for Europe (UNECE)
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Topic (ii): New and Emerging Methods Maria Garcia (USA) Jeroen Pannekoek (Netherlands) UNECE Work Session on Statistical Data Editing Paris, France,
Sander Scholtus and Leon Willenborg Editing and Imputation in the Memobust Handbook.
CBS-SSB STATISTICS NETHERLANDS – STATISTICS NORWAY Work Session on Statistical Data Editing Oslo, Norway, September 2012 Jeroen Pannekoek and Li-Chun.
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
Topic (i): Selective editing / macro editing Discussants Orietta Luzi - Italian National Statistical Institute Rudi Seljak - Statistical Office of Slovenia.
Michelle Simard, Thérèse Lalor Statistics Canada CSPA Project Manager UNECE Work Session on Statistical Data Confidentiality Helsinki, October 2015 Confidentialized.
Modernisation Activities DIME-ITDG – February 2015 Item 7.
Work Session on Statistical Metadata 2013 Session III: Metadata in the Statistical Business Process Better documenting statistical business processes:
Generic Statistical Information Model (GSIM) Jenny Linnerud
New and Emerging Methods UN/ECE Work Session on Statistical Data Editing Vienna April 21-23, 2008.
Generic Statistical Data Editing Models (GSDEMs) Workshop on the Modernisation of Official Statistics The Hague, 24 November 2015.
Ljubljana, 11 Mai 2011UNECE Work session on SDE Topic (vii) New and emerging methods 1 Topic (vii): New and emerging methods Introduction Session organizers:
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Modernization Committee on Products and Sources: Proposal for HLG project on Data Integration 5 th High -Level Group Workshop on the modernization of Production.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
United Nations Economic Commission for Europe Statistical Division GSBPM in Documentation, Metadata and Quality Management Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division CSPA: The Future of Statistical Production Steven Vale UNECE
Session topic (i) – Editing Administrative and Census data Discussants Orietta Luzi and Heather Wagstaff UNECE Worksession on Statistical Data Editing.
Theme (iv): Standards and international collaboration
Modernisation Story of Statistics Slovenia
UNECE-CES Work session on Statistical Data Editing
Data Integration in Official Statistics 2017 Project Proposal
UNECE Data Integration Project
Theme (v): Managing change
Theme (i): New and emerging methods
Achievements in 2016 Data Integration Linked Open Metadata
UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS Work Session on Statistical Data Editing April 2017 The Hague,
HLG-MOS Moving to the Next Level
Modernization Maturity Model
Experiences in Cooperation training using GSBPM
Navigating the application of Modernisation Frameworks when using Commercial Of The Shelf products. This presentation will provide a walkthrough of.
GSIM Implementation at Statistics Finland Session 1: ModernStats World - Where to begin with standards based modernisation? UNECE ModernStats World Workshop.
Survey phases, survey errors and quality control system
Generic Statistical Business Process Model (GSBPM)
Survey phases, survey errors and quality control system
GSIM The Generic Statistical Information Model
A handbook on validation methodology Marco Di Zio Istat
6.1 Quality improvement Regional Course on
The Generic Statistical Information Model
Statistical organisations should use standardised and industrial processes for the production of statistics in order to be more efficient. The statistical.
Modernising Official Statistics
Blue Sky Thinking Network – Looking Back & Ahead
Jeroen Pannekoek, Sander Scholtus and Mark van der Loo
CSPA: The Future of Statistical Production
Applying the ESS EARF in a VIP project: The ESS.VIP Validation example
Recent developments in the Generic Statistical Business Process Model: Revisions and Quality Indicators Alice Born, Statistics Canada,
Presentation to SISAI Luxembourg, 12 June 2012
GSBPM AND ISO AS QUALITY MANAGEMENT SYSTEM TOOLS: AZERBAIJAN EXPERIENCE Yusif Yusifov, Deputy Chairman of the State Statistical Committee of the Republic.
Topic (ii): Global Solutions to Editing
Business architecture
The future of Statistical Production
ESTP Training Course “Enterprise Architecture and the different EA layers, application to the ESS context ” Rome, 16 – 19 October 2017.
CSPA Common Statistical Production Architecture Motivations: definition and benefit of CSPA and service oriented architectures Carlo Vaccari Istat
CSPA Common Statistical Production Architecture Motivations: definition and benefit of CSPA and service oriented architectures Carlo Vaccari Istat
CSPA Templates for sharing services
CSPA Templates for sharing services
process and supporting information
High-Level Group for the Modernisation of Official Statistics
Hands-on GSIM Mauro Scanu ISTAT
Presentation transcript:

Generic Statistical Data Editing Models (GSDEMs) One of the outputs of ….. the Generic Statistical Data Editing Models is an output Workshop on the Modernisation of Official Statistics The Hague, 24 November 2015

Aim of GSDEMs Data editing is one of the most expensive and time-consuming parts of the statistical production process. It uses advanced methodology and algorithms with opportunities of sharing methods and tools. The use of multiple and new sources poses new challenges for data editing. To exchange ideas, experiences and practices, we need common concepts, models and definitions The GSDEMs are standard references for statistical data editing, consistent with existing standards (GSBPM and GSIM). They aim to facilitate understanding, communication, practice and development. Data editing is an important field for modernisation of statistics, for several reasons. To face these issues together and to cooperate, we need common concepts, models and definitions. That is what GSDEMs are intended to provide. Thereby staying of course fully consistent with other standards, such as GSBPM and GSIM.

The Task Team A Generic Process Framework for Statistical Data Editing was proposed at the 2014 UNECE Work Session on Data Editing GSDEMs have been developed by an international task team (Finland, France, Italy, Norway, Netherlands, UNECE) Under authority of the High Level Group for the Modernisation of Official Statistics (HLG-MOS). Resulting in a report describing: Data editing business functions (what) Methods to perform these functions (how) Organisation of the process (process flow models) The idea of creating a framework for data editing originated at the 2014 Work Session on Data Editing. Following that idea, GSDEMS have been developed by an international task team with 6 contributing institutions. The work was done under the authority of the HLG-MOS It resulted in a report describing: Data editing business functions (what id done in data editing), the methodology used in data editng (how) and the organisation of the process (in process flow models).

Data Editing Functions Data editing functions perform different kinds of tasks (GSBPM), and can be classified by purpose in three broad categories. Review (of input data) Assessing the consistency and plausibility of input data Quality measurement and validation Selection (for further processing) Selection of values or units for specified further treatment Amendment (manual editing, imputation) Changing or replacing data values to improve data quality The business functions in data editing perform different kinds of tasks, that can be found in GSBPM. They can be classified by their purpose in three broad categories: Review, Selection and Amendment. Review functions asses the consistency and plausibility of the data Selection functions select data values or units for specific further treatment Amendment function actually change data to improve the quality Each with a number of lower-level, more specific functions.

Methods for data editing functions Review Evaluation of pre-specified edit-rules (Im)plausibility of values Outlier detection algorithms Selection Automatic selection by influence scores Selection for manual review Graphical inspection of estimates Amendment Regression Imputation Estimation of missing values Imputation by donor values ⋯ ⋯ ⋯ ⋯ GSDEMs not only describe what is done by the specific data editing functions but also how it is done. This means that commonly used methodology for each of the functions is listed. Often more methods can be used for the same function depending on the circumstances ⋯ ⋯

Process flow: Organising the data editing process A data editing process consists of functions with specified methods that are executed in an organised way. The organisation is described by subdivided it into: Process steps : Sub-processes each consisting of a number of specified functions. Amendment and Review but no Selection. To describe the routing of the data through the process steps Process controls : Selection but no amendment functions. Do not modify data values but specify different streams of data through the process flow. The report contains process-flows for a number of scenarios: Structural Business Statistics, Short-term statistics, Business census, Household statistics, Statistics based on multiple sources. A data editing process consists of a considerable number of functions with specified methods that need to be executed in an organised way. To describe the process, GSDEMs divide it into sub-processes or process step. But we also need to describe the navigation of the data through these process steps and for that we use the controls.

Example: SDE flow model for Households statistics

Example: SDE flow model for statistics using data integration