IT Directors’ Group Meeting 1 19-20 October 2010 Sharing data validation tools in the ESS Christine WIRTZ – Head of Unit B3 Georges PONGAS – Unit B3 Daniel.

Slides:



Advertisements
Similar presentations
Introduction to SDMX Seminar Eurostat/ECLAC 02 October 2012 August Götzfried Head of Unit, Eurostat B5 Management of statistical data and metadata.
Advertisements

Eurostat The ESS.VIP Validation and its implementation in waste statistics Q2014 – Session 13 4 June 2014 Hartmut Schrör, Eurostat.
ESS VIP project on Validation
Background Data validation, a critical issue for the E.S.S.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Eurostat Unit B3 – Statistical Information Technologies Data transmission tools and services 15/05/ eDAMIS The standard solution for transmitting.
Implementation of SDMX for data and metadata exchange SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
13-Jul-07 Implementation of SDMX for data and metadata exchange Balance of Payments Working Group 2-3 April 2012 Daniel Suranyi Eurostat B5 Management.
Francesco Rizzo (ISTAT - Italy) SDMX ISTAT FRAMEWORK GENEVE May 2007 OECD SDMX Expert Group.
Editing Building Block (EBB) Validation Tool for FDI and ITS Balance of Payments Working Group 02 April 2012 Unit B4, IT for Statistical Production Georges.
Slide 1 Eurostat Unit B3 – Statistical Information Technologies CoRD Meeting – 4 June 2007 Agenda Item 8 Preliminary ideas for a 2011 census hub Giuseppe.
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
Eurostat achievements and challenges Emanuele Baldacci, Director European Commission - Eurostat Director Methodology; Corporate statistical.
SDMX and Metadata SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
Statistical data editing - UNECE work session – OSLO September 2012 Proposal of a revised approach for data validation within the European Statistical.
IS4STAT Vittorio Viaggi Eurostat
1 The EDIT System, Overview European Commission – Eurostat.
SDMX IT Tools SDMX use in practice in NA
EDIT – Eurostat’s editing tool
Implementation of SDMX for Balance of Payments Balance of Payments Working Group 9-10 April 2013 BP Daniel Suranyi Eurostat B5 Management of statistical.
1 Item 2.1.b of the agenda IT Governance in the ESS and related issues Renewal of mandates STNE Adam WROŃSKI Eurostat, Unit B5.
Eurostat Report on SDMX Reference Infrastructure User Group 1 st meeting in Luxembourg Sept 2012 Item 5.2 of the agenda November 2012IT Director's.
15-16 December 2010 CGST Meeting 1 IT Developments TRIS 1 – TRIS 1 / TRIS 2 Item 7.1 on the agenda 1 TRIS = TRansport Information System.
Eurostat Sharing data validation services Item 5.1 of the agenda.
Implementation of SDMX for data and metadata exchange SDMX Basics Course October 2012 Daniel Suranyi Eurostat B5 Management of statistical data and.
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge R&D Statistics Information System: An Interoperability Tail between CERIF and SDMX Dimitris Karaiskos Dimitrios.
UNECE-CES Work session on Statistical Data Editing
The evolution of the SDMX infrastructure and services
Training Course on EDIT 2013
The CVD Metadata Handler
SDMX Opportunities MED Meeting 14 May 2013 Daniel Suranyi Eurostat B5
EUROSTAT Unit B3 IT for statistical production Ewa Stacewicz
Workshop on the Validation of Waste Statistics
Progress report on the Single Entry Point Vincent TRONET Unit B3
Disseminating statistics: Internet and Publications course
Census Hub: Progress report
Eurostat – Units E2, B5 Cristina BLANARU
SDMX Implementation for PESTICIDES
Data Validation in the ESS Context
Draft EP/Council Regulation for processes, standards and
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Task Force on Annual Financial Accounts
Data Transmission Tools & Services EDAMIS, SDMX, Validation
ESS.VIP VALIDATION An ESS.VIP project for mutual benefits
Implementation of SDMX in the ESS
Giuliano Amerini Unit E6 (Transport)
SDMX as basis for water data reporting
ITDG meeting of of October 2011
Sharing data validation activities in the ESS.
Validation services developed in the ESS
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
SDMX : General introduction H. Linden, Eurostat, Unit B5
Unit D2: Regional Statistics New Eurostat rules Data Transmissions
Agricultural Data Collection System
Item of the Agenda Towards an integrated Eurostat metadata handler – Eurostat SDMX Registry services for Member States Francesco Rizzo Unit B3 13.
WG on Statistical Confidentiality (TRansport Information System)
ESS.VIP Validation Item 5.1
Legislative strategy for cross-cutting ESS legislation
Item 7.3 (b) SDMX for UOE data collection
EDAMIS - current status / further development
SDMX Implementation The National Accounts use case
Modernisation of Validation in the ESS Collaboration with countries
EDIT data validation system Ewa Stacewicz EUROSTAT VALIDATION TEAM
eDAMIS Validation Possibilities
Validation Activities in the ESS What you will hear today…
GESMES and SDMX-ML - Practical issues
Daniel Suranyi, Krassimir Ivanov
Presentation transcript:

IT Directors’ Group Meeting October 2010 Sharing data validation tools in the ESS Christine WIRTZ – Head of Unit B3 Georges PONGAS – Unit B3 Daniel SURANYI – Unit B5 Item 3.3.b of the agenda

19-20 October 2010IT Directors’ Group Meeting 2 Background ITDG 2009: –Eurostat presented new ESS vision with a specific view on IT architecture and IT tools Harmonising statistical production processes  welcomed, BUT –considered very ambitious –should be medium to longer term perspective Sharing of IT tools –Implicit and crucial aspect of future infrastructure of the ESS –Virtual sharing OR sharing of real software could be envisaged –Challenges: IT standards, interdependence of actors –Linked to SDMX framework  Data validation services  appropriate and logical step

19-20 October 2010IT Directors’ Group Meeting 3 Data validation in the ESS Data validation takes place: –In Member States – before transmission –In Eurostat – before further dissemination and processing Several steps in validation: –Format validation –Codes validation –Data validation 1st level: basic checks – existence of mandatory fields, range checks, consistency of info inside file 2nd level: consistency with historical data / data from other sources (other countries, other statistics) 3rd level: expert validation / in-depths analysis

19-20 October 2010IT Directors’ Group Meeting 4 Data validation tools developed by Eurostat eVE = eDamis Validation Engine –Allows for a final check before transmitting data to Eurostat –Covers format, codes and basic checks –For files in SDMX-ML format; linked to DSD EBB = Editing Building Block –Allows importing of external reference files –Can be configured for 2nd level validation –For files with an agreed format applied by all data senders (csv, flr,sdmx-ml, sdmx-edi) Different ways of coding validation rules Validation of confidential data currently limited

19-20 October 2010IT Directors’ Group Meeting 5 VIP “Data validation” VIP on efficiency gains in the validation process Initial focus on Agriculture Statistics (Animal Production and Farm Structure Survey 2010); Ultimate aim: improve efficiency in the production chain from MS to Eurostat through improvements in the validation process Looks at different approaches to achieve efficiency gains: –Implementing validation tools –Rebalancing validation tasks – ‘the sooner the better’ approach –Policy decisions and guidelines on the roles of different actors

19-20 October 2010IT Directors’ Group Meeting 6 EBB = Editing Building Block

19-20 October 2010IT Directors’ Group Meeting 7 EBB = Editing Building Block Main Functionalities: Acceptance of various file formats and number of variables (limited by the DBMS column number capacity) Validation programs are parametric Not only validation but also variable creation Possibility to manipulate incoming datasets Information is persistent (data+metadata) and reusable

19-20 October 2010IT Directors’ Group Meeting 8 Functionality in detail File management: Fixed length records Variable length records (delimited) Sdmx-ML Gesmes files Scripting and web services Web version (dec 2010) and stand alone version

19-20 October 2010IT Directors’ Group Meeting 9 Validation rules, Computations Rules are logical expressions followed by: The rule name The rule severity The rule warning message A possible modification or creation of data depending on the rule result. Rules can be horizontal or vertical (inter record) Special computations (outliers) Output statistics (summary) and details for errors (what error where in the dataset).

19-20 October 2010IT Directors’ Group Meeting 10 Dataset operations Copy file, select part of file Split file Aggregate Rename Merge Append Reorder lines or columns

19-20 October 2010IT Directors’ Group Meeting 11 The Architecture

19-20 October 2010IT Directors’ Group Meeting 12 Applied in the domains Foreign Trade Esspross AES, CVTS BOP EHIS Transport

19-20 October 2010IT Directors’ Group Meeting 13 eVE = eDAMIS Validation Engine

19-20 October 2010IT Directors’ Group Meeting 14 eDAMIS Validation Engine Validation at the Single Entry Point Based on SDMX No installation or configuration in Member States eDAMIS Web Forms: Real-Time Validation (in Production for some years) eDAMIS Web Portal: New Validation Engine (available since eDAMIS 3.0, July 2010) eDAMIS Web Application –Server side validation for all eWA versions –Local validation in eWA 3.1 (using rules from the server)

19-20 October 2010IT Directors’ Group Meeting 15 eVE – Data Validation Features Same Validation Rule Syntax as Web Forms Within one file and reference period Different rule sets per reference period possible Country specific rules Mandatory values, Range checks Basic expressions Validation of confidential datasets (Portal or eWA 3.1) Full automatic transmission and validation workflow

19-20 October 2010IT Directors’ Group Meeting 16 Workflow eDAMIS Validation Engine eWA eWP eDAMIS Server Validation SDMX Registry DSDs Eurostat Production Unit MS Database Web Service SDMX Report Browser SDMX Converter CSV Settings

19-20 October 2010IT Directors’ Group Meeting 17 Projects in Member States Fisheries Pilot (May 2010) –Workshops in Sweden, Latvia, UK, Romania –Remote Testing in Netherlands (CBS) –SDMX based collection starts in December 2010 Aviation Pilot (September 2010) –Workshop with Statistik Austria Results from both Pilot Projects –Implementation of SDMX is simpler than expected –Countries visited appreciated simple usage of eVE

19-20 October 2010IT Directors’ Group Meeting 18 Conclusion Tools have been developed that could be shared and tested For SDMX-ML data collections: eVE offers basic validation without further configuration EBB can be integrated without changing data transmission formats. It allows for more complex validation. More sophisticated validation requires further multi- disciplinary reflection.

19-20 October 2010IT Directors’ Group Meeting 19 Your feedback on: How to use these tools ESS-wide? Suggestions for directions of improvements