Validation at Insee.

Slides:



Advertisements
Similar presentations
CASE STUDY: IMPLEMENTING SDMX EXCHANGE WITH MEMBER COUNTRIES IN SHORT-TERM ECONOMIC STATISTICS (STES)
Advertisements

Eurostat Unit B3 – Statistical Information Technologies Data transmission tools and services 15/05/ eDAMIS The standard solution for transmitting.
Implementation of SDMX for data and metadata exchange SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
Eurostat Unit B3 – IT and standards for data and metadata exchange SDMX Basics Training – 2012 IT architectures for data exchange SDMX-RI and the Hub approach.
13-Jul-07 Implementation of SDMX for data and metadata exchange Balance of Payments Working Group 2-3 April 2012 Daniel Suranyi Eurostat B5 Management.
Eurostat 1 7a. Practical use case 1: Pesticides Use Project Blanaru Cristina Eurostat Unit B5: “Central data and metadata services” SDMX Basics course,
Model and Representations
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
Implementing the GSIM Statistical Classification model – the Finnish way Essi Kaukonen / Statistics Finland UNECE Workshop on International Collaboration.
1 The EDIT System, Overview European Commission – Eurostat.
Implementation of SDMX for Balance of Payments Balance of Payments Working Group 9-10 April 2013 BP Daniel Suranyi Eurostat B5 Management of statistical.
Production process for SBS item 9 of the agenda Structural Business Statistics Working Group 14 April 2015, Luxembourg Tatiana Mrlianová.
Eurostat Sharing data validation services Item 5.1 of the agenda.
Implementation of SDMX for data and metadata exchange SDMX Basics Course October 2012 Daniel Suranyi Eurostat B5 Management of statistical data and.
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
Validation Architecture in the ESS CSPA Workshop, Geneva June 2016 Geneva June 2016 Eurostat, Vincent TRONET, Unit B1.
UNECE-CES Work session on Statistical Data Editing
Statistical Information Systems Introducing SIS tool .Stat
Global data structure definitions
SDMX Information Model
EUROSTAT Unit B3 IT for statistical production Ewa Stacewicz
Workshop on the Validation of Waste Statistics
NUAC meeting June Validation – where are we?
SDMX: Enabling World Bank to automate data ingestion
8 EGR preliminary frame and validation tasks
Data collection of 2012: Data transmission standards and tools
Kristina Dourmashkin Eurostat Unit E4
Integrated management of LAU based territorial classifications
Eurostat – Units E2, B5 Cristina BLANARU
The future of annual LAU data exchange Abolishing CONC/MODA
Kristina Dourmashkin Eurostat Unit E4
6. EGR Identification Service
eDAMIS The single entry point
Task Force on Annual Financial Accounts
Data Transmission Tools & Services EDAMIS, SDMX, Validation
ESS.VIP VALIDATION An ESS.VIP project for mutual benefits
Implementation of SDMX in the ESS
Working Group on Population and Housing Censuses
SDMX in the S-DWH Layered Architecture
Structural validation of AFA transmissions (STRUVAL)
Statistical Information Technology
SDMX as basis for water data reporting
Sharing data validation activities in the ESS.
Validation Services - Implementation
Validation services developed in the ESS
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
Education and Training Statistics Working Group – 2-3 June 2016
Tools for Validation.
Working Party on Fisheries Statistics 14 October 2013
Agricultural Data Collection System
CRIME - Data Transmission
Structural Business Statistics
Item 7.3 (b) SDMX for UOE data collection
Education and Training Statistics Working Group – 1-2 June 2017
EGR Identification service
9. Practical use case 3: Pesticides Use Project
SDMX Implementation The National Accounts use case
Modernisation of Validation in the ESS Collaboration with countries
European Statistical System Metadata Handler ESS MH (Super) Providers
EDIT data validation system Ewa Stacewicz EUROSTAT VALIDATION TEAM
5. SDMX: General input requirements
eDAMIS Validation Possibilities
Validation Activities in the ESS What you will hear today…
Future of EDAMIS Webforms
SDMX: Frequently Asked Questions
SDMX: From Labour Force Department to the Statistical Database
7 EGR initial and preliminary frames and validation tasks
Presentation transcript:

Validation at Insee

Validation at Insee 1 – validation prior to transmission to EU 2 – French uses of Eurostat validation tools 3 – Additional support needs in the area of validation services 4 – Lessons learned from our efforts in modernising national validation systems

1 – Validation prior to transmission to EU Local solutions the same as for National Dissemination : a validation at each level of the management French specificity : studies Data feed a specific publication On each level of the writing process, figures are checked Various tools Excel / LibreOffice spreadsheets and macro Sas programs R programs Different from a statistics producer to another Not yet a unique, homogeneous and shared solution Is it really possible for a great variety of statistics productions ? Challenging : an up-coming project ?

2- Uses of Eurostat validation tools EDIT tool : Standalone versions (on each PC) Due to the nature of the data : confidential for Business demography, Ifats, EGR, TOURISM, EHISS (…) Input in CSV STRUVAL and CONVAL For SDMX files Short Term Statistics, NA, Asylum, Air Encapsulated in eDamis

EDIT Standalone solution Installation on each PC (15 + 5) Upload of the data (csv) Checking operations (called Job) Download of the feedback corrections and new Edit checking operations until sucessful feedbacks would be received

EDIT Feedbacks : informations on the kind of errors and where there are in the file « Not so easy to understand at the beginning » Difference between critical and non-critical errors → send an explanatory comment Limits of the standalone : no possibility to share the work between different persons and to optimize the process. For example one person cannot upload all the data and another one follow the checkings. → For the LC no visibility on the checking

STRUVAL AND CONVAL Used through edamis No specific installation needed Use of the V dataset (sandbox, not in production) Feedbacks (like with Edit tool) available in eDamis (in the « Received files » Menu) Only the correct files are sent in production

STRUVAL AND CONVAL STRUVAL : for Structural validation Implemented since 2nd semester of 2016 Structure of the header → if not correct the validation process stops Right dimensions and attributes Codes (respecting or not the codelists)

STRUVAL AND CONVAL CONVAL : for Content validation Recently implemented for NA / a bit early to give an opinion Example of some CONVAL feedbacks unsuccessful : Incorrect combination of OBS_VALUE and OBS_STATUS = inter dimensions / attributes checks For NA we passed the two validation systems but there were still errors :-) Up-coming ? Inter series checks (Y / Y-1) Threshold for some values and variations Like in EDIT tool ? Possible with SDMX ?

3- Additional support needed in the area of validation services EDIT tool First of all, a question : future of EDIT alongside STRUVAL and CONVAL ? An intermediary solution until the SDMX become the format of all Datasets ? Is Edit still extended to new domains ? For now : basically improvement of the existing tool To reduce the cost entry A more user-friendly GUI A more visible support team Guidelines and wiki need to be promoted Webinar maybe more often

3- Additional support needed in the area of validation services EDIT tool First of all, a question : future of EDIT alongside STRUVAL and CONVAL ? An intermediary solution until the SDMX become the format of all Datasets ? Is Edit still extended to new domains ? For now : basically improvement of the existing tool To reduce the cost entry A more user-friendly GUI A more visible support team Guidelines and wiki need to be promoted Webinar maybe more often

3- Additional support needed in the area of validation services EDIT tool About validation rules To share more widely the rules Some rules are too strict / sometimes more flexibility is needed (but difficult to apply, we understand it) For confidential data a secured application and no more standalone version, difficult to maintain / time-consuming

3-Additional support needed in the area of validation services For STRUVAL/ CONVAL : Good idea to encapsulate the validation tool in edamis Easy to use even with a basic knowledge of SDMX Warning : Burden of edamis in the peak days (29/09/17) => delays for feedbacks Some errors can still exist although successful feedbacks have been received

4- Lessons learned from our efforts in modernising national validation system Conforted us in our national checking operations pointed out missing controls Reduced the number of sendings And the « ping-pong » exchanges with Eurostat

4- Lessons learned from our efforts in modernising national validation systems Validation from the NSI is a crucial need for Eurostat To reduce the work of its teams To homogenize the data received (so they could be used) But the final users of Eurostat are also the NSI themselves Virtuous circle Useful for data sharing

4- Lessons learned from our efforts in modernising national validation system The validation tools from Eurostat are already a business process, level not yet reached in France We're actually redesigning our dissemination model → a centralized dissemination warehouse feeded by various databases → Validation process : a challenging issue EU has raised awareness of it A bit early yet, but we will surely benefit from Eurostat experience on implementing CONVAL and STRUVAL

Thank you for your attention ! Any questions ?