EDIT data validation system Ewa Stacewicz EUROSTAT VALIDATION TEAM

Slides:



Advertisements
Similar presentations
Eurostat Unit B3 – Statistical Information Technologies Data transmission tools and services 15/05/ eDAMIS The standard solution for transmitting.
Advertisements

Implementation of SDMX for data and metadata exchange SDMX Basics Course 12 April 2013 Daniel Suranyi Eurostat B5 Management of statistical data and metadata.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
13-Jul-07 Implementation of SDMX for data and metadata exchange Balance of Payments Working Group 2-3 April 2012 Daniel Suranyi Eurostat B5 Management.
Editing Building Block (EBB) Validation Tool for FDI and ITS Balance of Payments Working Group 02 April 2012 Unit B4, IT for Statistical Production Georges.
0 eCPIC Admin Training: OMB Submission Packages and Annual Submissions These training materials are owned by the Federal Government. They can be used or.
Eurostat 1 7a. Practical use case 1: Pesticides Use Project Blanaru Cristina Eurostat Unit B5: “Central data and metadata services” SDMX Basics course,
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
1 The EDIT System, Overview European Commission – Eurostat.
SDMX IT Tools SDMX use in practice in NA
7b. SDMX practical use case: Census Hub
EDIT – Eurostat’s editing tool
Implementation of SDMX for Balance of Payments Balance of Payments Working Group 9-10 April 2013 BP Daniel Suranyi Eurostat B5 Management of statistical.
IT Directors’ Group Meeting October 2010 Sharing data validation tools in the ESS Christine WIRTZ – Head of Unit B3 Georges PONGAS – Unit B3 Daniel.
15-16 December 2010 CGST Meeting 1 IT Developments TRIS 1 – TRIS 1 / TRIS 2 Item 7.1 on the agenda 1 TRIS = TRansport Information System.
Eurostat Sharing data validation services Item 5.1 of the agenda.
Implementation of SDMX for data and metadata exchange SDMX Basics Course October 2012 Daniel Suranyi Eurostat B5 Management of statistical data and.
Eurostat 6. SDMX: A non-technical overview of the SDMX architecture and IT tools 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services”
Training Course on EDIT
Securing the Network Perimeter with ISA 2004
Training Course on EDIT 2013
Training course on Euro SDMX Registry
The CVD Metadata Handler
SDMX Opportunities MED Meeting 14 May 2013 Daniel Suranyi Eurostat B5
JDXpert Workday Integration
EUROSTAT Unit B3 IT for statistical production Ewa Stacewicz
Eurostat EDIT 2012 Functional Presentation.
Practical use case of SDMX (1): Short-term Statistics (STS)
Data collection of 2012: Data transmission standards and tools
Disseminating statistics: Internet and Publications course
Data exchange between ENP-South countries and Eurostat
Eurostat – Units E2, B5 Cristina BLANARU
SDMX Tools Architecture
Workshop on ESA 2010 transmission programme – What and how?
Task Force on Annual Financial Accounts
Data Transmission Tools & Services EDAMIS, SDMX, Validation
EDIT User Group - services
SDMX Tools Overview and architecture
SDMX as basis for water data reporting
Sharing data validation activities in the ESS.
Validation services developed in the ESS
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
Point 6. Eurostat plans for Time Use Survey data processing and dissemination Working Group on Time Use Surveys 10 April 2013.
Unit D2: Regional Statistics New Eurostat rules Data Transmissions
Agricultural Data Collection System
Item of the Agenda Towards an integrated Eurostat metadata handler – Eurostat SDMX Registry services for Member States Francesco Rizzo Unit B3 13.
CRIME - Data Transmission
WG on Statistical Confidentiality (TRansport Information System)
Item 7.3 (b) SDMX for UOE data collection
Statistical data editing near the source using cloud computing concepts George Pongas, Christine Wirtz -Eurostat.
Tools for transmitting data to Eurostat The Single Entry Point (SEP)
EGR Identification service
9. Practical use case 3: Pesticides Use Project
Item 5.1 of agenda EVUG Meeting 2015 Eurostat, Unit B3
GENEDI EUROPEAN COMMISSION - EUROSTAT GENERIC EDI TOOLBOX
The new EDAMIS and its security
European Statistical System Metadata Handler ESS MH (Super) Providers
The migration to the new EDAMIS
Eurostat Unit B3 – IT and standards for data and metadata exchange
Validation Activities in the ESS What you will hear today…
New transmission methods: Use the most adapted transmission methods.
Standardizing and industrializing a business process – the dissemination use case Alessio Cardacino - ESTP Course “Information standards.
Validation at Insee.
Integrated Statistical Production System WITH GSBPM
Presentation transcript:

EDIT data validation system Ewa Stacewicz EUROSTAT VALIDATION TEAM The first module of the training is about the functionalities EDIT offers to the users

Content of the presentation Introduction Objects in EDIT, main principles of EDIT Ways of using EDIT Using EDIT - integration with other tools E. EDIT demo In an Introduction we will speak about how EDIT works and what we can do by using it. Then, we move to the way EDIT can be used in connection to other existing tools. Starting to see what it is inside EDIT, we will see first the main objects we can handle by using EDIT, their definitions, usefulness and the way they link to each other The core EDIT activity being the validation, it is worth to see how EDIT is doing this and how can we use its facilities to validate and to get validation reports Finally, some hints about future developments foreseen for EDIT

EDIT Introduction Data validation and editing tool provided by EUROSTAT - used in several ESS.VIP projects Allowing users to: import data run validation programs browse or export validation results Friendly web-based User Interface Available as a standalone application – can be installed locally on the user's PC and serve for the validation of confidential data 3

EDIT implementations FSS (SO & NSNE) R & D ENERGY ICT BOP-IIP New in 2017 FSS (SO & NSNE) R & D ENERGY ICT BOP-IIP UNIDEMO (ACQ and IMM) Updated in 2017 FSS microdata ASYLUM RESPER AES CVTS COD EGR TOURISM PRODCOM SBS, BOP STATISTICS EUROSTAT: 27 statistical domains use EDIT Member States and other institutions: 13 statistical domains use EDIT

EDIT principles User, programmer, administrator roles Basic EDIT use Accepted data formats

User role in EDIT Dataset upload according to a Format Create validation Job Browse Error Report or export it EDIT dataset data file VALIDATION FLOW (1-click) + validation Program validation Job EDIT dataset

Programmer role Programmer role - manages the metadata needed by the user to execute programs Implements Formats (DSD) Develops Programs for Datasets validation containing validation rules and dataset operations, prepares Lookups for code lists check Sets up the unattended mode configuration Dataset Instance (Dataset) – a collection of data rows according to the structure defined in Format; A bidimensional table composed by rows and columns: Columns correspond to the fields defined in the format; EDIT can import DSDs or code lists; EDIT acts as a client for the SDMX Registry Web Services in order to fetch DSD files and code lists data; The DSD file is broken down into EDIT components; Key families are translated to EDIT formats; Code lists are translated to EDIT Lookups; An EDIT Program is created performing lookup validations and basic checks on the dimension fields;

EDIT VALIDATION PROGRAM Programmer role Program development Custom EDIT Scripting Language designed for data validation Programs contain validation rules and dataset operations Coming in 2018: VTL translator integrated with EDIT VALIDATION RULES IN VTL EDIT VALIDATION PROGRAM

Administrator role Administrator role - manages domains, users and permissions Domain - self-contained grouping of EDIT elements available to a group of users; Contains Formats, Datasets, Programs No object in two different domains can interact with each other Users have access to all datasets and Jobs results within a Domain

Accepted data formats GESMES / BOP ITS, BOP FDI multi-year 2007, 2008, 2009 observations UNA:+.? ' UNB+UNOC:3+FR2+4D0+100929:1637+IREF000243++GESMES/TS' UNH+MREF000001+GESMES:2:1:E6' BGM+74' NAD+Z02+ECB' NAD+MR+4D0' NAD+MS+FR2' IDE+10+EUROSTAT_BOP_01 reporting' DSI+BOP_FDI_A' STS+3+7' DTM+242:201009291637:203' DTM+Z02:20072009:702' IDE+5+EUROSTAT_BOP_01' GIS+AR3' GIS+1:::-' ARR++A:FR:N:2:330:N:4A:E:9999:9999:20072009:702:0:A:F+0:A:F+0:A:F‘ ARR++A:FR:N:2:330:N:4F:E:9999:9999:20072009:702:0:A:F+0:A:F+0:A:F' ARR++A:FR:N:2:330:N:7Z:E:9999:9999:20072009:702:0:A:F+0:A:F+0:A:F' ARR++A:FR:N:2:330:N:A1:E:1100:9999:20072009:702:5824:A:F+5930:A:F+4204:A:F' ARR++A:FR:N:2:330:N:A1:E:1495:9999:20072009:702:5828:A:F+5932:A:F+4206:A:F' CSV (with or without header) /SBS, CVTS, TOURISM 9H; 2008; LT; 2; B-N_X_K642; 11930; 16236; ; ; ; ; UNIT; ; ; ; ; ; TT0; ; ; ; ; D08 9H; 2008; LT; 3; B-N_X_K642; 11930; 1001; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 4; B-N_X_K642; 11930; 529; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 30; B-N_X_K642; 11930; 17766; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 2; B-E; 11930; 1138; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 3; B-E; 11930; 104; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 9H; 2008; LT; 4; B-E; 11930; 61; ; ; ; ; UNIT; ; ; ; ; ; TT; ; ; ; ; D08 FLR 2010010011 010252000405595911005909580E 01ZZZZZ 2691.966 2734482.0 0.0 2010010011 010252000405595911004009600E 01ZZZZZ 237.543 341202.0 0.0

Ways of using EDIT Standalone – running on a PC Client – server – running in a Data Centre As a web service – called by other applications (CONVAL) INPUT FILE You can use EDIT from within Internet by providing an authentication – ECAS username and password. EDIT can be downloaded and used on a computer even if it is not connected to Internet. This fits well cases of confidential data and have the advantage of managing local resources in a configured way. For example, you can run the validation program repeatedly in a quicker way. Client - server VAL REPORT VAL PROGRAM_ID Status: Available for internal usage and for testing with some countries 11

EDIT variants for external users EDIT Public Web server EDIT standalone version for local installation user from MS user from ESTAT Local authentication All components running locally: Tomcat server & PostgreSQL DB Confidentiality is preserved ECAS authentication Non-confidential data In preparation: remote EDIT validation service for confidential data

EDIT variants for ESTAT users EDIT ESTAT server in the secure environment EDIT ESTAT server in the standard environment user from ESTAT user from ESTAT Confidential data Non-confidential data Unattended mode EDAMIS back channel Integration with other systems in Eurostat

EDAMIS Integration EDAMIS can send data to EDIT by placing the files in a configurable location; EDIT detects metadata based on the EDAMIS naming convention; EDIT performs the processing in unattended mode; EDIT acts as a client for the EDAMIS Feedback Channel Web Service in order to publish the results of a job execution.

Manual feedback EDIT & EDAMIS integration – Feedback modalities Valid Data EDIT Dom. Manager Error Report Examples: SBS ENERGY FSS COD Manual feedback 16

Automatic feedback EDIT & EDAMIS integration – Feedback modalities Valid Data EDIT Examples: ASYLUM Tourism EGR RD & GBAORD ICT Error Report Automatic feedback 17

Integration of pre-validation in EDAMIS Purpose: Improve the user experience and the quality of data sent Status: Part of EDAMIS 4, available Q2 2018

EDAMIS notification

Validation report It contains: Job results – information about the job plus an overview of the validation results; Error statistics – summary of the errors; Error report – detailed list of errors; Acceptance/rejection algorithm implemented in the program.

Errors statistics

Detailed error report Pass to EDIT and show the result of the job previously launched.

Detailed statistics report

More info on EDIT Take part in one of the next EDIT Webinars: registration via ESTAT-VALIDATION@ec.europa.eu For next webinars please check the section Editing & Validation Events on: https://webgate.ec.europa.eu/fpfis/mwikis/ESSValidServ

Thank you for your attention! Any questions?