1 SDMX Global Conference 28-30 September 2015 SDMX into the future VTL (Validation and Transformation Language) A new technical standard for enhancing.

Slides:



Advertisements
Similar presentations
Status on the Mapping of Metadata Standards
Advertisements

13 September 2012 SDMX Technical Working Group1 Report of the SDMX Technical Standards Working Group SDMX Expert Group Meeting, Paris, September 2012.
CASE STUDY: IMPLEMENTING SDMX EXCHANGE WITH MEMBER COUNTRIES IN SHORT-TERM ECONOMIC STATISTICS (STES)
Programming Paradigms and languages
Background Data validation, a critical issue for the E.S.S.
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Statistics New Zealand Classification Management System Andrew Hancock Statistics New Zealand Prepared for 2013 Meeting of the UN Expert Group on International.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
9/14/2012ISC329 Isabelle Bichindaritz1 Database System Life Cycle.
Introduction to Database Systems
Vincenzo Del Vecchio Banca d’Italia Statistics Collection and Processing Department 2012 ESSnet Workshop – 4 December.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
5 June 2013 SDMX Technical Working Group Luxembourg 1 5 June 2013 SDMX Technical Working Group Luxembourg 1 WP Item 6 The Expressions Language of Banca.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Francesco Rizzo (ISTAT - Italy) SDMX ISTAT FRAMEWORK GENEVE May 2007 OECD SDMX Expert Group.
Secure Epidemiology Research Platform (SERPent) Kick Start Meeting - April 15 th, 2010 Pascal Heus
Model and Representations
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
SDMX IT Tools Introduction
GSIM Mapping to SDMX and DDI: Preliminary Findings and Status Arofan Gregory Metadata Technology METIS, May , Geneva.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
13-14 December 2012 SDMX Technical Working Group Paris WP Item 6 Expressions and Calculations.
OECD Expert Group on Statistical Data and Metadata Exchange (Geneva, May 2007) Update on technical standards, guidelines and tools Metadata Common.
1 Joint UNECE/EUROSTAT/OECD METIS Work Session (Geneva, March 2010) The On-Going Review of the SDMX Technical Specifications Marco Pellegrino, Håkan.
Eurostat Sharing data validation services Item 5.1 of the agenda.
3 June 2013 SDMX Technical Working Group Luxembourg 1 WP Item 6 Expressions and Calculations.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
1 Recent developments in quality related matters in the ESS High level seminar for Eastern Europe, Caucasus and Central Asia countries Claudia Junker,
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation Steven Vale UNECE
IAEA International Atomic Energy Agency Implementing SDMX for Energy Domain: From Discussion to Actual Implementation and Testing Andrii Gritsevskyi Oslo.
UNECE-CES Work session on Statistical Data Editing
Contents Introducing the GSBPM Links to other standards
The evolution of the SDMX infrastructure and services
GSIM Implementation at Statistics Finland Session 1: ModernStats World - Where to begin with standards based modernisation? UNECE ModernStats World Workshop.
ESS Vision 2020 Validation: Implementation of deliverables
Upcoming changes to the DMX technical standard
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
11. The future of SDMX Introducing the SDMX Roadmap 2020
Logical information model LIM Geneva june
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Recent developments in Eurostat publications
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
The Generic Statistical Information Model
12. Validation services and the new. Validation & Transformation
SDMX Information Model: An Introduction
Task Force on Annual Financial Accounts
VTL: Validation and Transformation Language
SDMX in the S-DWH Layered Architecture
3rd WGM Meeting 3 May 2018 Item 2.3 Possible standards for ESS Validation.
Prepared by Peter Boško, Luxembourg June 2012
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
4.1 Do you speak VTL? Validation and Transformation Language
Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova
Item 7.11 SDMX Progress report
Standards and guidelines for reference metadata
Validation Activities in the ESS What you will hear today…
VTL – Validation and Transformation Language: a new emerging standard
Palestinian Central Bureau of Statistics
SDMX Roadmap 2020: Achievements, status and future outlook
Presentation transcript:

1 SDMX Global Conference September 2015 SDMX into the future VTL (Validation and Transformation Language) A new technical standard for enhancing data validation and processing Vincenzo Del Vecchio - Bank of Italy Marco Pellegrino – Eurostat SDMX TWG & VTL Task Force

September 2015 SDMX Global Conference Approach  SDMX originally focused on data collection and dissemination  Current line of tendency: Support more stages of the statistical production process Generic Statistical Business Process Model

September 2015 SDMX Global Conference What is VTL A standard language For defining validation and transformation rules Validation (now) Transformation (partially now, to be enriched at a later stage) Main goals: Define and preserve validation and transformation rules Exchange and share rules Apply rules in industrialized processes Apply to several standards (e.g. SDMX, DDI, GSIM) thanks to a generic information model

The VTL Information Model VTL is a “stand-alone” specification –It can be used with SDMX, DDI, GSIM or potentially anything else –It can be used on its own VTL has its own information model –All kind of data are modelled as mathematical functions having independent variables (Identifiers) and dependent variables (Measures and Attributes) –GSIM IM is used as a basis –It can be mapped against SDMX –It can be mapped against DDI September 2015 SDMX Global Conference 4

Main VTL drivers (1) Business orientation –Designed for subject matter experts use Integrated Approach –Any kind of data –Independent of the phase of the process –Unique language for validation and calculation September 2015 SDMX Global Conference

Main VTL drivers (2) IT implementation independence –Independent of IT tools –Allowing multiple tools –Resilient to tools changes Active Role for processing –Formal (described by means of BNF) –Able to drive the validation & calculation software Extensible and customizable September 2015 SDMX Global Conference

September 2015 SDMX Global Conference VTL 1.0 Operators

September 2015 SDMX Global Conference VTL features (1) Declarative language based on Expressions D4 := Check( (D1 – D2) = D3) D1, D2, D3:Operands D4: Result +, >Operators Operates on Data Sets (SDMX Dataflows) D1, D2, D3, D4are typically Data Sets, e.g.: D1 – population at time T by age and civil status D2 - population at time T-1 by age and civil status D3 – population flows between T-1 and T by age and civil status D4 – consistency of population figures (true / false), by age and civil status … and on parts of Data Sets e.g. Time Series, Cross Sections, single Data Points

September 2015 SDMX Global Conference VTL features (2) Supports operations on many types of statistical data, e.g.: Dimensional and Unit data, Survey and register data, Quantitative and qualitative data, …... And can combine them, e.g.: D1 – Securities Register (by security id) D2 – Securities Holdings (by security holder, security id, date) D3 := merge (D1, D2, on (D1#sec_id = D2#sec_id), return D2#sec_holder, D2#sec_id, D1#sec_type) produces D3 by adding to D2 the security type taken from D1)

September 2015 SDMX Global Conference VTL features (3) Can concatenate expressions D4 := Check( (D1-D2) = D3) D5 := if D4 = False then D2 else D1 (the result of the former is an operand of the latter) Considers the validation as a kind of Transformation (calculation), in order to Use a common language Use validations and calculations together, e.g.: Validation: D4 := Check( (D1-D2) = D3) Calculation: D5 := if D4 = False then D2 else D1

September 2015 SDMX Global Conference The Tranformations graph Collection activity n.1 D1D1 D2D2 D3D3 D4D4 D5D5 T1T1 T3T3 T2T2 D 11 D 12 D 13 D 15 D 17 D 16 T 13 T 12 T14T14 Collection activity n.2 Collection activity n.3 D 21 D 22 D 23 D 24 T 22 T 21 Legend: D i = Data Set i T j = Transformation j D 51 D 52 T 53 T 52 T 51 Analysis & research models D 54 D 53 T 54 D 60 D 61 Publications T 60 T 61 Statistical products D 70 T 71 T 70 T 72 D 71 D 72 D 41 T 42 T 41 D 42

September 2015 SDMX Global Conference VTL features (4) VTL 1.0 allows: Persistent and temporary results Operations on mono and multi measure data Dealing with missing data Dealing with Attributes and their propagation rules VTL 1.1 will introduce: Other operators, mainly for validation purposes Reusable rules Bug fixing, fine tuning

September 2015 SDMX Global Conference VTL status VTL 1.0: published in March 2015 –( VTL: part 1 - part 2part 1part 2 BNF (Extended Backus-Naur Form) Technical notation VTL 1.1 (language extensions): work in progress SDMX implementation: work in progress –Messages for exchanging VTL rules –Registry for storing VTL rules –Web services for retrieving VTL rules

VTL is maintained by the SDMX TWG through the VTL Task Force –Extensions will be considered for inclusion in future versions VTL has already produced some feedback to GSIM for next version –VTL can be mapped against SDMX –VTL can be directly utilized by DDI in those places where computations are included –As GSIM processing Rules Governance and Standards Alignment September 2015 SDMX Global Conference 14

September 2015 SDMX Global Conference SDMX into the future Contribute to VTL 1.1 !!! Comments on VTL 1.0 and suggestions for improvement can be sent to the SDMX Technical Working Group Thanks for your attention !