SDMX IT Tools Introduction

Slides:



Advertisements
Similar presentations
Introduction to SDMX Seminar Eurostat/ECLAC 02 October 2012 August Götzfried Head of Unit, Eurostat B5 Management of statistical data and metadata.
Advertisements

United Nations Economic Commission for Europe Statistical Division Towards a Generic Statistical Business Process Model Steven Vale, UNECE.
United Nations Economic Commission for Europe Statistical Division Exploring the relationship between DDI, SDMX and the Generic Statistical Business Process.
United Nations Economic Commission for Europe Statistical Division Towards a Generic Statistical Business Process Model Steven Vale, UNECE.
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation An update on the work of the High-level Group for the.
Background Data validation, a critical issue for the E.S.S.
Background Defining and mapping business processes in statistical organisations started at least 10 years ago –“Statistical value chain” –“Survey life-cycle”
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Generic Statistical Information Model (GSIM) Thérèse Lalor and Steven Vale United Nations Economic Commission for Europe (UNECE)
Overview of SDMX: Statistical Data and Metadata eXchange Technical and Content Standards for Statistical Data Ann McPhail, Division Chief Statistics Department,
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
Background to the Generic Statistical Information Model (GSIM) Briefing Pack December
Eurostat – Directorate B: Corporate statistical and IT services SDMX Basics Training – 2013 SDMX basics Marco Pellegrino Eurostat, Directorate B.
B A C K G R O U N D B R I E F I N G A N D N E X T S T E P S METIS Geneva, February 2004 Statistical Data and Metadata Exchange Initiative.
 Metadata Technology North America  200 Prosperity Dr., Knoxville, TN USA  +1 (877) DDI – SDMX   Metadata Standards and Official.
13-Jul-07 Implementation of SDMX for data and metadata exchange Balance of Payments Working Group 2-3 April 2012 Daniel Suranyi Eurostat B5 Management.
Promoting the use of SDMX WPTGS November Presentation contents: 1. What is SDMX? 2. SDMX: NSI Perspective 3. OECD SDMX work 4. How SDMX is used.
Eurostat achievements and challenges Emanuele Baldacci, Director European Commission - Eurostat Director Methodology; Corporate statistical.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Eurostat Standardisation within the ESS: SDMX present and future Luxembourg, October 2015 Marco Pellegrino Eurostat, Statistical Office of the European.
Modernization of official statistics Eric Hermouet Statistics Division, ESCAP
An overview of SDMX November Presentation contents: 1. What is SDMX? 2. SDMX: NSI Perspective 3. OECD SDMX work 4. How SDMX is used in sharing and.
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
SDMX IT Tools SDMX use in practice in NA
GSBPM and GAMSO Steven Vale UNECE
Eurostat November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange Jean-Francois LEBLANC Christian SEBASTIAN SDMX IT Tools SDMX.
Eurostat 1.SDMX: Background and purpose 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
GSIM, DDI & Standards- based Modernisation of Official Statistics Workshop – DDI Lifecycle: Looking Forward October 2012.
United Nations Economic Commission for Europe Statistical Division Standards-based Modernization of Official Statistics Steven Vale UNECE
OECD Expert Group on Statistical Data and Metadata Exchange (Geneva, May 2007) Update on technical standards, guidelines and tools Metadata Common.
United Nations Economic Commission for Europe Statistical Division Enhanced Generic Models to Support the Standardisation of Statistical Production Steven.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Eurostat November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange Jean-Francois LEBLANC Christian SEBASTIAN SDMX IT Tools Common.
1 Joint UNECE/EUROSTAT/OECD METIS Work Session (Geneva, March 2010) The On-Going Review of the SDMX Technical Specifications Marco Pellegrino, Håkan.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
1 High Level Seminar for Eastern Europe, Caucasus and Central Asia Countries (EECCA). Quality in Statistics: Metadata Tbilisi, Georgia, June 2012.
United Nations Economic Commission for Europe Statistical Division GSBPM in Documentation, Metadata and Quality Management Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation Steven Vale UNECE
IAEA International Atomic Energy Agency Implementing SDMX for Energy Domain: From Discussion to Actual Implementation and Testing Andrii Gritsevskyi Oslo.
Contents Introducing the GSBPM Links to other standards
Metadata Standards for Statistical Classifications
Generic Statistical Business Process Model GSBPM
Interoperable data formats: SDMX
SDMX Opportunities MED Meeting 14 May 2013 Daniel Suranyi Eurostat B5
The Re3gistry software and the INSPIRE Registry
SDMX: A brief introduction
11. The future of SDMX Introducing the SDMX Roadmap 2020
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
Applying the Generic Statistical Business Process Model to Business Register Maintenance Steven Vale UNECE
2. An overview of SDMX (What is SDMX? Part I)
Statistical Information Technology
SDMX as basis for water data reporting
August Götzfried Eurostat unit B 4
The Generic Statistical Business Process Model
Introducing the GSBPM Steven Vale UNECE
SDMX : General introduction H. Linden, Eurostat, Unit B5
Contents Introducing the GSBPM Links to other standards
Item 7.3 (b) SDMX for UOE data collection
Item 7.11 SDMX Progress report
1. SDMX: Background and purpose
Generic Statistical Information Model (GSIM)
SDMX IT Tools SDMX Registry
Palestinian Central Bureau of Statistics
Presentation transcript:

SDMX IT Tools Introduction Jean-Francois LEBLANC Christian SEBASTIAN November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange

Table of contents Where are we? Standardization Why do we need a model? GSBPM Generic Statistical Business Process Model Phases Key features Other uses Standards – Relations GSIM Generic Statistical Information Model

Table of contents SDMX & DDI SDMX Summary Why? Benefits Costs Opportunities Impacts From 1.0 to 2.1 The SDMX components SDMX in practice Summary

Where are we? Dramatic changes in the environment of official statistics producers (e.g. data deluge) Modernization of statistical information system seen as a question of survival for the sector of official statistics Standardization viewed as a key enabler for modernization "Standards-based” industrialization of statistical production

2. Standardization Why is it necessary? What does it imply? Harmonization Reusability and interoperability Shared solutions across statistical institutes What does it imply? Common processes Common tools Common methodologies Why is it necessary? Harmonization of statistical data (leading to more comparable data) Reusable, interoperable data for users Economies of scale across statistical institutes internationally - shared solutions Good vendor support for the industry

2. Standardization Industry Standards Other major standards GSBPM - Generic Statistical Business Process Model GSIM - Generic Statistical Information Model SDMX - Statistical Data and Metadata eXchange DDI - Data Documentation Initiative Other major standards RDF - Resource Description Framework LOD - Linked Open Data JSON - JavaScript Object Notation XBRL - eXtensible Business Reporting Language GSBPM GSIM SDMX DDI SDMX Preferred standard for exchange and sharing of data and metadata in the global statistical community (UNSC, 2008) – Widely used in the ESS for aggregated data DDI Standard for the documentation of data, initially focused on archiving micro-data in the area of social sciences – widely used in national data archives – extended to support the full life-cycle of data RDF W3C standard for web-based discovery, dissemination, and linking – an alternative to XML Allows for after-the-fact linking of any types of resources on the Web, e.g. linking press releases and speeches with relevant data from a statistical organization Powerful querying language for distributed searches on the web Very popular with “open data” and “open government” initiatives JSON Web-developer-friendly alternative to XML. JSON version of SDMX XBRL Standard for reporting accounting information and banking supervision data XML-based standard No formal model Communities standardize the taxonomies to support their needs Mapping to other models requires an understanding of the implied model of the community Good tools are required to hide this complexity

3. Why do we need a model? To define and describe statistical processes in a coherent way To standardize process terminology To compare and benchmark processes within and between organisations To identify synergies between processes To inform decisions on systems architectures and organisation of resources

4. GSBPM Generic Statistical Business Process Model Applicable to all activities undertaken by producers of official statistics -> data outputs Used by National and international statistical organisations Independent of data source, can be used for: Surveys / censuses Administrative sources / register-based statistics Mixed sources

4.1 GSBPM - Phases

Not a linear model 4.2 GSBPM – Key features Sub-processes do not have to be followed in a strict order It is a matrix with many possible paths, including iterative loops within and between phases Some iterations of a regular process may skip certain sub-processes

4.3 GSBPM – Other uses Harmonizing statistical computing systems Facilitating sharing of statistical software Framework for process quality management Structure for storage of documents Measuring operational costs

5. Standards - Relations Statistical Information concepts concepts Statistics production GSBPM GSIM Technology Methods Conceptual Practical Statistical how-to Production how-to SDMX, DDI, RDF, ISO-11179, …

6. GSIM Generic Statistical Information Model Other standards DDI SDMX Implementation standards Conceptual model

7. SDMX & DDI DDI offers a very rich model for the documentation of micro-data SDMX offers a very integrated exchange platform for statistical outputs (IT architectures, tools, web services) integration of the complete production process The combined use of both standards could allow a higher level of

8. SDMX Statistical Data and Metadata eXchange UNSD World Bank

SDMX is the global answer to this. 8.1 SDMX – Why? The exchange of statistical data and metadata is complex, resource intensive and expensive In the past, national and international organisations had developed specific approaches and solutions Opportunities and challenges related to new technologies for machine to machine exchange were coming up, e.g. XML, web services. SDMX is the global answer to this.

8.2 SDMX - Benefits Efficiency Reduced burden after low investment Consistent and comparable data and metadata messages produced by different organizations Harmonized statistical processes, offering new ways of data and metadata exchange (such as data hubs) Web-based dissemination formats are provided that are computer “readable” and easier to update.

8.3 SDMX - Costs Development/maintenance of the SDMX standards and guidelines done by the international sponsoring institutions (supported by NSIs) Standards are public and open source IT tools are created by sponsoring or other organizations and made freely available Capacity building by individual sponsoring institutions User community input by means of open process Low investment cost – gradual implementation

8.4 SDMX - Opportunities Streamline data flows Across domains Central management (SDMX Registry) Across domains Across organizations Simplification Standardization Harmonization Software tools Data sharing Data structures Concepts Code lists

8.5 SDMX - Impacts Reduced reporting burden via common formats adopted by international organizations for data and metadata exchange User-friendly access when publishing national data and metadata on the web via global standards for data formats, catalogs/registries and associated services Improved management and analysis of data via global guidelines for metadata vocabularies and repositories in common formats Replicable models and tools for statistical information systems at national levels

SDMX recognised and supported as the preferred standard 8.6 SDMX – From 1.0 to 2.1 Version 2.0 SDMX-EDI SDMX-ML SDMX Registry 2008 SDMX accepted at UN level Version 1.0 GESMES/TS SDMX recognised and supported as the preferred standard Version 1.0 Version 2.0 Version 2.1 September 2004 November 2005 February 2008 April2011

8.7 The SDMX Components Describe statistics in a standard way Objects and their relationships Data Structure Definition (DSD), Concepts, Code List Central management and standard access SDMX Registry, SDMX Web Services Cross Domain Concepts Cross Domain Code Lists Statistical Domains Metadata Common Vocabulary Push Provider generates and sends file to receiver Pull Provider opens web service to data Receiver downloads regularly Hub Special case of pull: receiver downloads on end user request 22 22

9. Summary To enable a modernized statistical production, standards are the key Standards at different levels are being used in an increasingly coherent way GSBPM and GSIM provide conceptual models and facilitate communication SDMX, DDI and other standards provide implementation models which can be used in a coordinated way There are now more technologies than just GESMES and XML: a coherent overall model is critical

Introduction