SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Presented to: By: Date: Federal Aviation Administration Registry/Repository in a SOA Environment SOA Brown Bag #5 SWIM Team March 9, 2011.
Federal Department of Home Affairs FDHA Federal Statistical Office FSO Meeting of the OECD Expert Group on SDMX September, OECD, Paris Centralized.
SDMX training session on basic principles, data structure definitions and data file implementation 29 November
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Managing Data Resources
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 8 Slide 1 System models.
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
Ch1: File Systems and Databases Hachim Haddouti
©Silberschatz, Korth and Sudarshan1.1Database System Concepts Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Reducing Metadata Objects Dan Gillman November 14, 2014.
Supplement 02CASE Tools1 Supplement 02 - Case Tools And Franchise Colleges By MANSHA NAWAZ.
Secure Systems Research Group - FAU Web Services Standards Presented by Keiko Hashizume.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
Neuchâtel Terminology Model: Classification database object types and their attributes Revision 2013 and its relation to GSIM Prepared by Debra Mair, Tim.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
The Status of Electronic Archives in Norway Riga, 22. march 2007.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 18 Slide 1 Software Reuse.
WP.5 - DDI-SDMX Integration
Data, Metadata and Quality Management Framework ( Quality and Information Management Framework at the Vietnamese Ministry of Planning and Investment )
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 System models l Abstract descriptions of systems whose requirements are being.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
System models Abstract descriptions of systems whose requirements are being analysed Abstract descriptions of systems whose requirements are being analysed.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
By: Md Rezaul Huda Reza 5Ps for SE Process Project Product People Problem.
Report on UNSD activities since the last meeting of the Expert Group on International Economic and Social Classifications Meeting of the Expert Group on.
METADATA HARMONISATION SDMX Training BANK INDONESIA SEPTEMBER 2015 YOGYAKARTA, INDONESIA.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Chapter 7 System models.
System models l Abstract descriptions of systems whose requirements are being analysed.
Modified by Juan M. Gomez Software Engineering, 6th edition. Chapter 7 Slide 1 Chapter 7 System Models.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
South Africa Case Study Update Matile Malimabe Executive Manager: Standards Acting Executive Manager: Data Management & Technology.
Model and Representations
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
Tutorial on XML Tag and Schema Registration in an ISO/IEC Metadata Registry Open Forum 2003 on Metadata Registries Tuesday, January 21, 2003; 4:45-5:30.
1 Integration of the Eurostat and ESS Metadata Systems A. Götzfried Head of Unit B6 Eurostat.
SDMX IT Tools Introduction
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Improving the visualisation of statistics: The need for an SDMX-based visualisation framework Xavier Sosnowska Luxembourg, 6 May 2008.
Eurostat November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange Jean-Francois LEBLANC Christian SEBASTIAN SDMX IT Tools SDMX.
7b. SDMX practical use case: Census Hub
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Eurostat Sharing data validation services Item 5.1 of the agenda.
11 th Open Forum for Metadata Registries May 2008 Metadata Management on a Shoe String in the Vietnamese Ministry of Planning and Investment by Michael.
SDMX Basics course, March 2016 Eurostat SDMX Basics course, March Introducing the Roadmap Marco Pellegrino Eurostat Unit B5: “Data and.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
Managing Data Resources File Organization and databases for business information systems.
Introduction To DBMS.
Prepared by: Galya STATEVA, Chief expert
SDMX Information Model
SDMX: A brief introduction
ESCWA SDMX Workshop Session: Constraints.
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
SDMX Information Model: An Introduction
LOD reference architecture
Statistical Information Technology
Generic Statistical Information Model (GSIM)
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
SDMX IT Tools SDMX Registry
Presentation transcript:

SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making in the Ministry of Planning and Investment EUROPEAN TECHNICAL ASSISTANCE PROGRAMME FOR VIETNAM (ETV2) Ministry of Planning and Investment, Ministry of Finance and Ministry of Science and Technology in partnership with the European Commission VNM/AIDCO/2002/0589

ETV2 Component 5 Goals Assist Ministry of Planning and Investment (MPI) to improve monitoring, analysis and decision-making capability Focus on access to and quality of MPI information Major data-related Issues Diverse data from multiple sources from provinces, other ministries, businesses, etc inconsistent formats, definitions overlapping data indifferent areas of MPI No facilities to share and reuse data poor data and metadata management no central storage or registration of data

We are evaluating an SDMX-based solution To hold and manage the metadata mostly structural metadata in SDMX terms but reference metadata will be added To link the data to the metadata To provide an environment where the data can be managed in the context of the metadata for capture and storage for searching and browsing for retrieval and access

Why SDMX? This does not seem to be its home territory data and metadata exchange the data does come from lots of other organisations but this is not the focus of the current work MPI has a pool of unmanaged, mission-critical data with very little management of structural metadata and with virtually no reference metadata although plenty of need for it

Why SDMX? The SDMX Conceptual Model is essentially about linking metadata to data this model provides a framework for sharing and re-using structural metadata a context within which data can be managed and used, via the structural metadata a context within which reference metadata (eg quality information) can be introduced and managed SDMX also provides basic tools to support use of this Conceptual Model It is a considerable challenge to build these features into most other approaches you have to devise and support your own SDMX-like model

The MPI Data Regular tables (and other material) supposed to come regularly from the various source organisations mostly do not come from all in timely fashion little control over how they come usually , often paper! may come as different versions at different times! quality and version status is an issue MPI works in Excel data that comes electronically generally comes in Excel where metadata exists it exists as Excel templates mostly stored on local machines hard to share within MPI departments, impossible to share across them much of the data has security management requirements

SDMX from the Conceptual Model Perspective SDMX is usually described from a data exchange perspective the terminology is a bit abstruse the UML makes a developing a detailed understanding a challenge I like to look at SDMX as a Conceptual Model and to relate all the SDMX jargon to important concepts in the conceptual model I have a few slides to explain the model and to show how we might use it at MPI still work in progress

Hierarchical Code Set (Classifications) Concept Scheme Concept Code List Data Structure Definition Metadata Structure Definition Data Flow Metadata Flow Metadata Set Data Set Data Provider Provision Agreement Category Scheme Category Data Provisioning Metadata Reference Metadata Data Structural Metadata Structural Metadata contains information used to structure data and metadata. This covers Concept and Category schemes, Code Lists and Hierarchical Classifications, and Structure Definitions for Data and Metadata Sets. The basic structural metadata components are the Item Scheme and the Structure Definition. Reference Metadata is non-structural metadata that gives more information about an object to make its interpretation more meaningful. Quality, methodological, and conceptual information are examples of Reference Metadata. SDMX allows Reference Metadata to be published, shared, and reused. Potentially many data sets from many providers conforming to structure of data flow Provision Agreements are agreements by providers to deliver data to a schedule according to a flow structure Possibly multiple category schemes to categorise flows and allow searching and indexing of data and metadata flows Structures are built from Concepts and associated Code Lists that define the valid content Organises simple code lists into hierarchies. Concepts are organised into Concept Schemes Data and Metadata Flows are linked to a structure that defines the format of the corresponding Data Sets and Metadata Sets The observed phenomena at specific points as identified by the values of concepts comprising the key of the observations. Codes and their names and descriptions. The major SDMX artefacts Flows – the heart of SDMX

The SDMX top-level model

This is a description of an data or metadata flow – an abstracted data or metadata set that will potentially occur for many periods and from many providers (eg a regular table received by MPI from various sources)

The SDMX top-level model This is a description of an data or metadata flow – an abstracted data or metadata set that will potentially occur for many periods and from many providers (eg a regular table received by MPI from various sources) This is an instance data or metadata set from a particular provider at a particular time, eg, a particular table from Ninh Binh province, for a particular period

The SDMX top-level model This is a description of an data or metadata flow – an abstracted data or metadata set that will potentially occur for many periods and from many providers (eg a regular table received by MPI from various sources) This is an instance data or metadata set from a particular provider at a particular time, eg, a particular table from Ninh Binh province, for a particular period Provision Agreements indicate what Providers will provide what subset, when, how often, and how

The SDMX top-level model This is a description of an data or metadata flow – an abstracted data or metadata set that will potentially occur for many periods and from many providers (eg a regular table received by MPI from various sources) This is an instance data or metadata set from a particular provider at a particular time, eg, a particular table from Ninh Binh province, for a particular period Provision Agreements indicate what Providers will provide what subset, when, how often, and how This identifies the Data Providers, giving indicative and contact information and linking to Provision Agreements and actual data and metadata sets

The SDMX top-level model This describes the structure of the data or metadata flow – all the metadata needed to request and understand an instance of the flow (an actual data or metadata set). Links to all other structural metadata. This categorises all the defined data and metadata flows, providing a structuring framework and a basis for searching. Links to other structural metadata.

The SDMX top-level model

SDMX at MPI What we envisage is an SDMX Registry/Repository code sets and classifications environment for standardising and harmonising Data Structure Definitions for all the regular data sets received by MPI the Data Flows Categorisation schemes to index the Data Flows a data storage environment to hold the Data Sets initially probably a simple file store possible a database store possibly a star-schema store with star schema design generated automatically from structural metadata provides options for different cuts through data flows

SDMX at MPI What we envisage (cont) intelligent interfaces to Excel using the structural metadata to support browsing and retrieval of data automatic generation of Excel templates data capture, registration, and management structural metadata browsing and management reference metadata definition and management reference metadata attachment

More Information In the workshop sessions