CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.

Slides:



Advertisements
Similar presentations
Status on the Mapping of Metadata Standards
Advertisements

SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
SDMX training session on basic principles, data structure definitions and data file implementation 29 November
United Nations Statistics Division Principles and concepts of classifications.
Overview of key concepts and features
ESCWA SDMX Workshop Session: SDMX and Data. Session Objectives At the end of this session you will: –Know the SDMX model of a data structure definition.
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
Survey Data Management and Combined use of DDI and SDMX DDI and SDMX use case Labor Force Statistics.
Sdmx web services Strutural data
SDMX AND DATA DISSEMINATION SDMX Training BANK INDONESIA SEPTEMBER 2015 YOGYAKARTA, INDONESIA.
METADATA HARMONISATION SDMX Training BANK INDONESIA SEPTEMBER 2015 YOGYAKARTA, INDONESIA.
CHRIS NELSON METADATA TECHNOLOGY WORK SESSION ON STATISTICAL METADATA GENEVA 6-8 MAY 2013 Designing a Metadata Repository Metadata Technology Ltd.
UNECE METIS work session on statistical metadata Luxembourg, 9 to 11 April SDMX as a source of standardised terminology: MCV and cross-domain concepts.
SDMX Standards Relationships to ISO/IEC 11179/CMR Arofan Gregory Chris Nelson Joint UNECE/Eurostat/OECD workshop on statistical metadata (METIS): Geneva.
Technical Overview of SDMX and DDI : Describing Microdata Arofan Gregory Metadata Technology.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
Francesco Rizzo (ISTAT - Italy) SDMX ISTAT FRAMEWORK GENEVE May 2007 OECD SDMX Expert Group.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
SDMX DATA STRUCTURE DEFINITION SDMX Training BANK INDONESIA SEPTEMBER 2015 YOGYAKARTA, INDONESIA.
1 Eurostat Unit B5 – Statistical Information Technologies SDMX Basics – October 2011 SDMX Basics Core Elements Information Model Data Structure Definition.
Session: General Statistical Business Process Model (GSBPM)
XML 2nd EDITION Tutorial 4 Working With Schemas. XP Schemas A schema is an XML document that defines the content and structure of one or more XML documents.
1 The Theoretical Framework. A theoretical framework is similar to the frame of the house. Just as the foundation supports a house, a theoretical framework.
1 DATA PRESENTATION AND SEASONAL ADJUSTMENT - DATA AND METADATA PRESENTATION TERMINOLOGY - DATA PRESENTATION AND SEASONAL ADJUSTMENT - DATA AND METADATA.
Basics David Barraclough OECD SDMX Coordinator
Eurostat 1 7a. Practical use case 1: Pesticides Use Project Blanaru Cristina Eurostat Unit B5: “Central data and metadata services” SDMX Basics course,
Model and Representations
A Data Structure Definition for Eurostat's short-term business statistics Jan Planovsky, Eurostat SDMX Global Conference Bangkok, September 2015.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
SDMX IT Tools Introduction
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
1 SDMX Global Conference September 2015 SDMX into the future VTL (Validation and Transformation Language) A new technical standard for enhancing.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
© Metadata Technology ESCWA SDMX Workshop Session: Reference Metadata and Metadata Structure Definition.
CountryData SDMX for Development Indicators MDG Data Structure Definition and CountryData.
Eurostat November 2015 Eurostat Unit B3 – IT and standards for data and metadata exchange Jean-Francois LEBLANC Christian SEBASTIAN SDMX IT Tools SDMX.
ΕΚΤ Access to Knowledge ΕΚΤ Access to Knowledge R&D Statistics Information System: An Interoperability Tail between CERIF and SDMX Dimitris Karaiskos Dimitrios.
IAEA International Atomic Energy Agency Implementing SDMX for Energy Domain: From Discussion to Actual Implementation and Testing Andrii Gritsevskyi Oslo.
4. SDMX: Main objects for data exchange
Exchanging Reference Metadata using SDMX
SDMX Information Model
SDMX: A brief introduction
ESTP Training Course 8 & 9 April 2014 Fabien JACQUET Eurostat B5
ESCWA SDMX Workshop Session: Constraints.
CountryData SDMX for Development Indicators
2. An overview of SDMX (What is SDMX? Part I)
SDMX Information Model
2. An overview of SDMX (What is SDMX? Part I)
SDMX Information Model: An Introduction
Developing a Data Model
Statistical Information Technology
August Götzfried Eurostat unit B 4
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
SDMX for MDG Indicators
SODI Live Demonstration
SDG Data Structure Definition
Item 7.3 (b) SDMX for UOE data collection
7. Introduction to the main SDMX objects for metadata exchange
Developing SDMX artefacts for data exchange, sharing and dissemination
SDMX: Frequently Asked Questions
SDMX Information Model
SDMX Converter Abdulla Gozalov, UNSD.
SDG Data Structure Definition
SDMX IT Tools SDMX Information Model, Structures and Messages
SDMX training Francesco Rizzo June 2018
GSIM overview Mauro Scanu ISTAT
Presentation transcript:

CountryData Technologies for Data Exchange SDMX Information Model: An Introduction

SDMX Information Model An abstract model, from which actual implementations are derived. Implemented in XML and GESMES, but could be implemented in other syntaxes.

STATISTICAL DATA & METADATA Time series data representation Cross-sectional data representation Statistical Data (Figures) Statistical Metadata (Identifiers, Descriptors) Structural metadata Reference metadata Statistical Metadata (Methodology, Quality) Source: Eurostat

Structural vs Reference Metadata Structural Metadata: Identifiers and Descriptors, e.g. Data Structure Definition Concept name Code Reference Metadata: Describes contents and quality of data, e.g. Indicator definition Comments and limitations

Data Structure Definition (DSD) Represents a data model used in exchange Defines dataset structure A DSD contains: Concepts that pertain to the data Code lists, which represent the concepts Dimensional structure, which describes roles of the concepts Groups, which define higher levels of aggregation. Also known as Key Family, but this term was discontinued in SDMX 2.1

ELEMENTS OF A DATA STRUCTURE DEFINITION Source: Eurostat

Concept “A unit of knowledge created by a unique combination of characteristics”* Each concept describes something about the data. * Source: Metadata Common Vocabulary

Concepts: An Example Ref. Area Indicator Period Obs. Value Unit Multiplier

Concept Roles Dimension: concept is used to identify a time series/observation Indicator, Reference Area, Time Attribute: concept conveys additional information, but does not identify a time series or observation Unit multiplier Measure: concept represents the phenomenon being measured Observation value

Representation When data are transferred, its descriptor concepts must have valid values. A concept can be Coded Un-coded with format Un-coded free text

Concept Scheme “The descriptive information for an arrangement or division of concepts into groups based on characteristics, which the objects have in common.” Places concepts into a maintainable unit. Optional in SDMX 2.0, mandatory in SDMX 2.1.

Code “A language-independent set of letters, numbers or symbols that represent a concept whose meaning is described in a natural language.” Codes are language-neutral and may include descriptions in multiple languages.

Code Lists Code lists provide representation for concepts, in terms of Codes. Agreement on code lists is often the most difficult part of developing a DSD. Code lists must be harmonized among all data providers that will be involved in exchange.

Code Lists: Some Examples

Dimensional Structure Lists concepts for: Dimensions Attributes Measures Links concepts and code lists. Defines groups. Defines attribute attachment levels. “DSD” may refer to dimensional structure alone, or the entire data structure definition.

Special Dimensions Time dimension provides observation time. If a DSD describes time series data, it must have one Time dimension. Frequency dimension describes interval between observations. If there is a Time dimension, one other dimension must be marked as Frequency dimension.

Groups In SDMX, groups define partial keys which can be used to attach information to. Attributes can be attached at observation, series, group, or dataset level. The parsimony principle calls for attributes to be attached to the highest applicable level. In MDG/CountryData DSD, groups are not used.

Time Series A set of observations of a particular variable, taken at different points in time. Observations that belong to the same time series, differ in their TIME dimension. All other dimension values are identical. Observation-level attributes may differ across observations.

Time Series: Demonstration

Cross-Sectional Data In simple terms, cross-sectional series (or “section”) is a set of observations of various variables, taken at a particular point in time. A non-time dimension (or a set of dimensions) is chosen along which a set of observations is constructed. Used less frequently than time series representation But census data is an important example

Time Series View vs Cross- Sectional View The Sex dimension was chosen as the cross-sectional measure. Note that Time is still applicable.

Keys in SDMX Series key uniquely identify a time series Consists of all dimensions except TIME Group key uniquely identifies a group of time series Consists of a subset of the series key

Dataset “…can be understood as a collection of similar data, sharing a structure, which covers a fixed period of time.”* Generally a collection of time series or cross- sectional series Dataset serves as a container for series data in SDMX data messages. *Source: Metadata Common Vocabulary

Metadata in SDMX Can be stored or exchanged separately from the object it describes, but be linked to it Can be indexed and searched Reported according to a defined structure

Metadata Structure Definition (MSD) MSD Defines: The object type to which metadata can be associated E.g. DSD, Dimension, Partial Key. The components comprising the object identifier of the target object E.g. CountryData MSD allows metadata to be attached to each indicator for each country Concepts used to express metadata (“metadata attributes”). E.g. Indicator Definition, Quality Management

Metadata Structure Definition and Metadata Set: an example Component: SERIES (phenomenon to be measured) Component ID: REF_AREA (Reference Area) Concept: STAT_CONC_DEF (Indicator Definition) Concept: METHOD_COMP (Method of Computation) Target Identifier Metadata Attributes METADATA SET SERIES=SH_STA_BRTC (Births attended by skilled health personnel) REF_AREA=KHM (Cambodia) METADATA STRUCTURE DEFINITION STAT_CONC_DEF=“It refers to the proportion of deliveries that were attended by skilled health personnel including physicians, medical assistants, midwives and nurses but excluding traditional birth attendants.” METHOD_COMP=“The number of women aged with a live birth attended by skilled health personnel (doctors, nurses or midwives) during delivery is expressed as a percentage of women aged with a live birth in the same period. ” Component: SERIES (phenomenon to be measured) Component ID: REF_AREA (Reference Area) Concept: STAT_CONC_DEF (Indicator Definition) Concept: METHOD_COMP (Method of Computation)

Dataflow and Metadataflow Dataflow defines a “view” on a Data Structure Definition Can be constrained to a subset of codes in any dimension Can be categorized, i.e. can have categories attached In its simplest form defines any data valid according to a DSD Similarly, Metadataflow defines a view on a Metadata Structure Definition.

Category and Category Scheme Category is a way of classifying data for reporting or dissemination Subject matter-domains are commonly implemented as Categories, such as “Demography”, “National Accounts” Category Scheme groups Categories into a maintainable unit.

SDMX INFORMATION MODEL – DATA & METADATA FLOW DATA & METADATA FLOWS Structure Definition Category Scheme Category Constraint Provision Agreement Data Provider Data & Metadata set Source: Eurostat

Data Provider and Provision Agreement Data Provider is an organization that produces and disseminates data and/or reference metadata. Provision Agreement links a Data Provider and a Data/Metadata Flow. I.e. a Data Provider agrees to provide data as specified by a Dataflow. Like Dataflows, Provision Agreements can be categorized and constrained.

SDMX Messages Any SDMX-related data are exchanged in the form of documents called messages. An SDMX message can be either in the XML or GESMES/TS format. There are several types of SDMX messages, each serving a particular purpose, e.g. Structure message is used to send structural information such as DSD, MSD, Concept Scheme, etc. Compact Message (SDMX 2.0) is used to send data. SDMX messages in the XML format are referred to as SDMX-ML messages.