Logical information model LIM Geneva june

Slides:



Advertisements
Similar presentations
Status on the Mapping of Metadata Standards
Advertisements

<<Date>><<SDLC Phase>>
Copyright © 2009 PMI RiskSIGNovember 5-6, 2009 “Project Risk Management – An International Perspective” RiskSIG - Advancing the State of the Art A collaboration.
DDI 3.0 Conceptual Model Chris Nelson. Why Have a Model Non syntactic representation of the business domain Useful for identifying common constructs –Identification,
Creating Architectural Descriptions. Outline Standardizing architectural descriptions: The IEEE has published, “Recommended Practice for Architectural.
GSBPM and GSIM as the basis for the Common Statistical Production Architecture Steven Vale UNECE
1 Business Exchange Structures Concepts.
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation An update on the work of the High-level Group for the.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Neuchâtel Terminology Model: Classification database object types and their attributes Revision 2013 and its relation to GSIM Prepared by Debra Mair, Tim.
Background Data validation, a critical issue for the E.S.S.
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Generic Statistical Information Model (GSIM) Thérèse Lalor and Steven Vale United Nations Economic Commission for Europe (UNECE)
SOFTWARE DESIGN (SWD) Instructor: Dr. Hany H. Ammar
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
United Nations Economic Commission for Europe Statistical Division Introduction to Steven Vale UNECE
Marco Oksman SDMX Transformation Component Applying CSPA.
StatisticsCanadaStatistiqueCanada Presentation of seasonally adjusted series STESEG Task Force on Data Presentation and Seasonal Adjustment Bernard Lefrançois.
Model and Representations
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
1 SDMX Global Conference September 2015 SDMX into the future VTL (Validation and Transformation Language) A new technical standard for enhancing.
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
GSIM, DDI & Standards- based Modernisation of Official Statistics Workshop – DDI Lifecycle: Looking Forward October 2012.
The FDES revision process: progress so far, state of the art, the way forward United Nations Statistics Division.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Results of the voting for future work priorities Workshop on International Collaboration for Standards-based Modernisation Geneva, 5-7 May 2015.
Engineering, 7th edition. Chapter 8 Slide 1 System models.
Metadata models to support the statistical cycle: IMDB
5 steps to align your talent strategy to the organisational strategy
DDI and GSIM – Impacts, Context, and Future Possibilities
Contents Introducing the GSBPM Links to other standards
BOMs – Base Object Models
Active Data Management in Space 20m DG
Abstract descriptions of systems whose requirements are being analysed
Metadata Standards for Statistical Classifications
Exchanging Reference Metadata using SDMX
Using the Checklist for SDMX Data Providers
Patterns emerging from chaos
ESS Vision 2020 Validation: Implementation of deliverables
Generic Statistical Business Process Model (GSBPM)
11. The future of SDMX Introducing the SDMX Roadmap 2020
DoDAF 2.x Meta Model (DM2) Conceptual Level
Measuring Data Quality and Compilation of Metadata
GSBPM, GSIM, and CSPA.
2. An overview of SDMX (What is SDMX? Part I)
ESS Vision 2020: ESS.VIP Validation
2. An overview of SDMX (What is SDMX? Part I)
The Generic Statistical Information Model
The problem we are trying to solve
Portfolio, Programme and Project
GSBPM and Data Life Cycle
Applying the ESS EARF in a VIP project: The ESS.VIP Validation example
Recent developments in the Generic Statistical Business Process Model: Revisions and Quality Indicators Alice Born, Statistics Canada,
Presentation to SISAI Luxembourg, 12 June 2012
Towards an integrated view of modernisation models
Metadata on quality of statistical information
Meta-analysis, systematic reviews and research syntheses
Generic Statistical Information Model (GSIM)
The future of Statistical Production
CSPA Specifications Overview
DDI and GSIM – Impacts, Context, and Future Possibilities
CSPA Templates for sharing services
CSPA Templates for sharing services
Palestinian Central Bureau of Statistics
Hands-on GSIM Mauro Scanu ISTAT
GSIM overview Mauro Scanu ISTAT
Presentation transcript:

Logical information model LIM Geneva 22-24 june

Why LIM The gap between the conceptual nature of the GSIM and the practical implementation focus of the CSPA was too wide. To bridge this gap, a new layer, a Logical Information Model (LIM), was needed.

Stats Agency Environment CSPA Service How to make best use of GSIM in the context of CSPA Stats Agency Environment

GLUE

CSPA Logical Information Model

LIM The aim of the LIM is to translate the conceptual GSIM information objects into physical specifications of the information that flows in and out of statistical services. LIM describes the information objects and logical relationships required to support a CSPA service, in a manner which is consistent with GSIM. LIM is independent of the terminology used in existing standards such as SDMX and DDI.

Scope for LIM Not all GSIM information objects will make it to LIM. The LIM is only concerned with the service and as such will not be taking all GSIM information objects down to LIM level.

The way forward: Build the LIM as we need it!

LIM development criteria CSPA services development roadmap Statistical agencies internal service development roadmap Reusability factor Coverage provided by existing standards

LIM possible physical representations The picture shows the CSPA LIM and its possible physical representations. LIM will make use of a number of standard logical models, e.g. DDI, SDMX, and others.  In some cases there will be an overlap between standard logical models, i.e. a particular LIM object could be represented by different standards.  There are also some areas in the standards that are outside LIM. Those areas consist of objects around which there is no clear usage agreement among practitioners: people in different domains represent the same information by means of different logical objects. Those ambiguous areas will be reduced over time as we include more objects in LIM and we disambiguate their usage for data exchange among CSPA services.  We will identify which existing standards (if any) are relevant to the GSIM (conceptual) information objects which are in scope of the particular service. Logical modelling for CSPA will align to the maximum practical extent with the logical models associated with the candidate standards. In cases where complete alignment with existing standards is not practical, the usual decision will be for the LIM to align with one or other of the choices on a "best fit" basis.  Depending on what information is being represented in practice, DDI and SDMX are currently expected to provide the primary basis for the physical representation of statistical information (e.g. data and metadata) in CSPA.  In future, VTL also for validation, perhaps Datacube for Linked Data.

Adding attributes to existing GSIM classes: Base classes and Code List All attributes in Identifiable Artefact, except local ID, are mandatory. This required a change in the cardinality of some attributes w.r.t. GSIM and the moving of some attributes to other information objects.   After mapping to DDI, three new attributes were added: Local ID, Version Date and Version Rationale. A Code List is a type of Node Set that contain Code Items, i.e. a combination of the meaning of a Category with a Code representation. Specific Code Items can be created to support imputation of missing values. The Code List inherits all identification and administrative information from Identifiable Artefact.

Adding class extensions to GSIM: Process Execution A Process Control defines the flow within Process Steps and between different Process Steps. In particular, it can describe the flow within a service and also between different services. Process Input and Process Output are used to identify the information objects to be passed across the service boundary. There are three types of Process Input in LIM: Transformable Inputs, Process Support Inputs, and Parameter Inputs. And three types of Process Output: Transformed Output, Process Execution Log and Process Metric. These six classes are additions to GSIM 1.1. There are three types of Process Input in LIM: Transformable Inputs, which is any object passed into a service that will be manipulated by the execution of the service, e.g. Data Sets and structural metadata. Process Support Inputs, which is a resource that is referenced or used to guide the service in completing its execution, e.g. code lists, background data, auxiliary data sets, or classifiers used as part of the service Parameter Inputs: Inputs used to specify which configuration should be used for a specific Process Step, which has been designed to be configurable. There are three types of Process Output in LIM: Transformed Output, which is the product of the actions that were executed by the Service, e.g. the updated ("value added") version of one or more Transformable Inputs supplied to the Service (Data Set where values that were missing previously have been imputed). Process Execution Log captures, which captures the output of a Process Step not directly related to the Transformed Output it produced, e.g. data that was recorded during the real time execution of the Process Step. Process Metric, which records the quality or methodological measurements about the execution of a service. One purpose for a Process Metric may be to provide a quality measure related to the Transformed Output, e.g. measure of how many records were imputed, and a measure of how much difference, statistically, the imputed values make to the Data Set overall

Developing LIM for each service: Stage 1: Determine the information inputs and outputs (GSIM objects) Check if the inputs/outputs are covered by LIM If yes, service builder uses relevant part of the LIM, as well as a recommended physical representation If no, contact CSPA implementation group for guidance, possible LIM development Look at which standards are in scope of the request Stage 2: LIM development team consults more broadly on needs On acceptance of development, make it CSPA mandated

Status for LIM

Candidate Services

Further development of LIM CSPA Implemention group Subgroup LIM Variables identified as high priority Help avoid risk of implementors using different Variable incorrectly -> easier to share services Neuchâtel terminology model definitions

Where to find information http://www1.unece.org/stat/platform/display/CSPA/Common+Statistical+Production+Architecture

Thank you! Eva Holm - Statistics Sweden David Barraclough – OECD Flavio Rizzolo – Statistics Canada Based on original slides by Therese Lalor