The problem we are trying to solve

Slides:



Advertisements
Similar presentations
GSIM, CSPA, and Related Activities of the High-Level Group
Advertisements

GSBPM and GSIM as the basis for the Common Statistical Production Architecture Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation An update on the work of the High-level Group for the.
The European Statistical System Vision Infrastructure Programme Daniel Defays, Director Directorate B, Eurostat Eurostat Workshop on the Modernisation.
Common Statistical Production Architecture An statistical industry architecture will make it easier for each organisation to standardise and combine the.
The future of Statistical Production CSPA. We need to modernise We have a burning platform with: rigid processes and methods; inflexible ageing technology;
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
Background Data validation, a critical issue for the E.S.S.
GSIM Stakeholder Interview Feedback HLG-BAS Secretariat January 2012.
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Standardisation Informal summary of ABS Perspective.
Metadata-driven Business Process in the Australian Bureau of Statistics Aurito Rivera, Simon Wall, Michael Glasson – 8 May 2013.
United Nations Economic Commission for Europe Statistical Division Standards and Statistical Production Architectures Steven Vale UNECE
Statistical Metadata Strategy and GSIM Implementation in Canada Statistics Canada.
The future of Statistical Production CSPA. 50 task team members 7 task teams CSPA 2015 project.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
SDMX IT Tools Introduction
Aim: “to support the enhancement and implementation of the standards needed for the modernisation of statistical production and services”
Generic Statistical Information Model (GSIM) Jenny Linnerud
2013 HLG Project: Common Statistical Production Architecture.
GSIM, DDI & Standards- based Modernisation of Official Statistics Workshop – DDI Lifecycle: Looking Forward October 2012.
On Implementing CSPA Specifications for Editing and Imputation Services Donato Summa, Monica Scannapieco, Diego Zardetto, Istat, Italy Istituto Nazionale.
The future of Statistical Production CSPA. This webinar on CSPA (common statistical production architecture) is part of a series of lectures on the main.
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
The future of Statistical Production CSPA. We need to modernise We have a burning platform with: rigid processes and methods; inflexible ageing technology;
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
United Nations Economic Commission for Europe Statistical Division Standards-based Modernisation Steven Vale UNECE
United Nations Economic Commission for Europe Statistical Division CSPA: The Future of Statistical Production Steven Vale UNECE
Common Statistical Production Architecture
DDI and GSIM – Impacts, Context, and Future Possibilities
Towards connecting geospatial information and statistical standards in statistical production: two cases from Statistics Finland Workshop on Integrating.
Contents Introducing the GSBPM Links to other standards
Understanding the communities readiness to transform
Navigating the application of Modernisation Frameworks when using Commercial Of The Shelf products. This presentation will provide a walkthrough of.
Interoperable data formats: SDMX
IT Directors Meeting November 2012
Generic Statistical Business Process Model (GSBPM)
GSBPM, GSIM, and CSPA.
Logical information model LIM Geneva june
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
Modernization of Statistical data processes
Modernising Official Statistics
X-DIS/XBRL Phase 2 Kick-Off
The Generic Statistical Business Process Model
CSPA: The Future of Statistical Production
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Contents Introducing the GSBPM Links to other standards
Presentation to SISAI Luxembourg, 12 June 2012
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
Generic Statistical Information Model (GSIM)
The future of Statistical Production
ESTP Training Course “Enterprise Architecture and the different EA layers, application to the ESS context ” Rome, 16 – 19 October 2017.
Introduction to the Common Statistical Production Architecture Alice Kovarikova High-Level Workshop on Modernization of Official Statistics, Nizhny Novgorod,
CSPA Common Statistical Production Architecture Motivations: definition and benefit of CSPA and service oriented architectures Carlo Vaccari Istat
Data Architecture project
CSPA Specifications Overview
DDI and GSIM – Impacts, Context, and Future Possibilities
CSPA Common Statistical Production Architecture Motivations: definition and benefit of CSPA and service oriented architectures Carlo Vaccari Istat
CSPA Templates for sharing services
ESS Enterprise Architecture
CSPA Templates for sharing services
process and supporting information
Palestinian Central Bureau of Statistics
High-Level Group for the Modernisation of Official Statistics
Hands-on GSIM Mauro Scanu ISTAT
GSIM overview Mauro Scanu ISTAT
Presentation transcript:

The problem we are trying to solve NSI 1 Collect Process Analyse Disseminate Survey A Survey B 2 big barriers that hinder modernisation of statistical organisations are: Rigid processes and methods; and Inflexible and ageing technology environment. Historically statistical organisations have produced specialised business processes and IT systems

How does Architecture help? Many statistical organisations are modernising and transforming using Enterprise Architecture Enterprise Architecture shows what the business needs are and where the organisation wants to be, then aligns efforts accordingly It can help to remove silos and improve collaboration across an organisation Architecture is intrinsically linked to a transformation in a complex system and is concerned with maintaining coherence across a set of activities

EA helps you get to this NSI 1 Collect Process Survey A Survey B Analyse Survey A Survey B Survey C Disseminate

…but if each statistical organisation works by themselves…..

…we get this…. Canada Collect Process Analyse Disseminate Sweden

This makes it hard to share and reuse! Canada Collect Process Analyse Disseminate Sweden ?

…but if statistical organisations work together? HLG sponsor project: Desired Project Outcomes Increased: interoperability in Official Statistics through the sharing of processes and components ability to find real/genuine collaboration opportunities ability to make international decisions and investments sharing of architectural/design knowledge and practices

This makes it easier to share and reuse! Collect Process Analyse Disseminate ? Sweden Canada

Architecture Proof of Concept 2 Strands to the project Architecture Proof of Concept

The Architecture

CSPA Definition Common Statistical Production Architecture (CSPA): framework about Statistical Services to create an agreed top level description of the 'system' of producing statistics which is in alignment with the modernization initiative CSPA provides a template architecture for official statistics, describing: What the official statistical industry wants to achieve How the industry can achieve this, i.e. principles that guide how statistics are produced What the industry will have to do, compliance with the CSPA

Business Architecture Information Architecture Application Architecture An important concept in architecture is the separation of concerns. For that reason, the architecture is separated into a number of layers:   Business Architecture: what the industry does and how it is done (statistics in our case) [mention: link to Statistical Network] Information Architecture: understanding of the information, its flows and uses across the industry, and how that information is managed Application Architecture: the set of practices used to select, define or design software components and their relationships Technology Architecture:  the infrastructure technology underlying (supporting) the business and application layers Technology Architecture

GSIM object structures A statistical service GSIM object structures (formats) GSBPM -process At the core of the architecure is the concept of statistical Service: It is a set of activities that are self contained and which can be executed autonomously It required well define input and provide well define output. It can be parametrised Link to GSBPM and GSIM IN the statistical world, we use GSBPM to describe our activities and we use GSIM to describe the objects (information, parameters) and their structure GSIM object instances http://www.illeccio.com/page.php?lang=en&product_id=37

The concept of Plug and Play Standardised Service: Standardised input and output Meet generic nonfunctional requirements Can be easily used and reused in a number of different processes We use the analogy of the lego block the connector are standardised, they can be easily assembled the block can be used and reused for many deferent purpose (wall, castle, ….) It usually have an IT conotation but this is not necessary : service are broader that this

Proof of Concept

Choosing the PoC components Lego pieces could be: Wrapped legacy/existing Brand new OR For pragmatic reason – ensuring the architecture is valid for our current environments – wrapped legacy/exisitng has been selected Timing Good stuff – just not easy to share

The Proof of Concept 5 countries played the role of Builders 3 countries played the role of Assemblers Editrules CANCEIS Blaise G Code SCS Agreed components for the proof of concept

What Did the Services Do? DataEdit: Localization of erors CANCEIS: Localization of errors, editing, imputation Blaise: Administration of questionnaire and collection of data G Code: An auto-coding service SCS: An auto-coding service

Using DDI in the Proof of Concept

CSPA Service Design and Implementation E.g. DDI or SDMX

Learning curves Proof of Concept required knowledge about: The tool which was wrapped (CANCEIS, Blaise etc) GSIM implementation standards (DDI in this case)

What did we prove?

CSPA is practical and can be implemented by various agencies in a consistent way

What was the CSPA POC Experience with DDI? Being a lifecycle-oriented project, the CSPA POC agreed to use DDI 3.1, the latest production version of DDI Lifecycle The services focused in two areas: questionnaires and (mostly) editing of microdata (re-coding, localization of errors, imputation) DDI Lifecycle was the natural choice DDI maps reasonably well to GSIM DDI profiles and “implementers guide” now being produced

DDI Lessons Learned (1) For data editing, DDI Lifecycle can be massive overkill Much of the required detail is simply not needed (better in 3.2) Data editing is a relatively “metadata-light” application A few data files needed to be described, for input data sets, edited data sets, and reports (tables of which variables were imputed, or where errors might be located) These files were mostly very simple .CSV files We also needed a codelist (codes and categories) for the coding services A *really simple* data set description is needed No interest in study-level information: it is not used by these applications This document will be included in DDI 4.* and later

DDI Lessons Learned (2) It is important to maintain the continuity of metadata across the lifecycle The editing phases of the lifecycle do not use a lot of metadata The tools often consume metadata, but do not produce much! (SAS, etc.) Study-level metadata is often fairly static Variables, logical records, physical data description, statistics can be “recovered” from post-process set-ups, etc. Otherwise, the processing phases of the life-cycle can be a “metadata black hole”!

DDI Lessons Learned (3) CSPA as an architecture is services-oriented The definition of services is broad (TOGAF), but web services and RESTful services both fit the definition being used DDI is not service-oriented: there are no standard service interfaces Most “files” were passed into the CSPA POC services as location references DDI was passed in wholesale in XML form This would not be necessary is we had a standard RESTful syntax, etc. Metadata could be obtained as needed by the services at run-time from minimal input parameters

DDI Lessons Learned (4) The CSPA architecture is designed to support more than just data-production processes Also “support” functions such as classification management In GSIM, the Study Unit maps neatly to a cycle of data production There is no good corresponding container for support functions: Study Unit is about data production Resource Packages represent reusable resources, and map against other things in GSIM For the CSPA POC, this was not an issue: all services were data-oriented

An Interesting Decision: Rules Language For the CSPA POC, many GSIM inputs were “Rules” For imputation For editing For validation There was no good “rules language” for expressing these in a standard way Decision was made, for future work, to use the platform-neutral “Expression Language” now being developed For use with SDMX and DDI, or as “stand-alone” Second face-to-face meeting will be in Basel, end of January 2014

Summary DDI was able to support the CSPA POC use cases Too complex, and too steep a learning curve Standard DDI services interfaces should be developed Need to think about the overall data production lifecycle and how to persist the metadata Need to consider the GSIM objects not only for cyclical data production, but also for “support” functions such as metadata management