Data Architecture project

Slides:



Advertisements
Similar presentations
Enterprise Architecture Rapid Assessment
Advertisements

ISO TC184/SC4 Future architecture Rotterdam Progress on the Future SC4 Architecture PWI Friday 13 th November 2009.
Connecting People With Information DoD Net-Centric Services Strategy Frank Petroski October 31, 2006.
United Nations Economic Commission for Europe Statistical Division High-Level Group Achievements and Plans Steven Vale UNECE
GSBPM and GSIM as the basis for the Common Statistical Production Architecture Steven Vale UNECE
Experiences from the Australian Bureau of Statistics (ABS)
Developing Enterprise Architecture
GSIM Stakeholder Interview Feedback HLG-BAS Secretariat January 2012.
The future of Statistical Production CSPA. 50 task team members 7 task teams CSPA 2015 project.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
SDMX IT Tools Introduction
Generic Statistical Information Model (GSIM) Jenny Linnerud
United Nations Economic Commission for Europe Statistical Division GSBPM and Other Standards Steven Vale UNECE
Enterprise Architectures Course Code : CPIS-352 King Abdul Aziz University, Jeddah Saudi Arabia.
Enterprise Architectures Course Code : CPIS-352 King Abdul Aziz University, Jeddah Saudi Arabia.
ROMA 23 GIUGNO 2016 MODERNISATION LAB - FOCUSSING ON MODERNISATION STRATEGIES IN EUROPE: SOME NSIS’ EXPERIENCES Insert the presentation title Modernisation.
United Nations Economic Commission for Europe Statistical Division CSPA: The Future of Statistical Production Steven Vale UNECE
ESS Enterprise Architecture Reference Framework Jean-Marc Museux, Eurostat 2016 UNECE CSPA Workshop on CSPA Geneva
UNECE / Eurostat Workshop on Implementing CSPA – Geneva – June 2016 ESSNet on Sharing Common Functionalities
Implementing ModernStats Standards Linked Open Metadata
GAMSO in context Denis GROFILS & Jean-Marc MUSEUX, Eurostat
Investment Intentions Survey 2016
UNECE-CES Work session on Statistical Data Editing
Data Integration in Official Statistics 2017 Project Proposal
Achievements in 2016 Data Integration Linked Open Metadata
Development of Strategies for Census Data Dissemination
Data Architecture World Class Operations - Impact Workshop.
Thérèse Lalor Statistical Management and Modernisation Unit
Data Architecture (CSDA): where we are today June 2017
Contents Introducing the GSBPM Links to other standards
Modernization Maturity Model and Roadmap
ServiceNow Implementation Knowledge Management
Modernization Maturity Model
The Open Group Architecture Framework (TOGAF)
Enterprise Data Model Enterprise Architecture approach Insights on application for through-life collaboration 2018 – E. Jesson.
ESS Validation State of Play and next steps
EOSC Governance Development Forum
"IT principles" Context, roadmap
ESTP TRAINING ON EGR Luxembourg – December 2014
Eurostat activities update
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
ESS roadmap on Linked Open Data State of play
11. The future of SDMX Introducing the SDMX Roadmap 2020
OBJECT-ORIENTED APPROACH TO OFFICIAL STATISTICS
SISAI STATISTICAL INFORMATION SYSTEMS ARCHITECTURE AND INTEGRATION
GSBPM, GSIM, and CSPA.
ESSnet on Linked Open Statistics
June 2017 Carlo Vaccari, project leader
Marie Haldorson, Statistics Sweden
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
2. An overview of SDMX (What is SDMX? Part I)
ESS Vision 2020: ESS.VIP Validation
The Generic Statistical Information Model
LOSD Publication Deirdre Lee
ESS.VIP VALIDATION An ESS.VIP project for mutual benefits
LOD reference architecture
3rd WGM Meeting 3 May 2018 Item 2.3 Possible standards for ESS Validation.
Energy Statistics Compilers Manual
Employee engagement Delivery guide
CSPA: The Future of Statistical Production
Task Force Household Budget Survey Innovative tools and sources
Presentation to SISAI Luxembourg, 12 June 2012
Generic Statistical Information Model (GSIM)
Item 2.2 of the agenda IT Working Group meeting 2016
ESTP Training Course “Enterprise Architecture and the different EA layers, application to the ESS context ” Rome, 16 – 19 October 2017.
Australian and New Zealand Metadata Working Group
ESS Enterprise Architecture
process and supporting information
High-Level Group for the Modernisation of Official Statistics
GSIM overview Mauro Scanu ISTAT
Presentation transcript:

Data Architecture project

Purpose and Scope Original scope: To develop a reference framework for data architectures for the official statistics industry, as a part of the Enterprise Architecture which is one of the fundaments of the design of the Common Statistical Production Architecture (CSPA). Focus on: Different sources, new sources Capabilities Data/metadata Guidelines and practices

Data Architecture Purpose Support Statistical Organizations in the design, integration, production and dissemination of official statistics based on both traditional and new types of data sources How to organize and structure their processes and systems for efficient and effective management of data and metadata, from the external sources through the internal storage and processing up to the dissemination of the statistical end-products

Activities 2017 20 people from 11 organisations 2 face-to-face meetings: May @ISTAT in Rome October @CBS in Heerlen Other meetings also at NTTS – Bruxelles and at CSPA Workshop - Wiesbaden Fortnightly plenary Calls on Webex and other sub-groups Calls

Deliverables Reference Architecture: First draft available in May Version 1.0 available on the wiki Use-cases: descriptions of 5 different UC with standard template available Guidelines: Impossible to get in time: Guidelines need Use-cases and Capabilities that was available in stable version by the end of 2017

Reference Architecture Structure of the document: Management summary Purpose and scope Usage and users of Data Architecture Key principles Capabilities and Building Blocks: Core Capabilities Cross-cutting Capabilities Notes on Semantic layer Categories of data

Users and usage

Key principles 1. Information is managed as an asset throughout its lifecycle 2. Information is accessible 3. Data is described to enable reuse 4. Information is captured and recorded at the point of creation/receipt 5. Use an authoritative source 6. Use agreed models and standards 7. Information is secured appropriately For each principle: some statements detailing the principle the rationale behind the principle the implications for NSIs

Capabilities Defining: Core capabilities in four conceptual layers: Ingestion, Transformation, Integration, Access & Consumption Cross-cutting (aver-arching) capabilities: Metadata Management, Data Governance, Provenance & Lineage, Security & Authorization Building Blocks (Conceptual and Logical) are used to realize/implement Capabilities

Diagrams 2017 The following diagrams depict the relationships between the concepts of capabilities and conceptual and logical building blocks that are used to present the CSDA. The CSDA identifies the building blocks at the conceptual level that are mapped to the capabilities that are required to fulfill the activities conducted by a statistical organisation.

Use-cases 2017 Five use-cases has been described: Integrate a new CSPA service (INSEE) Semantic virtualisation (ISTAT) Privacy preserved processing of Sensitive Data (CBS) Use-case from UN-GWG Transformation (ONS)

Intersections: UN-GWG The goal of the UNSD Global Platform project is to build a public-private partnership that should extend initiatives to make better use of innovative data sources at the national and regional levels by developing a Business Case and building a PoC platform DA defines all the capabilities which will be needed by the Global Platform, starting point for the proof of concept. DA principles can be used to guide the data management process design and development for the Global Platform DA focus on the integration of new data sources should prove useful input into the development of the Global Platform’s integration and analysis capabilities The Global Platform can be a particular use-case as example of how the application of DA can benefit a solution development

Other intersections Interesting comparison between ISTAT and CBS vision on metadata: ISTAT use-case where metadata are used for mapping data coming from different sources, using semantic web standards like OWL CBS where a formal model is used to implement metadata for Data Lake implementation More general discussion on the use of standards coming from outside the statistical community (W3C): implications on Open Data policy and more generally on Dissemination policy Interactions with Data Integration project: adopt DI use-cases? Check them against reference DA?

Project 2018 Complete Data Architecture (DA) 2017 project: missing Guidelines Exploit contacts and intersections: Data Architecture for new data sources (UN-GWG) Metadata and Data Integration use-cases Ontologies to be framed in a Data Architecture Other initiatives on DA like Central Banks Survey on Software used: understand how today Data Architecture is implemented in NSIs

Work-packages 2018 WP1: Updated version of CSDA Activity: extend and improve the Common Statistical Data Architecture where appropriate The work package will test the architecture against further use cases, and include further details on: the influence of new data sources on DA the connections between Metadata systems and DA the relationships between DA and semantic data

Work-packages 2018 WP2: Data Architecture Guidelines Activity: improve the ability of NSOs to implement the CSDA Guidelines with recommendations and steps for the introduction of the CSDA in statistical organisations. Guidelines will include: descriptions of data artefacts such as Statistical Data Dictionaries, to drive the definition of data structures and metadata relationship of the CSDA with data governance themes like the data-life cycle management best practices to ensure data quality and to share technical solutions (like CSPA services and algorithms) roadmap for the adoption and implementation of DA

WP1 - Updated version of CSDA WP2 - Data Architecture Guidelines Complete 2017 activities Integrate with other outcomes Focus on new data sources and practical implementation New contacts and new participants: CA, ME, PL, RS

2018 Project Current activities: Check-list on Data characteristics focused on differences between traditional an new data sources: contributions from Eurostat, Italy, Netherlands, Poland, Serbia, Slovenia New schemas: trying to connect GSBPM phases with Capabilities, data characteristics and Metadata (GSIM) (see below)

Checklist items for comparison between new and traditional data: 1.1. Content (nature of the data) 1.2. Storage/accessibility 1.3. Structure 1.4. Quality 1.5. Process 1.6. Linked Data 1.7. Data on the Web Best Practices 2. Data Source 2.1. Legal 2.2. Stability 3. Ingesting Statistical Organisation 3.1. Maturity 3.2. Size 3.3. Legal situation

META DATA Ingestion RAW DATA WORKING DATA Provisioning VALIDATED DATA Admin. data Survey data Big data Ingestion RAW DATA Building Block META DATA Transformation WORKING DATA Provisioning VALIDATED DATA Micro data Macro data Registers

2018 Project Work in progress, many contributions To be delivered by November Face-to-face Meeting in Belgrade 15-18 May outcome