Harry Goossens Centre of Competence on Data Warehousing.

Slides:



Advertisements
Similar presentations
ESSnet on Data Warehousing - WP2 Overview Amsterdam September 2013.
Advertisements

ESSnet on Data Warehousing Centre of Competence
Best practice case: Comparing the implementations of the Irish CDM and the Dutch DSC ESSnet on microdata linking and data warehousing in statistical production.
International Seminar on Modernizing Official Statistics:
S-DWH Architecture (Recap):
Producing and managing metadata Workshop on Writing Metadata for Development Indicators Lusaka, Zambia 30 July – 1 August 2012 Writing Metadata for Development.
What is Business Analysis Planning & Monitoring?
Setting up a National Warehouse of Official Statistics in India P C Mohanan Deputy Director general National Statistical Organisation Ministry of Statistics.
1 ESSnet Workshop 2012 Cavour Conference Centre, Rome 3-4 December 2012 Final conclusions from the workshop Donatella Fazio, Istat.
Standardisation in the European Statistical System Barteld Braaksma, Cecilia Colasanti, Piero Demetrio Falorsi, Wim Kloek, Miguel Angel Martínez Vidal,
WP.5 - DDI-SDMX Integration
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
1 Meeting on the Management of Statistical Information Systems (MSIS 2010) (Daejeon, Republic of Korea, April 2010) NIS ICT Strategy in the Production.
Bernadett Szekeres Quality management, Methodology Department, HCSO
Case Studies: Statistics Canada (WP 11) Alice Born Statistics UNECE Workshop on Statistical Metadata.
Data Warehousing at STC MSIS 2007 Geneva, May 8-10, 2007 Karen Doherty Director General Informatics Branch Statistics Canada.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
Overview of SDMX: Statistical Data and Metadata eXchange Technical and Content Standards for Statistical Data Ann McPhail, Division Chief Statistics Department,
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
SDMX and DDI Working Together Technical Workshop 5-7 June 2013
m. rugsėjo 25 d. Questionnaire on the use of software tools in S-DWH Centre of competence on data warehousing Questionnaire on the.
The Adoption of METIS GSBPM in Statistics Denmark.
CASE STUDY: STATISTICS NORWAY (SSB) Jenny Linnerud and Anne Gro Hustoft Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Modernisation of ESS infrastructure: The ESS instruments - a review E. di Meglio – P. Jacques – J.M. Museux.
United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session.
Data warehousing and online analytical processing- Ref Chap 4) By Asst Prof. Muhammad Amir Alam.
ESS-net DWH ESSnet DWH - Metadata in the S-DWH Harry Goossens – Statistics Netherlands Head Data Service Centre / ESSnet Coordinator
Data Warehouse Development Methodology
2 Copyright © Oracle Corporation, All rights reserved. Defining Data Warehouse Concepts and Terminology.
Explaining the statistical data warehouse (S-DWH)
1 Conclusions from Sessions 4, 5, 6 Rapporteurs: Donatella Fazio, Istat Maria Grazia Calza, Istat Arianna Carciotto, Istat ESSnet Workshop 2012 Cavour.
Jenny Linnerud, 27/10/2011, Cologne1 ESSnet CORE Common Reference Environment ESSnet workshop in Cologne 27th and 28th of October 2011.
Statistics New Zealand’s End-to-End Metadata Life-Cycle ”Creating a New Business Model for a National Statistical Office if the 21 st Century” Gary Dunnet.
ESSnet on microdata linking and data warehousing in statistical production: Metadata Quality in the Statistical Data Warehouse.
InSPIRe Australian initiatives for standardising statistical processes and metadata Simon Wall Australian Bureau of Statistics December
Statistics New Zealand's Move to Process-oriented Statistics Production Julia Gretton and Tracey Savage IAOS Conference Shanghai, China, October 2008.
African Centre for Statistics United Nations Economic Commission for Africa Data compilation and management proposal for Africa Statistical Commission.
Work packages SGA II ESSnet on microdata linking and data warehousing in statistical production Harry Goossens – Statistics Netherlands Head Data Service.
ESS-VIP ICT Project Work Package III Task Force Meeting, Luxembourg, 5 March 2013.
Metadata By N.Gopinath AP/CSE Metadata and it’s role in the lifecycle. The collection, maintenance, and deployment of metadata Metadata and tool integration.
United Nations Oslo City Group on Energy Statistics OG7, Helsinki, Finland October 2012 ESCM Chapter 8: Data Quality and Meta Data 1.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production Harry Goossens – Statistics Netherlands Head Data Service Centre.
Metadata Framework for a Statistical Data Warehouse
11 Centre of knowledge and expertise Data Warehousing ESSnet (DWH ESSnet)
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Standardisation in the European Statistical System inventory of normative documents and the standard-setting process – results of the ESSnet on Standardisation.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production.
METIS 2011 Workshop Session III – National Implementation of the GSBPM Alice Born and Tim Dunstan Thursday October 6, 2011 Implementation of the GSBPM.
1 Copyright © Oracle Corporation, All rights reserved. Business Intelligence and Data Warehousing.
MBA/1092/10 MBA/1093/10 MBA/1095/10 MBA/1114/10 MBA/1115/10.
Statistical process model Workshop in Ukraine October 2015 Karin Blix Quality coordinator
2 Copyright © 2006, Oracle. All rights reserved. Defining Data Warehouse Concepts and Terminology.
Slide 1 Data Warehousing in CIM  2000 YourNameHere Data Warehousing in Computer Integrated Manufacturing Steve Daino IEM 5303.
Business Intelligence Overview
UNECE-CES Work session on Statistical Data Editing
The ESS vision, ESSnets and SDMX
Implementing the ESS Vision 2020
YTY − an integrated production system for business statistics
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
ESSnet on Data Warehousing 4th Workshop Maia Ennok 20th. of March 2013
Methodology Working Group Luxemburg
SDMX in the S-DWH Layered Architecture
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
Data Warehousing Concepts
ESTP course on Statistical Metadata – Introductory course
METIS 2011 Workshop Session III – National Implementation of the GSBPM
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Joint UNECE/Eurostat/OECD
Palestinian Central Bureau of Statistics
Presentation transcript:

Harry Goossens Centre of Competence on Data Warehousing

Centre of Competence on DWH 2 Active support of ESS member states, putting results ESSnet DWH in practice

ESSnet DWH– Short Recap 3 A central ‘statistical data store’ for managing all available data of interest, regardless of its source, enabling the NSI to:  produce necessary information (= statistics !)  (re)use available data to create new data / new outputs  execute analysis and perform reporting  A warehouse approach to statistics: Provide an architectural model of the statistical data flow, from data collection to statistical output. Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Introduction: Why S-DWH ? The Challenges 4 Rapidly changing info-demand Decrease costs & admin burden Integration & re-use available data sources Shorter life cycle, Quicker delivery Increase efficiency & flexibility Statistical Production Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Introduction: Why S-DWH ? The goal Reusing Statistical Data External data sources 5 Data Collection Make optimal use of all available data sources (existing & new) Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Output: Huge set of deliverables The ESSnet DWH produced:  Architectural framework for the S-DWH: ­ Business architecture (3.1 and 3.3) ­ Information systems architecture (3.5 and 1.6) ­ Technical architecture (3.4)  Metadata framework (1.1)  Metadata guidelines & recommendations on ­ Use of metadata models (1.3) and functionalities (1.4) ­ Metadata quality (1.2) and governance (1.5)  Methodological recommendations  Workshop CoC on DWH, Helsinki, 24 – 25 September2014

7 ISO 11179

The layered architecture of the S-DWH 8 Distinguishes S-DWH Workshop CoC on DWH, Helsinki, 24 – 25 September2014

GSIM: General Statistical Information Model 9 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The Business architecture of a S-DWH 10 BUSINESS PRODUCTION STRUCTURES CONCEPTS Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Information Systems Architecture 11 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Information Systems Architecture 12 Source Layer Integration Layer Interpretation Layer Access Layer Staging Data ICT - Survey SBS - Survey ET- Survey... ADMIN Usually of temporary nature, contents can be erased or archived after the S-DWH has been loaded successfully Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Information Systems Architecture 13 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Information Systems Architecture 14 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Information Systems Architecture 15 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The layered architecture 16 Reflects to 2 different IT environments of the S-DWH: 1.Operational to support semi-automatic computer interaction systems. 2.Analytical, the actual data warehouse to maximize free human interaction Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The layered architecture 17 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The source layer 18  Gathering point for all data to be used in the S-DWH ­ Internal sources: surveys, data from processing programms ­ External sources: admin data, collected for other purposes  No specific, predefined data model ­ Depends on design of datacollection process ­ Well structured and/or simply flat files  Important role as ‘Gatekeeper’ ­ Ensuring that data getting in the S-DWH always has metadata matching minimum requirements and quality Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The source layer 19 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The integration layer 20  Operation system(s) used to process day-to-day operations, translating source data into useful content in S-DWH, commonly called ETL: ­ EXTRACT ­ TRANSFORM ­ LOAD Source Layer Integration Layer Integration Layer Integration Layer Interpretration & Analysis Layer Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The integration layer 21  As the focus is on processing, data should be stored in a generalized and normalized data model, optimized for OLPT  ETL needs active metadata  Integration layer produces reference, process and statistical metadata  The efficiency of the processing in the integration layer strongly depends on the quality of the metadata comming from the source layer. Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The integration and layer 22 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The interpretation and analysis layer 23  Contains relevant (micro) data, processed and structured ­ to be optimized for analysis ­ as base for the planned output of NSI  Specially designed for statistical experts  Built to support data manipulation of large, complex search operations Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The interpretation and analysis layer 24  Data modelling based on analysis & real time output ­ dimensional datamodels ­ highly denormalized, redundancy ­ sometimes cubes  Metadata normally added, with few changes ­ variable definitions, deriviation rules ­ estimation rules, confidentiality rules Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The interpretation and analysis layer 25 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The access layer 26  Designed for the final presentation, dissemination and delivery of statistical information.  Used by a wide range of users and computer instruments.  Data is optimized to present and compile data effectively.  Data may be presented in data cubes with different formats, specialized to support different tools and software: ’Data marts’ Workshop CoC on DWH, Helsinki, 24 – 25 September2014

The access layer 27 Workshop CoC on DWH, Helsinki, 24 – 25 September2014

28 What support in your opinion is most important for the Centre to provide ?  Review***  Advice & Consultancy****  Implementing deliverables*  Knowledge repository****  Information broker**  Supporting Business Case** C0C on DWH: Desired services Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Main route 29 Focus on middle Phases Workshop CoC on DWH, Helsinki, 24 – 25 September2014

30

Main goals 31  Contacting ESS members for identifying and prioritizing relevant projects and support requests  Provide ad-hoc support and consultancy to ESS members on their specific subjects as requested.  Active dissemination and implementation in daily practice of the deliverables of the ESSnet Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Main goals 32  Set up the knowledge repository in the ESSnet DWH domain of the CROS portal, incl. S-DWH best practice cases in ESS member states.  Further elaboration of specific deliverables from the ESSnet DWH that are characterized as continuous activities (’living documents’).  Keeping up-to-date the Handbook Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Status 33  1st line help desk function established via CROS Portal:   Inventory of needs & tools ­ At final workshop ­ Face-to-face ­ uestionnaire, wider group  Various support provided  Roadmap for Design Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Support actions 34  CSO Ireland: −Architecture and Metadata −Review Corporate Data Vault  Destatis Germany: −Metadata support  CSO Poland −General advise on S-DWH topic −Support session planned Dec/Jan  INE Portugal: −Contacts renewed, partner in COE Workshop CoC on DWH, Helsinki, 24 – 25 September2014

Lessons learned 35  Work in progress, but slowly  Most NSI in early stage of redesign, only few practice experiences to share  Availability of resources  Different format & more flexibility needed  Need for exchanging expertise and experiences still actual Workshop CoC on DWH, Helsinki, 24 – 25 September2014