at Statistics Netherlands

Slides:



Advertisements
Similar presentations
Practical Database Design Methodology and Use of UML Diagrams
Advertisements

Commentary and exploration of the MINERVA 10 Quality Principles Antonella FresaBerlin, 31 August 2004 Ministerial NEtwoRk for Valorising Activising in.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
Disseminating Statistics: Internet and Publications INE – Madrid, 3-5 March 2008 Ulrich Wieland, Eurostat How to link publications and Internet in order.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Best practice case: Comparing the implementations of the Irish CDM and the Dutch DSC ESSnet on microdata linking and data warehousing in statistical production.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education, Inc. publishing as Prentice Hall 4-1.
Chapter 4 Relational Databases Copyright © 2012 Pearson Education 4-1.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
MIS 710 Module 0 Database fundamentals Arijit Sengupta.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
WP.5 - DDI-SDMX Integration
The value added of a national statistical institute Max Booleman Marleen Verbruggen.
Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University.
StatLine 4 metadata implementation Edwin de Jonge Statistics Netherlands.
0 A Workable Solution for Basic Metadata January 9, 2006.
10/18/2015 NORTEL NETWORKS CONFIDENTIAL – FOR TRAINING PURPOSES ONLY Global Documentation Evolution System Overview and End-to-End Process Training.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
ESS-net DWH ESSnet DWH - Metadata in the S-DWH Harry Goossens – Statistics Netherlands Head Data Service Centre / ESSnet Coordinator
Explaining the statistical data warehouse (S-DWH)
Case Study Statistics Netherlands Max Booleman Statistics Netherlands METIS, 2010.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
A modular metadata-driven statistical production system The case of price index production system at Statistics Finland Pekka Mäkelä, Mika Sirviö.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production.
Harry Goossens Centre of Competence on Data Warehousing.
Elaborating on the Business Architecture of SN Robbert Renssen Statistics Netherlands Standard Process Steps.
Copyright © 2007, Oracle. All rights reserved. Managing Items and Item Catalogs.
Standard Process Steps in Statistics Robbert Renssen Statistics Netherlands Robbert Renssen and Astrea Camstra, Statistics Netherlands.
Fundamental of Database Systems
Introduction To DBMS.
Chapter (12) – Old Version
ARIS Extension Pack TOGAF April 2016
An introduction to MEDIN Data Guidelines September 2016
Modern Systems Analysis and Design Third Edition
PLM, Document and Workflow Management
The importance of being Connected
Modern Systems Analysis and Design Third Edition
Contents Introducing the GSBPM Links to other standards
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
An introduction to MEDIN Data Guidelines.
Chapter 4 Relational Databases
Creating ADaM Friendly Analysis Data from SDTM Using Meta-data by Erik Brun & Rico Schiller (CD ) H. Lundbeck A/S 13-Oct
Interoperable data formats: SDMX
Data management and the Production of Statistics Geert Bruinooge Deputy Director General Statistics Netherlands Seminar on Innovations in Official Statistics.
Modern Systems Analysis and Design Third Edition
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
ESSnet on Data Warehousing 4th Workshop Maia Ennok 20th. of March 2013
Standard Process Steps in Statistics
Modern Systems Analysis and Design Third Edition
Data Model.
Database Systems Instructor Name: Lecture-3.
IS-ENES Cases Seven use cases are listed as data lifecycle steps A B C
Max Booleman Statistics Netherlands
LOD reference architecture
SDMX in the S-DWH Layered Architecture
Statistical Information Technology
Metadata The metadata contains
Subject Name: SOFTWARE ENGINEERING Subject Code:10IS51
ESTP course on Statistical Metadata – Introductory course
Generic Statistical Information Model (GSIM)
Modern Systems Analysis and Design Third Edition
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Practical Database Design and Tuning Objectives
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
The Generic Statistical Business Process Model Steven Vale, UNECE
Palestinian Central Bureau of Statistics
Presentation transcript:

at Statistics Netherlands Data Service Centrum Implementation of the Data Service Centre at Statistics Netherlands Harry Goossens Programm Manager DSC

Agenda Why, What, How ? The CBS Metadata model The CBS Business Architectuur Steady States Implementation Daily practice Demo

Data Service Centre: What is it ? Fundamental corner stone of the CBS Business Architecture Central ‘vault’ with Steady States, linking: statistical data (facts & figures) conceptual metadata (description) technical metadata (user’s guide) documentation Implementation of the Dutch metadata model

DSC: The concept No data without metadata Based upon dedicated metadata model Strict distinction between the data that are actually processed and the metadata that describe the definitions, the quality and the process activities Steady states are explicitly designed for re-use. The metadata (of steady states) are generally accessible and are standardised as much as possible

DSC: What offers it ? Generic services: Catalogue: searching & finding Metadata management Centralised data distribution Authorisation management Automatic process interfacing Archiving of statistical datasets Version management

DSC: Conceptual metadata Metadata that describes the data in a generic, non specific way, in all the various phases of the statistical proces: Input description of received data; terminology of the supplier (internal and external) Processing description of data produced from various statistical processes; internal (and international) standards/guidelines Output description of publishable outputdata; definition of (sub)populations, outputvariables, object types content description,

CBS Metadata model - micro

CBS Metadata model - macro

DSC Metamodel - simplified Variabele Dataontwerp 1 : n 1 : 1 Technische Metafile (XML) Context Variabele 1 : n Documentatie (Word, PDF,….) Datasets (ASCII CSV, Fixed)

CBS Business Architecture Strategy Design DSC – Metadata Catalogue Chain management Statistics Production Steady States DSC - Data Storage

Steady States

What are steady states ? A steady state is a data set together with information for its correct interpretation. Rectangular Rows represent units (micro) or classes of units (macro) Columns represent variables Heading: population, time Dataset design (vary time): in design phase Dataset design is like a template of a table: only borders and heading 1 Dataset design, n Datasets

Why steady states ? Reduce storage: Store once Re-use many times Secure the statistical proces: Each steady state is a guaranteed fall back point Improve consistency: Every following process uses the same dataset Improve flexibility Enables independent, generic proces design

Implementation Micro model simplyfied for practical use Skip / Combine objects Reduce attributes Why Documentum ? Completely object orientated, enables to implement DSC metamodel Largly configurable (user interface, authorisation, etc.) Large flexibillity Proven technology TaskSpace

DSC system - Functionalities Taylor made user interface (TaskSpace) maintenance of meta objects constraints mostly in interface, not in DB/repository Specific flexible search engine various entrances, easy to extend Import & Export of datasets modified according to model Interface for bulk-import of metadata based on standard XML schema conceptual and technical meta

Daily practice - Challenges Available metadata quality often poor Great variety, each statistic own way of describing Often tool based (SPSS), more technical then logical Definition = question from survey Minimum mapping with DSC model No real urge Although rated IMPORTANT, low priority No clear ownership, resonsibility not felt Extra work without direct gain (burden)

Daily practice - Road map Explaining the concept & metadata model Requirements, guidelines Stocktaking What meta is available ? How extensive or poor ? What quality, actuality ? Re-usability Mapping on the model (Re)Design Datadesigns Matching attributes

Daily practice - Chances Look for added value Problems ? Wishes ? (Long) Wanted improvements ? Re-usability Define pilot Quick hands on experience, short cycle Good estimation time & resources ‘Proof of the design pudding’

Daily practice – At work Excel template Porch / Ballot Visual check on guidelines Automated check on completeness, inconsistencies, relations VARIABLE – CONTEXT VARIABLE Corrections Bulk import TMF Meta XML

Screenshot

Screenshot: Metadata attributes of a Data design

Screenshot: and this is how we stor the statistical datasets