ESTP course on Statistical Metadata – Introductory course

Slides:



Advertisements
Similar presentations
Practical Database Design Methodology and Use of UML Diagrams
Advertisements

Commentary and exploration of the MINERVA 10 Quality Principles Antonella FresaBerlin, 31 August 2004 Ministerial NEtwoRk for Valorising Activising in.
New Services for Data Creators and Providers Louise Corti, Head ESDS Qualidata/ Outreach & Training Alasdair Crockett, ESDS Data Services Manager.
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Provenance-Aware Storage Systems Margo Seltzer April 29, 2005.
Disseminating Statistics: Internet and Publications INE – Madrid, 3-5 March 2008 Ulrich Wieland, Eurostat How to link publications and Internet in order.
Dr Gordon Russell, Napier University Unit Data Dictionary 1 Data Dictionary Unit 5.3.
Best practice case: Comparing the implementations of the Irish CDM and the Dutch DSC ESSnet on microdata linking and data warehousing in statistical production.
Alternative Ways of Presenting Historical Census Data Luuk Schreven & Anouk de Rijk &
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
MIS 710 Module 0 Database fundamentals Arijit Sengupta.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
WP.5 - DDI-SDMX Integration
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
The value added of a national statistical institute Max Booleman Marleen Verbruggen.
StatLine 4 metadata implementation Edwin de Jonge Statistics Netherlands.
0 A Workable Solution for Basic Metadata January 9, 2006.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
ESS-net DWH ESSnet DWH - Metadata in the S-DWH Harry Goossens – Statistics Netherlands Head Data Service Centre / ESSnet Coordinator
Explaining the statistical data warehouse (S-DWH)
Case Study Statistics Netherlands Max Booleman Statistics Netherlands METIS, 2010.
ICS (072)Database Systems: An Introduction & Review 1 ICS 424 Advanced Database Systems Dr. Muhammad Shafique.
IS 325 Notes for Wednesday August 28, Data is the Core of the Enterprise.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production.
Harry Goossens Centre of Competence on Data Warehousing.
Elaborating on the Business Architecture of SN Robbert Renssen Statistics Netherlands Standard Process Steps.
Fundamental of Database Systems
CS4222 Principles of Database System
Introduction To DBMS.
Navigating Your Way Through the EFT, Nesstar and Beyond 20/20 (WDS)
Chapter (12) – Old Version
Investment Intentions Survey 2016
ARIS Extension Pack TOGAF April 2016
An introduction to MEDIN Data Guidelines September 2016
Contents Introducing the GSBPM Links to other standards
Accelerate define.xml using defineReady - Saravanan June 17, 2015.
An introduction to MEDIN Data Guidelines.
Investment Intentions Survey 2016
Chapter 4 Relational Databases
Interoperable data formats: SDMX
Data management and the Production of Statistics Geert Bruinooge Deputy Director General Statistics Netherlands Seminar on Innovations in Official Statistics.
File Systems and Databases
at Statistics Netherlands
Generic Statistical Business Process Model (GSBPM)
Tomaž Špeh, Rudi Seljak Statistical Office of the Republic of Slovenia
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
ESSnet on Data Warehousing 4th Workshop Maia Ennok 20th. of March 2013
Standard Process Steps in Statistics
Data Model.
Database Systems Instructor Name: Lecture-3.
IS-ENES Cases Seven use cases are listed as data lifecycle steps A B C
Max Booleman Statistics Netherlands
LOD reference architecture
SDMX in the S-DWH Layered Architecture
SDMX Tools Overview and architecture
Statistical Information Technology
Metadata The metadata contains
The Generic Statistical Business Process Model
Prepared by Peter Boško, Luxembourg June 2012
Mapping Data Production Processes to the GSBPM
Metadata used throughout statistics production
The role of metadata in census data dissemination
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Practical Database Design and Tuning Objectives
Data + Research Elements What Publishers Can Do (and Are Doing) to Facilitate Data Integration and Attribution David Parsons – Lawrence, KS, 13th February.
The Generic Statistical Business Process Model Steven Vale, UNECE
Introduction to reference metadata and quality reporting
Palestinian Central Bureau of Statistics
Presentation transcript:

ESTP course on Statistical Metadata – Introductory course Statistics Netherlands, The Hague,18-19 February 2013

at Statistics Netherlands Implementation of the Data Service Centre at Statistics Netherlands Harry Goossens Programm Manager DSC 2

Agenda Why, What, How ? The CBS Metadata model The CBS Business Architectuur Steady States Implementation Daily practice Demo 3

Data Service Centre: What is it ? Fundamental corner stone of the CBS Business Architecture Central ‘vault’ with Steady States, linking: statistical data (facts & figures) conceptual metadata (description) technical metadata (user’s guide) documentation Implementation of the Dutch metadata model

DSC: The concept No data without metadata Based upon dedicated metadata model Strict distinction between the data that are actually processed and the metadata that describe the definitions, the quality and the process activities Steady states are explicitly designed for re-use. The metadata (of steady states) are generally accessible and are standardised as much as possible

Generic services: Catalogue: searching & finding Metadata management DSC: What offers it ? Generic services: Catalogue: searching & finding Metadata management Centralised data distribution Authorisation management Automatic process interfacing Archiving of statistical datasets Version management

DSC: Conceptual metadata Metadata that describes the data in a generic, non specific way, in all the various phases of the statistical proces: Input - description of received data; - terminology of the supplier (internal and external) Processing - description of data produced in various statistical processes; - internal (and international) standards / guidlines Output - description of publishable output data - definition of (sub)populations, outputvariables, object types content description,

DSC Metamodel (simplified) Variable Data Design 1 : 1 1 : n Technical Metadata file (XML) Context Variable 1 : n Documentation (Word, PDF,….) Datasets (ASCII CSV, Fixed) Codelists (XML) 1 : 1

Mission, Policy and Strategic Objectives CBS Business Architecture (simplified) Mission, Policy and Strategic Objectives Design Process Design Data External users with information needs DSC Metadata Catalogue Metadata Management External Suppliers of data Collect Data Process Data Disseminate Data DSC Data storage Data (steady states)

What are steady states ? A steady state is a data set together with information for its correct interpretation. Rectangular - Rows represent units (micro) or classes of units (macro) - Columns represent variables Heading: population, time Dataset design (vary time): in design phase Dataset design is like a template of a table: only borders and heading 1 Dataset design, n Datasets

Why steady states ? Reduce storage: Secure the statistical proces: Store once Re-use many times Secure the statistical proces: Each steady state is a guaranteed fall back point Improve consistency: Every following process uses the same dataset ‘Single point of truth’ principle Improve flexibility Enables independent, generic proces design

Implementation Micro model simplyfied for practical use Skip / Combine objects Reduce attributes Why Documentum ? Completely object orientated, enables to implement DSC metamodel Largly configurable (user interface, authorisation, etc.) Large flexibillity Proven technology TaskSpace

DSC system - Functionalities Taylor made user interface (TaskSpace) Maintenance of meta objects Constraints mostly in interface, not in DB/repository Specific flexible search engine Various entrances, easy to extend Import & Export of datasets Modified according to model Interface for bulk-import of metadata Based on standard XML schema Conceptual and technical meta

Daily practice - Challenges Available metadata quality often poor Great variety, each statistic own way of describing Often tool based (SPSS), more technical then logical Definition = question from survey Minimum mapping with DSC model No real urge Although rated IMPORTANT, low priority No clear ownership, resonsibility not felt Extra work without direct gain (burden)

Daily practice - Road map Explaining the concept & metadata model Requirements, guidelines Stocktaking What meta is available ? How extensive or poor ? What quality, actuality ? Re-usability Mapping on the model (Re)Design Datadesigns Matching attributes

Daily practice - Chances Look for added value Problems ? Wishes ? (Long) Wanted improvements ? Re-usability Define pilot Quick hands on experience, short cycle Good estimation time & resources ‘Proof of the design pudding’

Daily practice – At work (1) Excel template, Nesstar Publisher Porch / Ballot Visual check on guidelines Automated check on completeness, inconsistencies, relations VARIABLE – CONTEXT VARIABLE etc. Advise for corrections/improvemnts: by owner (statistics) ! Define and set authorisations Groups for import, export, metadata maintenance

Daily practice – At work (2) Metadata in DSC-system Define datadesign Import TMF (xml) Bulkimport Variables (xml) Import dataset (s) Search Export data & metadata