Metadata use in the Statistical Value Chain

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Status on the Mapping of Metadata Standards
SDMX training session on basic principles, data structure definitions and data file implementation 29 November
Making the Case for Metadata at SRS-NSF National Science Foundation Division of Science Resources Statistics Jeri Mulrow, Geetha Srinivasarao, and John.
TC3 Meeting in Montreal (Montreal/Secretariat)6 page 1 of 10 Structure and purpose of IEC ISO - IEC Specifications for Document Management.
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
NSI 1 Collect Process AnalyseDisseminate Survey A Survey B Historically statistical organisations have produced specialised business processes and IT.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
Using ISO/IEC to Help with Metadata Management Problems Graeme Oakley Australian Bureau of Statistics.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University.
CountryData Technologies for Data Exchange SDMX Information Model: An Introduction.
Describing Statistical registers in SDMX and DDI: A Comparison Arofan Gregory Metadata Technology Eurostat, June 4-6, 2013 Luxembourg.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
Lecturer: Gareth Jones. How does a relational database organise data? What are the principles of a database management system? What are the principal.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Francesco Rizzo (ISTAT - Italy) SDMX ISTAT FRAMEWORK GENEVE May 2007 OECD SDMX Expert Group.
Developing Statistical Information Systems and XML Information Technologies - Possibilities and Practicable Solutions Geneva,
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Eurostat 4. SDMX: Main objects for data exchange 1 Raynald Palmieri Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October.
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
GEM METADATA DEVELOPMENT Xiaoping Wang, Macrosearch Allen Macklin, PMEL and Bernard Megrey, AFSC.
Joseph Lukhwareni Statistics South Africa Reengineering projects focusing on metadata and the statistical cycle Statistics South Africa, South Africa 3-5.
Fundamentals, Design, and Implementation, 9/e Appendix B The Semantic Object Model.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Ontology Technology applied to Catalogues Paul Kopp.
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
Chapter (12) – Old Version
MANAGEMENT OF STATISTICAL PRODUCTION PROCESS METADATA IN ISIS
Prepared by: Galya STATEVA, Chief expert
Progress Update MSIS: Bratislava, April 2005
SDMX Information Model
MSDs and combined metadata reporting

Chapter 10 Structuring System Requirements: Conceptual Data Modeling
Transportation Research Thesaurus:
eSciDoc – Content model requirements
Country Specific Notes Agenda point 11
The new metadata structure & Country Specific Notes
2. An overview of SDMX (What is SDMX? Part I)
Development of production routines for Crime & Criminal justice statistics Arsela Sturc SOGETI.
2. An overview of SDMX (What is SDMX? Part I)
Data Model.
NewCronos what policy and architecture contents consultation evolution
Metadata Framework as the basis for Metadata-driven Architecture
State of progress Eurostat web site 3 Dissemination WG 29 April 2004
Reference Data and Metadata Warehouses
SDMX Information Model: An Introduction
The problem we are trying to solve
August Götzfried Eurostat unit B 4
ESS VIP ICT Project Task Force Meeting 5-6 March 2013.
Item 3.1 – Overview of 2015 and 2016 UOE data collections
Data validation handbook
SDMX and implications for the ESS
Chapter 10 Structuring System Requirements: Conceptual Data Modeling
The role of metadata in census data dissemination
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Practical Database Design and Tuning Objectives
Presentation of Project Joint meeting of the ESS.VIP.BUS ICT Project
EDIT data validation system Ewa Stacewicz EUROSTAT VALIDATION TEAM
Petr Elias Czech Statistical Office
Introduction to reference metadata and quality reporting
7. Introduction to the main SDMX objects for metadata exchange
SDMX IT Tools SDMX Registry
Presentation transcript:

Metadata use in the Statistical Value Chain UNECE-Eurostat-OECD Meeting on Management of Statistical Information Systems MSIS 2008 Luxembourg, 7-9 April 2008 Georges Pongas Adam Wroński 07-Apr-08

Content Introduction Operational Characteristics of Metadata Technical Characteristics of the Metadata Metadata types needed in the various steps of the SVC (statistical value chain) Conclusion 7-Apr-08 Metadata use in the Statistical Value Chain

Seven SVC steps Expression of the need Data collection design Specification and development of the tools needed for the data collection Data collection Data editing and imputation Data processing Data dissemination 7-Apr-08 Metadata use in the Statistical Value Chain

Basics Leave out the statistical notions from the technical (implementation oriented) characteristics of the metadata. Design metadata technical characteristics so the same metadata structures can cover both statistical and non-statistical requirements 7-Apr-08 Metadata use in the Statistical Value Chain

Operational Characteristics of Metadata Static nature Long production process Located in various places (resources) Critical link with statistical data depends on statistical data changes Strong coupling of structural metadata with the statistical data Large number of metadata entities needed in SVC 7-Apr-08 Metadata use in the Statistical Value Chain

Technical Characteristics of Metadata Terminology often complex Technical characteristics and statistical notions frequently mixed 7-Apr-08 Metadata use in the Statistical Value Chain

Statistical Notions and Metadata Examples Classification, keyword list and set of information related to the SDDS standard Correspondence table between two classifications & table containing the links (access rights) between the user names and the statistical datasets of a database The only difference is the context, i.e., the user interface Thus develop separately: a common set of functionalities and the interface layer for an application 7-Apr-08 Metadata use in the Statistical Value Chain

Metadata Technical Structure Categories Three categories proposed: Simple Metadata Entities (SME) Binary Relationships (BR) Clustered Metadata Entities (CME) 7-Apr-08 Metadata use in the Statistical Value Chain

Simple Metadata Entities (SME) simple key variable number of attributes appropriate for vertical type storage Example 1 Example 2 Entity NACE user name Entity element 2122 gpongas Attribute name English label phone no Attribute value “Mining” 430139 7-Apr-08 Metadata use in the Statistical Value Chain

Examples of SMEs SDDS documents Dublin Core Classifications Keywords Administrative entities Programs Publications 7-Apr-08 Metadata use in the Statistical Value Chain

Binary Relationships (BR) Two types: Between two different entities correspondence tables, access rights definitions Inside the same entity thesauri, classification hierarchies, links between regulations, statistical documents Example Relationship id UN thesaurus First entity id EUROPE First entity role Parent Second entity id FR Second entity role Child Reason of link Broader term 7-Apr-08 Metadata use in the Statistical Value Chain

Clustered Metadata Entities (CME) Complex entities characterised by variable keys’ cardinality and references to other entities of type CME, SME and BR Description techniques XML schema is appropriate 7-Apr-08 Metadata use in the Statistical Value Chain

Examples SDMX, Gesmes definitions Dataset definitions Annotations to dataset cells Confidentiality definitions linked to datasets 7-Apr-08 Metadata use in the Statistical Value Chain

Metadata in the various steps of the SVC 7-Apr-08 Metadata use in the Statistical Value Chain

Collection Metadata Mostly of type BR and SME Among others they contain: source agencies data files descriptions codelists validation rules linked to initial data checks 7-Apr-08 Metadata use in the Statistical Value Chain

Editing, Imputation and Processing Metadata More complex than the collection metadata (more CME entities needed) Among others they contain: Dataset definitions Formulas, programs, scripts Conditional and ordinary annotations Dissemination feeding information 7-Apr-08 Metadata use in the Statistical Value Chain

Dissemination Metadata The most complex metadata types are located here. They contain almost all the previously described metadata plus their own Reasons for this complexity Dissemination contains all the statistical domains It must cover all user types It has tight delivery deadlines It must offer navigation presentation and extraction facilities of great friendliness 7-Apr-08 Metadata use in the Statistical Value Chain

Among others dissemination metadata contain Sitemap description Release calendars Dataset links to publication tables Questionnaires definitions linked to datasets Units of measurement Ready made queries 7-Apr-08 Metadata use in the Statistical Value Chain

Conclusion Separation of statistical notions (context) and structure (functionality) of metadata gives minimisation of structural metadata types consequently it makes easier to build and implement a complex statistical (metadata and data) system 7-Apr-08 Metadata use in the Statistical Value Chain