Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University.

Slides:



Advertisements
Similar presentations
Input Data Warehousing Canada’s Experience with Establishment Level Information Presentation to the Third International Conference on Establishment Statistics.
Advertisements

Stefania Bergamasco, Cecilia Colasanti An integrated approach to turn statistics into knowledge combining data warehouse, controlled vocabularies and advanced.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Introduction to Databases
1 CES IASSIST 2002, June 2002 University of Connecticut MetaNet: Standardising Statistical Metadata Methodology Karen Brannen University of Edinburgh,
ESS VIP project on Validation
Chapter 2 Database Environment Pearson Education © 2014.
The use and convergence of quality assurance frameworks for international and supranational organisations compiling statistics The European Conference.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
GSIM Stakeholder Interview Feedback HLG-BAS Secretariat January 2012.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
1 The system aspect of statistical quality Q2014 european conference on quality in official statistics Special session: Consistency of Concepts and Applied.
M ETADATA OF NATIONAL STATISTICAL OFFICES B ELARUS, R USSIA AND K AZAKHSTAN Miroslava Brchanova, Moscow, October, 2014.
The value added of a national statistical institute Max Booleman Marleen Verbruggen.
Marina Signore Head of Service “Audit for Quality Istat Assessing Quality through Auditing and Self-Assessment Signore M., Carbini R., D’Orazio M., Brancato.
4 April 2007METIS Work Session1 Metadata Standards and Their Support of Data Management Needs Daniel W. Gillman Bureau of Labor Statistics Paul Johanis.
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Luisa Franconi Integration, Quality, Research and Production Networks Development Department Unit on microdata access ISTAT Essnet on Common Tools and.
StatLine 4 metadata implementation Edwin de Jonge Statistics Netherlands.
Assessing Quality for Integration Based Data M. Denk, W. Grossmann Institute for Scientific Computing.
Eurostat Overall design. Presented by Eva Elvers Statistics Sweden.
Deliverable 2.6: Selective Editing Hannah Finselbach 1 and Orietta Luzi 2 1 ONS, UK 2 ISTAT, Italy.
Delivering business value through Context Driven Content Management Karsten Fogh Ho-Lanng, CTO.
Metadata Models in Survey Computing Some Results of MetaNet – WG 2 METIS 2004, Geneva W. Grossmann University of Vienna.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
Lisbone, March ALBANIAN METADATA AlbMeta Prepared by INSTAT Working Group.
Innovations in Data Dissemination Thomas L. Mesenbourg, Jr. Acting Director U.S. Census Bureau United Nations Seminar on Innovations in Official Statistics.
Towards a more efficient system of administrative data management and quality evaluation to support statistics production in Istat Grazia Di Bella, Simone.
Toward Generic Systems Shifra Haar - Central Bureau of Statistics-Israel.
Implementation Experiences METIS – April 2006 Russell Penlington & Lars Thygesen - OECD v 1.0.
Supporting Researchers and Institutions in Exploiting Administrative Databases for Statistical Purposes: Istat’s Strategy G. D’Angiolini, P. De Salvo,
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
SDMX IT Tools Introduction
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
Quality Frameworks: Implementation and Impact Notes by Michael Colledge.
STRATEGY FOR DEVELOPMENT OF ISIS AND IT STRATEGY IN THE NSI-BULGARIA Main principles, components, requirements.
MetaPlus Klas Blomqvist Statistics Sweden Research and Development – Central Methods
The Role of International Standards for National Statistical Offices Andrew Hancock Statistics New Zealand Prepared for 2013 Meeting of the UN Expert Group.
Integrated metadata systems History Status Vision Roadmap
HARMONIZATION AND INTEGRATION OF METADATA AN URGENT TASK FOR FUTURE EFFICIENT USE OF THE WEB Prepared by Dusan Soltes, FM CM BRATISLAVA, SLOVAKIA for the.
RECENT DEVELOPMENT OF SORS METADATA REPOSITORIES FOR FASTER AND MORE TRANSPARENT PRODUCTION PROCESS Work Session on Statistical Metadata 9-11 February.
Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) APRIL 2006Mar Blanco Frías STATISTICAL METADATA MODEL DEVELOPED IN SPAIN:CURRENT.
7b. SDMX practical use case: Census Hub
Towards a Process Oriented View on Statistical Data Quality Michaela Denk, Wilfried Grossmann.
ESS-net DWH ESSnet on microdata linking and data warehousing in statistical production.
1 Data Management and Information Delivery The Data Management and Information Delivery (DMID) Project 10 Apr 2008 Ashwell Jenneker & Matile Malimabe.
Chapter 2 Database Environment.
Conceptual metadata and process metadata Max Booleman (Statistics Netherlands) WP18.
11 th Open Forum for Metadata Registries May 2008 Metadata Management on a Shoe String in the Vietnamese Ministry of Planning and Investment by Michael.
Relationship between Short-term Economic Statistics Expert Group Meeting on Short-Term Statistics February 2016 Amman, Jordan.
United Nations Economic Commission for Europe Statistical Division GSBPM in Documentation, Metadata and Quality Management Steven Vale UNECE
1 Recent developments in quality related matters in the ESS High level seminar for Eastern Europe, Caucasus and Central Asia countries Claudia Junker,
METADATA MANAGEMENT AT ISTAT: CONCEPTUAL FOUNDATIONS AND TOOLS Istituto Nazionale di Statistica ITALY.
Session topic (i) – Editing Administrative and Census data Discussants Orietta Luzi and Heather Wagstaff UNECE Worksession on Statistical Data Editing.
Quality declarations Study visit from Ukraine 19. March 2015
Towards more flexibility in responding to users’ needs
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Chapter 2 Database Environment Pearson Education © 2009.
Workshop on the Validation of Waste Statistics
Documentation of statistics
(VIP-EDC) Point 6 of the agenda
Working on coherence and consistency of an output database
Reference Data and Metadata Warehouses
ESS.VIP VALIDATION An ESS.VIP project for mutual benefits
Data Warehousing Concepts
Statistical process as a structured chain of successive actions and intermediate products, supported by the coherent use of metadata  Focused on energy.
Metadata use in the Statistical Value Chain
The role of metadata in census data dissemination
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Presentation transcript:

Met a-data Resources in Europe: within NSIs and from Dosis Projects Wilfried Grossmann Department of Statistics and Decision Support Systems University Vienna

Metadata Resources in Europe2 Contents  Introduction  Contents of Meta-data  IT- Structures for Meta-data  Processing Meta-data  Conclusions

Metadata Resources in Europe3 Introduction Continuing hot topics in the meta-data discussion  Content-orientation versus IT- orientation There is a lack of communication between these two groups

Metadata Resources in Europe4 Introduction  Meta-data providers versus meta- data users Who provides which type of information for whom?

Metadata Resources in Europe5 Contents of Meta-data What kind of objects should be documented?  Basic statistical structures  Variables  Values  Data sets ____________________  Statistical output  Statistical Systems  Statistical Processing

Metadata Resources in Europe6 Contents of Meta-data Approaches towards meta-data content  The template oriented approach  The data warehouse approach  The process oriented approach

Metadata Resources in Europe7 Contents of Meta-data The template oriented approach Templates defined by a number of working groups  For micro data and data sets DDI, Dublin Core  For (economic) macrodata OECD, IMF, ECE (Internet)

Metadata Resources in Europe8 Contents of Meta-data The template oriented approach The OECD Template: Concepts and sources Data Collection Data manipulation by national source Data quality Data Transmission International Standards Data Storage and Manipulation by OECD Output preparation and delivery by OECD

Metadata Resources in Europe9 Contents of Meta-data The template oriented approach The IMF Template: Coverage Periodicity Timeliness Quality of disseminated data Integrity of disseminated data Access by the public

Metadata Resources in Europe10 Contents of Meta-data The template oriented approach Although the OECD approach seems more reliable from statistical point of view, IMF is favoured at the moment by international organisations (EUROSTAT)

Metadata Resources in Europe11 Contents of Meta-data The warehouse approach  Integration of the data inside the NSIs in a data warehouse  Output and dissemination as first step  Meta-data are oriented towards the needs of the data warehouse

Metadata Resources in Europe12 Contents of Meta-data The warehouse approach Projects in this direction in many NSI Best documentation: Australian Office Definitional meta-data Procedural meta-data Operational meta-data Systems meta-data Datasets meta-data

Metadata Resources in Europe13 Contents of Meta-data The process oriented approach  Combines statistical and IT considerations  Statistical data are considered not as final products but as the result of a process chain  More detailed consideration of statistical terminology

Metadata Resources in Europe14 Contents of Meta-data The process oriented approach Starting point was the SCB-DOC model (Rosen and Sundgren, 1991) A sequence of templates accompanying the statistical production process Ongoing activities at Statistics Sweden A number of NSIs want to adopt the model

Metadata Resources in Europe15 Contents of Meta-data The process oriented approach The IDARESA model Object oriented representation based on SCB-DOC with emphasis on possible semi-automatic processing

Metadata Resources in Europe16 Contents of Meta-data The process oriented approach The US-Bureau of census model (Gillman, Appel et al. running project): Statistical system defined as an identifiable process.... to produce one or more deliverables

Metadata Resources in Europe17 Contents of Meta-data Summary Process oriented approach seems to be favourable for a number of reasons Two Examples: Classification servers Data Quality

Metadata Resources in Europe18 Contents of Meta-data Summary: Classification server A classification server should  Support unified use of terminology inside NSIs or international organisations  Support harmonisation between (international) standard classifications and locally defined (adapted) classifications

Metadata Resources in Europe19 Contents of Meta-data Summary: Classification server Requirements for a classification server A data base supporting easy and user friendly manipulation of hierarchy trees A mapping tool supporting the definition of correspondence tables between classifications A management strategy for implementation

Metadata Resources in Europe20 Contents of Meta-data Summary: Classification server Up to now only few successful implementations for partial solutions EUROSTAT (SIMONE-Server) New Zealand,

Metadata Resources in Europe21 Contents of Meta-data Summary: Data Quality Data Quality  Criteria for quality of statistics are well known (Relevance, accuracy, timeliness, accessibility, comparability, coherence, completeness)  The problem Achieve quality in the production process Document quality by appropriate meta-data

Metadata Resources in Europe22 Contents of Meta-data Summary: Data Quality Experience shows that documentation quality is rather poor as soon as it is separated from the production process Example for an integration project SIDI-approach by ISTAT

Metadata Resources in Europe23 IT Structures for Meta-data Internet and data warehouse offer new opportunities for  Meta-data and data repositories  Meta-data access and exchange Lead towards a more open policy in data dissemination

Metadata Resources in Europe24 IT Structures for Meta-data Meta-data repositories Approaches towards repositories  The thesaurus approach  The template oriented approach  The Data Warehouse oriented approach

Metadata Resources in Europe25 IT Structures for Meta-data Meta-data repositories Example for a thesaurus oriented approach EUROSTAT servers for concepts and definitions Advantage: available on the Internet Problem: Navigation not so easy

Metadata Resources in Europe26 IT Structures for Meta-data Meta-data repositories Contents –Descriptions (dictionaries) –Semantic (coverage, standard classifications coherence of information) –Administration (responsible persons) –Selection (keywords, search facilities)

Metadata Resources in Europe27 IT Structures for Meta-data Meta-data repositories Example for the template oriented approach StatBase: supporting access to meta-data as well as data and reports Meets quite well the requirements of OECD data template No direct connection between data and meta-data

Metadata Resources in Europe28 IT Structures for Meta-data Meta-data repositories Example for the warehouse oriented approach StatLine(CBS): Based on data access from multidimensional tables (cubes) Accompanying meta-information is only in Dutch Extraction of special meta-data items is not so easy as in StatBase

Metadata Resources in Europe29 IT Structures for Meta-data Meta-data access and exchange Ongoing work in access and exchange  New Standards for access and exchange  Accessing distributed sources  Combination of information

Metadata Resources in Europe30 IT Structures for Meta-data Meta-data access and exchange Actual trends in standardization Traditional standards for data and meta-data exchange like GESMES or CLASET will probably switch to XML-platform. New standards from the Object Management Group (OMG)

Metadata Resources in Europe31 IT Structures for Meta-data Meta-data access and exchange Example MOF (Meta Object Facility) –Extensible Framework for meta-data model definition –Programming interface for storage and access of meta-data –Integration facilities across domains But note: This is a general approach for warehouses not necessarily tied with statistics

Metadata Resources in Europe32 IT Structures for Meta-data Meta-data access and exchange Example for Accessing and processing distributed sources ADDSIA: Accessing and processing distributed sources for analysis purposes Minimum requirements for standardisation in advance Orientation towards statistical problems

Metadata Resources in Europe33 Processing Meta-data Goal  Data and meta-data are processed together 

Metadata Resources in Europe34 Processing Meta-data Advantages  Reduction of documentation effort  More consistency in meta-data Requirements  Software tools supporting this view  Operational models for meta-data

Metadata Resources in Europe35 Processing Meta-data Up to know only prototypes with emphasis on different aspects of processing  The planning approach  The throughput approach  The transformation approach

Metadata Resources in Europe36 Processing Meta-data The planning approach  Develop software tools (workbench) for setting up meta-data documentation BRIDGE/IMIM:  A desktop for planning surveys and statistical production  Meta-data generated in the planning phase are managed by the system  No data are processed

Metadata Resources in Europe37 Processing Meta-data The planning approach  Improvement and adaptation of meta-data models for new tasks like quality and use of administrative sources SIDI (Statistics Italy)  Integration of quality in the statistical production process  Standardization of the production process

Metadata Resources in Europe38 Processing Meta-data The throughput approach Use as much meta-data as possible from OldMeta-data to obtain NewMeta-data CBS (ongoing work):  Use BLAISE meta-data as input  Produce StatLine meta-data as output

Metadata Resources in Europe39 Processing Meta-data The transformation approach Define meta-data algorithms for all types of data algorithms  Throughput meta-data  Modified meta-data  New meta-data  Meta-data summarization

Metadata Resources in Europe40 Processing Meta-data The transformation approach IDARESA project Meta-data algorithms for elementary data base operations ISMIS Identification of added value in meta-data (new meta-data) Pursuit of the production process inside EUROSTAT

Metadata Resources in Europe41 Processing Meta-data The transformation approach

Metadata Resources in Europe42 Conclusions Is there progress in meta-data research and development? Yes, but rather slow because  There is a lack of co-ordination in research (Probably improved by a forthcoming meta-data working group)  There is an information gap between meta- data research groups and NSIs  NSIs seem to prefer their own solutions