Metadata projects and tasks at Statistics Finland METIS 2010 Saija Ylönen

Slides:



Advertisements
Similar presentations
SDMX in the Vietnam Ministry of Planning and Investment - A Data Model to Manage Metadata and Data ETV2 Component 5 – Facilitating better decision-making.
Advertisements

Project management Project manager must;
Information and Business Work
Chapter 4 Database Management Systems. Chapter 4Slide 2 What is a Database Management System (DBMS)?  Database An organized collection of related data.
LEVERAGING THE ENTERPRISE INFORMATION ENVIRONMENT Louise Edmonds Senior Manager Information Management ACT Health.
Test Automation: An Architected Approach Dan Young March 17th, 2005
United Nations Economic Commission for Europe Statistical Division Applying the GSBPM to Business Register Management Steven Vale UNECE
Malaysian Grid for Learning October DC 2004, Shanghai, China. © 2004 MIMOS Berhad. All Rights Reserved Metadata Management System DC2004: International.
9 Feb 2004Mikko Mäkinen & Saija Ylönen Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Geneva, 9-11 February 2004, Topic (ii): Metadata.
Teaching Metadata and Networked Information Organization & Retrieval The UNT SLIS Experience William E. Moen School of Library and Information Sciences.
Ihr Logo Data Explorer - A data profiling tool. Your Logo Agenda  Introduction  Existing System  Limitations of Existing System  Proposed Solution.
ABSTRACT Zirous Inc. is a growing company and they need a new way to track who their employees working on various different projects. To solve the issue.
The Data Attribution Abdul Saboor PhD Research Student Model Base Development and Software Quality Assurance Research Group Freie.
Metadata management and statistical business process at Statistics Estonia Work Session on Statistical Metadata (Geneva, Switzerland 8-10 May 2013) Kaja.
Using XML technologies to implement complex tables in short- term statistics Francesco Rizzo
Planning and Writing Your Documents Chapter 6. Start of the Project Start the project by knowing the software you will write about, but you should try.
1 XML as a preservation strategy Experiences with the DiVA document format Eva Müller, Uwe Klosa Electronic Publishing Centre Uppsala University Library,
WEB DESIGN SOLUTIONS. 2 Presentation by JAVANET SYSTEMS 1st Floor, ROFRA House, Suite 4, Kansanga, Gaba Road P.O Box 31586, Kampala, Uganda Tel: +256(0) ,
Development of metadata in the National Statistical Institute of Spain Work Session on Statistical Metadata Genève, 6-8 May-2013 Ana Isabel Sánchez-Luengo.
Publishing Metadata with Data - XML based dissemination process of statistical information (CoSSI) Harri Lehtinen
CASE STUDY: STATISTICS NORWAY (SSB) Jenny Linnerud and Anne Gro Hustoft Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) Luxembourg.
Statistics Sweden Results from operations in 2006: 146 publications 356 press releases commissions 3,7 million visitors at
IB ITGS Case Study. Introduction: Serving thousands of clients, it is method of environment-friendly green ticketing. User friendly system which minimizes.
Current and Future Applications of the Generic Statistical Business Process Model at Statistics Canada Laurie Reedman and Claude Julien May 5, 2010.
ISO 9001:2008 to ISO 9001:2015 Summary of Changes
CountrySTAT Regional Basic Administrator Training for ECO Member States Friday, October 23, 2015 EVENT Foundations of CountrySTAT E-learning.
Metadata driven application for data processing – from local toward global solution Rudi Seljak Statistical Office of the Republic of Slovenia.
Revision Project of the Business Register (BR) and Business Statistics in September 2013 Tuula Viitaharju.
South Africa Case Study Update Matile Malimabe Executive Manager: Standards Acting Executive Manager: Data Management & Technology.
Implementation Experiences METIS – April 2006 Russell Penlington & Lars Thygesen - OECD v 1.0.
Developing Statistical Information Systems and XML Information Technologies - Possibilities and Practicable Solutions Geneva,
6.1 © 2010 by Prentice Hall 6 Chapter Foundations of Business Intelligence: Databases and Information Management.
1 1 Developing a framework for standardisation High-Level Seminar on Streamlining Statistical production Zlatibor, Serbia 6-7 July 2011 Rune Gløersen IT.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Metadata Working Group Jean HELLER EUROSTAT Directorate A: Statistical Information System Unit A-3: Reference data bases.
Implementing the GSIM Statistical Classification model – the Finnish way Essi Kaukonen / Statistics Finland UNECE Workshop on International Collaboration.
Corporate Data Vault Data Warehousing Workshop Sept Data Warehousing Workshop Sept
STRATEGY FOR DEVELOPMENT OF ISIS AND IT STRATEGY IN THE NSI-BULGARIA Main principles, components, requirements.
MetaPlus Klas Blomqvist Statistics Sweden Research and Development – Central Methods
NEFIS (WP5) Evaluation Meeting, November 2004 Evaluation Metadata Aljoscha Requardt, University of Hamburg Response rate: 93% (14 of 15 partners.
1 Statistical business registers as a prerequisite for integrated economic statistics. By Olav Ljones Deputy Director General Statistics Norway
CONCEPTUAL MODELLING OF STATISTICAL METADATA AND METADATA DATA MODEL IN CoSSI Geneva, 3-4 April 2006 Heikki Rouhuvirta, Statistical.
Joint UNECE/Eurostat/OECD work session on statistical metadata (METIS) APRIL 2006Mar Blanco Frías STATISTICAL METADATA MODEL DEVELOPED IN SPAIN:CURRENT.
The business process models and quality issues at the Hungarian Central Statistical Office (HCSO) Mr. Csaba Ábry, HCSO, Methodological Department Geneva,
Jean-Yves Le Meur - CERN Geneva Switzerland - GL'99 Conference 1.
Manage your projects efficiently and on a high level PROJECT MANAGEMENT SYSTEM Enovatio Projects Efficient project management Creating project plans Increasing.
Internet Made Easy! Make sure all your information is always up to date and instantly available to all your clients.
Figure 9.8 User Evaluation Form
Algorithms II Software Development Life-Cycle.
30 September 2010 Sami Saarikivi
Prepared by: Galya STATEVA, Chief expert
Towards connecting geospatial information and statistical standards in statistical production: two cases from Statistics Finland Workshop on Integrating.
Streamlining the Statistical Production in TurkStat Metadata Studies in TURKSTAT High Level Seminar for Eastern Europe, Caucasus and Central Asia Countries.
Variables documentation system in Statistics Norway
Software Documentation
GSIM Implementation at Statistics Finland Session 1: ModernStats World - Where to begin with standards based modernisation? UNECE ModernStats World Workshop.
SDMX Information Model
C.U.SHAH COLLEGE OF ENG. & TECH.
Metadata in the modernization of statistical production at Statistics Canada Carmen Greenough June 2, 2014.
30 September 2010 Sami Saarikivi
National Update - Finland
Information and software architecture for statistical dissemination
Data validation in Statistical Office of the Republic of Serbia
Energy Statistics Compilers Manual
Prepared by Peter Boško, Luxembourg June 2012
GSBPM AND ISO AS QUALITY MANAGEMENT SYSTEM TOOLS: AZERBAIJAN EXPERIENCE Yusif Yusifov, Deputy Chairman of the State Statistical Committee of the Republic.
Work Session on Statistical Metadata (Geneva, Switzerland May 2013)
Module 1.1 Overview of Master Facility Lists in Nigeria
Palestinian Central Bureau of Statistics
Presentation transcript:

Metadata projects and tasks at Statistics Finland METIS 2010 Saija Ylönen

Organizational chart 11/03/20102Saija Ylönen

Co-operating parties of the metadata tasks: organizational units IT Management situated in the Secretariat of the Director General co-ordinates the general information architecture, of which metadata tasks form one element Classification and Metadata Services situated in the IT and Statistical Methods department operational unit active role in developing of metadata Dissemination Services situated in the IT and Statistical Methods department develops the metadata connected with the dissemination 11/03/20103Saija Ylönen

Metadata Co-ordination Group Originally a co-operation group for persons working with metadata issues in the support function departments of SF The objective at present is to intensify the co-operation between the statistics departments and the parties responsible for general metadata work Comprised of members working on metadata and permanent members from all statistics department Goal is to widen knowledge about metadata and metadata systems and to give an opportunity to the statistics departments to discuss their metadata needs with metadata specialists 11/03/20104Saija Ylönen

CoSSI Steering Group and CoSSI model Foundation for the metadata system Modular, xml-based model for describing statistical tables, classifications, concepts, variables, general information on statistical documents, and quality, etc. Expandable CoSSI Steering Group is in charge of mastering and developing the model according to user needs in a manner that will not expose its main structure to risk 11/03/20105Saija Ylönen

Definition of metadata 1) Statistical metadata variable and data descriptions classifications, concepts 2) Statistical data quality quality reports statistical method descriptions 3) Metadata of statistical documents or products producers publication information field or subject area 11/03/20106Saija Ylönen

Definition of metadata II 4) Process metadata a) technical metadata technical metadata guide the workflow of data production, makes it possible to follow data production and documents the working process. b) conceptual process metadata technical information of data and variables which are used in producing data. E.g. minimum or maximum values, various calculation rules or use of certain classification values 11/03/20107Saija Ylönen

Metadata systems at Statistics Finland 11/03/20108Saija Ylönen

Metadata systems: present situation We are in a transitional phase from relational databases to an xml-based environment Relational databases: classifications, concepts and definitions, archiving database Xml database eXist: publications, classifications, concepts, data descriptions 11/03/20109Saija Ylönen

Relational databases Built in the 1990’s Used in statistics production but not in all statistical processes or all statistics Classifications in the relational databases are used in SAS and Superstar Archiving database is in use in the archiving process Classifications and concepts are generated from the relational databases to the web pages 11/03/201010Saija Ylönen

XML database At the moment, the xml database is used mostly in the creation of publications with an Arbortext word processor Classifications and concepts are copied to the xml database from the relational databases and are ready to use Tools for utilising metadata objects from the xml database are being constructed The first metadata tool linked to the xml database is the variable editor 11/03/201011Saija Ylönen

Variable editor For creating and maintaining the descriptions of statistical data and variables At the testing phase Implementation begins in 2010 Descriptions are saved as xml documents conforming to the CoSSI model in the eXist/xml database 11/03/201012Saija Ylönen

Content and functions of the variable editor Data descriptions are comprised of a general description of the data, a list of variables and information about an individual variable General data description includes descriptive information on the entire data document Variable list interleaf allows management of the list of variables in the data description and selection of the variable whose description needs editing. 11/03/201013Saija Ylönen

11/03/201014Saija Ylönen Variable list interleaf

Variable metadata 11/03/201015Saija Ylönen Field nameDescription short nameShort identifying name of variable long nameName of variable in natural language concept definitionBasic conceptual description of variable operational definitionVerbal description of the formation of the variable deduction ruleE.g. programming instructions, mathematical formula, etc. classification IDIdentifier of classification. Refers to a classification in the classification database. unit of measureMeasurement unit of variable variable modifiedDate of creation or modification of variable (yyyy-mm-dd) start of validityStart date of validity of variable (yyyy-mm-dd) end of validityEnd date of validity of variable (yyyy-mm-dd) statusStage of editing of variable: draft, ready, validated variable groupName of group to which variable belongs. Makes working with long variable lists easier. work commentFree text field. Contains information only for the use of the maintainer of a description.

Results from the variable editor project the development of a consistent information architecture the construction of production applications in which metadata need not be separately produced or manually added to data when publishing or archiving statistics information service where excessive time need not be spent on searching for metadata, or on actual reproduction of metadata for special compilation assignments a system from which table column and row headings can in tabulation applications be retrieved in multiple languages for all statistics using the same methods. 11/03/201016Saija Ylönen In addition to actual variable editor application the project also created preconditions for:

Experiences gained during the variable editor project Various questions concerning standardisation had to be addressed in the project although they were not originally in the projects’ scope of task – they had to be done and they took a lot of time Because the variable editor project was the first leg in the revision of the metadata system it was subjected to a diversity of expectations Project was a good test run for the CoSSI model – the data content of the model proved to be exhaustive 11/03/201017Saija Ylönen

The planning and building of a classification editor Reasons for the renewing of the classification system: the present way of maintaining classifications has been viewed as inflexible by statistics renunciation of the Sybase relational databases ICT strategy: in the next few years the agency will introduce a common statistical metadata system based on the CoSSI model Classification editor project ) definition stage 2) construction stage 11/03/201018Saija Ylönen

Goals of the classification editor project Analyse the service needs required from a centralised classification system Create maintenance tools for classifications in connection with the CoSSI/eXist metadata store so that the basic maintenance needs of classifications of individual statistics are met in a user-oriented manner which also allows further development of the classification system Produce the solutions with which the interoperability of the Sybase classification database and the eXist metadatabase can be ensured Compile user instructions for the editor Pilot test the editor 11/03/201019Saija Ylönen

Benefits of the new classification system A classification system which serves well will encourage centralised and structured maintenance of classification The documentation of classifications will improve, making them easy to find for use in-house and for the provision of information service The new classification system will support smooth movement between data descriptions, variable descriptions and maintenance of classifications and thus improve the efficiency of the maintenance and use of classifications in statistics 11/03/201020Saija Ylönen

General benefits of the common classification system A centralised classification system eases the workload needed to maintain classifications because classifications are only maintained in one place Reduces the possibility of errors because classifications are documented in the system consistently so that they are accessible to everybody and easy to find Improves the efficiency of time use because working hours need not be spent on looking for classifications and trying to find their background information Makes the classifications used in different statistics visible to everybody and thus creates possibilities for their harmonisation 11/03/201021Saija Ylönen

In conclusion: Why do some statistics departments still have their own metadata systems instead of using the centralized system? Centralised metadata work progresses too slowly from the perspective of individual statistics – We should rethink our construction and implementation strategy Common attitude still regards the process of an individual set of statistics as unique, and therefore incapable of exploiting systems that are meant for all statistics – We have to get quick results to prove the benefits of the system Commitment by the Management and their support to the work is crucial – We have to convince them 11/03/201022Saija Ylönen

THANK YOU FOR YOUR ATTENTION! 11/03/201023Saija Ylönen