Italian National Institute of Statistics - Istat

Slides:



Advertisements
Similar presentations
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Advertisements

Database Systems: Design, Implementation, and Management Tenth Edition
STANDARD ERRORS PRESENTATION AND DISEMINATION AT THE STATISTICAL OFFICE OF THE REPUBLIC OF SLOVENIA Rudi Seljak Statistical Office of the Republic of Slovenia.
Enhancing Data Quality of Distributive Trade Statistics Workshop for African countries on the Implementation of International Recommendations for Distributive.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Chapter 6 Methodology Conceptual Databases Design Transparencies © Pearson Education Limited 1995, 2005.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Lecture Fourteen Methodology - Conceptual Database Design
Modeling & Designing the Database
Foundations This chapter lays down the fundamental ideas and choices on which our approach is based. First, it identifies the needs of architects in the.
FRE 2672 Urban Ontologies : the Towntology prototype towards case studies Chantal BERDIER (EDU), Catherine ROUSSEY (LIRIS)
WP.5 - DDI-SDMX Integration
Chapter 3 The Relational Model Transparencies Last Updated: Pebruari 2011 By M. Arief
WP.5 - DDI-SDMX Integration E.S.S. cross-cutting project on Information Models and Standards Marco Pellegrino, Denis Grofils Eurostat METIS Work Session6-8.
GEM/IRDR Social Vulnerability and Resilience Information System and Metadata Portal IRDR Scientific Board Meeting Chengdu 03/11/2012.
ONTOLOGICAL MODEL OF THE KNOWLEDGE IN FOLKLORE DIGITAL LIBRARY Desislava Paneva Institute of Mathematics and Informatics – Bulgarian Academy of Sciences.
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Methodology - Conceptual Database Design Transparencies
Software School of Hunan University Database Systems Design Part III Section 5 Design Methodology.
Methodology Conceptual Databases Design
Vincenzo Del Vecchio Banca d’Italia Statistics Collection and Processing Department 2012 ESSnet Workshop – 4 December.
1 Chapter 15 Methodology Conceptual Databases Design Transparencies Last Updated: April 2011 By M. Arief
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Entity Framework Overview. Entity Framework A set of technologies in ADO.NET that support the development of data-oriented software applications A component.
GSIM implementation in the Istat Metadata System: focus on structural metadata and on the joint use of GSIM and SDMX Mauro Scanu
Statistics Portugal/ Metadata Unit Monica Isfan « Joint UNECE/ EUROSTAT/ OECD Work Session on Statistical Metadata.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Methodology: Conceptual Databases Design
Dimitrios Skoutas Alkis Simitsis
Methodology - Conceptual Database Design
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Environment Change Information Request Change Definition has subtype of Business Case based upon ConceptPopulation Gives context for Statistical Program.
44220: Database Design & Implementation Modelling the ‘Real’ World Ian Perry Room: C41C Ext.: 7287
EUROPEAN COMMISSION - EUROSTAT ESSnet on Consistency of Concepts and Methods of business-related Statistics 2010 project on statistical units n°
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Extending the MDR for Semantic Web November 20, 2008 SC32/WG32 Interim Meeting Vilamoura, Portugal - Procedure for the Specification of Web Ontology -
Metadata Framework for a Statistical Data Warehouse
1 Chapter 2 Database Environment Pearson Education © 2009.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Department of Mathematics Computer and Information Science1 CS 351: Database Management Systems Christopher I. G. Lanclos Chapter 4.
ROMA 23 GIUGNO 2016 MODERNISATION LAB - FOCUSSING ON MODERNISATION STRATEGIES IN EUROPE: SOME NSIS’ EXPERIENCES Insert the presentation title Modernisation.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Supporting the use of administrative data in official statistics.
Highlighting the added value of Statistical Linked Open Data
DATA MODELS.
Databases and Database Management Systems Chapter 9
Web Ontology Language for Service (OWL-S)
The Generic Statistical Information Model (GSIM) and the Sistema Unitario dei Metadati (SUM): state of application of the standard Cecilia Casagrande –
Advanced Database Models
Chapter 2 Database Environment Pearson Education © 2009.
Analyzing and Securing Social Networks
Chapter 2 Database Environment.
Entity Relationship Diagrams
OBJECT-ORIENTED APPROACH TO OFFICIAL STATISTICS
Chapter 4 Entity Relationship (ER) Modeling
Database Environment Transparencies
Monica Scannapieco Division "Information and Application Architecture“
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
Data Model.
Metadata Framework as the basis for Metadata-driven Architecture
Resolution concerning statistics of Work, Employment & labour underutilization
Concepts of industry, occupation and status in employment - Overview
Functional geographies through the package LabourMarketAreas
The ArchiMate modelling structures
Presentation transcript:

Italian National Institute of Statistics - Istat The Italian Integrated System of Statistical Registers On the Design of an Ontology-based Data Integration Architecture R. Radini (radini@istat.it), M. Scannapieco (scannapi@istat.it) , G.Garofalo (garofalo@istat.it) Italian National Institute of Statistics - Istat Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Outline Introduction to ISSR OBDM and examples Data architecture Correspondence with EARF DV vs DW Conclusions Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

ISSR – Italian Integrated System of Statistical Registries Istat engaged a modernization programme aimed at a significant revision of the statistical production One of the main pillars of this revision is the design of production processes based on an Integrated System of Statistical Registers Single logical environment to support the consistency of statistical production processes in Istat, in particular consistency in “identification” and “estimation” for the whole integrated system of units and variables Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

ISSR: Types of Registers RSE (Extended registers) extends the information of a specific RSB on a specific RSB’s population RST (Thematic registers) supports more statistical processes through a consistent and shared treatment on some topics RSB (Base registers) contains several statistical populations and the minimum set of variables useful to characterize stat units Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

OBDM Ontology Based Data Management System Ontology (or computational ontology): conceptual data representation expressed through «computational» languages In mathematical logic: assiomatic first order theory expressable in description logic OBDM is an integration system where the usual ER global schema is replaced by the conceptual model of the application domain formulated as an ontology Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

OBDM Architecture Main features Data source transparency property (called data virtualization by IT platform) Global view Consistency Ontology Mapping Data source 1 Data source 3 Data source 2 Three-level architecture: Ontology, Sources, Mapping Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Excerpt of the Ontology of the Working Relationships Employee Self-employee Worker Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Excerpt of the Population Ontology Family registry Common law family Family Individual Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Data Integration: same concept Individual (Population Ontology) Individual (Working relationships ontology) Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Querying over the ontology Query: We would like to query for people that have the residence in a certain region and classify them by age, educational degree and employment condition We don’t have to know how information are stored in the sources! Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

by employment condition Query Ontology Mapping Mapping Query rewritten over the sources RS of Individuals RS of Labour people that have residence in a certain region classified by age and educational degree by employment condition Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

High expressive power It is possible to give different definition of a concept dependending on the istance It is possible to express different constraints related to each definition CorporationManager- Labour Force Employee Self-employee Corporation Manager NationalAccount CorporationManager has a different semantics according to the domain Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Data architecture Compliance to EARF (Enterprise Architecture Reference Framework) Metadata Management Primary Data Storage Quality Assessment Unitary Metadata System Logical centralization of ISSR Data consistency ODBM Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Data architecture: IT View Features DV DW Storage of Historical Data NO YES Capture Every Change in Production Data (requires integration with CDC) Multi-Dimensional Data Structures Data Pre-Aggregation Query performance on large amounts of data SLOW (relative to DW) FAST (relative to DV) Data Integration on Demand Operational Cost LOW HIGH Time-To-Market Easy to Make Changes Dependence on IT Monica Scannapieco – Brussels, NTTS, 14-16 March 2017

Conclusions EA approach for ISSR design and implementation ISSR Data Architecture: Hybrid solution with DV and DW E.g. DV-based data architecture with DW for historical data and dissemination Next steps: Prototypes of RSB Individual, Families and Cohabitations and RST Working Relationships Guidelines for the Management of the Integrated System of Statistical Registers Monica Scannapieco – Brussels, NTTS, 14-16 March 2017