Evolution Management for Preservation PRELIDA Consolidation Workshop 17.10.2014 Giorgos Flouris (FORTH)

Slides:



Advertisements
Similar presentations
EBankII Workshop 1 Making Scientific Data Openly Available Simon Coles School of Chemistry, University of Southampton.
Advertisements

Midterm Workshop, Catania, April 2014 D3.1 State of the art assessment on Linked Data and Digital Preservation René van Horik, Data Archiving & Networked.
Ilias Tachmazidis 1,2, Grigoris Antoniou 1,2,3, Giorgos Flouris 2, Spyros Kotoulas 4 1 University of Crete 2 Foundation for Research and Technology, Hellas.
1 S. Tallam, R. Gupta, and X. Zhang PACT 2005 Extended Whole Program Paths Sriraman Tallam Rajiv Gupta Xiangyu Zhang University of Arizona.
W3C Invited Talk 16/09/2009Giorgos Flouris1 High-Level Change Detection in the Semantic Web Institute of Computer Science Foundation for Research and Technology.
Xyleme A Dynamic Warehouse for XML Data of the Web.
Sharing Knowledge in Adaptive Learning Systems Miloš Kravčík Dragan Gašević Fraunhofer FIT, GermanySimon Fraser University, Canada
6/17/20151 Table Structure Understanding by Sibling Page Comparison Cui Tao Data Extraction Group Department of Computer Science Brigham Young University.
An almost linear fully dynamic reachability algorithm.
Generating Application Ontologies from Reference Ontologies Marianne Shaw Todd Detwiler Jim Brinkley Dan Suciu University of Washington.
Liam Roditty Reachability in Directed Graphs. Connectivity in undirected graphs Given two vertices decide whether they are in the same component. Reachability.
FREMA: e-Learning Framework Reference Model for Assessment Design Patterns for Wrapping Similar Legacy Systems with Common Service Interfaces Yvonne Howard.
Week 2 Lecture 2 Structure of a database. External Schema Conceptual Schema Internal Schema Physical Schema.
Advanced Metering Infrastructure
By: Shawn Li. OUTLINE XML Definition HTML vs. XML Advantage of XML Facts Utilization SAX Definition DOM Definition History Comparison between SAX and.
TAPP-09 23/02/2009Giorgos Flouris1 On Explicit Provenance Management in RDF/S Graphs Institute of Computer Science Foundation for Research and Technology.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Artificial Chemistries Autonomic Computer Systems University of Basel Yvonne Mathis.
Robert Sharpe, Tessella PRELIDA Workshop 2013 ENSURE Linked Data Registry.
1/151/15 ENT Metamodel Implementation & Applications ENT metamodel, prototype implementation Component substitutability checking, ENT based component comparison.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Towards Translating between XML and WSML based on mappings between.
Archival Integration with Neo4j Mike Bryant Centre for e-Research King’s College London.
A knowledge-based Assistant for real-time Planning and Execution of PSS Engineering Change Processes Michael Abramovici, Youssef Aidi IT in Mechanical.
University of Maryland Bug Driven Bug Finding Chadd Williams.
FORTH-ICS, With some help from: Irini Fundulaki, Vassilis Papakonstantinou Linked Open Data Giorgos Flouris 20/03/14.
TELEFÓNICA I+D Date: 25th October 2007 Sergio Garcí á Gómez © 2007 Telefónica Investigación y Desarrollo, S.A. Unipersonal SPIDERS Semantic.
Open Data Protocol * Han Wang 11/30/2012 *
IDB, SNU Dong-Hyuk Im Efficient Computing Deltas between RDF Models using RDFS Entailment Rules (working title)
B REAKOUT S ESSION - B IG B ENCH - 3rd Workshop on Big Data Benchmarking July Xi‘an, China.
Samad Paydar Web Technology Lab. Ferdowsi University of Mashhad 10 th August 2011.
Knowledge Modeling, use of information sources in the study of domains and inter-domain relationships - A Learning Paradigm by Sanjeev Thacker.
Pavan Reddiavri (Ebiquity Labs) “R ♫ P” RDF Access control Policies.
Linking Social, Open, and Enterprise Data Tope Omitola, J. Davies, A. Duke, H. Glaser, N. Shadbolt.
STASIS Technical Innovations - Simplifying e-Business Collaboration by providing a Semantic Mapping Platform - Dr. Sven Abels - TIE -
Antoine Isaac 1 st PRELIDA Workshop Pisa, June 26, 2013.
Component 4: Introduction to Information and Computer Science Unit 6a Databases and SQL.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Chapter 4: SQL Complex Queries Complex Queries Views Views Modification of the Database Modification of the Database Joined Relations Joined Relations.
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud Ali Hasnain et. al Insight Center for Data Analytics National University of Ireland,
Web Information Systems Modeling Luxembourg, June VisAVis: An Approach to an Intermediate Layer between Ontologies and Relational Database Contents.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
Ontology engineering Lab #8 – October 20, 2014.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Shridhar Bhalerao CMSC 601 Finding Implicit Relations in the Semantic Web.
Logics for Data and Knowledge Representation Web Ontology Language (OWL) -- Exercises Feroz Farazi.
Managing Enterprise GIS Geodatabases
Understanding Data Intensive Systems Using Dynamic Analysis and Visualization Nesrine NOUGHI.
Interface for Glyco Vault Functionality and requirements. Initial proposal. Maciej Janik.
Final project Morocco Holiday Center Group number 6 Members: Alexanders Weiss Andrei Agape Darius Micu Gabriel Razvan.
SICoP Presentation A story about communication Michael Lang BEARevelytix April 25, 2007.
Synchronise work on DEXs and reference data between PLCS pilots and OASIS/PLCS Workshop #3 10 – 11 November 2004.
1 A Medical Information Management System Using the Semantic Web Technology Networked Computing and Advanced INFORMATION MANAGEMENT, NCM '08. Fourth.
Prizms for Data Publication and Management Katie Chastain May 9, 2014.
1 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Enterprise Edition: Overview.
SALUS Semantic Middleware SALUS Advisory Board Meeting - January 17, 2013.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Linked Open Data Dataset from Related Documents Petya Osenova and Kiril Simov IICT-BAS LDL-2016, LREC, Portoroz.
Preservation Through Evolution Management: The DIACHRON Approach DIACHRON Final Dissemination Workshop Giorgos Flouris (FORTH)
2nd International Workshop on Preservation of Evolving Big Data (DIACHRON 2016) 15 March 2015, Co-located with EDBT 2016, Bordeaux, France
Upgrade from 2013 to SDL Web 8 Road Map for Up-gradation.
EvoGen: a Generator for Synthetic Versioned RDF Marios Meimaris Institute for the Management of Information Systems Research Center “Athena”
1 RDF Storage and Retrieval Systems Jan Pettersen Nytun, UiA.
Visualization for Ontology Evolution
SQL Relational Database Project
Add and subtract complex numbers
UMBC AN HONORS UNIVERSITY IN MARYLAND
An ontology for e-Research
Schema Used in Examples
BPaaS Evaluation Environment Research Prototype
BPaaS Evaluation Research Prototype
Presentation transcript:

Evolution Management for Preservation PRELIDA Consolidation Workshop Giorgos Flouris (FORTH)

Evolution Management Problem Preservation ↔ Evolution

Change Detection Change detection for evolution management – Identifying changes between versions Challenges (in DIACHRON) 1.Diverse data models 2.Dynamic datasets 3.Recoverable versions 4.Changes as first-class citizens 5.Cross-snapshot queries

Evolution in DIACHRON Pilot datasetDIACHRON Version 1 Pilot datasetDIACHRON Version 2

Change Types: Motivation What a naïve diff will report Add (Rec, diachron:subject, EFO_001927) Add (Rec, diachron:hasRecordAttribute, rAtt1) Add (rAtt1, diachron:predicate, rdfs:subClassOf) Add (rAtt1, diachron:object, ObsoleteClass) What the pilot expects Add_SuperClass (EFO_001927, ObsoleteClass)

Change Hierarchy: Low-level (1/3) Low-level changes – DIACHRON model, for internal use – Fixed: Add, Delete – Just additions and deletions of triples – Simple set difference

Change Hierarchy: Simple (2/3) Pilot terminology: – Add_SuperClass Add_Dimension Fixed, pre-defined Comprising of low-level changes Partitioning is perfect – Complete and unambiguous

Change Hierarchy: Complex (3/3) Pilot terminology: – Add_Synonym, Mark_As_Obsolete Totally custom, pilot-specific (defined at run-time)

Using Changes for Evolution Management DIACHRON data model contains all versions Detection based on SPARQL queries – Provided at deployment time (for simple) – Generated at creation time (for complex) Recoverability – Allows moving back and forth between versions

Representation Requirements Interesting queries – Return the simple changes that dataset X underwent between versions V1 and V2 – Return the changes that resource X underwent in the first semester of 2014 – Give me all resources of type X that underwent change Y – Return all countries for which the unemployment rate of their capital city increased at a rate higher than the average increase of the country as a whole, between versions V1 and V2 Access to both the changes and the data is required – Changes are first-class citizens – Allowing preservation

Data Changes Ontology C1C1 Add_SuperClass V1V1 V2V2 asc_p1 asc_p2 Simple_Change Change prov:Activity Data level Schema level EFO_ ObsoleteClass old_version new_version diachron:Entity Add_Synonym Complex_Change … … … …

Conclusion Main DIACHRON message – (Linked) data preservation is related to evolution management DIACHRON challenges 1.Diverse data models 2.Dynamic datasets 3.Recoverable versions 4.Changes as first-class citizens 5.Cross-snapshot queries Solutions – DIACHRON data model (#1) – Appropriate change definition and detection (#2, #3) – Changes and data represented at the same level (#4, #5)