Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin.

Slides:



Advertisements
Similar presentations
The Integration of Biological Data Using Semantic Web Technologies Susie Stephens Principal Product Manager, Life Sciences Oracle
Advertisements

Digital Repositories – Linked Open Data – the possible Role of D4Science Workshop, December 2010, FAO use cases A tool to create Linked Data providers.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Use Case: Populating Business Objects.
Creating Linked Data Juan F. Sequeda Semantic Technology Conference June 2011.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
1 gStore: Answering SPARQL Queries Via Subgraph Matching Presented by Guan Wang Kent State University October 24, 2011.
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD HCLS TMO October 27, 2010.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Semantic Web Introduction
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Storing, Indexing and Querying Large Provenance Data Sets as RDF Graphs in Apache HBase Artem Chebotko Joint work with John Abraham and Pearl Brazier University.
Michael Povolotsky CMSC491s/691s. What is Virtuoso? Virtuoso, known as Virtuoso Universal Server, is a multi-protocol RDBMS Includes an object-relational.
Manish Bhide, Manoj K Agarwal IBM India Research Lab India {abmanish, Amir Bar-Or, Sriram Padmanabhan IBM Software Group, USA
RDF: Building Block for the Semantic Web Jim Ellenberger UCCS CS5260 Spring 2011.
© 1Spatial All rights reserved. An Internet of Places Making Location Data Pervasive Paul Watson Giuseppe Conti* Federico Prandi*
Automatic Data Ramon Lawrence University of Manitoba
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
ONTOLOGY ENGINEERING Lab #9 - November 3, Linking Relational Databases to Ontologies 2  Relational databases are still a common means of storing.
Project By: Anuj Shetye Vinay Boddula. Introduction Motivation HBase Our work Evaluation Related work. Future work and conclusion.
CIMI / FHIR and Shape Expressions. Local DB … …
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice Publishing data on the Web (with.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Berlin SPARQL Benchmark (BSBM) Presented by: Nikhil Rajguru Christian Bizer and Andreas Schultz.
Information Integration Intelligence with TopBraid Suite SemTech, San Jose, Holger Knublauch
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
Incremental Export of Relational Database Contents into RDF Graphs Nikolaos Konstantinou, Dimitris Kouis, Nikolas Mitrou By Dr. Nikolaos Konstantinou National.
DAVID M. KROENKE’S DATABASE PROCESSING, 10th Edition © 2006 Pearson Prentice Hall 7-1 David M. Kroenke’s Chapter Seven: SQL for Database Construction and.
SPARQL Semantic Web - Spring 2008 Computer Engineering Department Sharif University of Technology.
Logics for Data and Knowledge Representation
1 © 2012 OpenLink Software, All rights reserved. Virtuoso - Column Store, Adaptive Techniques for RDF Orri Erling Program Manager, Virtuoso Openlink Software.
Ultrawrap: SPARQL Execution on Relational Data Juan F. Sequeda, Daniel P. Miranker University of Texas - Austin ISWC 2009 Seoul National University Internet.
SQL Databases are a Moving Target Juan F. Sequeda – Syed Hamid Tirmizi –
DANIEL J. ABADI, ADAM MARCUS, SAMUEL R. MADDEN, AND KATE HOLLENBACH THE VLDB JOURNAL. SW-Store: a vertically partitioned DBMS for Semantic Web data.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
Lesley Charles November 23, 2009.
On the Semantics of R2RML and its Relationship with the Direct Mapping Juan F. Sequeda Research in Bioinformatics and Semantic Web (RiBS) Lab Department.
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
Pavan Reddiavri (Ebiquity Labs) “R ♫ P” RDF Access control Policies.
Dimitrios Skoutas Alkis Simitsis
Daniel J. Abadi · Adam Marcus · Samuel R. Madden ·Kate Hollenbach Presenter: Vishnu Prathish Date: Oct 1 st 2013 CS 848 – Information Integration on the.
Oracle Data Integrator Transformations: Adding More Complexity
RDF and triplestores CMSC 461 Michael Wilson. Reasoning  Relational databases allow us to reason about data that is organized in a specific way  Data.
Semantic Access to Existing Archives Using RDF and SPARQL Alasdair J G Gray.
Developing “Geo” Ontology Layers for Web Query Faculty of Design & Technology Conference David George, Department of Computing.
Oracle Database 11g Semantics Overview Xavier Lopez, Ph.D., Dir. Of Product Mgt., Spatial & Semantic Technologies Souripriya Das, Ph.D., Consultant Member.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Semantically Processing The Semantic Web Presented by: Kunal Patel Dr. Gopal Gupta UNIVERSITY OF TEXAS AT DALLAS.
C-Store: RDF Data Management Using Column Stores Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Apr. 24, 2009.
OWLED 2008 DC Use of OWL and SWRL for Semantic Relational Database Translation Matthew Fisher, Mike Dean, Greg Joiner {mdean, April 1.
PHS / Department of General Practice Royal College of Surgeons in Ireland Coláiste Ríoga na Máinleá in Éirinn Knowledge representation in TRANSFoRm AMIA.
SPIN in Five Slides Holger Knublauch, TopQuadrant Inc. Example file:
R Store Angelique Moscicki Oshani Seneviratne Sergio Herrero-Lopez.
Exposing caGrid Data Services as Linked Data Joshua Phillips Alejandra Gonzalez-Beltran Jyoti Pathak October 22, 2009.
05/01/2016 SPARQL SPARQL Protocol and RDF Query Language S. Garlatti.
RDF and Relational Databases
©2011 MFMER | slide-1 The Linked Clinical Data Project Jyotishman Pathak, PhD Rick Kiefer SemTIG November 4, 2011.
Lecture 15: Query Optimization. Very Big Picture Usually, there are many possible query execution plans. The optimizer is trying to chose a good one.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Managing Large RDF Graphs Vaibhav Khadilkar Dr. Bhavani Thuraisingham Department of Computer Science, The University of Texas at Dallas December 2008.
SysML v2 Model Interoperability & Standard API Requirements Axel Reichwein Consultant, Koneksys December 10, 2015.
RDB2RDF Working Group Cunxin Jia. Why Mapping RDBs to RDF?
Linking Ontologies to Spatial Databases
Cloud based linked data platform for Structural Engineering Experiment
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Database Systems Instructor Name: Lecture-3.
Query Optimization.
Creating a Virtual Knowledge Base for Financial Risk and Reporting
Presentation transcript:

Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin

I want RDF… but my data is in RDB! 2

Why RDB2RDF? Semantic Web –Deep Web is 500 times bigger than Static Web (2008) –Where do you think that the majority of the data is stored? –If we want a Semantic Web, we need data to be on the web as RDF and interlinked! Where do you think this data is going to come from?

RDB

RDB2RDF

Why RDB2RDF? Data Integration –Do you know why RDF is cool? because it’s a graph! –How do link/integrate two different graphs? add edges between nodes or merge nodes!

Boss: Find me clients that are based in cities who have a population less than 1 million? You: ??? idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations Real world scenario

You: I found the population information… but it’s in a different database. Can you add a column to the Location table in order to insert the new data? DBA: NO! idcitystatepop 1AustinTX DallasTX Location Real world scenario idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations

idcitystatepop 1AustinTX DallasTX Location ient10 client ACME Inc Foo Bars Austin TX Dallas TX ex:Client ex:basedIn ex:pop ex:state ex:city ex:state ex:name rdf:type Austin TX ex:state ex:city Dallas TX ex:state ex:city ex:pop idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations

idcitystatepop 1AustinTX DallasTX Location ient10 client11 ACME Inc Foo Bars ex:Client ex:basedIn ex:pop ex:name rdf:type Austin TX ex:state ex:city Dallas TX ex:state ex:city ex:pop idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations

A bit of history Relational Databases on the Web. TimBL, 1998 W3C Workshop on RDF Access to Relational Databases, October 2007 –Report: W3C RDB2RDF Incubator Group, –Survey: Report.pdf Report.pdf W3C RDB2RDF Working Group, 2009 – today –R2RML: RDB to RDF Mapping Language –A Direct Mapping of Relational Data to RDF

RDB and the Semantic Web 12 RDF RDFS OWL RIF

RDB and the Semantic Web 13 RELATIONAL MODEL TABLE DEFINITION CONSTRAINTS TRIGGERS

RDB and the Semantic Web 14 RELATIONAL MODEL TABLE DEFINITION CONSTRAINTS TRIGGERS RDF RDFS OWL RIF

Overview

R2RML: RDB to RDF Mapping Language Language for expressing customized mappings from relational databases to RDF datasets Give precise control to the developer –You create the structure you want –You choose the target vocabulary No RDFS/OWL is created from the schema 16

RDB RDF R2RML manual R2RML Mapping

Direct Mapping Automatic transformation from Relational Database to RDF –Click a button… Voila! Generate RDFS/OWL of the database schema If this doesn’t get you where you want…use existing languages for mapping –RDF to RDF with RIF or SPARQL Construct Semantic Web community –Create SQL Views and directly map those Database community 18

RDB Direct Mapping RDF RIF/ SPARQL Construct automatic RDF Direct Mapping SQL Views

Hybrid Instead of starting from a blank R2RML file… 1) Direct Mapping 2) Manual Editing 20

RDB RDF Direct Mapping in R2RML R2RML Direct Mapping Modify Hybrid Mapping

Materialize Triples Data is not dynamic Dump RDB into RDF and then insert into triplestore RDF dump may not be consistent with RDB 22

RDB RDF Dump SPARQL Materialized Triples

Virtual Triples Data is dynamic Need to query RDB with SPARQL Translate SPARQL to SQL –Comparing the overall performance […] of the fastest rewriter with the fastest relational database shows an overhead for query rewriting of 106%. This is an indicator that there is still room for improving the rewriting algorithms [Bizer and Schultz 2009] –Current rdb2rdf systems are not capable of providing the query execution performance required [...] it is likely that with more work on query translation, suitable mechanisms for translating queries could be developed. These mechanisms should focus on exploiting the underlying database system’s capabilities to optimize queries and process large quantities of structure data [Gray et al. 2009] –Ultrawrap solves this RDF data is consistent with RDB data 24

RDB Mapping SPARQL Virtual Triples RDF

Materialized Triples Virtual Triples Direct Mapping Custom Mapping RDB2RDF Space Hybrid

Tuples to Triples SIDNAMEAGE 1Alice25 2Bob26 SUBJECT PREDICATE OBJECT

Current Status of W3C RDB2RDF WG R2RML: RDB to RDF Mapping Language Working Draft A Direct Mapping of Relational Data to RDF Working Draft Last Call: Sept 1 (hopefully) 28

Implementations Ultrawrap –SPARQL and semantically equivalent SQL have equal execution time –Commercial databases – Spyder –Oracle and HSQLDB – Other non-standard RDB2RDF –D2R Server, Virtuoso, Triplify, … 29

Publicity International Semantic Web Conference –Oct 23 – 27 in Bonn, Germany Posters and Demos –August 15 Consuming Linked Data Workshop –August 15 Outrageous Ideas Track –Sept 5 Semantic Web Challenge –Sept 30 2 nd Linked Data-a-thon –Oct Join the Facebook group SSSW2011

Thank Acknowledgments: - UT Austin - W3C RDB2RDF WG members - David McNeil - Revelytix