Download presentation
Presentation is loading. Please wait.
Published byMadeline Brown Modified over 9 years ago
1
Relational Databases to RDF (a.k.a RDB2RDF) Juan F. Sequeda Dept of Computer Science University of Texas at Austin
2
I want RDF… but my data is in RDB! 2
3
Why RDB2RDF? Semantic Web –Deep Web is 500 times bigger than Static Web (2008) –Where do you think that the majority of the data is stored? –If we want a Semantic Web, we need data to be on the web as RDF and interlinked! Where do you think this data is going to come from?
4
RDB
5
RDB2RDF
6
Why RDB2RDF? Data Integration –Do you know why RDF is cool? because it’s a graph! –How do link/integrate two different graphs? add edges between nodes or merge nodes!
7
Boss: Find me clients that are based in cities who have a population less than 1 million? You: ??? idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations Real world scenario
8
You: I found the population information… but it’s in a different database. Can you add a column to the Location table in order to insert the new data? DBA: NO! idcitystatepop 1AustinTX790390 2DallasTX1197816 Location Real world scenario idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations
9
idcitystatepop 1AustinTX790390 2DallasTX1197816 Location http://db1/cl ient10 http://db1/ client11 http://db1/loc20 http://db1/loc21 ACME Inc Foo Bars Austin TX Dallas TX 790390 1197816 ex:Client ex:basedIn ex:pop ex:state ex:city ex:state ex:name rdf:type http://db2/loc1 Austin TX ex:state ex:city http://db2/loc2 Dallas TX ex:state ex:city ex:pop idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations
10
idcitystatepop 1AustinTX790390 2DallasTX1197816 Location http://db1/cl ient10 http://db1/ client11 ACME Inc Foo Bars 790390 1197816 ex:Client ex:basedIn ex:pop ex:name rdf:type http://db2/loc1 Austin TX ex:state ex:city http://db2/loc2 Dallas TX ex:state ex:city ex:pop idNamec_id 10ACME Inc20 11Foo Bars21 c_idcitystate 20AustinTX 21DallasTX Clients Locations
11
A bit of history Relational Databases on the Web. TimBL, 1998 W3C Workshop on RDF Access to Relational Databases, October 2007 –Report: http://www.w3.org/2007/03/RdfRDB/reporthttp://www.w3.org/2007/03/RdfRDB/report W3C RDB2RDF Incubator Group, 2008-2009 –Survey: http://www.w3.org/2005/Incubator/rdb2rdf/RDB2RDF_Survey Report.pdf http://www.w3.org/2005/Incubator/rdb2rdf/RDB2RDF_Survey Report.pdf W3C RDB2RDF Working Group, 2009 – today –R2RML: RDB to RDF Mapping Language –A Direct Mapping of Relational Data to RDF
12
RDB and the Semantic Web 12 RDF RDFS OWL RIF
13
RDB and the Semantic Web 13 RELATIONAL MODEL TABLE DEFINITION CONSTRAINTS TRIGGERS
14
RDB and the Semantic Web 14 RELATIONAL MODEL TABLE DEFINITION CONSTRAINTS TRIGGERS RDF RDFS OWL RIF
15
Overview
16
R2RML: RDB to RDF Mapping Language Language for expressing customized mappings from relational databases to RDF datasets Give precise control to the developer –You create the structure you want –You choose the target vocabulary No RDFS/OWL is created from the schema 16
17
RDB RDF R2RML manual R2RML Mapping
18
Direct Mapping Automatic transformation from Relational Database to RDF –Click a button… Voila! Generate RDFS/OWL of the database schema If this doesn’t get you where you want…use existing languages for mapping –RDF to RDF with RIF or SPARQL Construct Semantic Web community –Create SQL Views and directly map those Database community 18
19
RDB Direct Mapping RDF RIF/ SPARQL Construct automatic RDF Direct Mapping SQL Views
20
Hybrid Instead of starting from a blank R2RML file… 1) Direct Mapping 2) Manual Editing 20
21
RDB RDF Direct Mapping in R2RML R2RML Direct Mapping Modify Hybrid Mapping
22
Materialize Triples Data is not dynamic Dump RDB into RDF and then insert into triplestore RDF dump may not be consistent with RDB 22
23
RDB RDF Dump SPARQL Materialized Triples
24
Virtual Triples Data is dynamic Need to query RDB with SPARQL Translate SPARQL to SQL –Comparing the overall performance […] of the fastest rewriter with the fastest relational database shows an overhead for query rewriting of 106%. This is an indicator that there is still room for improving the rewriting algorithms [Bizer and Schultz 2009] –Current rdb2rdf systems are not capable of providing the query execution performance required [...] it is likely that with more work on query translation, suitable mechanisms for translating queries could be developed. These mechanisms should focus on exploiting the underlying database system’s capabilities to optimize queries and process large quantities of structure data [Gray et al. 2009] –Ultrawrap solves this RDF data is consistent with RDB data 24
25
RDB Mapping SPARQL Virtual Triples RDF
26
Materialized Triples Virtual Triples Direct Mapping Custom Mapping RDB2RDF Space Hybrid
27
Tuples to Triples SIDNAMEAGE 1Alice25 2Bob26 SUBJECT PREDICATE OBJECT http://ex.com/person1 25 http://ex.com/age
28
Current Status of W3C RDB2RDF WG R2RML: RDB to RDF Mapping Language Working Draft http://www.w3.org/TR/r2rml/ http://www.w3.org/TR/r2rml/ A Direct Mapping of Relational Data to RDF Working Draft http://www.w3.org/TR/rdb-direct-mapping/ http://www.w3.org/TR/rdb-direct-mapping/ Last Call: Sept 1 (hopefully) 28
29
Implementations Ultrawrap –SPARQL and semantically equivalent SQL have equal execution time –Commercial databases –http://ribs.csres.utexas.edu/ultrawraphttp://ribs.csres.utexas.edu/ultrawrap Spyder –Oracle and HSQLDB –http://www.revelytix.com/content/spyderhttp://www.revelytix.com/content/spyder Other non-standard RDB2RDF –D2R Server, Virtuoso, Triplify, … 29
30
Publicity International Semantic Web Conference –Oct 23 – 27 in Bonn, Germany Posters and Demos –August 15 Consuming Linked Data Workshop –August 15 Outrageous Ideas Track –Sept 5 Semantic Web Challenge –Sept 30 2 nd Linked Data-a-thon –Oct 1 30 http://iswc2011.semanticweb.org/ Join the Facebook group SSSW2011
31
Thank You @juansequeda Acknowledgments: - RiBS @ UT Austin - W3C RDB2RDF WG members - David McNeil - Revelytix
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.