© 2009 OpenLink Software, All rights reserved. Mapping Relational Databases to RDF with OpenLink Virtuoso Orri Erling - Program Manager, Virtuoso.

Slides:



Advertisements
Similar presentations
On The Evolution of Terms
Advertisements

© 2007 OpenLink Software, All rights reserved OpenLink Virtuoso - SQL & RDF RDF Views of SQL Data (Exposing SQL Data as RDF) Orri Erling Program Manager.
1 © 2008 OpenLink Software, All rights reserved. SPARQL for Business Intelligence Orri Erling - Program Manager, Virtuoso 1.
Virtuoso Product Family
© 2006 IBM Corporation Features of an Enterprise-ready Triple Store Ben Szekely June, 2006.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Michael Pizzo Software Architect Data Programmability Microsoft Corporation.
Lukas Blunschi Claudio Jossen Donald Kossmann Magdalini Mori Kurt Stockinger.
Store RDF Triples In A Scalable Way Liu Long & Liu Chunqiu.
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Building and Analyzing Social Networks Web Data and Semantics in Social Network Applications Dr. Bhavani Thuraisingham February 15, 2013.
Michael Povolotsky CMSC491s/691s. What is Virtuoso? Virtuoso, known as Virtuoso Universal Server, is a multi-protocol RDBMS Includes an object-relational.
© 2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice RDF and SOA David Booth, Ph.D. HP.
Distributed Database Management Systems. Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE Spring
RIZWAN REHMAN, CCS, DU. Advantages of ORDBMSs  The main advantages of extending the relational data model come from reuse and sharing.  Reuse comes.
Cloud based linked data platform for Structural Engineering Experiment Xiaohui Zhang
Chapter 9: Creating Database Conventions & Standards MCITP Administrator: Microsoft SQL Server 2005 Database Server Infrastructure Design Study Guide (70-443)
DartGrid Browser-based mapping tool of SQL to RDF Point Template Zhejiang University & OpenLink Software.
Data Integration in Service Oriented Architectures Rahul Patel Sr. Director R & D, BEA Systems Liquid Data – XML-based data access and integration for.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
Berlin SPARQL Benchmark (BSBM) Presented by: Nikhil Rajguru Christian Bizer and Andreas Schultz.
Database System Concepts and Architecture Lecture # 3 22 June 2012 National University of Computer and Emerging Sciences.
Rajashree Deka Tetherless World Constellation Rensselaer Polytechnic Institute.
SPARQL All slides are adapted from the W3C Recommendation SPARQL Query Language for RDF Web link:
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Implemented Systems Presenter: Manos Karpathiotakis Extended Semantic Web Conference 2012.
Copyright © 2003 by Prentice Hall Module 4 Database Management Systems 1.What is a database? Data hierarchy and data organization Field, record, file,
Systems analysis and design, 6th edition Dennis, wixom, and roth
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
Introduction to SQL Steve Perry
1 © 2012 OpenLink Software, All rights reserved. Virtuoso - Column Store, Adaptive Techniques for RDF Orri Erling Program Manager, Virtuoso Openlink Software.
Ultrawrap: SPARQL Execution on Relational Data Juan F. Sequeda, Daniel P. Miranker University of Texas - Austin ISWC 2009 Seoul National University Internet.
Master Informatique 1 Semantic Technologies Part 11Direct Mapping Werner Nutt.
Databases From A to Boyce Codd. What is a database? It depends on your point of view. For Manovich, a database is a means of structuring information in.
Querying with SPARQL Tuesday, October 28, 2014 Technical Exchange on Network Management Interoperability Andrea Westerinen JHU-APL/Nine Points Solutions.
Steven Seida D2RQ Blog Integration Lab. Data to RDF Integration Approaches* 1 of 2 *Summarized from Ch 9 of Semantiic Web Programming, 2009, by Hebeler.
9/7/2012ISC329 Isabelle Bichindaritz1 The Relational Database Model.
Database Design and Management CPTG /23/2015Chapter 12 of 38 Functions of a Database Store data Store data School: student records, class schedules,
5-1 McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
1 SPARQL A. Emrah Sanön. 2 RDF RDF is quite committed to Semantic Web. Data model Serialization by means of XML Formal semantics Still something is missing!
C-Store: RDF Data Management Using Column Stores Jianlin Feng School of Software SUN YAT-SEN UNIVERSITY Apr. 24, 2009.
Fall 2013, Databases, Exam 2 Questions for the second exam…
Indexes and Views Unit 7.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
RDF Access to Relational Databases Ashok Malhotra Oracle Corporation.
Semantic Web Final Exam Review. Topics for Final Exam First exam material (~30%) Design Patterns and Map/Reduce (~20%) Inference / Restrictions (~10%)
Steven Seida How Does an RDF Knowledge Store Compare to an RDBMS?
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
Session 1 Module 1: Introduction to Data Integrity
RDF David R Newman 15 May 2009.
23 Copyright © 2009, Oracle. All rights reserved. Oracle Business Intelligence Answers: Advanced Features.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Chapter 04 Semantic Web Application Architecture 23 November 2015 A Team 오혜성, 조형헌, 권윤, 신동준, 이인용.
Linked Open Data for European Earth Observation Products Carlo Matteo Scalzo CTO, Epistematica epistematica.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
BBY 464 Semantic Information Management (Spring 2016) Semantic Query Languages Yaşar Tonta & Orçun Madran [yasartonta, Hacettepe.
What’s new in SQL Server 2017 for BI?
Cloud based linked data platform for Structural Engineering Experiment
RDF and RDB 1 Some slides adapted from a presentation by Ivan Herman at the Semantic Technology & Business Conference, 2012.
Oracle Analytic Views Enhance BI Applications and Simplify Development
OpenLink Virtuoso - SQL & RDF
MANAGING DATA RESOURCES
UMBC AN HONORS UNIVERSITY IN MARYLAND
Enhance BI Applications and Simplify Development
Presentation transcript:

© 2009 OpenLink Software, All rights reserved. Mapping Relational Databases to RDF with OpenLink Virtuoso Orri Erling - Program Manager, Virtuoso

© 2009 OpenLink Software, All rights reserved. Who Wants to Map? Semantic Web Scalers ¡Expose whatever there is as RDF, the next guy will unify terms, make search and apps Data Warehouse Keepers ¡Data is spread out, has implicit semantics, complex schemas, heterogeneous sources, ambiguous terms but we must make it join and aggregate cleanly

© 2009 OpenLink Software, All rights reserved. Present State SPARQL to SQL exists but still, complex integrations are data warehouses We'd really like to map, but... Can it be otherwise?

© 2009 OpenLink Software, All rights reserved. Why RDF Data Warehouse? Pros  Even query performance across all data  Possibility of forward-chaining inference  Some SPARQL features may be better supported, e.g. Unspecified predicates Cons  Keeping data up-to-date  Complex set up, needs dedicated servers: you don't build them on a whim

© 2009 OpenLink Software, All rights reserved. Why Map? No copying, no timeliness issues RDBMS outperforms RDF for analytics workloads Agile reconfiguration without reloading data

© 2009 OpenLink Software, All rights reserved. Virtuoso Mapping of SPARQL to SQL against any existing schema - whether stored in Virtuoso or elsewhere Physical Quad-store Federated/local RDBMS

© 2009 OpenLink Software, All rights reserved. For Mapping to Deliver... Tackle any SQL analytics workload in SPARQL without extra cost Deal with arbitrary SQL schemata Produce single SQL statements, for target RDBMS to optimize Have intelligence for cases where one RDF entity can come from many relational sources

© 2009 OpenLink Software, All rights reserved. The Cases of Integration Bring similar but heterogeneous schemas into a unified ontology - Union View Translate FKs of one schema to PKs in another - Distributed Join Hide differences in normalization - Views for hiding joins Unit/Terminology conversions

© 2009 OpenLink Software, All rights reserved. Defining a Mapping Define URI formats and their subclass relations Define which key-column-value combinations make a triple Arbitrary SQL is allowed for mapping values and filtering A single RDF node can be a composite of many columns, e.g. multipart key Use SPARQL/SQL to:

© 2009 OpenLink Software, All rights reserved. The TPC-H Case The 22 queries as extended SPARQL Each generates a single SQL statement, executable by Virtuoso, Oracle, others Next make several TPC-H databases on different servers and run the queries against the union

© 2009 OpenLink Software, All rights reserved. Where Problems Begin In OpenLink Data Spaces - 6 Collaborative apps all mapped to SIOC: Trivially becomes a union of everything, lines of SQL Intelligently Becomes a Union of : select * from where {?s ?p ?o. ?s has_comment ?c. ?c has_author } select post.* from post, comment, user where c_post = p_id and c_author = u_id and u_name = f ('xxx') Once per app

© 2009 OpenLink Software, All rights reserved. What One Must Know Mapping for integration is not trivial Be careful when mapping multiple tables/columns to one class/property Make URI schemes which encode type and source, so that senseless joins are not attempted if types not specified in query Understand what the mapping logic can and cannot optimize Understand what SQL can and cannot optimize View resulting SQL for sanity check

© 2009 OpenLink Software, All rights reserved. SQL Extensions in Virtuoso Mapping must work against any RDBMS/Schema, as is But there is Virtuoso SQL between the mapping and target RDBMS(s) Location and latency - conscious distributed cost model Breakup for making a wide result set into a row per property Inverse functions

© 2009 OpenLink Software, All rights reserved. Use Cases OpenLink Data Spaces - Blog, Wiki, News, Social Network, &c OpenLink's own MIS - “total information awareness”: URI for any CRM Object, Account, Product, Support Case, , &c Musicbrainz PHP BB, Drupal, MediaWiki, &c

© 2009 OpenLink Software, All rights reserved. OpenLink Software Thank You!