Managing & Integrating Enterprise Data with Semantic Technologies Susie Stephens Principal Product Manager, Oracle
The Semantic Web is Maturing Mature standards Approx. 5m RDF files; 15k OWL files Increasing adoption by enterprises Many open source tools Commercial tools
Achieving the Vision Source: J Web Semantics (in press)
A Hybrid World Relational, XML, and RDF data Triple stores and on the fly mappings to RDF
Triple Stores Many open source triple stores One enterprise commercial triple store
Oracle RDF Data Model RDF Triples: {S 1, P 1, O 1 } {S 1, P 2, O 2 } {S 2, P 2, O 2 } S1S1 O1O1 O2O2 S2S2 P2P2 P2P2 P1P1 Support for RDF and RDFS Object-relational implementation Subjects and objects are re-used Links represent complete RDF triples
SPARQL-like Query Capability A table function allows a graph query to be embedded in a SQL query Searches for an arbitrary pattern against the RDF data Includes inferencing based on RDF, RDFS, and user-defined rules Automatically resolve multiple representations of the same point in value space, e.g. 1 vs. 1.00
Scalability: Real Application Clusters (RAC) Security: Encryption, Authentication, Data Guard Performant: multi-threaded, parallel processing, memory management, indexed, etc. Multi-Media: Images, Text, etc. Enterprise Functionality Source: VLDB Performance testing with UniProt (units in seconds)
Map relationships to terms using RDF triples -‘Mandible’, sameAs’, ‘Jaw’ -‘Molar’, ‘partOf’, ‘Jaw’ Image Search “Find me all DICOM images that contain the term ‘Jaw’”
Text Search “Find me all papers that contain the term ‘Jaw’” Map relationships to terms using RDF triples -‘Mandible’, sameAs’, ‘Jaw’ -‘Molar’, ‘partOf’, ‘Jaw’
Data Integration SQL / RDBMS – Concise, efficient transactions – Transaction metadata is embedded or implicit in the application or database schema XQuery / XML – Transaction across organizational boundaries – XML wraps the metadata about the transaction around the data SPARQL / RDF – Information sharing with ultimate flexibility – Enables semantics as well as syntax to be embedded in documents
Mappings to RDF Relational XML
Adapting SQL Databases Source: Tim Berners-Lee, Bio-IT World 2005
SQL Component Table EMPLOYEE Column EMPLOYEE. EMPID Row with EMPID=110 NAME cell in that row URI relative to database EMPLOYEE/schema#EMPLOYEE EMPLOYEE/schema#empid EMPLOYEE/rowBy/empid/110 EMPLOYEE/rowBy/empid/110#name Approach for RDB to RDF Mappings EMPIDNAMESITEIDROLE 110S Jones1Development 120B Brown2Marketing 130K Smith1Development Acme Employee Table
XML to RDF Mappings Source: Tim Berners-Lee, Bio-IT World 2005
Situation Awareness Architecture Source: University of Texas Health Science Center
Semantic Web Stack Source: Tim Berners-Lee, Bio-IT World 2005
A little semantics goes a long way – Jim Hendler, University of Maryland
Enterprise Applications Drug target discovery Knowledge discovery in translational medicine Government geo-spatial data management Resource optimization in intelligence Emergency response planning Medical record management Web-based knowledge discovery Web publishing Automotive design
Drug Safety Determination Image Source: IEEE Intelligent Systems, 2006
Drug Safety Determination Image Source: IEEE Intelligent Systems
Initiatives: W3C’s HCLSIG
Summary Adoption of Semantic Web technologies is accelerating The Semantic Web offers heterogeneous data integration using explicit semantics It will continue to be hybrid world Legacy data can be incorporated into the Semantic Web Oracle has a scalable, secure, highly-available RDF Data Model