1 Integrating Databases into the Semantic Web through an Ontology-based Framework Dejing Dou, Paea LePendu, Shiwoong Kim Computer and Information Science,

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Ontology Alignment, Matching and Translation. In the old days People have been building knowledge based systems for ~40 years There was not much interest.
Analyzing Minerva1 AUTORI: Antonello Ercoli Alessandro Pezzullo CORSO: Seminari di Ingegneria del SW DOCENTE: Prof. Giuseppe De Giacomo.
1 CSIT600f: Introduction to Semantic Web Conclusion and Outlook Dickson K.W. Chiu PhD, SMIEEE Text: Antoniou & van Harmelen: A Semantic Web PrimerA Semantic.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Searching the Semantic Web. Introduction  Research Focuses: IE Ontologies (creating, languages, merging, storing, querying)  Next Sep: Using the Semantic.
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
1 CIS607, Fall 2005 Semantic Information Integration Instructor: Dejing Dou Week 2 (Oct. 5)
1 CIS607, Fall 2005 Semantic Information Integration Instructor/Organizer: Dejing Dou Week 1 (Sept. 28)
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
How can Computer Science contribute to Research Publishing?
1 Lecture 13: Database Heterogeneity Debriefing Project Phase 2.
1 CIS607, Fall 2004 Semantic Information Integration Attendees: Vikash Agarwal, Julian M Catchen Kevin A Huck, Kushal M Koolwal, Paea J Le Pendu Xiangkui.
1 CIS607, Fall 2005 Semantic Information Integration Presentation by Zebin Chen Week 7 (Nov. 9)
Infomaster: An information Integration Tool O. M. Duschka and M. R. Genesereth Presentation by Cui Tao.
11/8/20051 Ontology Translation on the Semantic Web D. Dou, D. McDermott, P. Qi Computer Science, Yale University Presented by Z. Chen CIS 607 SII, Week.
The Neural ElectroMagnetic Ontology (NEMO) System: Design & Implementation of a Sharable EEG/MEG Database with ERP ontologies G. A. Frishkoff 1,3 D. Dou.
Ontology translation: two approaches Xiangkui Yao OntoMorph: A Translation System for Symbolic Knowledge By: Hans Chalupsky Ontology Translation on the.
1 Information Integration and Source Wrapping Jose Luis Ambite, USC/ISI.
OIL: An Ontology Infrastructure for the Semantic Web D. Fensel, F. van Harmelen, I. Horrocks, D. L. McGuinness, P. F. Patel-Schneider Presenter: Cristina.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Introduction to DBMS Purpose of Database Systems View of Data
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Break Out Session on Infrastructure and Technology: A Report Vipul Kashyap AOS Workshop, Rome, 15 November 2001
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
A Model-Driven Semantic Web David Frankel (David Frankel Consulting) Pat Hayes ( Institute for Human & Machine Cognition, University of West Florida) Elisa.
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
EXCS Sept Knowledge Engineering Meets Software Engineering Hele-Mai Haav Institute of Cybernetics at TUT Software department.
Peer-to-Peer Data Integration Using Distributed Bridges Neal Arthorne B. Eng. Computer Systems (2002) Supervisor: Babak Esfandiari April 12, 2005 Candidate.
Logics for Data and Knowledge Representation
1 Introduction to Database Systems. 2 Database and Database System / A database is a shared collection of logically related data designed to meet the.
Database Support for Semantic Web Masoud Taghinezhad Omran Sharif University of Technology Computer Engineering Department Fall.
SQL Databases are a Moving Target Juan F. Sequeda – Syed Hamid Tirmizi –
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
1 Ontology-based Semantic Annotatoin of Process Template for Reuse Yun Lin, Darijus Strasunskas Depart. Of Computer and Information Science Norwegian Univ.
1 Lessons from the TSIMMIS Project Yannis Papakonstantinou Department of Computer Science & Engineering University of California, San Diego.
Methodology - Conceptual Database Design. 2 Design Methodology u Structured approach that uses procedures, techniques, tools, and documentation aids to.
Dimitrios Skoutas Alkis Simitsis
Methodology - Conceptual Database Design
Semantic Web - an introduction By Daniel Wu (danielwujr)
Advanced topics in software engineering (Semantic web)
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Artificial Intelligence 2004 Ontology
Working with Ontologies Introduction to DOGMA and related research.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
The Semantic Web Riccardo Rosati Dottorato in Ingegneria Informatica Sapienza Università di Roma a.a. 2006/07.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
Copy right 2004 Adam Pease permission to copy granted so long as slides and this notice are not altered Ontology Overview Introduction.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Welcome to CPSC 534B: Information Integration Laks V.S. Lakshmanan Rm. 315.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Chapter 8A Semantic Web Primer 1 Chapter 8 Conclusion and Outlook Grigoris Antoniou Frank van Harmelen.
Mechanisms for Requirements Driven Component Selection and Design Automation 최경석.
OWL (Ontology Web Language and Applications) Maw-Sheng Horng Department of Mathematics and Information Education National Taipei University of Education.
Building Trustworthy Semantic Webs
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Ontology.
Ontology.
Semantic Markup for Semantic Web Tools:
ONTOMERGE Ontology translations by merging ontologies Paper: Ontology Translation on the Semantic Web by Dejing Dou, Drew McDermott and Peishen Qi 2003.
Presentation transcript:

1 Integrating Databases into the Semantic Web through an Ontology-based Framework Dejing Dou, Paea LePendu, Shiwoong Kim Computer and Information Science, University of Oregon, USA Peishen Qi Computer Science Department, Yale University, USA April, SWDB’06

2 Outline Introduction – The status of the Semantic Web – Realizing SW needs existing databases OntoGrate: An Ontology-based Information Integration Framework – Some previous work – Modules in OntoGrate Architecture Case Study for integrating Databases into SW – Without an existing domain ontology – With an existing domain ontology Conclusion and Future Work

3 The Semantic Web One major goal of the Semantic Web is that web-based agents can process and “understand” data [Berners-Lee etal01]. Ontologies formally describe the semantics of data and web-based agents can take SW documents (e.g. in RDF/OWL) as a set of assertions (true statements) and draw inferences from them. human SW Web-based agents

4 What we have now? DAML+OIL  OWL (Web ontology language) More and more domain ontologies are defined in DAML+OIL/OWL, even for some specific domains (e.g., GO) We are developing some tools, agents, services See

5 Two things are important Real Data for sharing – relational databases (may be the biggest resource) – Other kinds of databases – WWW/XML data – Some knowledge bases Better Semantic Web Services/Agents

6 Semantic Annotation for Data? It is good for small size data resources It is not that good for large size data resources (relational databases) – “Redundant” copies – Time consuming for query answering. E.g. it currently works as loading OWL data into a knowledge base then answering queries with DL ABox reasoning. (Can it compete with existing DBMS which has well developed indexing and query optimization techniques?) It is better that relational databases can be accessed/queried directly by SW agents/services

7 The difficulties The Semantic WebThe Relational DBs Ontologies define the semantics of data Schemas define the structure and integrity constraints

8 A more general question How can we make databases, SW resources, WWW/XML data, KBs work together? The problem is similar – SW resources and KBs are defined by ontologies, which are more expressive and focus on semantics – Databases and XML documents are defined by schemas, which focus on structure – Syntax difference (e.g., OWL vs. SQL)

9 OntoGrate: An Ontology-based Information Integration System

10 Some Previous Work Schemas (e.g., stores7 DB in IBM informix),

11 Some Previous Work Schemas, Ontologies and Web-PDDL Relation  Type/Class Attribute  Predicate/Property Integrity Constrain  Axiom/Rule Primary Key  Fact/Instance

12 Some Previous Work Merging Ontologies with Bridging Axioms

13 Some Previous Work The Bridge Axiom/mapping on customerfname/customerlname vs. customercontactname : (forall (c f l (if (and c f) c l)) c f l))))

14 Some Previous Work The Bridge Axiom/mapping on customerregion vs. customerstatecode/statename/statecode : (forall (x y (if x y) (exists (z t (and x t) z y) z t)))))

15 Some Previous Work Inferential Data Integration with OntoEngine – Data Translation: View data as true statements, e.g., (statecode S#28 “OR”) (M s_t ;  s ) D  t only if (M s_t ;  s ) ╞  t (M s_t ;  s ) D  t  (M s_t ;  s ) ├  t  (M s_t ;  s ) ╞  t – Query Translation: (M s_t ;  s ) Q  t only if (M s_t ;  (  t )) ╞  (  s ) (M s_t ;  s ) Q  t  (M s_t ;  (  t )) ├  (  s )  (M s_t ;  (  t )) ╞  (  s )

16 OntoGrate Architecture Revisited

17 Modules in OntoGrate Architecture The Syntax Translators (Wrappers) – e.g., PDDSQL (SQL  Web-PDDL), PDDOWL(OWL  Web-PDDL) The Matching (correspondence) Generation – e.g., name, structure (tree, graph) similarity,synonyms and is-a (part of) relationships using thesauri and dictionary, such as Wordnet The Data Mining Module The Machine Learning Module The Inference Engine (OntoEngine) The User Interface

18 Learning the mappings from domain experts (forall (x (if x) (and x 6) x 3))))

19 Mining the mappings from large datasets For example, two Medical databases in the same hospital: DB1 list blood pressure of patients with nominal values, such as low, normal, at risk, and high, while the other DB2 may record the exact numerical values for systolic and diastolic pressure. By association rule mining, we may get the rule/mapping  140  90 = `High‘ (support = 40%, confidence = 90%)

20 Case Study in Two Scenarios Integrating DBs into SW without an existing domain ontology Integrating DBs into SW with an existing domain ontology

21 Without an existing domain ontology

22 Generating OWL ontologies from DB Schemas SQL schema  Web-PDDL (by using PDDSQL) Web-PDDL  OWL (by using PDDOWL) – E.g., Stores7.sql  Stores7.pddl  Stores7.owl... <rdfs:subClassOf rdf:resource=“

23 An OWL-QL query based on Stores7.owl …

24 The corresponding Web-PDDL and SQL queries (and (customercity ?C - Customer "Eugene") (customerfname ?C - Customer ?x - String) (customerlname ?C - Customer ?y - String)) PDDSQL SELECT C.customerfname, C.customerlname FROM Customer C WHERE C.customercity = "Eugene" PDDOWL

25 Getting Answers from Stores7 DB {?x/Paea, ?y/LePendu} {?x/Dejing, ?y/Dou} {?x/Shiwoong, ?y/Kim} PDDOWL PDDSQL customerfnamecustomerlname PaeaLePendu DejingDou ShiwoongKim <owl-ql:answerBundle xmlns:owl-ql=" owl-ql-syntax#"...> (1000 bindings/3 secs) (1000/100,000/3secs)

26 With an existing domain ontology Order ontology:

27 An OWL-QL query based on order.owl …

28 The Bridging Axioms/Mappings between Stores7.pddl and (forall (P A z - String) (if (and P A) A z)) P z))) (forall (C z - String) (if P z) (exists (A (and P A) A z)))))

29 The Bridging Axioms/Mappings between Stores7.pddl and (forall (C x - String) (iff C x) C x))) (forall (C y - String) (iff C y) C y)))

30 The Query Translation between Stores7 and Order (and (hasAddress ?C - Person ?A - Address) (City ?A "Eugene") (FirstName ?C - Person ?x - String) (LastName ?C - Person ?y - String)) OntoEngine ( < 1 sec) (and (customercity ?C - Customer "Eugene") (customerfname ?C - Customer ?x - String) (customerlname ?C - Customer ?y - String)) PDDOWL Bridging Axioms OWL-QL query in order.owl

31 Final Answers in the order ontology (customerfname C1 Paea) (customerlname C2 LePendu) (customerfname C1 Dejing) … PDDOWL (10,000 facts/11 secs) PDDSQL customerfnamecustomerlname PaeaLePendu DejingDou ShiwoongKim … OntoEngine (40,000facts/30 secs) Bridging Axioms (FirstName C1 Paea) (LastName C2 LePendu) (FirstName C1 Dejing) …

32 Some related work Semantic Annotation – [Stojanovic maps relational model to frame logic/RDF. – DOGMA[Verheyden translates a ontology query to SQL Schema and Ontology mapping – Similarity matching, machine learning… useful for generating candidate matchings – Semi-automatic tool (Clio) Data integration and query answering – Federated databases[Sheth&Larson 90], data warehouse, peer to peer management [Halevy MiniCon uses query rewriteing at GLV Logic and Databases – Reiter’s reconstruction of relational model in FOL. – Carnot, SIMS, Information Manifold by using a global ontology, DL or Datalog

33 Conclusion and Future work We applied OntoGrate, an ontology-based information integration framework, to integrate relational databases with the Semantic Web. The testing result based on two scenarios is promising. We are developing other modules (e.g., learning/mapping/UI) in OntoGrate. The scalability and efficiency need to be investigated in larger- size data resources. Extending the current work to integrate XML (with/without XML schemas or DTD) and the Semantic Web.

34 Thank you for your attention !