2015-11-28 ISWC2007, Nov. 14. Discovering simple mappings between Relational database schemas and ontologies Wei Hu, Yuzhong Qu {whu,

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
Database Systems: Design, Implementation, and Management Tenth Edition
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
A Framework for Ontology-Based Knowledge Management System
Software Testing and Quality Assurance
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 3 The Basic (Flat) Relational Model.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Data Modeling Using the Entity- Relationship (ER) Model.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
Towards Semantic Web: An Attribute- Driven Algorithm to Identifying an Ontology Associated with a Given Web Page Dan Su Department of Computer Science.
Methodology Conceptual Database Design
CONSTRAINTS AND UPDATES CHAPTER 3 (6/E) CHAPTER 5 (5/E) 1.
BIS310: Week 7 BIS310: Structured Analysis and Design Data Modeling and Database Design.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
CSE314 Database Systems Data Modeling Using the Entity- Relationship (ER) Model Doç. Dr. Mehmet Göktürk src: Elmasri & Navanthe 6E Pearson Ed Slide Set.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Exploring Personal CoreSpace For DataSpace Management Li Yukun and Xiaofeng Meng WAMDM Lab Renmin University of China.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Funded by: European Commission – 6th Framework Project Reference: IST WP 2: Learning Web-service Domain Ontologies Miha Grčar Jožef Stefan.
Intelligent Database Systems Lab Presenter: WU, JHEN-WEI Authors: Rodrigo RizziStarr, Jose´ Maria Parente de Oliveira IS Concept maps as the first.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
CROSSMARC Web Pages Collection: Crawling and Spidering Components Vangelis Karkaletsis Institute of Informatics & Telecommunications NCSR “Demokritos”
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
CSCI 3140 Module 3 – Logical Database Design for the Relational Model Theodore Chiasson Dalhousie University.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
ISBN Chapter 3 Describing Semantics -Attribute Grammars -Dynamic Semantics.
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
Keyword Query Routing.
Part4 Methodology of Database Design Chapter 07- Overview of Conceptual Database Design Lu Wei College of Software and Microelectronics Northwestern Polytechnical.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Data Modeling Using the Entity- Relationship (ER) Model.
Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
27/3/2008 1/16 A FRAMEWORK FOR REQUIREMENTS ENGINEERING PROCESS DEVELOPMENT (FRERE) Dr. Li Jiang School of Computer Science The.
CSE314 Database Systems Lecture 3 The Relational Data Model and Relational Database Constraints Doç. Dr. Mehmet Göktürk src: Elmasri & Navanthe 6E Pearson.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Inference-based Semantic Mediation and Enrichment for the Semantic Web AAAI SSS-09: Social Semantic Web: Where Web 2.0 Meets Web 3.0 March 25, 2009 Dan.
1 Open Ontology Repository initiative - Planning Meeting - Thu Co-conveners: PeterYim, LeoObrst & MikeDean ref.:
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 7 Data Modeling Using the Entity- Relationship (ER) Model.
Fundamentals, Design, and Implementation, 9/e Appendix B The Semantic Object Model.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Screen Readers Cannot See (Ontology Based Semantic Annotation for Visually impaired Web users) Yeliz Yesilada, Simon Harper, Carole Goble and Robert Stevens.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Data Modeling Using the Entity- Relationship (ER) Model
COP Introduction to Database Structures
Logical Database Design and the Rational Model
Entity- Relationship (ER) Model
Cross-Ontological Relationships
Chapter 7: Entity-Relationship Model
Associative Query Answering via Query Feature Similarity
ece 720 intelligent web: ontology and beyond
Topic Oriented Semi-supervised Document Clustering
Property consolidation for entity browsing
Block Matching for Ontologies
Information Networks: State of the Art
Presentation transcript:

ISWC2007, Nov. 14. Discovering simple mappings between Relational database schemas and ontologies Wei Hu, Yuzhong Qu {whu, Institute of Web Science School of Computer Science and Engineering Southeast University, China

ISWC2007, Nov. 14. Outline  Introduction  Our approach  Evaluation  Related work  Summary and future work

ISWC2007, Nov. 14. Introduction  The popularity of ontologies is rapidly growing since the emergence of the Semantic Web.  Swoogle collected more than 10,000 ontologies so far.  Falcons indexed more than 2 million classes/properties.  However, most of the world’s data today are still locked in data stores, and are not published as an open Web of inter- referring resources. [Ref.4. Creating a science of the Web. 2006]  About 77.3% data on the current Web are stored in relational databases. [Ref.6. SIGMOD Record. 33(3) (2004)]  So, it is necessary to establish interoperability between (Semantic) Web applications using relational databases and ontologies for creating a Web of data.

ISWC2007, Nov. 14. Introduction – By an example  Left part: relations, attributes, primary keys, foreign keys.  Right part: classes, properties (data valued or object properties)

ISWC2007, Nov. 14. Introduction (cont’d)  Manually discovering such simple mappings is tedious and improbable at the Web scale.  So (semi-) automatic approaches have been proposed.  Not well consider the characteristics of relational data models and ontology model The mappings are not accurate enough.  Most of the present approaches cannot construct semantic mappings The (missed) semantic mappings are useful in various practical applications.

ISWC2007, Nov. 14. Introduction – the contribution  We propose a new approach to discovering simple mappings  It constructs virtual documents for the entities To discover mappings by comparing virtual documents.  It validates mapping consistency To eliminate certain incorrect mappings.  It explores contextual mappings Can be transformed directly to view-based mappings with selection conditions. Be useful for applications in real world domains. [Ref. 5. Putting context into schema matching. VLDB'06]

ISWC2007, Nov. 14. Introduction – Terminology  R denotes a relation, and A denotes an attribute.  type(A): the domain name of A;  rel(A): the relation which specifies A;  pk(R): the attributes appeared as the primary keys of R;  ref(A): the attributes referenced by A;  C represents a class, and P represents a property. P D denotes a data valued property and P O denotes an object property.  d(P): the domain(s) of P;  r(P): its range(s) of P.

ISWC2007, Nov. 14. Introduction – Terminology (cont’d)  A mapping m is a 5-tuple:, where:  id is a unique identifier;  u is an entity in {R} ∪ {A}, and v is an entity in {C} ∪ {P};  t is a relationship, e.g. equivalence and subsumption, holding between u and v;  f is a confidence measure in the [0, 1] range.  Examples 

ISWC2007, Nov. 14. Outline  Introduction  Our approach  Evaluation  Related work  Summary and future work

ISWC2007, Nov. 14. Overview of the approach  Phase 1: Classifying entity types (A preprocess step)  Heuristically classifies entities into different groups, coordinates different characteristics.  Phase 2: Discovering simple mappings  Constructs virtual documents for entities, calculating confidence measure via TF/IDF model.  Phase 3: Validating mapping consistency  Use mappings to validate the consistency of ;  Also, the comparability between the data types of attributes and data valued properties.  Phase 4: Constructing contextual mappings  + sample instances  contextual mappings.

ISWC2007, Nov. 14. Phase 1: Classifying entity types  Relation: strong entity relation (SER), weak entity relation (WER), regular relationship relation (RRR), specific relationship relation (SRR).  Attribute: foreign key attribute (FKA), non-foreign key attribute (NFKA). [Ref.9. Data & Knowledge Engineering. 12 (1994)]  Group 1: {{SER} ∪ {WER}}×{C}; Group 2: {{RRR} ∪ {SRR}}×{P O }; Group 3: {FKA}×{P O }; Group 4: {NFKA}×{{P D } ∪ {P O }}.  Coordinate different characteristics  Reifying n-arity relationship (n>2)  Others.

ISWC2007, Nov. 14. Phase 2: Discovering simple mappings  We construct virtual documents for the entities in both the relational schema and the ontology to capture their structural information.  A virtual document represents a collection of weighted tokens, which are derived not only from the description of the entity itself, but also from the descriptions of its neighbors. The weights of the tokens indicate their importance, and could then be viewed as a vector in the TF/IDF model.  Rationality: the semantic information of a relational schema is characterized mainly by its ICs; an OWL ontology can be mapped to an RDF graph, which also indicates the semantic information in its structure.

ISWC2007, Nov. 14. Discovering simple mappings (cont’d.)  Relations and attributes:  Classes and properties:

ISWC2007, Nov. 14. Phase 3: Validating mapping consistency  Using mappings between to validate the consistency of mappings.  Attributes cannot stand alone without relations.  The restriction construct in an OWL ontology specifies local domain and range constraints on the classes.

ISWC2007, Nov. 14. Phase 4: Constructing contextual mappings  Focus on a special type of mappings – contextual mappings  Directly translated to conditional mappings or view-based mappings.

ISWC2007, Nov. 14. Constructing contextual mappings (cont’d.)

ISWC2007, Nov. 14. Outline (cont’d.)  Introduction  Our approach  Evaluation  Related work  Summary and future work

ISWC2007, Nov. 14. Evaluation – Data sets  Data sets: [Ref.1. MapOnto]  We implemented our approach in Java, called Marson.

ISWC2007, Nov. 14. Evaluation – Experimental methodology  Experiment 1. Discovering simple mappings:  Marson vs. Simple, VDoc, Valid, RONTO Simple: not constructing virtual documents, not checking mapping consistency; VDoc: constructing virtual documents, not validating mapping consistency; Valid: not constructing virtual documents, validating mapping consistency; RONTO: an existing prototype, distinguish the types of entities, using I-Sub.  F1-Measure: a combination of precision and recall.  Testing various thresholds for each approach, and selecting the best ones.  Experiment 2. Constructing contextual mappings  Collecting instances from the Web for the first three data sets: More than 50 instances for each relation and class.  Comparing with the mappings established by experienced volunteers.

ISWC2007, Nov. 14. Evaluation – Experiment 1  Under Intel Pentium IV 2.8GHz processor, 512MB DDR2 memory, Windows XP Professional, and Java SE 6, Marson takes about 5 seconds to complete all the five tests (including the parsing time).

ISWC2007, Nov. 14. Evaluation – Experiment 2  In Case 1, missing.  Not finding the mapping : Without background knowledge.

ISWC2007, Nov. 14. Evaluation – Experiment 2 (cont’d.)  In Case 2: finding  When the values of the attribute type in Event equals to “Research Session” or “Industrial Session”, the subsumption relationship between Event and Conference can be converted to the equivalence relationship.

ISWC2007, Nov. 14. Outline (cont’d.)  Introduction  Our approach  Evaluation  Related work  Summary and future work

ISWC2007, Nov. 14. Related work  Interested by both Database and Semantic Web communities.  At an early stage: visual toolkits, help users specify mappings manually.  At present: discovering mappings (semi-) automatically. For example, COMA, RONTO: –Not considering the structural differences in models; –Not validating the consistency between mappings.  Other research directions: Describing system framework, e.g., OntoGrate; Defining mapping expression language, e.g., R2O; Extending OWL with ICs; Inferring complex mappings, e.g., MapOnto.

ISWC2007, Nov. 14. Summary and future work  Summary  An approach to discovering simple mappings;  An algorithm to build contextual mappings;  Experiments to evaluate our approach.  Future work  Instance matching;  Machine learning techniques for mining semantic mappings;  Others.

ISWC2007, Nov. 14. Thanks for your attention! Any comments are welcome! Tools: Marson, Falcon-AO, OntoSum Services: Falcons (Searching the SW with CSpaces)