Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,

Slides:



Advertisements
Similar presentations
Discovering Missing Background Knowledge in Ontology Matching Pavel Shvaiko 17th European Conference on Artificial Intelligence (ECAI06) 30 August 2006,
Advertisements

S-Match: an Algorithm and an Implementation of Semantic Matching Fausto Giunchiglia July 2004, Hannover, Germany work in collaboration with Pavel Shvaiko.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
An Introduction to the Model Verifier verds Wenhui Zhang September 15 th, 2010.
Amit Shvarchenberg and Rafi Sayag. Based on a paper by: Robin Dhamankar, Yoonkyong Lee, AnHai Doan Department of Computer Science University of Illinois,
C-OWL: contextualizing ontologies Fausto Giunchiglia October 22, 2003 Paolo Bouquet, Fausto Giunchiglia, Frank van Harmelen, Luciano Serafini, and Heiner.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Dynamic Ontology Matching Pavel Shvaiko OpenKnowledge meetings 9 February, 13 March, 2006 Trento, Italy.
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
24/1/20081 Architecture of multiple databases integration.
D2I Modena, 27 Aprile 2001 Methodologies and techniques for translating information from source to target data models Unità Responsabile: CS-RC Unità Coinvolte:
Generic Schema Matching with Cupid Jayant Madhavan Philip A. Bernstein Erhard Raham Proceedings of the 27 th VLDB Conference.
Slides adapted from A. Silberschatz et al. Database System Concepts, 5th Ed. Entity-Relationship Model Database Management Systems I Alex Coman, Winter.
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
BYU Data Extraction Group Funded by NSF1 Brigham Young University Li Xu Source Discovery and Schema Mapping for Data Integration.
Query Rewriting for Extracting Data Behind HTML Forms Xueqi Chen Department of Computer Science Brigham Young University March 31, 2004 Funded by National.
ONTOLOGY MATCHING Part III: Systems and evaluation.
Partners Using NLP Techniques for Meaning Negotiation Bernardo Magnini, Luciano Serafini and Manuela Speranza ITC-irst, via Sommarive 18, I Trento-Povo,
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
A survey of approaches to automatic schema matching Erhard Rahm, Universität für Informatik, Leipzig Philip A. Bernstein, Microsoft Research VLDB 2001.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
LDK R Logics for Data and Knowledge Representation Semantic Matching.
An Extension to XML Schema for Structured Data Processing Presented by: Jacky Ma Date: 10 April 2002.
Reasoning with context in the Semantic Web … or contextualizing ontologies Fausto Giunchiglia July 23, 2004.
SCHEMA-BASED SEMANTIC MATCHING Pavel Shvaiko joint work on “semantic matching” with Fausto Giunchiglia and Mikalai Yatskevich joint work on “ontology matching”
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Logics for Data and Knowledge Representation Semantic Matching.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
A SURVEY OF APPROACHES TO AUTOMATIC SCHEMA MATCHING Sushant Vemparala Gaurang Telang.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Semantic Enrichment of Ontology Mappings: A Linguistic-based Approach Patrick Arnold, Erhard Rahm University of Leipzig, Germany 17th East-European Conference.
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
September 5-7, Trento Deriving “sub-source” similarities from heterogeneous, semi-stuctured information sources D. Rosaci, G. Terracina, D. Ursino Dipartimento.
Dimitrios Skoutas Alkis Simitsis
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
CSE 636 Data Integration Schema Matching Cupid Fall 2006.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
A Survey of Approaches to Automatic Schema Matching (VLDB Journal, 2001) November 7, 2008 IDB SNU Presented by Kangpyo Lee.
ISWC2007, Nov. 14. Discovering simple mappings between Relational database schemas and ontologies Wei Hu, Yuzhong Qu {whu,
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
University of the Aegean AI – LAB ESWC 2008 From Conceptual to Instance Matching George A. Vouros AI Lab Department of Information and Communication Systems.
Ferdowsi University of Mashhad 1 Automatic Semantic Web Service Composition based on owl-s Research Proposal presented by : Toktam ghafarian.
LDK R Logics for Data and Knowledge Representation Propositional Logic Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto Giunchiglia,
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Sheet 1MDAFA2004 Linköping, June 2004 A Language for Model Transformations in the MOF Architecture Ivan Kurtev, Klaas van den Berg University of Twente,
Logics for Data and Knowledge Representation ClassL (part 1): syntax and semantics.
Of 24 lecture 11: ontology – mediation, merging & aligning.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
Element Level Semantic Matching
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
MOMA - A Mapping-based Object Matching System
Presentation transcript:

Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia, Pavel Shvaiko, Mikalai Yatskevich, Ilya Zaihrayeu

Stanford University, October 31, Outline Matching Syntactic Matching Semantic Matching On Implementing Semantic Matching Conclusions

Stanford University, October 31, MATCHING

Stanford University, October 31, Application Domains Generic Model Management Schema integration Data warehouses E-commerce Data Coordination in P2P systems, Semantic Web

Stanford University, October 31, Example of Matching Arts Organizations Art History Music Baroque History Organizations Arts&Humanities Art History Design Art Baroque Architecture History S c =0.9 S r ={  } S c =1.0 S r ={  } S r ={  }

Stanford University, October 31, Matching Match is an operator that takes two graph-like structures (e.g., database schemas or ontologies) and produces a mapping between elements of the two graphs that correspond semantically to each other

Stanford University, October 31, Matching The problem of matching can be decomposed in two steps: Extract graphs from the data and conceptual models Match the resulting graphs (generic matching)

Stanford University, October 31, Matching Mapping element is a 4-tuple m ID is a unique identifier of the given mapping element; N i 1 is the i-th node of the first graph, N j 2 is the j-th node of the second graph, R specifies a similarity relation of the given nodes Mapping is a set of mapping elements Matching is the process of discovering mappings between two graphs through the application of a matching algorithm

Stanford University, October 31, Matching Semantic Matching Syntactic Matching R is computed between labels at nodes R = {x  [0,1]} R is computed between concepts at nodes R = { =, , , ,  } Matching: Syntactic AND Semantic

Stanford University, October 31, SYNTACTIC MATCHING

Stanford University, October 31, Syntactic Matching Mapping element is a 4-tuple, where L i 1 is the label at the i-th node of the first graph; L j 2 is the label at the j-th node of the second graph; R specifies a similarity relation in the form of a coefficient, which measures the similarity between the labels of the given nodes Example: R is a similarity coefficient in [0,1] R =

Stanford University, October 31, The State of the Art Cupid … is a hybrid matching prototype. It exploits linguistic and structural schema matching heuristics, and computes similarity coefficients between nodes of the trees. Similarity Flooding … is a hybrid matching prototype. It uses fix-point computation to determine correspondences between nodes of the graphs. COMA …is a composite matching prototype. It provides an extensible library of different matchers which manipulate DAGs and supports various ways of combining final results. As far as we know, so far only syntactic matching…

Stanford University, October 31, SEMANTIC MATCHING

Stanford University, October 31, Semantic Matching Mapping element is a 4-tuple, where C i 1 is the concept of the i-th node of the first graph; C j 2 is the concept of the j-th node of the second graph; R specifies a similarity relation in the form of a semantic relation between the extensions of concepts at the given nodes Possible R’s: equality {=}, overlapping {  }, mismatch {  }, more general/specific { ,  } Example: R =

Stanford University, October 31, Examples: Analysis of Ancestors. Case 1 Suppose that we want to match nodes 5 1 and 1 2 Cupid does not find a similarity coefficient between the nodes under consideration, due to the significant differences in structure of the given graphs Semantic matching: The concept denoted by the label at node 5 1 is C C 1, while the concept at node 5 1 is C 5 1 = C A 1  C C 1. The concept at node 1 2 is C 1 2 = C C 2. Thus, C 5 1  C 1 2

Stanford University, October 31, Examples: Analysis of Ancestors. Case 2 Suppose that we want to match nodes 5 1 and 5 2 Cupid: R= 0,86. This is because of the identity of labels A 1 =A 2, C 1 =C 2 Semantic matching: The concept at node 5 1 is C 5 1 = C A 1  C C 1 ; while the concept at node 5 2 is C 5 2 = C A 2  *  C C 2. Since we have that C A 1 =C A 2 and C C 1 =C C 2, then C 5 2  C 5 1

Stanford University, October 31, ON IMPLEMENTING SEMANTIC MATCHING

Stanford University, October 31, On Implementation Semantic Matching Structure - level Element - level Weak Semantics Techniques Strong Semantics Techniques

Stanford University, October 31, Element-level Semantic Matching Weak Semantics Techniques Analysis of strings {=} Analysis of data types {=, , , ,  } Analysis of soundex {=} Strong Semantics Techniques Precompiled thesaurus syn key WordNet, where #1 … sense number 1 of the word Art according to WordNet

Stanford University, October 31, Element-level Semantic Matching (cont.) Semantic Relations via WordNet Equality: one concept is equal to another if there is at least one sense of the first concept, which is a synonym of the second Overlapping: one concept is overlapped with the other if there are some senses in common Mismatch: two concepts are mismatched if they have no sense in common More general: one concept is more general then the other iff there exists at least one sense of the first concept that has a sense of the other as a hyponym or meronym Less general: one concept is less general than the other iff there exists at least one sense of the first concept that has a sense of the other concept as hypernym or as a holonym

Stanford University, October 31, Structure-level Semantic Matching We translate the matching problem, namely the two graphs (in particular, the pair of nodes submitted to matching) into a propositional formula and then check for its validity We check for validity using SAT

Stanford University, October 31, Semantic Matching Algorithm 1. Extract the two graphs 2. Compute element-level semantic matching 3. Compute concepts at nodes 4. Construct the propositional formula 5. Run SAT 6. Perform iterations

Stanford University, October 31, Semantic Matching Algorithm: Example – (1) Extract the two graphs In the case of RDB, XML and OODB schemas, it is necessary to extract useful semantic information, for instance in the form of ontologies

Stanford University, October 31, Semantic Matching Algorithm: Example – (2) Element-level semantic matching. For each node, compute semantic relations holding among all the concepts denoted by labels at nodes under consideration C A 1 = C A 2 C B 1 = C B 2 C C 1 = C C 2 C D 1 = C D 2 C E 1 = C E 2

Stanford University, October 31, Semantic Matching Algorithm: Example – (3) Compute concepts at nodes. Suppose, we want to find a semantic relation between nodes 5 1 and 1 2 ? C 1 1 = C A 1 C 5 1 = C A 1  C C 2 C 1 2 = C C 2

Stanford University, October 31, Semantic Matching Algorithm: Example – (4) Construct the propositional formula. We translate all the semantic relations computed in step 2 into propositional formulas under the following rules:  C A 1  C A 2  C A 2  C A 1  C A 1  C A 2  C A 1  C A 2  C A 1 = C A 2  C A 1  C A 2  C A 1  C A 2   (C A 1  C A 2 ) ? From step 2 we have: C C 1  C C 2. We want to prove that C 5 1  C 1 2 ( we guess relation between nodes at this stage) (CA1  CC1)  CC2(CA1  CC1)  CC2 (C C 1  C C 2 )  ((C A 1  C C 1 )  C C 2 ) …

Stanford University, October 31, Semantic Matching Algorithm: Example – (5) Run SAT In order to prove that (C C 1  C C 2 )  ((C A 1  C C 1 )  C C 2 ) is valid, we prove that its negation is unsatisfiabile (C C 1  C C 2 )  ((C A 1  C C 1 )  C C 2 ) SAT returns FALSE Thus, C 5 1  C 1 2

Stanford University, October 31, Example: Cupid vs. Semantic Matching Arts Organizations Art History Music Baroque History Organizations Arts&Humanities Art History Design Art Baroque Architecture History {}{} {}{} {}{} {}{} {}{} {}{}

Stanford University, October 31, Conclusions We have made a rational reconstruction of the major matching problems and articulated them in terms of the more generic problem of matching graphs We have identified semantic matching as a new approach for performing generic matching We have proposed an implementation of semantic matching using SAT

Stanford University, October 31, Future Work Extend to a full graph matcher How to extract semantics from schemas Study how to take into account attributes and instances Develop an efficient implementation of the system Do a thorough testing of the system

Stanford University, October 31, References Project website: F. Giunchiglia, P.Shvaiko “Semantic Matching”. Technical Report #DIT Also to appear in The Knowledge Engineering Review journal. Short version in proceedings of Semantic Integration workshop at ISWC’03. F. Giunchiglia, I. Zaihrayeu “Making peer databases interact – a vision for an architecture supporting data coordination” In Proc. Of the Conference of Information Agents (CIA 2002), Madrid, 2002.