Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.

Slides:



Advertisements
Similar presentations
Discovering Missing Background Knowledge in Ontology Matching Pavel Shvaiko 17th European Conference on Artificial Intelligence (ECAI06) 30 August 2006,
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
S-Match: an Algorithm and an Implementation of Semantic Matching Fausto Giunchiglia July 2004, Hannover, Germany work in collaboration with Pavel Shvaiko.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
Improved TF-IDF Ranker
Fast Algorithms For Hierarchical Range Histogram Constructions
1 Extended Gloss Overlaps as a Measure of Semantic Relatedness Satanjeev Banerjee Ted Pedersen Carnegie Mellon University University of Minnesota Duluth.
By Murat Şensoy Ontology Alignment by Murat Şensoy
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
CMPT 354, Simon Fraser University, Fall 2008, Martin Ester 52 Database Systems I Relational Algebra.
Designing clustering methods for ontology building: The Mo’K workbench Authors: Gilles Bisson, Claire Nédellec and Dolores Cañamero Presenter: Ovidiu Fortu.
Learning syntactic patterns for automatic hypernym discovery Rion Snow, Daniel Jurafsky and Andrew Y. Ng Prepared by Ang Sun
1 Relational Algebra and Calculus Yanlei Diao UMass Amherst Feb 1, 2007 Slides Courtesy of R. Ramakrishnan and J. Gehrke.
Partners Using NLP Techniques for Meaning Negotiation Bernardo Magnini, Luciano Serafini and Manuela Speranza ITC-irst, via Sommarive 18, I Trento-Povo,
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
2.2 A Simple Syntax-Directed Translator Syntax-Directed Translation 2.4 Parsing 2.5 A Translator for Simple Expressions 2.6 Lexical Analysis.
LDK R Logics for Data and Knowledge Representation Semantic Matching.
Reasoning with context in the Semantic Web … or contextualizing ontologies Fausto Giunchiglia July 23, 2004.
SCHEMA-BASED SEMANTIC MATCHING Pavel Shvaiko joint work on “semantic matching” with Fausto Giunchiglia and Mikalai Yatskevich joint work on “ontology matching”
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Logics for Data and Knowledge Representation Semantic Matching.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Logics for Data and Knowledge Representation
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
Chapter 10: Compilers and Language Translation Invitation to Computer Science, Java Version, Third Edition.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
LDK R Logics for Data and Knowledge Representation Lightweight Ontologies.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
LDK R Logics for Data and Knowledge Representation Modeling First version by Alessandro Agostini and Fausto Giunchiglia Second version by Fausto Giunchiglia.
LDK R Logics for Data and Knowledge Representation Application of (Ground) ClassL.
1 University of Palestine Topics In CIS ITBS 3202 Ms. Eman Alajrami 2 nd Semester
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Prepared by: Mahmoud Rafeek Al-Farra College of Science & Technology Dep. Of Computer Science & IT BCs of Information Technology Data Mining
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Semantic web course – Computer Engineering Department – Sharif Univ. of Technology – Fall Knowledge Representation Semantic Web - Fall 2005 Computer.
Logics for Data and Knowledge Representation
LDK R Logics for Data and Knowledge Representation PL of Classes.
XML Access Control Koukis Dimitris Padeleris Pashalis.
Logical Agents Chapter 7. Outline Knowledge-based agents Logic in general Propositional (Boolean) logic Equivalence, validity, satisfiability.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
Answer Mining by Combining Extraction Techniques with Abductive Reasoning Sanda Harabagiu, Dan Moldovan, Christine Clark, Mitchell Bowden, Jown Williams.
Syntax and Semantics Form and Meaning of Programming Languages Copyright © by Curt Hill.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Programming Languages and Design Lecture 2 Syntax Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
LDK R Logics for Data and Knowledge Representation Propositional Logic Originally by Alessandro Agostini and Fausto Giunchiglia Modified by Fausto Giunchiglia,
First-Order Logic Semantics Reading: Chapter 8, , FOL Syntax and Semantics read: FOL Knowledge Engineering read: FOL.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Overview of Compilation Prepared by Manuel E. Bermúdez, Ph.D. Associate Professor University of Florida Programming Language Principles Lecture 2.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Artificial Intelligence Knowledge Representation.
Logics for Data and Knowledge Representation ClassL (part 1): syntax and semantics.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Mapping the NCI Thesaurus and the Collaborative Inter-Lingual Index Amanda Hicks University of Florida HealthInsight Workshop, Oslo, Norway.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
A Simple Syntax-Directed Translator
Overview of Compilation The Compiler Front End
Overview of Compilation The Compiler Front End
Element Level Semantic Matching
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Logics for Data and Knowledge Representation
Semantic Similarity Methods in WordNet and their Application to Information Retrieval on the Web Yizhe Ge.
Chapter 10: Compilers and Language Translation
Lec00-outline May 18, 2019 Compiler Design CS416 Compiler Design.
Presentation transcript:

Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto Giunchiglia and Mikalai Yatskevich

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 2 Outline Semantic Matching The S-Match Algorithm Element Level Semantic Matchers Conclusions and Future Work

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 3 Semantic Matching

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 4 Matching Semantic Matching Syntactic Matching Relations are computed between labels at nodes R = {x  [0,1]} Relations are computed between concepts at nodes R = { =,,, , } Note: all previous systems in the above sense are syntactic… Note: First implementation CTXmatch [Bouquet et al ] Matching: given two graph-like structures (e.g., concept hierarchies or ontologies), produce a mapping between the nodes of the graphs that semantically correspond to each other

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 5 Concept of a Label Wine and Cheese Italy Europe Austria Pictures The idea: Labels in classification hierarchies are used to define the set of documents one would like to classify under the node holding the label A label has an intended meaning, which is what this label means in the world Concept of a label is the set of documents that are about what the label means in the world

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 6 Concept of a Node Wine and Cheese Italy Europe Austria Pictures Observations: the semantics of a label are the real world semantics the semantics of the concept of a label are in the space of documents the relation being that the documents in the extension of the concept of a label are about what the label means in the real world Concept of a node is the set of documents that we would classify under this node, given it has a certain label and it is positioned in a certain place in the tree The idea: Trees add structure which allows us to perform the classification of documents more effectively

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 7 Mapping element is a 4-tuple, where ID ij is a unique identifier of the given mapping element; n 1 i is the i-th node of the first graph; n 2 j is the j-th node of the second graph; R specifies a semantic relation between the concepts at the given nodes. Semantic Matching Semantic Matching: Given two graphs G1 and G2, for any node n 1 i  G1, find the strongest semantic relation R’ holding with node n 2 j  G2 Computed R’s, listed in the decreasing binding strength order: equivalence { = }; more general/specific {, }; mismatch {  }; overlapping { }.

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 8 The S-Match Algorithm

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 9 1. For all labels in T A and T B compute concepts at labels 2. For all nodes in T A and T B compute concepts at nodes 3. For all pairs of labels in T A and T B compute relations between concepts at labels 4. For all pairs of nodes in T A and T B compute relations between concepts at nodes Steps 1 and 2 constitute the preprocessing phase, and are executed once and each time after the tree is changed (OFF- LINE part) Steps 3 and 4 constitute the matching phase, and are executed every time the two trees are to be matched (ON - LINE part) Four Macro Steps Given two labeled trees T A and T B, do:

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 10 Step 1: compute concepts at labels The idea: Translate natural language expressions into internal formal language Compute concepts based on possible senses of words in a label and their interrelations Preprocessing: Tokenization. Labels (according to punctuation, spaces, etc.) are parsed into tokens. E.g., Wine and Cheese  ; Lemmatization. Tokens are morphologically analyzed in order to find all their possible basic forms. E.g., images  image; Building atomic concepts. An oracle (WordNet) is used to extract senses of lemmatized tokens. E.g., Image has 8 senses, 7 as a noun and 1 as a verb; Building complex concepts. Prepositions, conjunctions, etc. are translated into logical connectives and used to build complex concepts out of the atomic concepts E.g., C Wine and Cheese =, where U is a union of the senses that WordNet attaches to lemmatized tokens

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 11 Step 2: compute concepts at nodes The idea: extend concepts at labels by capturing the knowledge residing in a structure of a graph in order to define a context in which the given concept at a label occurs Computation: Concept at a node for some node n is computed as a conjunction of concepts at labels located above the given node, including the node itself Wine and Cheese Italy Europe Austria Pictures C 4 = C Europe C Pictures C Italy

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 12 Step 3: compute relations between concepts at labels The idea: Exploit a priori knowledge, e.g., lexical, domain knowledge with the help of element level semantic matchers Results of step 3: Italy Europe Wine and Cheese Austria Pictures Europe Italy Austria Images TATA TBTB T2  = C Italy =  C Austria = C Europe = C Images C Austria C Italy C Pictures C Europe T1 C Wine C Cheese

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 13 Step 4: compute relations between concepts at nodes axioms  rel (Context A i, Context B j ) A propositional formula is valid iff its negation is unsatisfiable The idea: Reduce the matching problem to a validity problem We take the relations between concepts at labels computed in step 3 as axioms for reasoning about relations between concepts at nodes rel = { =,,, , }. Construct the propositional formula (Context A i in Tree1 and Context B j in Tree2)

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 14 Element Level Semantic Matchers

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 15 Element level Semantic Matchers String based matchers have two labels as input and produce semantic relations exploiting string comparison techniques Sense based matchers have two WordNet senses in input and produce semantic relations exploiting structural properties of WordNet hierarchies Gloss based matchers have two WordNet senses as input and produce relations exploiting gloss comparison techniques

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 16 Element level Semantic Matchers: Overview

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 17 String based matchers: Prefix Prefix: checks whether one input label starts with the other. It returns an equivalence relation in this case, and Idk otherwise Prefix is efficient in matching cognate words and similar acronyms (e.g., RDF and RDFS) but often syntactic similarity does not imply semantic relatedness The matcher returns equality for hot and hotel which is wrong but it recognizes the right relations in the case of the pairs net, network and cat, core

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 18 String based matchers: Edit distance Edit Distance: calculates the edit distance measure between two labels. The calculation includes counting the number of the simple editing operations i.e., delete, insert and replace needed to convert one label into another. It is normalized by the max number of characters in the two labels If the value exceeds a given threshold the equivalence relation is returned, otherwise, Idk is produced

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 19 Sense based matchers: WordNet WordNet: returns a lexical relation which holds in WordNet between two input labels Lexical relations provided by WordNet are converted to semantic relations according to the following rules: A  B if A is a hyponym or meronym of B; A  B if A is a hypernym or holonym of B; A = B if they are connected by synonym relation or they belong to one synset; A  B if they are connected by antonymy relation or they are the siblings in the part of hierarchy

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 20 Sense based matchers: Hierarchy distance Hierarchy distance: returns the equivalence relation if the distance between two input senses in a WordNet hierarchy is less than a given threshold value and Idk otherwise There is no direct relation between red and pink in WordNet. However, the distance between these concepts is 2 (1 more general link and 1 less general). Thus, we can infer that red and pink are close in their meaning and return the equivalence relation

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 21 Gloss based matchers: WordNet gloss WordNet gloss: compares the labels of the first input sense with the WordNet gloss of the second one Hound is any of several breeds of dog used for hunting typically having large drooping ears. Hound is described through the specification of the more general concept dog

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 22 Gloss based matchers: Gloss comparison Gloss comparison: The number of the same words occurring in the two input glosses increases the similarity value The equivalence relation is returned if the resulting similarity value exceeds a given threshold Maltese dog is a breed of toy dogs having a long straight silky white coat Afghan hound is a tall graceful breed of hound with a long silky coat; native to the Near East

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 23 Conclusions and Future Work Library of element level semantic matchers The result returned by a matcher is always a semantic relation or idk Evaluate element level semantic matchers Identify the element level matchers better suited for particular matching tasks

MCN workshop, ISWC, 8 th November 2004, Hiroshima, Japan 24 Thank you!