SCHEMA-BASED SEMANTIC MATCHING Pavel Shvaiko joint work on “semantic matching” with Fausto Giunchiglia and Mikalai Yatskevich joint work on “ontology matching”

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Discovering Missing Background Knowledge in Ontology Matching Pavel Shvaiko 17th European Conference on Artificial Intelligence (ECAI06) 30 August 2006,
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
S-Match: an Algorithm and an Implementation of Semantic Matching Fausto Giunchiglia July 2004, Hannover, Germany work in collaboration with Pavel Shvaiko.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard C. Dragut Ramon Lawrence Eduard C. Dragut Ramon Lawrence.
Dynamic Ontology Matching Pavel Shvaiko OpenKnowledge meetings 9 February, 13 March, 2006 Trento, Italy.
Xyleme A Dynamic Warehouse for XML Data of the Web.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
Generic Schema Matching with Cupid Jayant Madhavan Philip A. Bernstein Erhard Raham Proceedings of the 27 th VLDB Conference.
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Partners Using NLP Techniques for Meaning Negotiation Bernardo Magnini, Luciano Serafini and Manuela Speranza ITC-irst, via Sommarive 18, I Trento-Povo,
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
LDK R Logics for Data and Knowledge Representation Semantic Matching.
Reasoning with context in the Semantic Web … or contextualizing ontologies Fausto Giunchiglia July 23, 2004.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Logics for Data and Knowledge Representation Semantic Matching.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
Semantic Enrichment of Ontology Mappings: A Linguistic-based Approach Patrick Arnold, Erhard Rahm University of Leipzig, Germany 17th East-European Conference.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Dimitrios Skoutas Alkis Simitsis
Ontology Alignment. Ontologies in biomedical research many biomedical ontologies e.g. GO, OBO, SNOMED-CT practical use of biomedical ontologies e.g. databases.
CSE 636 Data Integration Schema Matching Cupid Fall 2006.
Interoperable Visualization Framework towards enhancing mapping and integration of official statistics Haitham Zeidan Palestinian Central.
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
Advanced topics in software engineering (Semantic web)
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Logics for Data and Knowledge Representation
Object-Oriented Modeling: Static Models. Object-Oriented Modeling Model the system as interacting objects Model the system as interacting objects Match.
ISWC2007, Nov. 14. Discovering simple mappings between Relational database schemas and ontologies Wei Hu, Yuzhong Qu {whu,
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Mining Document Collections to Facilitate Accurate Approximate Entity Matching Presented By Harshda Vabale.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
University of the Aegean AI – LAB ESWC 2008 From Conceptual to Instance Matching George A. Vouros AI Lab Department of Information and Communication Systems.
272: Software Engineering Fall 2012 Instructor: Tevfik Bultan Lecture 9: Test Generation from Models.
32nd International Conference on Very Large Data Bases September , 2006 Seoul, Korea Efficient Detection of Empty Result Queries Gang Luo IBM T.J.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
An Efficient Method for Computing Alignment Diagnoses
Cross-Ontological Relationships
Element Level Semantic Matching
Result of Ontology Alignment with RiMOM at OAEI’06
Logics for Data and Knowledge Representation
SAT-Based Area Recovery in Technology Mapping
ece 627 intelligent web: ontology and beyond
[jws13] Evaluation of instance matching tools: The experience of OAEI
Business Process Management and Semantic Technologies
Presentation transcript:

SCHEMA-BASED SEMANTIC MATCHING Pavel Shvaiko joint work on “semantic matching” with Fausto Giunchiglia and Mikalai Yatskevich joint work on “ontology matching” with Jérôme Euzenat 1st European Semantic Technology Conference (ESTC’07), Semantic Web Technology Showcase 31 May 2007, Vienna, Austria

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 2 Outline Part I: The matching problem Part II: State of the art in ontology matching Part III: Schema-based semantic matching Part IV: Evaluation (technology showcase) Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 3 Outline Part I: The matching problem Problem statement Applications Part II: State of the art in ontology matching Part III: Schema-based semantic matching Part IV: Evaluation (technology showcase) Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 4 Matching operation Matching operation takes as input ontologies, each consisting of a set of discrete entities (e.g., tables, XML elements, classes, properties) and determines as output the relationships (e.g., equivalence, subsumption) holding between these entities

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 5 Example: two XML schemas Equivalence Generality Disjointness

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 6 Example: two ontologies Equivalence Generality Disjointness.year =

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 7 Statement of the problem Scope Reducing heterogeneity can be performed in two steps: Match, thereby determine the alignment Process the alignment (merge, transform, translate...)

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 8 Statement of the problem Correspondence is a 5-tuple id is a unique identifier of the given correspondence e1 and e2 are entities (XML elements, classes,...) R is a relation (equivalence, more general, disjointness,...) n is a confidence measure, typically in the [0,1] range Alignment (A) is a set of correspondences with some cardinality: 1-1, 1-n,... some other properties (complete)

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 9 Statement of the problem Alignment A p (weights,.. ) r (WordNet, …) Matching Alignment A’ Ontology O1 Ontology O2 Matching process

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 10 Applications Traditional Ontology evolution Schema integration Catalog integration Data integration Emergent P2P information sharing Web service composition Agent communication Query answering on the web

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 11 Common Ontology Local Ontology 1 wrapper 1 Local Ontology n wrapper n Q: find an article about Ontology Matching A: “Discovering missing background knowledge in ontology matching” by F. Giunchiglia, P. Shvaiko, M. Yatskevich. In Proceedings of ECAI, 2006 Matcher Alignment n Applications: Information integration Matcher Alignment 1

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 12 Applications: summary Applicationoperation Ontology evolution  transformation Schema integration  merging Catalog integration  data translation Data integration  query answering P2P information sharing  query answering Web service composition  data mediation Multi-agent communication  data translation Query answering  query reformulation instances run time automatic correctcomplete

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 13 Outline Part I: The matching problem Part II: State of the art in ontology matching Classification of matching techniques Overview of matching systems Part III: Schema-based semantic matching Part IV: Evaluation (technology showcase) Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 14 Classification of basic techniques Three layers The upper layer Granularity of match Interpretation of the input information The middle layer represents classes of elementary (basic) matching techniques The lower layer is based on the kind of input which is used by elementary matching techniques

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 15 Classification of techniques (simplified) Element level Structure level Syntactic External Semantic External String- based Graph- based Constraint- based Linguistic resource Repository of structures Model- based Language- based Upper, domain specific formal ontologies - Names Datatypes Thesauri Taxonomic strcuture Strcuture’s metadata - SAT - DL based - Tokenization FMA -... Terminological Semantic Structural LinguisticInternal Relational

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 16 Basic techniques String-based Edit distance It takes as input two strings and calculates the number of insertions, deletions, and substitutions of characters required to transform one string into another, normalized by max(length(string1), length(string2)) EditDistance(NKN,Nikon) = 0.4

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 17 Basic techniques (cont’d) Linguistic resources: WordNet It computes relations between ontology entities by using (lexical) relationships of WordNet A  B if A is a hyponym or meronym of B Brand  Name A  B if A is a hypernym or holonym of B Europe  Greece A = B if they are synonyms Quantity = Amount A  B if they are antonyms or siblings in part of hierarchy Microprocessors  PC Board

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 18 Systems: analytical comparison SFArtemisCupidCOMAPromptOLAS-Match string-based, data types, key properties domain compatibility, language- based string-based, language-based, data types, key properties string-based language-based, data types string-based, domains and ranges string-based, data types, language-based string-based, language- based - common thesaurus (CT) auxiliary dictionary -WordNet iterative fix-point computation matching of neighbors via CT tree matching weighted by leaves DAG (tree) matching with a bias towards leaf or children structures bounded path matching (arbitrary links, is-a links) iterative fix-point computation, matching of neighbors SAT ~50 matching systems exist, …we consider some of them Element-level Structure-level Syntactic External Semantic

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 19 Outline Part I: The matching problem Part II: State of the art in ontology matching Part III: Schema-based semantic matching Semantic matching Iterative semantic matching Part IV: Evaluation (technology showcase) Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 20 Generic matching Information sources (classifications, XML schemas, …) can be viewed as graph-like structures containing terms and their inter-relationships Matching takes two graph-like structures and produces correspondences between the nodes of the graphs that are supposed to correspond to each other

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 21 Semantic matching in a nutshell Semantic matching: Given two graphs G1 and G2, for any node n 1 i  G1, find the strongest semantic relation R’ holding with node n 2 j  G2 Computed R’s, listed in the decreasing binding strength order: equivalence { = } more general/specific {, } disjointness {  } I don’t know {idk} We compute semantic relations by analyzing the meaning (concepts, not labels) which is codified in the elements and the structures of ontologies Technically, labels at nodes written in natural language are translated into propositional logical formulas which explicitly codify the labels’ intended meaning. This allows us to codify the matching problem into a propositional validity problem

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 22 Concept of a label & concept at a node Concept of a label is the propositional formula which stands for the set of documents that one would classify under a label it encodes Concept at a node is the propositional formula which represents the set of documents which one would classify under a node, given that it has a certain label and that it is in a certain position in a tree Cameras and Photo PC board Electronics Digital Cameras PC

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 23 1.For all labels in T1 and T2 compute concepts at labels 2.For all nodes in T1 and T2 compute concepts at nodes 3.For all pairs of labels in T1 and T2 compute relations between concepts at labels (background knowledge) 4.For all pairs of nodes in T1 and T2 compute relations between concepts at nodes Steps 1 and 2 constitute the preprocessing phase, and are executed once and each time after the ontology is changed (OFF- LINE part) Steps 3 and 4 constitute the matching phase, and are executed every time two ontologies are to be matched (ON - LINE part) Four macro steps Given two labeled trees T1 and T2, do:

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 24 Step 1: compute concepts at labels The idea Translate labels at nodes written in natural language into propositional logical formulas which explicitly codify the labels’ intended meaning Preprocessing Tokenization. Labels (according to punctuation, spaces, etc.) are parsed into tokens. E.g., Photo and Cameras  Lemmatization. Tokens are morphologically analyzed in order to find all their possible basic forms. E.g., Cameras  Camera Building atomic concepts. An oracle (WordNet) is used to extract senses of lemmas. E.g., Camera has 2 senses Building complex concepts. Prepositions, conjunctions are translated into logical connectives and used to build complex concepts out of the atomic concepts E.g., C Cameras_and_Photo =

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 25 Step 2: compute concepts at nodes The idea Extend concepts at labels by capturing the knowledge residing in a structure of a tree in order to define a context in which the given concept at a label occurs Computation Concept at a node for some node n is computed as a conjunction of concepts at labels located above the given node, including the node itself C 2 = C Electronics C PC Conjunctive Disjunctive C 4 = C Electronics (C Cameras C Photo ) C Digital Cameras Two types of concepts of nodes Cameras and Photo Electronics Digital Cameras PC

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 26 Step 3: compute relations between (atomic) concepts at labels The idea Exploit a priori knowledge, e.g., lexical, domain knowledge, with the help of element level semantic matchers O1O2 Cameras 2 Photo 2 Digital_Cameras 2 Photo 1 idk= Cameras 1 =idk cLabsMatrix (result of Step 3)

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 27 Step 3: Element level semantic matchers Sense-based matchers have two WordNet senses in input and produce semantic relations exploiting (direct) lexical relations of WordNet String-based matchers have two labels in input and produce semantic relations exploiting string comparison techniques Matcher name Execution order Approximation level Matcher type Schema info WordNet11Sense-basedWordNet senses Prefix22String-basedLabels Suffix32String-basedLabels Edit distance42String-basedLabels Ngram52String-basedLabels

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 28 Step 4: compute relations between concepts at nodes The idea Decompose the tree matching problem into the set of node matching problems Translate each node matching problem, namely pairs of nodes with possible relations between them, into a propositional formula Check the propositional formula for validity

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 29 Step 4: Example of a node matching task Axioms  rel(context 1, context 2 ) O1O2 ? (Electronics 1  Electronics 2 )  (Personal_Computers 1  PC 2 )  (Electronics 1  Personal_Computers 1 )  (Electronics 2  PC 2 ) Axioms context 1 context 2

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 30 Step 4: Efficient semantic matching Conjunctive concepts at nodes Matching formula is Horn Satisfiability can be determined in linear time SAT solver requires quadratic time We developed ad hoc linear time reasoning procedure Avoid conversion to propositional formula Reason on the axioms matrix Disjunctive concepts at nodes Matching formula is not in CNF by construction Most SAT solvers require the input formula to be in CNF Conversion to CNF may lead to exponential space explosion Exploit structure preserving transformation Size of formula in CNF is linear with respect to original formula

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 31 Outline Part I: The matching problem Part II: State of the art in ontology matching Part III: Schema-based semantic matching Semantic matching Iterative semantic matching Part IV: Evaluation (technology showcase) Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 32 Motivation: Problem of low recall (incompletness) - I Facts Matching (usually) has two components: element level matching and structure level matching Contrarily to many other systems, the semantic matching structure level algorithm is correct and complete Still, the quality of results is not very good Why?... the problem of lack of knowledge recall

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 33 Motivation: Problem of low recall (incompletness) - II Preliminary (analytical) evaluation Matching tasks #nodesmax depth#labels per tree Google vs Looksmart 706/108111/161048/1715 Google vs Yahoo 561/66511/11722/945 Yahoo vs Looksmart 74/1408/10101/222 Dataset [P. Avesani et al., ISWC’05] E

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 34 On increasing the recall: an overview Multiple strategies Strengthen element level matchers Reuse of previous match results from the same domain of interest PO = Purchase Order Use general knowledge sources (unlikely to help) WWW Use, if available (!), domain specific sources of knowledge UMLS, FMA

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 35 Iterative semantic matching (ISM) The idea Repeat Step 3 and Step 4 of the matching algorithm for some critical (hard) matching tasks ISM macro steps Discover critical points in the matching process Generate candidate missing axiom(s) Re-run SAT solver on a critical task taking into account the new axiom(s) If SAT returns false, save the newly discovered axiom(s) for future reuse

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 36 ISM: Discovering critical points - example Google (T1)Looksmart (T2) cLabsMatrix (result of Step 3) cNodesMatrix (result of Step 4) TOP 1 Games 1 Board_Games 1 TOP 2 = idk Entertainment 2 idk Games 2 idk = C1 1 C1 2 C1 3 C1 4 C1 9 C1 10 C1 11 C2 1 = C2 3 idk

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 37 ISM: Generating candidate axioms Sense-based matchers have two WordNet senses in input and produce semantic relations exploiting structural properties of WordNet hierarchies Hierarchy Distance (HD) Gloss-based matchers have two WordNet senses as input and produce relations exploiting gloss comparison techniques WordNet Gloss (WNG) Extended WordNet Gloss (EWNG) Gloss Comparison (GC)

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 38 ISM: generating candidate axioms Hierarchy Distance Hierarchy distance returns the equivalence relation if the distance between two input senses in WordNet hierarchy is less than a given threshold value (e.g., 3) and idk otherwise Distance between these concepts is 2 (1 more general link and 1 less general). Thus, we can conclude that games and entertainment are close in their meaning and return the equivalence relation diversion entertainment games There is no direct relation between games and entertainment in WordNet

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 39 Outline Part I: The matching problem Part II: State of the art in ontology matching Part III: Schema-based semantic matching Part IV: Evaluation (technology showcase) Evaluation setup Evaluation results Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 40 True negatives (TN) Reference alignment Alignment True positives (TP) False positives (FP) False negatives (FN) Complete set of correspondences Evaluation (quality) measures

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 41 Test cases #Matching task#nodesmax depth#labels per tree 1 Images vs Europe4/52/26/5 2Product schemas 13/144/414/15 3Yahoo Finance vs Standard 10/162/222/45 4Cornell vs Washington 34/393/362/64 5CIDX vs Excel 34/393/356/58 6Google vs Looksmart 706/108111/161048/1715 7Google vs Yahoo 561/66511/11722/945 8Yahoo vs Looksmart 74/1408/10101/222 9Iconclass vs Aria 999/5539/32688/835

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 42 Matching systems Schema-based systems S-Match Cupid COMA Similarity Flooding as implemented in Rondo OAEI-2005 and OAEI-2006 participants Systems were used in default configurations PC: PIV 1,7Ghz; 512Mb. RAM; Win XP

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 43 Outline Part I: The matching problem Part II: State of the art in ontology matching Part III: Schema-based semantic matching Part IV: Evaluation (technology showcase) Evaluation setup Evaluation results Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 44 Experimental results, test case #4

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 45 Experimental results, test case #5

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 46 Experimental results, #3,6,7,8: efficiency Yahoo-Standard Looksmart -Yahoo Google-Yahoo Google-Looksmart

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 47 Experimental results, #6,7,8: incompleteness OAEI-2005

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 48 Experimental results, #6,7,8: incompleteness (OAEI-2006 comparison)

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 49 Preliminary results, test case #9 Precision, %Recall, %F-measure, % S-Match44,826,4511,29 Iterative S-Match 47,696,611,59 Observations The dataset is hard and challenging Why do we have such a low recall? Gloss-like labels Aria: Top>Accessories>Jewelry Iconclass: Top>Nature>earth, world as celestial body>rock types; minerals and metals; soil types>rock types>precious and semiprecious stones>precious and semiprecious stones (with NAME)>precious and semiprecious stones: emerald

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 50 Outline Thesis contributions Part I: The matching problem Part II: State of the art in ontology matching Part III: Schema-based semantic matching Part IV: Evaluation (technology showcase) Part V: Conclusions

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 51 Summary Ontology matching applications and their requirements Overview of the state of the art, including classification of matching techniques and systems Semantic matching approach, including algorithms for basic, efficient and iterative semantic matching Evaluation of the approach on various data sets with encouraging results

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 52 Automated reasoning techniques (e.g., SAT) provide good performance for industrial-strength matching tasks The issue is not efficiency but rather missing domain knowledge This problem on the industrial size matching tasks is very hard We have investigated it by examples of light weight ontologies, such as Google and Yahoo Partial solution by applying semantic matching iteratively Summary (cont’d)

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 53 Future challenges Missing background knowledge Interactive approaches Explanations of matching results Social and collaborative ontology matching Large-scale evaluation Infrastructures...

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 54 Future challenges: scalability of visualization

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 55 References Project website - KNOWDIVE: Ontology Matching website: F. Giunchiglia, M. Yatskevich, P. Shvaiko: Semantic matching: algorithms and implementation. Journal on Data Semantics, IX, F. Giunchiglia, P. Shvaiko, M. Yatskevich: Discovering missing background knowledge in ontology matching. In Proceedings of ECAI'06. P. Shvaiko and J. Euzenat: A survey of schema-based matching approaches. Journal on Data Semantics, IV, P. Shvaiko, J. Euzenat, N. Noy, H. Stuckenschmidt, R. Benjamins, M. Uschold. Proceedings of the ISWC International Workshop on Ontology Matching, P. Avesani, F. Giunchiglia, M. Yatskevich: A large scale taxonomy mapping evaluation. In Proceedings of ISWC'05. B. Magnini, M. Speranza, C. Girardi. A semantic-based approach to interoperability of classification hierarchies: Evaluation of linguistic techniques. In Proceedings of COLING'04. P. Bouquet, L. Serafini, S. Zanobini: Semantic coordination: a new approach and an application. In Proceedings of ISWC'03. C. Ghidini, F. Giunchiglia: Local models semantics, or contextual reasoning = locality + compatibility. Artificial Intelligence Journal, 127(3), 2001.

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 56 Ontology ISWC’07+ASWC’07 Ontology Alignment Evaluation Initiative OAEI–2007 campaign

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 57

Semantic Web Technololgy Show Case at ESTC’07, Vienna, Austria 58 Thank you for your attention and interest!