Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.

Slides:



Advertisements
Similar presentations
Hoai-Viet To1, Ryutaro Ichise2, and Hoai-Bac Le1
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
Query Dependent Pseudo-Relevance Feedback based on Wikipedia SIGIR ‘09 Advisor: Dr. Koh Jia-Ling Speaker: Lin, Yi-Jhen Date: 2010/01/24 1.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
An Efficient IP Address Lookup Algorithm Using a Priority Trie Authors: Hyesook Lim and Ju Hyoung Mun Presenter: Yi-Sheng, Lin ( 林意勝 ) Date: Mar. 11, 2008.
A Framework for Ontology-Based Knowledge Management System
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
Semantic text features from small world graphs Jure Leskovec, IJS + CMU John Shawe-Taylor, Southampton.
June 19-21, 2006WMS'06, Chania, Crete1 Design and Evaluation of Semantic Similarity Measures for Concepts Stemming from the Same or Different Ontologies.
Learning to Match Ontologies on the Semantic Web AnHai Doan Jayant Madhavan Robin Dhamankar Pedro Domingos Alon Halevy.
Toward Making Online Biological Data Machine Understandable Cui Tao Data Extraction Research Group Department of Computer Science, Brigham Young University,
Mapping Techniques and Visualization of Statistical Indicators Haitham Zeidan Palestinian Central Bureau of Statistics IAOS 2014 Conference.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Query Rewriting for Extracting Data Behind HTML Forms Xueqi Chen Department of Computer Science Brigham Young University March 31, 2004 Funded by National.
Introduction and Review : Educational Technology 1
1 Prototype Hierarchy Based Clustering for the Categorization and Navigation of Web Collections Zhao-Yan Ming, Kai Wang and Tat-Seng Chua School of Computing,
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
Retrieval Effectiveness of an Ontology-based Model for Information Selection Khan, L., McLeod, D. & Hovy, E. Presented by Danielle Lee.
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Personalisation Seminar on Unlocking the Secrets of the Past: Text Mining for Historical Documents Sven Steudter.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
TM CG Notes Latest: Outline Big Picture architecture Topic Maps – CG – Notio Notes – Amine Notes – CharGer Notes – Prolog – Analogy –
Tag Clouds Revisited Date : 2011/12/12 Source : CIKM’11 Speaker : I- Chih Chiu Advisor : Dr. Koh. Jia-ling 1.
Internet Search Methods 1.01 Understand Internet search tools and methods.
CMPS 3223 Theory of Computation Automata, Computability, & Complexity by Elaine Rich ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Slides provided.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Query Processing In Multimedia Databases Dheeraj Kumar Mekala Devarasetty Bhanu Kiran.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
Retrieval Models for Question and Answer Archives Xiaobing Xue, Jiwoon Jeon, W. Bruce Croft Computer Science Department University of Massachusetts, Google,
Date: 2013/8/27 Author: Shinya Tanaka, Adam Jatowt, Makoto P. Kato, Katsumi Tanaka Source: WSDM’13 Advisor: Jia-ling Koh Speaker: Chen-Yu Huang Estimating.
JM - 1 Introduction to Bioinformatics: Lecture III Genome Assembly and String Matching Jarek Meller Jarek Meller Division of Biomedical.
A Collaborative and Semantic Data Management Framework for Ubiquitous Computing Environment International Conference of Embedded and Ubiquitous Computing.
Interoperable Visualization Framework towards enhancing mapping and integration of official statistics Haitham Zeidan Palestinian Central.
Semantics-Based News Recommendation with SF-IDF+ International Conference on Web Intelligence, Mining, and Semantics (WIMS 2013) June 13, 2013 Marnix Moerland.
CISC Machine Learning for Solving Systems Problems Presented by: Ashwani Rao Dept of Computer & Information Sciences University of Delaware Learning.
LOGO 1 Corroborate and Learn Facts from the Web Advisor : Dr. Koh Jia-Ling Speaker : Tu Yi-Lang Date : Shubin Zhao, Jonathan Betz (KDD '07 )
A Logical Framework for Web Service Discovery The Third International Semantic Web Conference Hiroshima, Japan, Michael Kifer 1, Rubén Lara.
Compact Encodings for All Local Path Information in Web Taxonomies with Application to WordNet Svetlana Strunjaš-Yoshikawa Joint with Fred Annexstein and.
1 Masters Thesis Presentation By Debotosh Dey AUTOMATIC CONSTRUCTION OF HASHTAGS HIERARCHIES UNIVERSITAT ROVIRA I VIRGILI Tarragona, June 2015 Supervised.
A Classification-based Approach to Question Answering in Discussion Boards Liangjie Hong, Brian D. Davison Lehigh University (SIGIR ’ 09) Speaker: Cho,
1 Measuring the Semantic Similarity of Texts Author : Courtney Corley and Rada Mihalcea Source : ACL-2005 Reporter : Yong-Xiang Chen.
Learning Taxonomic Relations from Heterogeneous Evidence Philipp Cimiano Aleksander Pivk Lars Schmidt-Thieme Steffen Staab (ECAI 2004)
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Using Wikipedia for Hierarchical Finer Categorization of Named Entities Aasish Pappu Language Technologies Institute Carnegie Mellon University PACLIC.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
INVITATION TO Computer Science 1 11 Chapter 2 The Algorithmic Foundations of Computer Science.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Short Text Similarity with Word Embedding Date: 2016/03/28 Author: Tom Kenter, Maarten de Rijke Source: CIKM’15 Advisor: Jia-Ling Koh Speaker: Chih-Hsuan.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Ontology Evaluation Outline Motivation Evaluation Criteria Evaluation Measures Evaluation Approaches.
Do-Gil Lee1*, Ilhwan Kim1 and Seok Kee Lee2
Result of Ontology Alignment with RiMOM at OAEI’06
Statistical Optimal Hash-based Longest Prefix Match
CSc4730/6730 Scientific Visualization
An Empirical Study of Property Collocation on Large Scale of Knowledge Base 龚赛赛
Property consolidation for entity browsing
A Graph-Based Approach to Learn Semantic Descriptions of Data Sources
Block Matching for Ontologies
Actively Learning Ontology Matching via User Interaction
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Presentation transcript:

Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information Science Ryutaro Ichise Principles of Informatics Research Division, National Institute of Informatics Speaker : Shun-hong Sie Date : 2009/10/19

Outline Introduction Ontology Mapping Problem Ontology Mapping as a Machine Learning Problem Experimental Conclusions 2

Introduction People use the internet to collect information as a decision mark tool. – To make vacation plans, user conduct research on the internet for suitable lodging, routers, and sightseeing spots. Internet sites are operated by individual enterprise, which mean that we are required to check the sites manually in order to collect information. 3

Introduction In order to solve this problem, the Semantic Web is expected to become a next generation web standard that will be capable of connecting different data resources. The semantics of the data are provided by ontologies for interoperability of the resources. 4

The problem Since ontologies cover a particular domain or use, it is necessary to develop a method to map multiple ontologies in order to increase the coverage of different domain or users. 5

Ontology Mapping Problem Defined – The ontology O contains a set of concepts,, that are organized into a hierarchy. Each concept is labeled by strings and can contain instances. 6

Ontology Mapping Problem How to find a concept in ontology B that corresponds to the concept in ontology A? Concept Instance 7

Ontology Mapping Problem To solve this problem, we think about the combination of concepts among different ontologies. – Defining the value of the combination pair. – Defining the value of pairs of concepts in a concept pair matrix. 8

Ontology Mapping Problem The value is 1 when the two concepts can be mapped and 0 when the two concepts cannot be mapped. In this paper, we assume that the matrix value is binary. 9

Ontology Mapping as a Machine Learning Problem What type of information is available to compose the matrix? – Using a string-matching method, such as concept name matching, and other methods. – Single similarity measure is insufficient for determining the matrix. 10

Ontology Mapping as a Machine Learning Problem Diversity of ontologies. financebank constructionbank 11

Ontology Mapping as a Machine Learning Problem Define matrix values by using multiple similarity values of the concepts. Therefore, we can convert the ontology mapping problem into a machine learning problem by using this framework. 12

Concept Similarity Measures String-based similarity Graph-based similarity Instance classification similarity Knowledge-based similarity 13

Concept Similarity Measures In this paper – Word similarity – Word list similarity – Concept hierarchy similarity – Structure similarity 14

Word similarity Prefix – Eng. vs. England Suffix – Phone vs. telephone Edit distance – Calculate the similarity as a count of the string substitutions, deletions and additions. N-gram – The word is divided into n number of strings, and the similarity is calculated by the number of same string sets. 15

Word similarity Knowledge-based similarity – Use WordNet( as the knowledge resource for calculating the similarity. – WordNet is organized with synsets. Synset Wu & Palmer description Lin 16

WordNet search result 17

WordNet search result 18

Knowledge-based similarity Synset – To calculate the shortest path of the different word pairs using synsets. Wu & Palmer – Use the depth and the least common super- concept (LCS) of words. 19

Knowledge-based similarity description – Utilizes the description of a concept in WordNet. The similarity is calculated as the square of the common word length in both descriptions of the words. Lin 20

Word List Similarity Word similarity measures is not applicable to a word list such as “Food_Wine.” We define two types of similarities: – maximum word similarity – word edit distance. Pyramid Pyramid, Theory Social, Sci. Social, Science Edit distance=1 Edit distance=? 21

Concept Hierarchy Similarity TopSocial_Sci TopSocial_Science Ontology A Ontology B 22

Experimental Evaluation Ontology Alignment Evaluation Initiative (OAEI) 2007 data set. – Google 、 Yahoo 、 Look-smart Includes 4639 pairs of ontologies written in OWL format, with 2265 pairs of the 4639 pairs are correctly matching answers, which are positive examples, and 2374 pairs are incorrectly matching answers, which are negative examples. Use 10-fold cross-validations. – Implemented system called Malform-SVM (Machine learning framework for Ontology Matching using SVM) 23

Experimental Results 24

Experimental Results 25

Conclusions Use multiple similarity measures. Need to investigate the trade-off between the man-hours required for making mapping examples with the performance improvement system in the future. 26