Ontology Alignment for Linked Open Data – ISWC2010 research track Prateek Jain Pascal Hitzler Amit Sheth Kno.e.sis Center Wright State University, Dayton,

Slides:



Advertisements
Similar presentations
eClassifier: Tool for Taxonomies
Advertisements

The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
Chapter 9: Ontology Management Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
A Framework for Ontology-Based Knowledge Management System
Linked Sensor Data Harshal Patni, Cory Henson, Amit P. Sheth Ohio Center of Excellence in Knowledge enabled Computing (Kno.e.sis) Wright State University,
April 22, Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Doerre, Peter Gerstl, Roland Seiffert IBM Germany, August 1999 Presenter:
Aki Hecht Seminar in Databases (236826) January 2009
11/8/20051 Ontology Translation on the Semantic Web D. Dou, D. McDermott, P. Qi Computer Science, Yale University Presented by Z. Chen CIS 607 SII, Week.
Department of Computer Science, University of Maryland, College Park 1 Sharath Srinivas - CMSC 818Z, Spring 2007 Semantic Web and Knowledge Representation.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Huimin Ye.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Semantic Web Technology Evaluation Ontology (SWETO): A test bed for evaluating tools and benchmarking semantic applications WWW2004 (New York, May 22,
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Query Planning for Searching Inter- Dependent Deep-Web Databases Fan Wang 1, Gagan Agrawal 1, Ruoming Jin 2 1 Department of Computer.
Vocabulary Services “Huuh - what is it good for…” (in WDTS anyway…) 4 th September 2009 Jonathan Yu CSIRO Land and Water.
Predicting Missing Provenance Using Semantic Associations in Reservoir Engineering Jing Zhao University of Southern California Sep 19 th,
A Statistical and Schema Independent Approach to Identify Equivalent Properties on Linked Data † Kno.e.sis Center Wright State University Dayton OH, USA.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Copyright © 2010 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are trademarks of Accenture. Multiple Ontologies in.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
In The Name Of God. Jhaleh Narimisaei By Guide: Dr. Shadgar Implementation of Web Ontology and Semantic Application for Electronic Journal Citation System.
Reasoning with context in the Semantic Web … or contextualizing ontologies Fausto Giunchiglia July 23, 2004.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Krishnaprasad Thirunarayan, Pramod Anantharam, Cory A. Henson, and Amit P. Sheth Kno.e.sis Center, Ohio Center of Excellence on Knowledge-enabled Computing,
Exploiting Wikipedia as External Knowledge for Document Clustering Sakyasingha Dasgupta, Pradeep Ghosh Data Mining and Exploration-Presentation School.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Environmental Terminology Research in China HE Keqing, HE Yangfan, WANG Chong State Key Lab. Of Software Engineering
Graph Data Management Lab, School of Computer Science gdm.fudan.edu.cn XMLSnippet: A Coding Assistant for XML Configuration Snippet.
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
Ontologies for the Integration of Geospatial Data Michael Lutz Workshop: Semantics and Ontologies for GI Services, 2006 Paper: Lutz et al., Overcoming.
Knowledge Enabled Information and Services Science Extending SPARQL to Support Spatially and Temporally Related Information Prateek Jain, Amit Sheth Peter.
Knowledge Enabled Information and Services Science SPARQL Query Re-writing for Spatial Datasets Using Partonomy Based Transformation Rules Prateek Jain,
© Copyright 2008 STI INNSBRUCK Media Meets Semantic Web – How the BBC Uses DBpedia and Linked Data to Make Connections.
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
SWETO: Large-Scale Semantic Web Test-bed Ontology In Action Workshop (Banff Alberta, Canada June 21 st 2004) Boanerges Aleman-MezaBoanerges Aleman-Meza,
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Intelligent Database Systems Lab Presenter : WU, MIN-CONG Authors : Jorge Villalon and Rafael A. Calvo 2011, EST Concept Maps as Cognitive Visualizations.
ISURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains Prof. Dr. Asuman Dogac METU-SRDC Turkey METU.
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
Semantic (Web) Technology in Action - today The Semantic Web – Scientific American article considered harmful? WWW2003 Panel (PN2), Budapest, May 21, 2003.
Logics for Data and Knowledge Representation
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Harvesting Social Knowledge from Folksonomies Harris Wu, Mohammad Zubair, Kurt Maly, Harvesting social knowledge from folksonomies, Proceedings of the.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
ISO/IEC JTC 1/SC 32 Plenary and WGs Meetings Jeju, Korea, June 25, 2009 Jeong-Dong Kim, Doo-Kwon Baik, Dongwon Jeong {kjd4u,
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Acquisition of Categorized Named Entities for Web Search Marius Pasca Google Inc. from Conference on Information and Knowledge Management (CIKM) ’04.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Corpus Exploitation from Wikipedia for Ontology Construction Gaoying Cui, Qin Lu, Wenjie Li, Yirong Chen The Department of Computing The Hong Kong Polytechnic.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Cross-Ontological Relationships
Query Rewriting Framework for Spatial Data
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Restrict Range of Data Collection for Topic Trend Detection
Extracting Semantic Concept Relations
Service-Oriented Computing: Semantics, Processes, Agents
ece 627 intelligent web: ontology and beyond
Property consolidation for entity browsing
[jws13] Evaluation of instance matching tools: The experience of OAEI
Service-Oriented Computing: Semantics, Processes, Agents
Presentation transcript:

Ontology Alignment for Linked Open Data – ISWC2010 research track Prateek Jain Pascal Hitzler Amit Sheth Kno.e.sis Center Wright State University, Dayton, OH TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A Kunal Verma Peter Z. Yeh Accenture Technology Labs San Jose, CA

2 Linked Open Data

3 Outline Introduction Motivation Existing Approaches BLOOMS Approach Evaluation Applications Conclusion & Future Work References

4 BLOOMS

5

6 When I was 6 years old…

7 11 years later… Image from Scientific American Website

8 In 2006 Web of Data

9 Is it really mainstream Semantic Web? What is the relationship between the models whose instances are being linked? How to do querying on LOD without knowing individual datasets? How to perform schema level reasoning over LOD cloud?

10 What can be done? Relationships are at the heart of Semantics. LOD captures instance level relationships, but lacks class level relationships. –Superclass –Subclass –Equivalence How to find these relationships? –Perform a matching of the LOD Ontology’s using state of the art ontology matching tools. Desirable –Considering the size of LOD, at least have results which a human can curate.

11 Outline Introduction Motivation Existing Approaches BLOOMS Approach Evaluation Applications Conclusion & Future Work References

12 Existing Approaches A survey of approaches to automatic Ontology matching by Erhard Rahm, Philip A. Bernstein in the VLDB Journal 10: 334–350 (2001)

13 LOD Ontology Alignment Existing systems have difficulty in matching LOD Ontologys!  Nation = Menstruation, Confidence=0.9 They perform extremely well on established benchmarks, but typically not in the wilds. LOD Ontology’s are of very different nature Created by community for community. Emphasis on number of instances, not number of meaningful relationships. Require solutions beyond syntactic and structural matching.

14 Outline Introduction Motivation Existing Approaches BLOOMS Approach Evaluation Applications Conclusion & Future Work References

15 Something else changed..

16

17 Our Approach Use knowledge contributed by users To improve Structured knowledge contributed by users

18 Rabbit out of a hat? Traditional auxiliary data sources like (WordNet, Upper Level Ontologies) have limited coverage and are insufficient for LOD datasets. LOD datasets have diverse domains Community generated data although noisy but is rich in Content Structure Has a “self healing property” Pro blems like Ontology Matching have a dimension of context associated with them. Since community generated data is created by diverse set of people, hence captures diverse context.

19 Wikipedia The English version alone contains more than 2.9 million articles. It is continually expanded by approximately 100,000 active volunteer editors world-wide. Allows multiple points of view to be mentioned with their proper contexts. Article creation/correction is an ongoing activity with no down time.

20 Ontology Matching on LOD using Wikipedia Categorization On Wikipedia, categories are used to organize the entire project. Wikipedia's category system consists of overlapping trees. Simple rules for categorization –“If logical membership of one category implies logical membership of a second, then the first category should be made a subcategory” –“Pages are not placed directly into every possible category, only into the most specific one in any branch” –“Every Wikipedia article should belong to at least one category.”

21 BLOOMS Approach – Step 1 Pre-process the input ontology Remove property restrictions Remove individuals, properties Tokenize the class names Remove underscores, hyphens and other delimiters Breakdown complex class names –example: SemanticWeb => Semantic Web

22 BLOOMS Approach – Step 2 For each concept name processed in the previous step –Identify article in Wikipedia corresponding to the concept. –Each article related to the concept indicates a sense of the usage of the word. For each article found in the previous step –Identify the Wikipedia category to which it belongs. –For each category found, find its parent categories till level 4. Once the “BLOOMS tree” for each of the sense of the source concept is created (T s ), utilize it for comparison with the “BLOOMS tree” of the target concepts (T t ). –BLOOMS trees are created for individual senses of the concepts.

23 BLOOMS Approach – Step 3 In the tree T s, remove all nodes for which the parent node which occurs in T t to create T s ’. –All leaves of Ts are of level 4 or occur in Tt. –The pruned nodes do not contribute any additional new knowledge. Compute overlap O s between the source and target tree. –Os= n/(k-1) –n = |z|, z ε Ts’ Π Tt –k= |s|, s ε Ts’ The decision of alignment is made as follows. –For Ts ε Tc and Tt ε Td, we have Ts=Tt, then C=D. –If min{o(Ts,Tt),o(Tt,Ts)} ≥ x, then set C rdfs:subClassOf D if o(Ts,Tt) ≤ o(Tt, Ts), and set D rdfs:subClassOf C if o(Ts, Tt) ≥ o(Tt, Ts).

24 Example

25 Outline Introduction Motivation Existing Approaches BLOOMS Approach Evaluation Applications Conclusion & Future Work References

26 Evaluation Objectives Examine BLOOMS as a tool for the purpose of LOD ontology matching. Examine the ability of BLOOMS to serve as a general purpose ontology matching system.

27 BLOOMS

28 BLOOMS

29 Outline Introduction Motivation Existing Approaches BLOOMS Approach Evaluation Applications Conclusion & Future Work References

30 Potential Applications Schema level reasoning over LOD. Identification and rectification of contradictory/misleading assertions –Population of London is X (Geonames) / Population of London is Y (DBpedia), but geonames London is same as Dbpedia London. –Hollywood is a country. (Really?) Enabling intelligent federated querying of LOD –Beyond merely crawling. –Terminological difference can be resolved automatically.

31 Outline Introduction Motivation Existing Approaches BLOOMS Approach Evaluation Applications Conclusion & Future Work References

32 Conclusion State of the art tools fail to scale up to the requirements of LOD ontologies. There is plenty of knowledge presented in community generated data which can be harnessed for improving itself.

33 Future Work New ways for computing overlap –Penalize nodes which match at lower levels –Give priority to leftmost categories over rightmost categories. Context based matching –Harness implicit and explicit contextual information in matching. –Provide user with matches and the context of matching. Use “committee” of auxiliary data sources for matching. BLOOMS based smart federated querying framework of LOD.

34 References Prateek Jain, Pascal Hitzler, Amit P. Sheth, Kunal Verma, Peter Z. Yeh: Ontology Alignment for Linked Open Data. Proceedings of the 9th International Semantic Web Conference 2010, Shanghai, China, November 7th-11th, Pages Prateek Jain, Pascal Hitzler, Peter Z. Yeh, Kunal Verma, and Amit P.Sheth, Linked Data Is Merely More Data. In: Dan Brickley, Vinay K. Chaudhri, Harry Halpin, and Deborah McGuinness: Linked Data Meets Artificial Intelligence. Technical Report SS-10-07, AAAI Press, Menlo Park, California, 2010, pp ISBN Prateek Jain, Pascal Hitzler, Amit P. ShethFlexible Bootstrapping-Based Ontology AlignmentIn: P. Shvaiko, J. Euzenat, F. Giunchiglia, H. Stuckenschmidt, M. Mao, I. Cruz (eds.), Ontology Matching, OM Proceedings of the 5th International Workshop on Ontology Matching, at ISWC2010, Shanghai, China, November 2010, pp

Thank You! Questions?