Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Chronos: A Tool for Handling Temporal Ontologies in Protégé
April 15, 2004SPIE1 Association in Level 2 Fusion Mieczyslaw M. Kokar Christopher J. Matheus Jerzy A. Letkowski Kenneth Baclawski Paul Kogut.
GridVine: Building Internet-Scale Semantic Overlay Networks By Lan Tian.
Semantic integration of data in database systems and ontologies
By Murat Şensoy Ontology Alignment by Murat Şensoy
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY Matthew Williams
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Ontology Notes are from:
Dynamic Ontology Matching Pavel Shvaiko OpenKnowledge meetings 9 February, 13 March, 2006 Trento, Italy.
Aki Hecht Seminar in Databases (236826) January 2009
Description Logics. Outline Knowledge Representation Knowledge Representation Ontology Language Ontology Language Description Logics Description Logics.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Distributed Database Management Systems. Reading Textbook: Ch. 4 Textbook: Ch. 4 FarkasCSCE Spring
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Knowledge Mediation in the WWW based on Labelled DAGs with Attached Constraints Jutta Eusterbrock WebTechnology GmbH.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
RDF (Resource Description Framework) Why?. XML XML is a metalanguage that allows users to define markup XML separates content and structure from formatting.
LDK R Logics for Data and Knowledge Representation Semantic Matching.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Logics for Data and Knowledge Representation Semantic Matching.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
An Introduction to Description Logics. What Are Description Logics? A family of logic based Knowledge Representation formalisms –Descendants of semantic.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
Interpreting Dictionary Definitions Dan Tecuci May 2002.
RDF and OWL Developing Semantic Web Services by H. Peter Alesso and Craig F. Smith CMPT 455/826 - Week 6, Day Sept-Dec 2009 – w6d21.
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
A view-based approach for semantic service descriptions Carsten Jacob, Heiko Pfeffer, Stephan Steglich, Li Yan, and Ma Qifeng
Querying Structured Text in an XML Database By Xuemei Luo.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Dimitrios Skoutas Alkis Simitsis
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
Definition of a taxonomy “System for naming and organizing things into groups that share similar characteristics” Taxonomy Architectures Applications.
RRXS Redundancy reducing XML storage in relations O. MERT ERKUŞ A. ONUR DOĞUÇ
Interoperable Visualization Framework towards enhancing mapping and integration of official statistics Haitham Zeidan Palestinian Central.
Coastal Atlas Interoperability - Ontologies (Advanced topics that we did not get to in detail) Luis Bermudez Stephanie Watson Marine Metadata Interoperability.
Semantic Web - an introduction By Daniel Wu (danielwujr)
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY WITHIN THE (SEMANTIC) WEB Matthew Williams
©Ferenc Vajda 1 Semantic Grid Ferenc Vajda Computer and Automation Research Institute Hungarian Academy of Sciences.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
GREGORY SILVER KUSHEL RIA BELLPADY JOHN MILLER KRYS KOCHUT WILLIAM YORK Supporting Interoperability Using the Discrete-event Modeling Ontology (DeMO)
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Some questions -What is metadata? -Data about data.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
THEORY OF COMPUTATION Komate AMPHAWAN 1. 2.
Working with XML. Markup Languages Text-based languages based on SGML Text-based languages based on SGML SGML = Standard Generalized Markup Language SGML.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Selecting Relevant Documents Assume: –we already have a corpus of documents defined. –goal is to return a subset of those documents. –Individual documents.
Ccs.  Ontologies are used to capture knowledge about some domain of interest. ◦ An ontology describes the concepts in the domain and also the relationships.
Of 24 lecture 11: ontology – mediation, merging & aligning.
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
Describing Syntax and Semantics
Cross-Ontological Relationships
Overview of Compilation The Compiler Front End
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Element Level Semantic Matching
A Semantic Peer-to-Peer Overlay for Web Services Discovery
Presentation transcript:

Ontology Alignment/Matching Prafulla Palwe

Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem  I have a plan for you  I have a plan for you ► Matching Problem  Matching Operation  Motivation  Schema Matching Vs Ontology Matching  Correspondence  Alignment ► Matching Process  Sequential composition  Parallel composition ► Application Domains  Traditional  Emergent ► Classification  Matching Dimensions ► Basic Techniques  Element Level  Structure Level ► Summary and Challenges

Introduction ► ► Being serious about the semantic web -   It is not one guy's ontology   It is not several guys' common ontology   It is many guys and girls' many ontologies   So it is a mess, but a meaningful mess

Introduction ► ► Living with heterogeneity -   The semantic web will be: ► ► Huge ► ► Dynamic ► ► Heterogeneous   These are not bugs, they are features.   We must learn to live with them.

Introduction ► ► Heterogeneity problem –   Resources being expressed in different ways must be reconciled before being used.   Mismatch between formalized knowledge can occur when: ► ► different languages are used; ► ► different terminologies are used; ► ► different modeling is used.

Introduction ► I have a plan for you – Reconciliation

Matching Problem ► Matching Operation  Definition – Matching operation takes as input ontologies, each consisting of a set of discrete entities (e.g., tables, XML elements, classes, properties) and determines as output the relationships (e.g., equivalence, subsumption) holding between these entities

Matching Problem ► Motivation –  2 XML Schemas  2 Ontologies

Matching Problem

► Schema mapping Vs ontology mapping  Differences - ► Schemas often do not provide explicit semantics for their data  Relational schemas provide no generalization ► Ontologies are logical systems that constrain the meaning  Ontology definition as set of logical axioms  Commonalities - ► Schemas and ontologies provide a vocabulary of terms that describes the domain of interest ► Schemas and ontologies constrain the meaning of terms used in the vocabulary.

Matching Problem ► Correspondence  Definition –  Given 2 ontologies O and O’, a correspondence between M between O and O’ is a 5-uple : such that: ► id is a unique identifier of the correspondence. ► e and e’ are entities of O and O’ (e.g. XML Elements, classes) ► R is a relation (e.g. equivalence (=), disjointness (_|_)) ► n is a confidence measure in some mathematical structure (typically in the [0,1] range)

Matching Problem ► Alignment  Definition –  Given 2 ontologies O and O’, an alignment A between O and O’: ► Is a set of correspondence on O and O’ ► With some cardinality: 1-1, 1-* etc. ► Some additional metadata (method, date, properties etc)

Matching Process

► General Basic Matching Process

Matching Process ► Sequential Composition

Matching Process ► Parallel composition

Matching Process ► Similarity Filter, alignment extractor and alignment filter –

Matching Process ► Aggregation Operations –  There are many different ways to aggregate matcher results, usually depending on confidence/similarity: ► Triangular norms (min, weighted products) useful for selecting only the best results ► Multidimensional distances (Eudidean distance, weighted sum) useful for taking into account all dimensions ► Fuzzy aggregation (min, weighted average) useful for aggregating competing algorithms and averaging their results ► Other specific measures (e.g., ordered weighted average)

Application Domains ► Traditional -  Ontology evolution  Schema integration  Catalog integration  Data integration

Application Domains ► Ontology Evolution

Application Domains ► Catalog Integration

Application Domains ► Emergent  P2P information sharing  Agent communication  Web service composition  Query answering on the web

Application Domains ► P2P information sharing

Application Domains ► Web Service Composition

Application Domains ► Agent communication

Classifications ► Matching Dimensions  Input Dimensions ► Underlying models (e.g. XML, OWL) ► Schema Level Vs Instance Level  Process Dimensions ► Approximate Vs Exact ► Interpretation of the input  Output Dimensions ► Cardinality ► Equivalence Vs Diverse relations ► Graded Vs Absolute Confidence

Classifications ► Three Layers  Upper Layer ► Granularity of match ► Interpretation of the input information  Middle Layer ► Represents classes of elementary (basic) matching techniques  Lower Layer ► Based on the kind of input which is used by elementary matching techniques

Classifications ► Classification of schema based techniques

Basic Techniques ► Element Level Techniques  String based –  Prefix - ► Takes an input 2 strings and checks whether the first string starts with the second ► e.g. net = network but also hot = hotel  Suffix – ► Takes an input 2 strings and checks whether the first string ends with the second ► e.g. ID = PID but also word = sword  Edit Distance – ► Takes as input 2 strings and calculates the number of edit operations (insertion,deletion,substitution) of characters required to transform one string into other normalized by length of the max string. ► editDistance(NKN, Nikon) = 0.4

Basic Techniques  Language based –  Tokenization – ► Parses names into tokens by recognizing punctuation, cases ► Hands-Free_Kits  ► Hands-Free_Kits   Lemmatization – ► Analyses morphologically tokens in order to find all their possible basic forms ► Kits  Kit  Elimination – ► Discards empty tokens that are articles, prepositions, conjuctions ► a, the, by, type of, their, from

Basic Techniques ► Structure Level Techniques  Ontologies are viewed as graph-like structure containing terms and their inter-relationships.  Taxonomy based ► Bounded path matching  These take 2 paths with links between classes defined by the hierarchical relations, compare terms and their positions along these paths and identify similar terms. ► Super(sub)-concept rules  If super concepts are the same, the actual concepts are similar to each other

Basic Techniques  Tree based  Children ► 2 non leaf schema elements are structurally similar if their immediate children sets are highly similar  Leaves ► 2 non leaf schema elements are structurally similar if their leaf sets are highly similar, even if their immediate children are not.

Basic Techniques

Summary and Challenges ► Summary  Ontology Matching and alignment is the process of developing the common or most common structure/semantic terms out of 2 or more different ontologies/structures/schemas.  Different efficient and complex algorithms using basic techniques of matching process, can be developed for matching and alignment generation. ► Challenges  Developing generic and highly efficient matching and alignment generation algorithms.

Thank You