Ontology Alignment. Ontologies in biomedical research many biomedical ontologies e.g. GO, OBO, SNOMED-CT practical use of biomedical ontologies e.g. databases.

Slides:



Advertisements
Similar presentations
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Advertisements

Project 2 Ontology alignment. SIGNAL-ONTOLOGY (SigO) Immune Response i- Allergic Response i- Antigen Processing and Presentation i- B Cell Activation.
The Jikitou Biomedical Question Answering System: Using High-Performance Computing to Preprocess Possible Answers Michael A. Bauer 1,2, Daniel Berleant.
By Murat Şensoy Ontology Alignment by Murat Şensoy
Ontology mapping: a way out of the medical tower of Babel?
Ontology alignment Patrick Lambrix Linköpings universitet.
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Ontology Notes are from:
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
Biological Data Extraction and Integration A Research Area Background Study Cui Tao Department of Computer Science Brigham Young University.
CMPT-825 (Natural Language Processing) Presentation on Zipf’s Law & Edit distance with extensions Presented by: Kaustav Mukherjee School of Computing Science,
GO Ontology Editing Workshop: Using Protege and OWL Hinxton Jan 2012.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Evaluating Ontology-Mapping Tools: Requirements and Experience Natalya F. Noy Mark A. Musen Stanford Medical Informatics Stanford University.
State of the Art Ontology Mapping By Justin Martineau.
ONTOLOGY MATCHING Part III: Systems and evaluation.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Student : Sheng-Hsuan Wang Department.
Ontology Alignment Patrick Lambrix Linköpings universitet.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Semantic Similarity over Gene Ontology for Multi-label Protein Subcellular Localization Shibiao WAN and Man-Wai MAK The Hong Kong Polytechnic University.
The dynamic nature of the proteome
Patrick Lambrix Linköpings universitet
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
A Semi-automatic Ontology Acquisition Method for the Semantic Web Man Li, Xiaoyong Du, Shan Wang Renmin University of China, Beijing WAIM May 2012.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Of 33 lecture 10: ontology – evolution. of 33 ece 720, winter ‘122 ontology evolution introduction - ontologies enable knowledge to be made explicit and.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
Relevance Detection Approach to Gene Annotation Aid to automatic annotation of databases Annotation flow –Extraction of molecular function of a gene from.
1 What is an Ontology? n No exact definition n A tool to help organize knowledge n Or a way to convey a theory on how to represent a class of things n.
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
Logics for Data and Knowledge Representation
GO terms implicitly refer to other term cysteine biosynthesis myoblast fusion hydrogen ion transporter activity snoRNA catabolism wing disc pattern formation.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Ontology Mapping in Pervasive Computing Environment C.Y. Kong, C.L. Wang, F.C.M. Lau The University of Hong Kong.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
Ontology Alignment state of the art and an application in literature search Patrick Lambrix Linköpings universitet.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Mining the Biomedical Research Literature Ken Baclawski.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
Ontology Alignment. Ontology alignment Ontology alignment Ontology alignment strategies Evaluation of ontology alignment strategies Ontology alignment.
Chapter 7 Complex Similarity Topix. About this chapter Extends previous discussed methods The reader can choose to read about only specific methods, depending.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
Automatically Identifying Candidate Treatments from Existing Medical Literature Catherine Blake Information & Computer Science University.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Automatic Document Indexing in Large Medical Collections.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Semantic search-based image annotation Petra Budíková, FI MU CEMI meeting, Plzeň,
1 Integrating Databases into the Semantic Web through an Ontology-based Framework Dejing Dou, Paea LePendu, Shiwoong Kim Computer and Information Science,
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
Of 24 lecture 11: ontology – mediation, merging & aligning.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics Semantic distance between two words.
A session-based approach for aligning large ontologies Patrick Lambrix, Rajaram Kaliyaperumal Linköping University.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
Using Partial Reference Alignments to Align Ontologies
Stanford Medical Informatics
Element Level Semantic Matching
Patrick Lambrix Linköpings universitet
Presentation transcript:

Ontology Alignment

Ontologies in biomedical research many biomedical ontologies e.g. GO, OBO, SNOMED-CT practical use of biomedical ontologies e.g. databases annotated with GO GENE ONTOLOGY (GO) immune response i- acute-phase response i- anaphylaxis i- antigen presentation i- antigen processing i- cellular defense response i- cytokine metabolism i- cytokine biosynthesis synonym cytokine production … p- regulation of cytokine biosynthesis … i- B-cell activation i- B-cell differentiation i- B-cell proliferation i- cellular defense response … i- T-cell activation i- activation of natural killer cell activity …

Ontologies with overlapping information SIGNAL-ONTOLOGY (SigO) Immune Response i- Allergic Response i- Antigen Processing and Presentation i- B Cell Activation i- B Cell Development i- Complement Signaling synonym complement activation i- Cytokine Response i- Immune Suppression i- Inflammation i- Intestinal Immunity i- Leukotriene Response i- Leukotriene Metabolism i- Natural Killer Cell Response i- T Cell Activation i- T Cell Development i- T Cell Selection in Thymus GENE ONTOLOGY (GO) immune response i- acute-phase response i- anaphylaxis i- antigen presentation i- antigen processing i- cellular defense response i- cytokine metabolism i- cytokine biosynthesis synonym cytokine production … p- regulation of cytokine biosynthesis … i- B-cell activation i- B-cell differentiation i- B-cell proliferation i- cellular defense response … i- T-cell activation i- activation of natural killer cell activity …

Ontologies with overlapping information Use of multiple ontologies e.g. custom-specific ontology + standard ontology Bottom-up creation of ontologies experts can focus on their domain of expertise  important to know the inter-ontology relationships

SIGNAL-ONTOLOGY (SigO) Immune Response i- Allergic Response i- Antigen Processing and Presentation i- B Cell Activation i- B Cell Development i- Complement Signaling synonym complement activation i- Cytokine Response i- Immune Suppression i- Inflammation i- Intestinal Immunity i- Leukotriene Response i- Leukotriene Metabolism i- Natural Killer Cell Response i- T Cell Activation i- T Cell Development i- T Cell Selection in Thymus GENE ONTOLOGY (GO) immune response i- acute-phase response i- anaphylaxis i- antigen presentation i- antigen processing i- cellular defense response i- cytokine metabolism i- cytokine biosynthesis synonym cytokine production … p- regulation of cytokine biosynthesis … i- B-cell activation i- B-cell differentiation i- B-cell proliferation i- cellular defense response … i- T-cell activation i- activation of natural killer cell activity …

Ontology Alignment equivalent concepts equivalent relations is-a relation SIGNAL-ONTOLOGY (SigO) Immune Response i- Allergic Response i- Antigen Processing and Presentation i- B Cell Activation i- B Cell Development i- Complement Signaling synonym complement activation i- Cytokine Response i- Immune Suppression i- Inflammation i- Intestinal Immunity i- Leukotriene Response i- Leukotriene Metabolism i- Natural Killer Cell Response i- T Cell Activation i- T Cell Development i- T Cell Selection in Thymus GENE ONTOLOGY (GO) immune response i- acute-phase response i- anaphylaxis i- antigen presentation i- antigen processing i- cellular defense response i- cytokine metabolism i- cytokine biosynthesis synonym cytokine production … p- regulation of cytokine biosynthesis … i- B-cell activation i- B-cell differentiation i- B-cell proliferation i- cellular defense response … i- T-cell activation i- activation of natural killer cell activity … Defining the relations between the terms in different ontologies

Many experimental systems: Prompt (Stanford SMI) Anchor-Prompt (Stanford SMI) Chimerae (Stanford KSL) Rondo (Stanford U./ULeipzig) MoA (ETRI) Cupid (Microsoft research) Glue (Uof Washington) FCA-merge (UKarlsruhe) IF-Map Artemis (UMilano) T-tree (INRIA Rhone-Alpes) S-MATCH (UTrento) Coma (ULeipzig) Buster (UBremen) MULTIKAT (INRIA S.A.) ASCO (INRIA S.A.) OLA (INRIA R.A.) Dogma's Methodology ArtGen (Stanford U.) Alimo (ITI-CERTH) Bibster (UKarlruhe) QOM (UKarlsruhe) KILT (INRIA LORRAINE)

According to input  KR: OWL, UML, EER, XML, RDF, …  components: concepts, relations, instance, axioms According to process  What information is used and how? According to output  1-1, m-n  Similarity vs explicit relations (equivalence, is-a)  confidence Classification

Matchers

Strategies based on linguistic matching Structure-based strategies Constraint-based approaches Instance-based strategies Use of auxiliary information Matcher Strategies Strategies based on linguistic matching Strategies based on linguistic matching SigO: complement signaling synonym complement activation GO: Complement Activation

Example matchers Edit distance  Number of deletions, insertions, substitutions required to transform one string into another  aaaa  baab: edit distance 2 N-gram  N-gram : N consecutive characters in a string  Similarity based on set comparison of n-grams  aaaa : {aa, aa, aa}; baab : {ba, aa, ab}

Matcher Strategies Strategies based on linguistic matching Structure-based strategies Structure-based strategies Constraint-based approaches Instance-based strategies Use of auxiliary information

Example matchers Propagation of similarity values Anchored matching

Example matchers Propagation of similarity values Anchored matching

Example matchers Propagation of similarity values Anchored matching

Matcher Strategies Strategies based on linguistic matching Structure-based strategies Constraint-based approaches Constraint-based approaches Instance-based strategies Use of auxiliary information O1O1 O2O2 Bird Mammal Flying Animal

Matcher Strategies Strategies based on linguistic matching Structure-based strategies Constraint-based approaches Constraint-based approaches Instance-based strategies Use of auxiliary information O1O1 O2O2 Bird Mammal Stone

Example matchers Similarities between data types Similarities based on cardinalities

Matcher Strategies Strategies based on linguistic matching Structure-based strategies Constraint-based approaches Instance-based strategies Instance-based strategies Use of auxiliary information Ontology instance corpus

Example matchers Instance-based Use life science literature as instances Structure-based extensions

Learning matchers – instance- based strategies Basic intuition A similarity measure between concepts can be computed based on the probability that documents about one concept are also about the other concept and vice versa. Intuition for structure-based extensions Documents about a concept are also about their super-concepts. (No requirement for previous alignment results.)

Learning matchers - steps Generate corpora  Use concept as query term in PubMed  Retrieve most recent PubMed abstracts Generate text classifiers  One classifier per ontology / One classifier per concept Classification  Abstracts related to one ontology are classified by the other ontology’s classifier(s) and vice versa Calculate similarities

Basic Naïve Bayes matcher Generate corpora Generate classifiers  Naive Bayes classifiers, one per ontology Classification  Abstracts related to one ontology are classified to the concept in the other ontology with highest posterior probability P(C|d) Calculate similarities

Structural extension ‘Cl’ Generate classifiers  Take (is-a) structure of the ontologies into account when building the classifiers  Extend the set of abstracts associated to a concept by adding the abstracts related to the sub-concepts C1 C3 C4 C2

Structural extension ‘Sim’ Calculate similarities  Take structure of the ontologies into account when calculating similarities  Similarity is computed based on the classifiers applied to the concepts and their sub-concepts

Matcher Strategies Strategies based linguistic matching Structure-based strategies Constraint-based approaches Instance-based strategies Use of auxiliary information Use of auxiliary information thesauri alignment strategies dictionary intermediate ontology

Example matchers Use of WordNet  Use WordNet to find synonyms  Use WordNet to find ancestors and descendants in the is- a hierarchy Use of Unified Medical Language System (UMLS)  Includes many ontologies  Includes many alignments (not complete)  Use UMLS alignments in the computation of the similarity values

Ontology Alignment and Mergning Systems