Ontology-based search and knowledge sharing using domain ontologies

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst Information Semantics Command & Control Center July 17, 2007 Ontologies Can't Help Records Management Or Can They?
Advertisements

The Application of Machine Translation in CADAL Huang Chen, Chen Haiying Zhejiang University Libraries, Hangzhou, China
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Ontology Notes are from:
A Framework for Ontology-Based Knowledge Management System
CS652 Spring 2004 Summary. Course Objectives  Learn how to extract, structure, and integrate Web information  Learn what the Semantic Web is  Learn.
1 An Ontology of Relations for Biomedical Informatics Barry Smith 10 January 2005.
The Role of Foundational Relations in the Alignment of Biomedical Ontologies Barry Smith and Cornelius Rosse.
FMA: a domain reference ontology Comments on Cornelius Rosse’s talk Anita Burgun WG6 meeting, Rome 29 Apr- 2 May 2005.
June 19-21, 2006WMS'06, Chania, Crete1 Design and Evaluation of Semantic Similarity Measures for Concepts Stemming from the Same or Different Ontologies.
Bootstrapping an Ontology-based Information Extraction System Alexander Maedche, Günter Neumann, Steffen Staab (presented by D. Lonsdale) CS 652 – June.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
Developing a Biomedical Ethics Ontology (BMEO) Robert Arp, Ph.D. Ontology Research Group (ORG) National Center for Biomedical Ontology.
XML on Semantic Web. Outline The Semantic Web Ontology XML Probabilistic DTD References.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
BioText Infrastructure Ariel Schwartz Gaurav Bhalotia 10/07/2002.
Toward Making Online Biological Data Machine Understandable Cui Tao Data Extraction Research Group Department of Computer Science, Brigham Young University,
1 Information Integration and Source Wrapping Jose Luis Ambite, USC/ISI.
 Copyright 2009 Digital Enterprise Research Institute. All rights reserved Digital Enterprise Research Institute Ontologies & Natural Language.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
ANSWERING CONTROLLED NATURAL LANGUAGE QUERIES USING ANSWER SET PROGRAMMING Syeed Ibn Faiz.
Knowledge Science & Engineering Institute, Beijing Normal University, Analyzing Transcripts of Online Asynchronous.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
Automated Explanation of Gene-Gene Relationships Wacek Kuśnierczyk.
Semantic Relations in the Environmental Domain Gerhard Budin.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Computational Linguistics WTLAB ( Web Technology Laboratory ) Mohsen Kamyar.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Development of Front End Tools for Semantic Grid Services Dr.S.Thamarai Selvi, Professor & Head, Dept. of Information Technology, Madras Institute of Technology,
WebODE and its Ontology Management APIs. April 8th © Ontology Engineering Group WebODE and its Ontology Management APIs Ontology Engineering Group.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Ontology Summit2007 Survey Response Analysis -- Issues Ken Baclawski Northeastern University.
SemSearch: A Search Engine for the Semantic Web Yuangui Lei, Victoria Uren, Enrico Motta Knowledge Media Institute The Open University EKAW 2006 Presented.
Literature Mining and Ontology BMI/IBGP 730 Autumn, 2010 Yang Xiang, Ph.D. in Computer Science Department of Biomedical.
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
Mining fuzzy domain ontology based on concept Vector from wikipedia category network.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
ADVANCED DB SYSTEMS BIOMEDICAL ENGINEERING. Index INTRODUCTION  BIOMEDICAL ENGINEERING  B.E. DATASETS APPLICATIONS  DATA MINING ON FDA DATABASE  ONTOLOGY-BASED.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
The Role of Architecture and Ontology for Interoperability EFMI Special Topic Conference 2010 June Reykjavik, Iceland Bernd Blobel eHealth Competence.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Iana Atanassova Research: – Information retrieval in scientific publications exploiting semantic annotations and linguistic knowledge bases – Ranking algorithms.
Semantic web Bootstrapping & Annotation Hassan Sayyadi Semantic web research laboratory Computer department Sharif university of.
INFORMATION RETRIEVAL PROJECT Creation of clusters of concepts that represent a domain corpus.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
CS621 : Artificial Intelligence Pushpak Bhattacharyya CSE Dept., IIT Bombay Lecture 12 RDF, OWL, Minimax.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Selected Semantic Web UMBC CoBrA – Context Broker Architecture  Using OWL to define ontologies for context modeling and reasoning  Taking.
Integrated Departmental Information Service IDIS provides integration in three aspects Integrate relational querying and text retrieval Integrate search.
An Ontology-based Automatic Semantic Annotation Approach for Patent Document Retrieval in Product Innovation Design Feng Wang, Lanfen Lin, Zhou Yang College.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Introduction to PubChem BioAssay
Einat Minkov University of Haifa, Israel CL course, U
Independent Study of Ontologies
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Introduction.
Ontologies: Introduction and Some Uses
Presentation transcript:

Ontology-based search and knowledge sharing using domain ontologies Sine Zambach, PhD student, Roskilde University GERPS ‘08 Presentation of myself: bioinformatician with the ontology niche.

Outline 1. Why Domain Ontologies? 2. Ontology-based search 3. Domain analysis: Relations in ontologies 4. How does this gain value for the organisation?

Why Domain Ontologies? Knowledge sharing for common understanding in e.g. software development and translations Background for domain specific information retrieval My part is actually a corner of the conference which is not explicitely ERP. However in next generation the tools I work with can actually be an integral part of these systems in the progress of development of common understanding (knowledge sharing) and domain specific information retrieval (eg through research notes in the company database). I am not a ERP-person which is why I hope to get your view (a top down) on our research.

Example of a domain ontology

Ongoing example substance process Glucose uptake Insulin isa isa activates

Ontology-based search Ontology background for information retrieval: Broaden search wrt synonyms, ontological similarity, relations, etc. Can potentially be used by organisations to search through all kinds of texts

Ongoing example substance process Glucose uptake = Glycose transport isa isa activate Glucose uptake = Glycose transport Insulin = INS activate New unknown substance

Ontology based search in biomedical texts Siabo project Computer scientists computational linguists, domain experts, terminologists Develops Background ontology Text preprocessing tools Knowledge extraction tools Implementation on the texts

The SIABO-project Computational Linguists (CL) Knowledge Engineers (K) Computer Scientists (CS) Terminologists (T) Domain experts (D) Ontology based search application Knowledge extraction Search implementation Text pattern rule development on NP’s (CL, KI, D) Interface (CS) Search functions (CS, K, D) Similarity measures (CS) Text preprossesing Domain ontology modelling Grammatical parsing/ POS-tagging/ (CL) Grabbing/ontological tagging fragments using ontotypes (K) Mapping into ontology (CS) Indexing (CS) Start from UMLS (T,D) Modeller in a suitable tool (T,D) Put into relational database (CS)

Relations Semantic glue between concepts (the idea behind words) General and domain specific relations Represented by e.g. verbs and can be identified in various ways Parallel to concepts that are represented by terms

Relations as semantic ”glue” Insulin activates glucose uptake Pancreas activates organ (odd) Substance activates substance Substance activates process

Domain specific relations To be og to have er ranket højt I alle tekster

OBO-ontologi Table 3 Some properties of the relations in the OBO Relation Ontology RelationTransitive Symmetric Reflexive Antisymmetric is_a + - + + part_of + - + + located_in + - + - contained_in - - - - adjacent_to - - - - transformation_of + - - - derives_ from + - - - preceded_by + - - - has_participant- - - - has_agent - - - - Smith et al. Genome Biology 2005 6:R46

Domain specific relations Inhibition and activation Domain specific Bio-relations Has interesting properties through a path of relations of that types. The relation of ”activation” is transitive, where ”inhibition” is more complex and is dependent of the stimulation-relation

Example: positive relation –> transitivity? A activates B B activates C -> A activates C Fig 1_ equilibrium A B C A B C A B C

Example: inhibits and stimulate -> complex property A inhibits B B inhibits C -> A activates C A B C A B C A B C

Verb frequences in the 4 corpora:

Background

Relations in an enterprise ontology Discovering of weird words = domain specific concepts and relations Similarity measure in information retrieval Information fishing of new concepts 1) Domain specifik plug in Looking at the word frequences of the most common verbs in BNC (probably) shows no significant difference Domain specific verbs do show difference Problem: How do we ”discover” those domain specific relations? 2) Similarity measures A simple similarity measure is counting edges between concepts. The edges could have different weights according to how much properties each. Inhibit and activate both facilitates ”inheritance” for their concepts. Inheritance of properties shows similarity Used in e.g. business intelligence 3) Information fishing Using an ontology from the known area for Fishing Compounds (relations used as hooks)

Ongoing example substance process Glucose uptake = Glycose transport isa isa activate Glucose uptake = Glycose transport Insulin = INS activate New unknown substance