MANAGING, QUERYING AND EXTRACTING BIOMEDICAL KNOWLEDGE Trabajo de Investigación Extracción de Conocimiento para la Web Semántica (1241119) Sistemas Informáticos.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

Oyster, Edinburgh, May 2006 AIFB OYSTER - Sharing and Re-using Ontologies in a Peer-to-Peer Community Raul Palma 2, Peter Haase 1 1) Institute AIFB, University.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
The Ontology of the Radiographic Image: From RadLex to RadiO.
Logics for Data and Knowledge Representation Projects and thesis introduction.
REPORT ON STICA‘06 1st International Workshop on Semantic Technologies in Collaborative Applications Chairman: Robert.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Ontology Notes are from:
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Storing and Retrieving Biological Instances with the Instance Store Daniele Turi, Phillip Lord, Michael Bada, Robert Stevens.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
ELSE (eLearning for Software Engineering) S. Stojanov ECL, University of Plovdiv.
Editing Description Logic Ontologies with the Protege OWL Plugin.
B IOMEDICAL T EXT M INING AND ITS A PPLICATION IN C ANCER R ESEARCH Henry Ikediego
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
This chapter is extracted from Sommerville’s slides. Text book chapter
Srihari-CSE730-Spring 2003 CSE 730 Information Retrieval of Biomedical Text and Data Inroduction.
Break Out Session on Infrastructure and Technology: A Report Vipul Kashyap AOS Workshop, Rome, 15 November 2001
Knowledge based Learning Experience Management on the Semantic Web Feng (Barry) TAO, Hugh Davis Learning Society Lab University of Southampton.
Session II: Scientific Publishing and Semantic Web W3C Semantic Web for Life Sciences Workshop October 27, 2004 Moderator: Alan R. Aronson.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
A Case Study of ICD-11 Anatomy Value Set Extraction from SNOMED CT Guoqian Jiang, PhD ©2011 MFMER | slide-1 Division of Biomedical Statistics & Informatics,
Building an Ontology of Semantic Web Techniques Utilizing RDF Schema and OWL 2.0 in Protégé 4.0 Presented by: Naveed Javed Nimat Umar Syed.
1 st June 2006 St. George’s University of LondonSlide 1 Using UMLS to map from a Library to a Clinical Classification: Improving the Functionality of a.
FI-CORE Data Context Media Management Chapter Release 4.1 & Sprint Review.
Ontologies and Lexical Semantic Networks, Their Editing and Browsing Pavel Smrž and Martin Povolný Faculty of Informatics,
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
Dimitrios Skoutas Alkis Simitsis
A School of Information Science, Federal University of Minas Gerais, Brazil b Medical University of Graz, Austria, c University Medical Center Freiburg,
An Introduction to Description Logics (chapter 2 of DLHB)
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Semantic based P2P System for local e-Government Fernando Ortiz-Rodriguez 1, Raúl Palma de León 2 and Boris Villazón-Terrazas 2 1 1Universidad Tamaulipeca.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Management Information Systems, 4 th Edition 1 Chapter 8 Data and Knowledge Management.
Text Mining & NLP based Algorithm to populate ontology with A-Box individuals and object properties Alexandre Kouznetsov and Christopher J. O. Baker, University.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
A View-based Methodology for Collaborative Ontology Engineering (VIMethCOE) Ernesto Jiménez Ruiz Rafael Berlanga Llavorí Temporal Knowledge Bases Group.
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
1 Ontolog OOR-BioPortal Comparative Analysis Todd Schneider 15 October 2009.
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
HeC PMT Meeting Tallinn 10 July 2008 Brainstorming session: what interest can there be in thinking about a possible new HeC 2 project to follow-up the.
Ontology-Based Interoperability Service for HL7 Interfaces Implementation Carolina González, Bernd Blobel and Diego López eHealth Competence Center, Regensurg.
Semantic Data Extraction for B2B Integration Syntactic-to-Semantic Middleware Bruno Silva 1, Jorge Cardoso 2 1 2
WonderWeb. Ontology Infrastructure for the Semantic Web. IST Project Review Meeting, 11 th March, WP2: Tools Raphael Volz Universität.
Knowledge Support for Modeling and Simulation Michal Ševčenko Czech Technical University in Prague.
WonderWeb. Ontology Infrastructure for the Semantic Web. IST WP4: Ontology Engineering Heiner Stuckenschmidt, Michel Klein Vrije Universiteit.
Ontology Technology applied to Catalogues Paul Kopp.
Multi-disciplinary Approach for Industrial Phases in Space Projects Evolution of classic SE into MBSE Harald EisenmannAstrium Satellites Joachim Fuchs.
Mechanisms for Requirements Driven Component Selection and Design Automation 최경석.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
CCNT Lab of Zhejiang University
Development of the Amphibian Anatomical Ontology
Stanford Medical Informatics
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Methontology: From Ontological art to Ontological Engineering
Knowledge Based Workflow Building Architecture
Rafael Almeida, Inês Percheiro, César Pardo, Miguel Mira da Silva
Presented by: Prof. Ali Jaoua
Metadata Framework as the basis for Metadata-driven Architecture
CSE 635 Multimedia Information Retrieval
Presentation transcript:

MANAGING, QUERYING AND EXTRACTING BIOMEDICAL KNOWLEDGE Trabajo de Investigación Extracción de Conocimiento para la Web Semántica ( ) Sistemas Informáticos Avanzados Ernesto Jiménez-Ruiz Supervisor: Rafael Berlanga

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group2 Outline Context and Motivation  Application of Ontologies  Biomedical issues and Health-e-Project Proposed Methodologies Ontology Management System Conclusions and Future Work

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group3 Application of Ontologies «An ontology is a formal specification of a shared conceptualization» (Borst (1997)) Applications:  E-Commerce  Web Browsers  Digital Libraries  Biomedicine  etc… Context and Motivation

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group4 Health-e-Child Project Aims to develop an integrated healthcare platform for European pediatrics, achieving a comprehensive view of children’s health Grid Architecture Main Upper Level Applications: KDS, DSS Our tasks: Integration of biomedical data, information, and knowledge. Context and Motivation

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group5 Health-e-Child Project The biomedical information sources will cover six distinct levels (vertical levels):  Molecular  Cellular  Tissue  Organ  Individual  Population And will focus on three representative diseases (inside paediatrics):  Heart diseases  Inflammatory diseases  Brain tumours. Context and Motivation

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group6 Application of current Ontologies in HeC HeC vertical levels expressed by Ontologies Available several large biomedical ontologies and taxonomies, e.g: GO, GALEN, FMA, NCI-Thesurus, Tambis, BioPax, etc. Difficult too apply in concrete applications like HeC:  Scalability in reasoning.  Specificity: local view of the domain  Visualization and treatment Context and Motivation

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group7 View Mechanism Operation through views or used defined fragments/modules. Working on the development of OntoPath Future: to formalize the extracted views Context and Motivation

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group8 Outline Context and Motivation Proposed Methodologies  Development of Ontologies in a Collaborative Way  From Domain Ontologies to Application Views Ontology Management System Conclusions and Future Work

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group9 Development Requirements Development Methodologies with new dimensions:  Dynamism  distribution of team structure  and partially controlled development. Proposed Requirements:  Modularization/Particionamiento  Local Adaptation  Knowledge Abstraction  User-defined modules (Views)  Argumentation and Consensus Development of Ontologies in a Collaborative Way

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group10 Development Phases We distinguish 5 different phases:  Requirements  Development  Publication and Argumentation  Evaluation and Maintenance  Application Development of Ontologies in a Collaborative Way

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group11 Knowledge Spaces Development of Ontologies in a Collaborative Way

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group12 View Derivation Hierarchy Development of Ontologies in a Collaborative Way

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group13 Outline Context and Motivation Proposed Methodologies  Development of Ontologies in a Collaborative Way  From Domain Ontologies to Application Views Ontology Management System Conclusions and Future Work

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group14 ‘Current’ Biomedical Sources Creation of large and important Biomedical Ontologies and Taxonomies:  GALEN, FMA, NCI-Thesurus, Tambis, BioPax, etc  Open Biomedical Ontologies (OBO) Metathesaurus, Dictionaries and Lexicons:  Unified Medical Language System (UMLS)  MesH (Medical Subject Headings)  SNOMED Bioinformatics public databases (OMIM, UNIPROT, DrugBank, etc.) Hospital Resources (databases, texts, forms, images, etc.) From Domain Ontologies to Applications

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group15 New Issues and Challenges Many domain ontologies in Biomedicine do not cover completely the requirements of specific applications. Concepts may involve different abstraction levels (e.g. molecular, organ, disease, etc.) that can be in the same or in different domain ontologies. Domain ontologies are normally rather large:  Users find them hard to use for annotating and querying information sources  Only a subset of those is used by system applications. From Domain Ontologies to Applications

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group16 New Issues and Challenges The work to be presented mainly focuses on this issues:  Do not cover requirements Integration and Enrichment  Involve different abstraction levels Integration, enrichment and definition of user-defined modules (Views)  Are rather large (hard to use and only a subset are used) User-defined modules (Views) From Domain Ontologies to Applications

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group17 From Ontologies to Applications From Domain Ontologies to Applications

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group18 Enrichment from Textual Sources Automatic Instance Generation (Danger (2004))  Look for aggregation paths (i.e.: concept-relation-concept…) in texts.  Necessity of a well created and consistent ontology, UMLS-based Biomedical Entity Recognizer  A treated version of UMLS as a dictionary  Entity relation  co-occurrences  Problem: How to discover named relations between entities.  Technical Report: Jimeno-Yepes and Jiménez-Ruiz et al. (2007) From Domain Ontologies to Applications

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group19 Outline Context and Motivation Proposed Methodologies Ontology Management System  Parser and Storage in G database  OntoPath and Builder  Plugin-Protégé Conclusions and Future Work

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group20 System Architecture Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group21 OWL Parser Greater flexibility in the OWL treatment and storage capabilities (e.g. indexes) The OWL parser creates from the OWL file a set of structures for classes, properties, nominal and individuals. These structures will be stored in the graph- based database G. Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group22 G Semi-structured Database Backend to store, index and retrieve the OWL ontologies as graphs. Four database object types are needed: ontology, property, concept, and enumeration (nominals)  O=ontology(name=’Simple.owl’, rootConcept=C1, rootProperty=P1)  C1=concept(name=’Thing’)  P1= property (name=’PropertyThing’)  C2=concept(name=’Person’, subClassOf=C1)  P2= property(name=’hasFriend’, range=C2, domain=C2, subPropertyOf=P1) Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group23 OWL Parser: DL Treatment (I) Inferred parents: C ≡C1 ⊓ … ⊓ Cn  C.subClassOf=C1,…,Cn Inferred children: C ≡C1 ⊔ … ⊔ Cn  C1.subClassOf.append(C), …, Cn.subClassOf.append(C) Inferred domains: C ⊓  R.D  i-property(name=R, domain=C, range=D). Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group24 OWL Parser: DL Treatment (II) Creation of new classes:  C ⊑  R.(D ⊓  S.E)  C and D named classes  D’ is created, with D’.subClassOf=D.  It is also created: i-property(name=R, domain=C, range=D’).  D’.name=D_with_S_E. Nominals: (C ⊓  R.{i1, i2…, in})…  i-property(name=R, domain=C, enumeration=E). (C ⊓  R.{ i1})…  i-property(name=R, domain=C, enumeration=E). E=enumeration(name=R-nominals, list-of-values=i1,…, in) Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group25 OWL Parser: DL Treatment (III) ResistanceToInsulin   isWithReferenceTo (presence ⊓  isPresenceAbsenceOf.Insulin)  C = concept(name=ResistanceToInsulin)  P = concept(name=presence, …)  I = concept(name=insulin,…)  A = concept(name=presence_with_isPresenceAbsenceOf_Insulin, subClassOf=P)  P1 = i-property(name=isWithReferenceTo, domain=C, range=A, …)  P2 = i-property(name=isPresenceAbsenceOf, domain=A, range=I,…) Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group26 OWL Parser: DL Treatment (IV) Not properly stored (and queried) :  Some Union cases  Negation Expressivity  a subset of SHIF(D),  The closest DL underlying OWL-Lite.  OWL-DL Ontologies uses a subset of the DL SHOIN(D) And it will produce approximate answers to OntoPath queries. Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group27 OntoPath Query Language To retrieve consistent fragments (personalized modules or views) from domain ontologies. Syntax simple like XPath. Results as a new OWL Ontology Example:  Disease / related_to / Rheumatoid_Factor Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group28 Protégé Extensions Storing Ontologies Retrieving full ontologies or fragments Representation in a definition hierarchy Connection with Python codes Ontology Management System

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group29 Storing Ontologies Ontology Management System OWL File Selection References to other Ontologies (Views) Biomedical (HeC) Coverage

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group30 Retrieving full ontologies or fragments Ontology Management System Several Fragments Source Ontology Set of OntoPath Queries Metadata

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group31 Representation in a definition hierarchy Ontology Management System Organization of Views in a Definition Hierarchy Classification by Biomedical Level New Tab Created

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group32 Conclusions and Future Work The system is still work in progress  Adaptation to new versions of Protégé  Further tests to the OntoPath language  Formalizations of connections between fragments and source knowledge. e-connections?  Manchester  Enrichment by text mining techniques Work at EBI: from text to ontologies  Apply the ontologies in HeC: evaluation and validation

PhD Research Work - Ernesto Jiménez Ruiz - TKBG Group33 Questions and Feedback Thank you very much!!! Ernesto Jiménez-Ruiz   