ODLS 2016 Workshop on Ontologies and Data in Life Sciences, Sep 29-30, Halle (Saale), Germany Ontological interpretation of biomedical database annotations.

Slides:



Advertisements
Similar presentations
Upper Ontology Summit Tuesday March 14 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center.
Advertisements

AIFB Denny Vrandečić – AIFB, Universität Karlsruhe (TH) 1 Mind the Web! Valentin Zacharias, Andreas Abecker, Imen.
Charting the Potential of Description Logic for the Generation of Referring Expression SELLC, Guangzhou, Dec Yuan Ren, Kees van Deemter and Jeff.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
1 Institute of Medical Biometry und Medical Informatics, University Medical Center Freiburg, Germany 2 AVERBIS GmbH, Freiburg, Germany 3 Paediatric Hematology.
Logics for Data and Knowledge Representation Projects and thesis introduction.
Catalina Martínez-Costa, Stefan Schulz: Ontology-based reinterpretation of the SNOMED CT context model Ontology-based reinterpretation of the SNOMED CT.
SNOMED CT’s Ontological Commitment Stefan Schulz University Medical Center, Freiburg, Germany Ronald Cornet Academic Medical Center, Amsterdam, The Netherlands.
1 CBioC: Collaborative Bio- Curation Chitta Baral Department of Computer Science and Engineering Arizona State University.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Information Technology, Informatics, & Information Science How do they relate to each other? to each other?
Extracted from “Lmo-2 interacts with Elf-2” On the Meaning of Common Statements in Biomedical Literature Stefan Schulz Department of Medical Informatics,
Framework for Model Creation and Generation of Representations DDI Lifecycle Moving Forward.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Integrating bio-ontologies with a workflow/Petri Net model to qualitatively represent and simulate biological systems Mor Peleg, Irene Gbashvili, and Russ.
Role of the Computer System Current Proposed Ontology Evolution Using Belief Change and the AGM Theory Giorgos Flouris, Dimitris Plexousakis, Grigoris.
Managing Information Quality in e-Science using Semantic Web technology Alun Preece, Binling Jin, Edoardo Pignotti Department of Computing Science, University.
The Foundational Model of Anatomy and its Ontological Commitment(s) Stefan Schulz University Medical Center, Freiburg, Germany FMA in OWL meeting November.
Component 4: Introduction to Information and Computer Science Unit 6: Databases and SQL Lecture 2 This material was developed by Oregon Health & Science.
A service-oriented middleware for building context-aware services Center for E-Business Technology Seoul National University Seoul, Korea Tao Gu, Hung.
MPEG-7 Interoperability Use Case. Motivation MPEG-7: set of standardized tools for describing multimedia content at different abstraction levels Implemented.
1 Issues in Assessment in Higher Education: Science Higher Education Forum on Scientific Competencies Medellin-Colombia Nov 2-4, 2005 Dr Hans Wagemaker.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
A School of Information Science, Federal University of Minas Gerais, Brazil b Medical University of Graz, Austria, c University Medical Center Freiburg,
WHO – IHTSDO: SNOMED CT – ICD-11 coordination: Conditions vs. Situations Stefan Schulz IHTSDO conference Copenhagen, Apr 24, 2012.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
Quality views: capturing and exploiting the user perspective on data quality Paolo Missier, Suzanne Embury, Mark Greenwood School of Computer Science University.
Efficient RDF Storage and Retrieval in Jena2 Written by: Kevin Wilkinson, Craig Sayers, Harumi Kuno, Dave Reynolds Presented by: Umer Fareed 파리드.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Data Integration and Management A PDB Perspective.
Automata Based Method for Domain Specific Languages Definition Ulyana Tikhonova PhD student at St. Petersburg State Politechnical University, supervised.
Web Information Systems Modeling Luxembourg, June VisAVis: An Approach to an Intermediate Layer between Ontologies and Relational Database Contents.
Master headline RDFizing the EBI Gene Expression Atlas James Malone, Electra Tapanari
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Towards a Top-Domain Ontology for Linking Biomedical Ontologies Holger Stenzhorn a,b Elena Beißwanger c Stefan Schulz a a Department of Medical Informatics,
Kaiser Permanente Convergent Medical Terminology (CMT) Using Oxford RDFox and SNOMED for Quality Measures.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
ece 627 intelligent web: ontology and beyond
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
A Tutorial of the PrePPI Database Presenters: Gabriel Leis and Katrina Sherbina Loyola Marymount University Departments of Biology and Computer Science.
APPLICATION OF ONTOLOGIES IN CANCER NANOTECHNOLOGY RESEARCH Faculty of Engineering in Foreign Languages 1 Student: Andreea Buga Group: 1241E – FILS Coordinating.
Higgs bosons, mars missions, and unicorn delusions: How to deal with terms of dubious reference in scientific ontologies Stefan Schulz a,b Mathias Brochhausen.
Enable Semantic Interoperability for Decision Support and Risk Management Presented by Dr. David Li Key Contributors: Dr. Ruixin Yang and Dr. John Qu.
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Distributed Instance Retrieval over Heterogeneous Ontologies Andrei Tamilin (1,2) & Luciano Serafini (1) (1) ITC-IRST (2) DIT - University of Trento Trento,
Mechanisms for Requirements Driven Component Selection and Design Automation 최경석.
NEDA ALIPANAH, MARIA ADELA GRANDO DBMI 11/19/2012.
Introduction to Ontology Introductions Alan Ruttenberg Science Commons.
Large Scale Semantic Data Integration and Analytics through Cloud: A Case Study in Bioinformatics Tat Thang Parallel and Distributed Computing Centre,
1 Representing and Reasoning on XML Documents: A Description Logic Approach D. Calvanese, G. D. Giacomo, M. Lenzerini Presented by Daisy Yutao Guo University.
Integrating SysML with OWL (or other logic based formalisms)
Kaiser Permanente Convergent Medical Terminology (CMT)
Formal ontologies vs. triple based KR gap or convergence?
Can SNOMED CT be harmonized with an upper-level ontology?
Ontology authoring and assessment
Ontology.
Annotation: linking literature to gene products
Methontology: From Ontological art to Ontological Engineering
Harmonizing SNOMED CT with BioTopLite
FOUST II – 2nd Workshop on Foundational Ontologies
Rafael Almeida, Inês Percheiro, César Pardo, Miguel Mira da Silva
CCO: concept & current status
Data Model.
Ontology-Based Approaches to Data Integration
Ontology.
Stefan Schulz – Medical University of Graz (Austria) with input from
Representations & Reasoning Systems (RRS) (2.2)
Presentation transcript:

ODLS 2016 Workshop on Ontologies and Data in Life Sciences, Sep 29-30, Halle (Saale), Germany Ontological interpretation of biomedical database annotations Filipe Santana da Silva1, Ludger Jansen2, Fred Freitas1 , Stefan Schulz3 1Centro de Informática (CIn), Universidade Federal de Pernambuco (UFPE), Recife, Brazil 2Institut für Philosophie, Universität Rostock, Germany 3Institut für Medizinische Informatik, Statistik und Dokumentation, Medizinische Universität Graz, Austria

Biological Databases and Bio-Ontologies Two worlds: Bio-DBs Bio-ontologies How are they related to each other? Can their content be expressed by a unified model of meaning? Is database content of ontological nature? Can OWL be used as a language to express bio-ontology content bio-database structure bio-database content Which is the added value?

Biological Databases and Bio-Ontologies Two worlds: Bio-DBs Bio-ontologies Store summarized results of laboratory experiments Classical database structure Values: Numeric Textual Symbolic (codes from ontologies) Provide definitions Provide axioms that are universally true Obey formal semantics Main use case: annotation of biological database entries

Example – tabular Bio-DB structure

Example – tabular Bio-DB structure ‘is included in’ ‘is included in ’ ‘is included in’ ‘is included in’ ‘is participant in’ ‘is part of’ ‘includes’

Example – tabular Bio-DB structure more abstract: a database record informs about experimental evidence that: Proteins of the type Prot1 participate in Processes of type BProc1… BProck within organisms of type Org1 are active in cellular components of type CComp1 or CComp2 or … CCompx within organisms of type Org1 participate in Processes that have small molecules of type Mol1, Mol2…Moly as outcome within organisms of type Org1 – if dysfunctional – Org1 has dispositions to develop the phenotypes (disorders) Phen1…Phenz This information is not explicitly contained in the database – it is implicitly shared by database users and curators

Ontological framework Information entities Domain entities Classes (T-Box) Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin binding Methionin synthase (…) Individuals (A-Box) Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue

Denotation Information entities Domain entities Classes (T-Box) Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin binding Methionin synthase (…) Individuals (A-Box) Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue individual to class individual to individual

Case 1: database entry represents individuals Information entities Domain entities Classes (T-Box) Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin binding Methionin synthase (…) Individuals (A-Box) Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue R1 P1 O1 C1 Pt1 Pr1 individual to class o1 pt pt1 btl2:represents r p1 pr1 c c1

Case 2: database entry represents classes Multiple defined subclasses Case 2: database entry represents classes Information entities Domain entities Classes (T-Box) Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin bnding Methionin synthase (…) Individuals (A-Box) Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue P1 O1 C1 Pt1 Pr1 P1 O1 C1 Pt1 Pr1 P1 O1 C1 Pt1 Pr1 P1a O1a C1a Pt1a Pr1a or btl2:represents rdf:Type only r

Multiple defined subclasses

Querying: A-box query for individuals Information entities Domain entities Classes (T-Box) Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin binding Methionin synthase (…) Individuals (A-Box) Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue R1 P1 O1 C1 Pt1 Pr1 individual to class o1 pt pt1 btl2:represents r p1 pr1 c c1

Querying: T-box query for subclasses Information entities Domain entities Classes Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin bnding Methionin synthase (…) Individuals Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue P1 O1 C1 Pt1 Pr1 P1 O1 C1 Pt1 Pr1 P1 O1 C1 Pt1 Pr1 P1a O1a C1a Pt1a Pr1a or btl2:represents rdf:Type only r

Querying: T-box query for subclasses Information entities Domain entities Classes Bio-DB Database record Data item Homo sapiens Megaloblastic anemia Cobalamin bnding Methionin synthase (…) Individuals Uniprot Ensembl Database record about Methionin Synthase in Homo Sapiens Data item, such as "cobalamin binding" in this record John Doe, of which tissue is stored in a biobank and analysed in a lab John's megaloblatic anemia A cobalamin binding process observed in the lab within a tissue sample from John a dysfunctional Methionin synthase protein molecule in John's tissue P1 O1 C1 Pt1 Pr1 P1 O1 C1 Pt1 Pr1 P1 O1 C1 Pt1 Pr1 P1a O1a C1a Pt1a Pr1a Two step DL query: Does subclass exist? if not: no database entry else: determine superclass from bioontology

Competency questions (Q1) Which biological processes have proteins of the kind Proti as participants? BProc1 and (‘has participant’ some Proti) (Q2) In which cellular locations is Proti active in organisms of the type Org1? ‘Cellular component’ and (‘is included in’ some Org1) and (includes some Proti) (Q3) Which proteins are involved in processes of the type BProc in organisms of the type Org1? Protein and (‘ is participant in’ some BProc1 ) and (‘is included in’ some Org1)

Evaluation (one database record) Model Q1 Q2 Q3 Classes Individuals Axioms/ Assertions A- Box bp1001, bp2001, bp3001 cc1001, cc2001, cc3001 p1004 24 51 207 T- BProc1 CComp1 Proti 68 149

Discussion (I) Both modelling solutions: highly productive scaling problems to be expected A-Box solution (prototypical individuals): A-box reasoning more costly Makes existential assumptions T-Box solution (multiple subclasses) Theoretically allows non-referential entries Simplified model: EL++

Discussion (II) Do biological database refer to ontological content? No "real" universal statements on biological entities Even no existential assumption Dispositional statements to be discussed (see paper) Exercise best described as ontological representation of referring individuals Possible use case: non-disruptive querying of Bio-DBs where axioms of the annotation ontologies need to be explored

Conclusion Four ontological approaches - IND, SUBC, DISP and HYB Structure and content of BIO-DBs Solution: Expressiveness, DB retrieval and retrieval based on DL queries Interpretation: Denoted entities as prototypical individuals Creation of defined subclasses Database content as reporting dispositions DB retrieval in the sense that some of these queries can be implemented by means of ontology-based data access (OBDA) approaches. There is a difference regarding what can be done with OBDA and reasoning, the first can be used to target ALC queries, but does not enable subsumption-based reasoning (as with ontologies).

Funding: This work was funded by Conselho Nacional de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) 3914/2014-03 and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) 140698/2012-4.