Towards Interoperability of Biomedical Ontologies Workshop on Biological Upper Ontologies BioTop and Chemistry Ontology Stefan Schulz, Elena Beisswanger,

Slides:



Advertisements
Similar presentations
CH. 2 Chemistry of life Section 1 Nature of matter
Advertisements

ELAINE N. MARIEB EIGHTH EDITION 3 Copyright © 2006 Pearson Education, Inc., publishing as Benjamin Cummings PowerPoint ® Lecture Slide Presentation by.
Anatomical boundaries and immaterial objects Stefan Schulz Department of Medical Informatics University Hospital Freiburg (Germany)
Schulz S, Boeker M – BioTopLite – An Upper Level Ontology for the Life Sciences – ODLS 2013 BioTopLite : An Upper Level Ontology for the Life Sciences.
Ontologies - Design principles Cartic Ramakrishnan LSDIS Lab University of Georgia.
A Description Logics Approach to Clinical Guidelines and Protocols Stefan Schulz Department of Med. Informatics Freiburg University Hospital Germany Udo.
Genetica per Scienze Naturali a.a prof S. Presciuttini 1. Enzymes build everything Enzymes allow nutrients to be digested; they convert food into.
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
RnaO.owl Colin Batchelor Royal Society of Chemistry, UK
FMA: a domain reference ontology Comments on Cornelius Rosse’s talk Anita Burgun WG6 meeting, Rome 29 Apr- 2 May 2005.
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
VT. From Basic Formal Ontology to Medicine Barry Smith and Anand Kumar.
Biological Ontologies Neocles Leontis April 20, 2005.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
Ontological Model for Colon Carcinoma: A Case Study for Knowledge Representation in Clinical Bioinformatics Kumar A 1,2, Yip L 3, Jaremek M 2, Scheib H.
Bio 1000 Human Biology for Non-Majors. Introduction to Biology and Chemistry Biology is the study of life.
The Science of Life Biology unifies much of natural science
Chapter 2 The Chemistry of Life.
Chapter 2 The Chemistry of Life.
Developing an OWL-DL Ontology for Research and Care of Intracranial Aneurysms – Challenges and Limitations Holger Stenzhorn, Martin Boeker, Stefan Schulz,
Standardization of Anatomy Parts and Wholes – From Function to Location Stefan Schulz Department of Medical Informatics University Hospital Freiburg (Germany)
Stefan Schulz Medical Informatics Research Group
Alignment of the UMLS Semantic Network with BioTop Methodology and Assessment Stefan Schulz University Medical Center, Freiburg, Germany Elena Beisswanger.
U.S. Patent and Trademark Office Technology Center 1600 Michael P. Woodward Unity of Invention: Biotech Examples.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
The Chemistry of Life:  The nature of matter  Properties of water.
Towards a Top-Level Ontology for Molecular Biology Stefan Schulz 1, Elena Beisswanger 2, Udo Hahn 2, Joachim Wermter 2, 1 Freiburg University Hospital,
CHAPTER 2 CHEMISTRY OF LIFE. 2-1 The Nature of Matter.
Ch. 2: “Chemistry of Life”
Essentials of Human Anatomy & Physiology Copyright © 2003 Pearson Education, Inc. publishing as Benjamin Cummings Slides 3.38 – 3.54 Seventh Edition Elaine.
Brief Introduction to Biology and Genetics Ryan McConnell STAT 499 September 17, 2004.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
Copyright © 2008 Pearson Education, Inc., publishing as Benjamin Cummings BIO Biochemistry.
From GENIA to BioTop – Towards a Top-Level Ontology for Biology Stefan Schulz, Elena Beisswanger, Udo Hahn, Joachim Wermter, Anand Kumar, Holger Stenzhorn.
Introduction to Chemistry Chapter 2. Introduction Matter - anything that has mass Made of elements (92 naturally occurring Element - substance that cannot.
What our bodies are made of Chemistry of Cells. Nature of Matter All matter is made of atoms. Atoms consist of electrons, protons and neutrons. Molecules.
Biochemistry An Introduction to the Chemistry of Life for Biology Students.
Basic Chemistry. Matter - Anything that has mass and takes up space *everything we see, touch, taste or smell.
A View-based Methodology for Collaborative Ontology Engineering (VIMethCOE) Ernesto Jiménez Ruiz Rafael Berlanga Llavorí Temporal Knowledge Bases Group.
Naïve Approach “ABCC5 ⊑  encodes.MRP5” Critique There may be ABCC5 (sensu nucleotide chain) instances that happen to never encode any instance of the.
Biology Rainier Jr/Sr High School Mr. Taylor. An atom is the smallest particle of an element. An element is made of only one kind of atom and cannot be.
Towards a Top-Domain Ontology for Linking Biomedical Ontologies Holger Stenzhorn a,b Elena Beißwanger c Stefan Schulz a a Department of Medical Informatics,
2/10/2014 to 2/14/2014. DNA structure In 1952, scientist Rosalind Franklin discovered that DNA is two chains of molecules in a spiral form. The actual.
General Chapter 6 Assessment answers. Section 1 1. What is chemistry? The study of matter 2. What is the definition of matter? Anything that has mass.
Basic Formal Ontology Barry Smith August 26, 2013.
Chapter 2 Chemistry of Life Section 1: Nature of Matter Section 2: Water and Solutions Section 3: Chemistry of Cells Section 4: Energy and Chemical Reactions.
Dive in! Chemistry Review and Properties of Water!
1 Ontological Foundations For SysML Henson Graves September 2010.
BASIC CHEMISTRY I thought this was biology? Objective: 1.To Understand what causes chemical bonds 2.To be able to read an equation 3.To Understand polarity.
ELAINE N. MARIEB EIGHTH EDITION 3 Copyright © 2006 Pearson Education, Inc., publishing as Benjamin Cummings PowerPoint ® Lecture Slide Presentation by.
Stefan Schulz, Martin Boeker, Holger Stenzhorn: How Granularity Issues Concern Biomedical Ontology Integration Stefan Schulz, Martin Boeker, Holger Stenzhorn.
Introduction to Ontology Introductions Alan Ruttenberg Science Commons.
Basic Biological Chemistry
Cells and Tissues.
DEVELOPING AN OWL-DL ONTOLOGY FOR RESEARCH AND CARE OF
Mid-Term Review Day 1.
AO1- development of science knowledge- KS3
A Description Logics Approach to Clinical Guidelines and Protocols
The Chemistry of Life Chapter 2.
Cells and Tissues.
Cells and Tissues.
Chemistry of Life.
Cells and Tissues.
Standardization of Anatomy Parts and Wholes – From Function to Location Stefan Schulz Department of Medical Informatics University Hospital Freiburg (Germany)
Cells and Tissues.
A Description Logics Approach to Clinical Guidelines and Protocols
Standardization of Anatomy Parts and Wholes – From Function to Location Stefan Schulz Department of Medical Informatics University Hospital Freiburg (Germany)
Presentation transcript:

Towards Interoperability of Biomedical Ontologies Workshop on Biological Upper Ontologies BioTop and Chemistry Ontology Stefan Schulz, Elena Beisswanger, Holger Stenzhorn Freiburg University, Jena University, Saarland University Germany Dagstuhl Seminar March 2007

Organization of the Session 1.Holger Stenzhorn: Overview of BioTop 2.Elena Beisswanger: Interfacing Ontologies 3.Stefan Schulz: Is BFO fit for BioTop and OBO?

Outline (1)  Situation: Many different bio-ontologies exist, covering related or even overlapping subdomains of (molecular) biology / biomedicine  Challenge: How to interlink ontologies and how to enable interoperability?

Outline (2)  Our solution: BioTop, an integrative top level for biomedical domain ontologies with BFO as its top layer  Not only integration and interlinkage but also cleansing / 'ontologicalization' of the OBOs by mapping to strict upper level

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology Transcription DNA-dependent transcription antisense RNA transcription mRNA transcription rRNA transcription tRNA transcription … (from Gene Ontology) OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology DOLCE BFO OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology DOLCE BFO Entity Continuant Dependent Continuant Realizable Entity Function Role Independent Continuant Object Object Aggregate Occurrent (from BFO) OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Domain Upper Ontology / Top Domain Ontology Upper Ontology / Top Ontology Domain Ontology DOLCE BFO OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology DOLCE BFO OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology DOLCE BFO Biology Domain: Organism Body Part Cell Cell Component Tissue Protein Nucleic Acid DNA RNA Biological Function Biological Process OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology Simple Bio Upper Ontology GFO-Bio DOLCE BFO GENIA OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …) Ontological Layers

Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology Simple Bio Upper Ontology GFO-Bio GENIA DOLCE BFO Ontological Layers BioTop OBO (Gene Ontology, Sequence Ontology, Cell Ontology, Mouse Anatomy, ChEBI, …)

BioTop Overview (1)  Covers fundamental classes and relations in the biomedical domain (with relations taken from RO but also extending the RO)  Provides verbal and formal definitions (OWL-DL class restrictions, domain/range restrictions)

BioTop Overview (2)  BFO as top level (might be other upper ontology, e.g. DOLCE)  BioTop design principles: single hierarchy (pre-inferred ontology), (full) definitions, defined relations, disjoint partitions

Development  Implemented in OWL-DL to allow for the use of reasoners to check consistency and classify the ontology  Developed using Protégé as editor and Pellet and Fact++ as reasoners  Using BFO and RO top-level classes and relations via OWL import facility

Ontology Facts  Class number  Total classes: 242  Primitive classes: 100  Defined classes: 142  Class restriction number  Total restrictions: 325  Existential restrictions : 194  Universal restrictions: 122

Ontology Classification Idea  Actual complex ontology structure „hidden“ in flat structure via restrictions  Need to employ an OWL-DL reasoner to classify the asserted version of the ontology  The inferred version has the proper stucture (e.g. structure the flat list of 'Aggregates')

Ontology – Before Classification

Ontology – After Classification

Further Information  Website:  Mailing list:  Comments are always highly appreciated!

Organization of the Session 1.Holger Stenzhorn: Overview of BioTop 2.Elena Beisswanger: Interfacing Ontologies 3.Stefan Schulz: Is BFO fit for BioTop and OBO?

Alignment Step  Find “correspondence” and “subclass” relations between domain ontology classes and BioTop classes  Inheritance of class restrictions and taxonomic links (precondition for consistency check)

Analysis Step (1)  Find overlaps  Candidate for true duplicates: ChEBI “proteins” and FMA “protein” (Ask ChEBI and FMA experts. If they agree, introduce cross-reference.)  Seeming duplicates: ChEBI “proteins” / FMA “protein” and SO “protein” (Ask SO experts. If they agree, rename SO “protein” as “protein sequence”)

Analysis Step (2)  Find inconsistencies  CO “spheroplast” („A cell, usually of bacteria or yeast, which has partially lost its cell wall.“). CO states:  “spheroplast” is-a “procaryotic cell” AND  “spheroplast” is-a “fungal cell” (Replace AND by OR)

Analysis Step (3)  Find gaps (e.g. classes and relations mentioned implicitly within the name of another class)  Missing intermediate classes: SO “chromosome structure variation” and SO “mutation affecting gene structure” (Introduce classes “variation” and “mutation”)  Missing relations: FMA “protein” and FMA “protein complex” (Introduce relation “protein complex” has- part “protein”)

Correction and Refinement Step  Correction  True duplicates: Introduce cross references  Seeming duplicates: Refine class names  Inconsistencies: Remodel  Refinement (define implicitly mentioned classes explicitly)  Introduce lacking intermediate classes  Add relations within and between ontologies

Integration via BioTop  Implicit mentions of protein  GO: molecular function (protein* function)  GO: biological process (process that has a protein* as agent)  GO: cellular component (cellular location of protein*)  Explicit mentions of protein  SO: protein (protein sequence)  ChEBI: proteins (protein molecule)  FMA: protein (protein molecule)  FMA: protein complex  GO: protein transport (... that has a protein as agent)  GO: protein binding (... by protein)  CO: protein secreting cell ...and others *More precise: gene product inheres-in has-agent has-part

Preliminary Conclusions (Integration of domain ontologies)  Cross validation between BioTop and domain ontologies  Alignment enforces clarification of actual meaning of domain ontology classes  Alignment motivates to introduce classes and relations explicitly which have previously been mentioned only implicitly (e.g. in class names)  Increase of completeness of the over-all representation of domain knowledge

Open Issues for BioTop (1)  Exact meaning of fundamental domain terms  “What is a protein?” „What is a cell?“  Cut-off to domain ontologies  Example: “protein”, “protein complex”, “protein subunit”, “glycoprotein”  Classes referred to in BioTop class definitions need to be in BioTop (Example: protein has-part polypeptide chain has-part amino acid has-part nitrate atom)  How “picky” must we be? (Example: Distinguish between amino acid and amino acid residue?)

Cut-off to domain ontologies  Have an overlap of the most upper class levels of the domain ontologies Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology Domain Ontology overlap Upper Ontology / Top Ontology Domain Upper Ontology / Top Domain Ontology  Select the most important classes from the domain ontologies even if they are deeper down overlap

Open Issues for BioTop (2)  BioTop relations (OBO relation ontology not sufficient)  What is the relation linking “gene” to “protein”? (“encodes”)  What is the relation linking “cell” to “organism”? (“originates from”)

Open Issues in Alignment  Reasonable candidates for alignment  Top level classes of domain ontologies (but e.g. SO “consequences of mutation” on a higher level than SO “region”)  Selected fundamental classes of domain ontologies  Right place for new / refined classes and relation definitions  BioTop? The OBO and RO?

Organization of the Session 1.Holger Stenzhorn: Overview of BioTop 2.Elena Beisswanger: Interfacing Ontologies 3.Stefan Schulz: Is BFO fit for BioTop and OBO?

BFO BioTop Entity Continuant Dependent Continuant Realizable Entity Function Role Independent Continuant Object Object Aggregate Occurrent (from BFO) Is BFO fit for OBO and BioTop? Case Study on Anatomical and Chemical Entities

BFO: snap Ontology snap:Continuant snap:Independent continuant snap:Objectsnap:FiatObjectPart snap:ObjectAggregate Df: An independent continuant entity that is spatially extended, maximally self- connected and self-contained (the parts of a substance are not separated from each other by spatial gaps), and possesses an internal unity; the identity of substantial objects is independent of that of other entities and can be maintained through time and through loss and gain of parts and qualities. Examples: an organism, a chair, a cell, a lung, an apple. A independent continuant entity that is part of a object but is not demarcated by any physical discontinuities. Examples: upper and lower lobes of the left lung, the dorsal and ventral surfaces of the body, the east side of Copenhagen, the lower right portion of a human torso.

Object or FiatObjectPart ? Lung Surface vs. Pleural Space: Bona Fide Boundary Lung Hilus: Entry of Bronchi, veins and arteries. Fiat Boundary

Object or FiatObjectPart ? Nearly all seemingly well-delimited parts of biological organisms exhibit somewhere continuous spatial transitions into neighboring anatomical parts, which can be viewed in different ways by different disciplines. They have to be delineated partly by fiat before they can be spoken of in a well defined way. Schulz & Johansson: Continua in Biological Systems. Forthcoming in: The Monist, 2007

FiatObjectPart  BFO:“.…not demarcated by any physical discontinuities”.  How is the lung demarcated from the respiratory tract ?  Is there a physical discontinuity between the extrapulmonar and the intrapulmonar segments of tubular structures?  Other examples: Brain, Heart, Kidney, Liver, Hand, Fingernail, Cell Nucleus…  Which anatomical structures are true BFO:objects ?

BFO: snap Ontology snap:Continuant snap:Independent continuant snap:Object snap:FiatObjectPart snap:ObjectAggregate Df: An independent continuant entity that is spatially extended, maximally self- connected and self-contained (the parts of a substance are not separated from each other by spatial gaps), and possesses an internal unity; the identity of substantial objects is independent of that of other entities and can be maintained through time and through loss and gain of parts and qualities. Examples: an organism, a chair, a cell, a lung, an apple. Df: A independent continuant entity that is a mereological sum of separate objects. Examples: a heap of stones, a group of commuters on the subway, a collection of bacteria, a flock of geese, the patients in a hospital, a symphony orchestra.

An Object E.coli

An ObjectAggregate E.coli BFO: “A independent continuant entity that is a mereological sum of separate objects.”

An Object E.coli

An Object ? E.coli

An Object ?? H H H

H H H H H H H H H H H H H H

H H H H H H H H H H H 0,0001% nuclei 99,9999% empty space “filled” by probability distributions of electrons

BFO Critique  Every object appears – under a fine grained view as an aggregate of non connected elementary particles  Whether something is an object or an object aggregate, depends on the granularity of the view  Biologically relevant material continuants span over a huge range of granularity: From populations to atoms  A realist ontology claims to describe the world as it is, not as it is perceived  Granularity should or should not matter in a realist ontology ??

An ObjectAggregate E.coli BFO: “A independent continuant entity that is a mereological sum of separate objects.” separate separate ?

Seperatedness / Connectedness strength of connection mechanic adhesion electro- static adhesion Hydogen Bridges (surfaces) (chemicals) Van- der- Waals Metallic Non Polar Covalent Polar Covalent Ionic

Continuum between Types of Chemical Bonds

Seperatedness / Connectedness  Depends on  kinds of surfaces and mechanic adhesion and cohesion forces  types of chemical bonds  strengths of chemical bonds  number of chemical bonds  Is a continuous property, and therefore problematic as a criterion for the upper level distinction of ontological categories

Conclusion  Critique of domain level by applying the rigor of an upper level ontology but also…  Critique of the upper level ontology by assessing the appropriateness of upper level categories from the perspective of the domain level Ontology Intergration: Upper Level, Top Domain, Domain has two aspects:

BFO BioTop Domain Ontologies Top-down validation: Do domain ontologies comply with upper-level principles? Bottom-up validation: Do upper ontologies provide adequate means for formally and unambiguously describing the domain?

Towards Interoperability of Biomedical Ontologies Workshop on Biological Upper Ontologies BioTop and Chemistry Ontology Stefan Schulz, Elena Beisswanger, Holger Stenzhorn Freiburg University, Jena University, Saarland University Germany Dagstuhl Seminar March 2007