The Relation Ontology Barry Smith 1. Concepts, Types and Frames ConceptsFrames Types Relational Structures 2.

Slides:



Advertisements
Similar presentations
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
Advertisements

1 How Philosophy of Science Can Help Biomedical Research Barry Smith
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
1 The Future of Biomedical Informatics Barry Smith University at Buffalo
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
1 Pax Terminologica Barry Smith Institute for Formal Ontology and Medical Information Science, Saarland University, Saarbrücken.
1 Introduction to Biomedical Ontology Barry Smith University at Buffalo
1 Doing Ontology Over Images Barry Smith. What ontologies are for.
Ontology and the Future of Biomedical Research Barry Smith
1 Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo
1 An Ontology of Relations for Biomedical Informatics Barry Smith 10 January 2005.
The Role of Foundational Relations in the Alignment of Biomedical Ontologies Barry Smith and Cornelius Rosse.
1 Introduction to (Geo)Ontology Barry Smith
1 Ontology in 15 Minutes Barry Smith. 2 Main obstacle to integrating genetic and EHR data No facility for dealing with time and instances (particulars)
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
1 The OBO Foundry 2 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast.
The Problem of Reusability of Biomedical Data OBO Foundry & HL7 RIM Barry Smith.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Underlying Ontologies for Biomedical work - The Relation Ontology (RO) and Basic Formal Ontology (BFO) Thomas Bittner SUNY Buffalo
Using Ontologies to Represent Immunological Networks Lindsay G. Cowell, Anne Lieberman, Anna Maria Masci Duke University Center for Computational Immunology.
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
AN INTRODUCTION TO BIOMEDICAL ONTOLOGY Barry Smith University at Buffalo 1.
VT. From Basic Formal Ontology to Medicine Barry Smith and Anand Kumar.
Pathways and Networks for Realists Barry Smith 1.
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
1 Ontologie als konkretisierte Darstellung der Wirklichkeit Barry Smith.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
1 A General Introduction to Biomedical Ontology Barry Smith
Anatomical Information Science Barry Smith
1 The OBO Relation Ontology Genome Biology 2005, 6:R46 based on the fundamental distinction between instances and universals takes instances and time into.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
1 Part II. The Ontology of Biomedical Reality Some Terminological Proposals.
1 What an Ontology is For Barry Smith University at Buffalo Common Anatomy Reference Ontology Workshop.
1 How Ontologies Create Research Communities Barry Smith
1 Ontology (Science) Barry Smith University at Buffalo
1 The Future of Clinical Bioinformatics: Overcoming Obstacles to Information Integration Barry Smith Brussells, Eurorec Ontology Workshop, 25 November.
Limning the CTS Ontology Landscape Barry Smith 1.
Switch on Webex.
Ontology of Sensors: Some Examples from Biology
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Intelligence Ontology A Strategy for the Future Barry Smith University at Buffalo
Ontological Engineering Barry Smith Computers and Information in Engineering Conference, Buffalo August 19,
Core 6 (University at Buffalo) Dissemination of Ontology Best Practices Barry Smith (PI) Fabian Neuhaus (Post-Doc) Werner.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Ontology-based search and knowledge sharing using domain ontologies
How to integrate data Barry Smith. The problem: many, many silos DoD spends more than $6B annually developing a portfolio of more than 2,000 business.
2 3 where in the body ? where in the cell ?
About ontologies Melissa Haendel. And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist,
Ontology and the Semantic Web Barry Smith August 26,
What is an ontology and Why should you care? Barry Smith 1.
Need for common standard upper ontology
Lecture 3 BFO: A Standard Upper Level Ontology. 2 The idea of ontological realism Before we build a data model we need to look at the reality we are trying.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 The OBO Relation Ontology: Preliminaries Barry Smith
1 Ontology (Science) vs. Ontology (Engineering) Barry Smith University at Buffalo
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Upper Ontology Summit The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center for Ontological Research National.
What is an ontology and Why should you care?
Ontology in 15 Minutes Barry Smith.
Why do we need upper ontologies? What are their purported benefits?
Ontology in 15 Minutes Barry Smith.
OBO Foundry Update: April 2010
Presentation transcript:

The Relation Ontology Barry Smith 1

Concepts, Types and Frames ConceptsFrames Types Relational Structures 2

Concepts, Types and Frames ConceptsFrames Linguistic Approach Types Relational Structures Scientific Approach 3

4 has_lower_level_granularity TLR2-MyD88 binding TLR2 has_participant LTA binding has_disposition TIR domain has_part TLR2-TLR2 ligand binding TIR-TIR binding process preceded_by regulated_by has_output has_participant TLR2:MyD88 complex MyD88 has_participant TLR-2 signalling pathway

5 how to define relations such as this?

Uses of ‘ontology’ in PubMed abstracts 6

By far the most successful: The Gene Ontology

MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFES IPTEVRSFYEDEKSGLIKVVKFRTGAMDRKRSFEKVVIS VMVGKNVKKFLTFVEDEPDFQGGPISKYLIPKKINLMVY TLFQVHTLKFNRKDYDTLSLFYLNRGYYNELSFRVLER CHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKY GYNLAPYMFLLLHVDELSIFSAYQASLPGEKKVDTERL KRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRAC ALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVC KLRSPNTPRRLRKTLDAVKALLVSSCACTARDLDIFDD NNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGI SLLAFAGPQRNVYVDDTTRRIQLYTDYNKNGSSEPRLK TLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPW MDVVGFEDPNQVTNRDISRIVLYSYMFLNTAKGCLVEY ATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGS RFETDLYESATSELMANHSVQTGRNIYGVDFSLTSVSG TTATLLQERASERWIQWLGLESDYHCSFSSTRNAEDV How to do biology across the genome?

MKVSDRRKFEKANFDEFESALNNKNDLVHCPSITLFESIPTEVRSFYEDEKSGLIKVVKFRTGAMDR KRSFEKVVISVMVGKNVKKFLTFVEDEPDFQGGPIPSKYLIPKKINLMVYTLFQVHTLKFNRKDYDTL SLFYLNRGYYNELSFRVLERCHEIASARPNDSSTMRTFTDFVSGAPIVRSLQKSTIRKYGYNLAPYM FLLLHVDELSIFSAYQASLPGEKKVDTERLKRDLCPRKPIEIKYFSQICNDMMNKKDRLGDILHIILRA CALNFGAGPRGGAGDEEDRSITNEEPIIPSVDEHGLKVCKLRSPNTPRRLRKTLDAVKALLVSSCAC TARDLDIFDDNNGVAMWKWIKILYHEVAQETTLKDSYRITLVPSSDGISLLAFAGPQRNVYVDDTTR RIQLYTDYNKNGSSEPRLKTLDGLTSDYVFYFVTVLRQMQICALGNSYDAFNHDPWMDVVGFEDP NQVTNRDISRIVLYSYMFLNTAKGCLVEYATFRQYMRELPKNAPQKLNFREMRQGLIALGRHCVGS RFETDLYESATSELMANHSVQTGRNIYGVDSFSLTSVSGTTATLLQERASERWIQWLGLESDYHCS FSSTRNAEDVVAGEAASSNHHQKISRVTRKRPREPKSTNDILVAGQKLFGSSFEFRDLHQLRLCYEI YMADTPSVAVQAPPGYGKTELFHLPLIALASKGDVEYVSFLFVPYTVLLANCMIRLGRRGCLNVAPV RNFIEEGYDGVTDLYVGIYDDLASTNFTDRIAAWENIVECTFRTNNVKLGYLIVDEFHNFETEVYRQS QFGGITNLDFDAFEKAIFLSGTAPEAVADAALQRIGLTGLAKKSMDINELKRSEDLSRGLSSYPTRMF NLIKEKSEVPLGHVHKIRKKVESQPEEALKLLLALFESEPESKAIVVASTTNEVEELACSWRKYFRVV WIHGKLGAAEKVSRTKEFVTDGSMQVLIGTKLVTEGIDIKQLMMVIMLDNRLNIIELIQGVGRLRDGG LCYLLSRKNSWAARNRKGELPPKEGCITEQVREFYGLESKKGKKGQHVGCCGSRTDLSADTVELIE RMDRLAEKQATASMSIVALPSSFQESNSSDRYRKYCSSDEDSNTCIHGSANASTNASTNAITTAST NVRTNATTNASTNATTNASTNASTNATTNASTNATTNSSTNATTTASTNVRTSATTTASINVRTSATT TESTNSSTNATTTESTNSSTNATTTESTNSNTSATTTASINVRTSATTTESTNSSTSATTTASINVRTS ATTTKSINSSTNATTTESTNSNTNATTTESTNSSTNATTTESTNSSTNATTTESTNSNTSAATTESTN SNTSATTTESTNASAKEDANKDGNAEDNRFHPVTDINKESYKRKGSQMVLLERKKLKAQFPNTSEN MNVLQFLGFRSDEIKHLFLYGIDIYFCPEGVFTQYGLCKGCQKMFELCVCWAGQKVSYRRIAWEAL AVERMLRNDEEYKEYLEDIEPYHGDPVGYLKYFSVKRREIYSQIQRNYAWYLAITRRRETISVLDSTR GKQGSQVFRMSGRQIKELYFKVWSNLRESKTEVLQYFLNWDEKKCQEEWEAKDDTVVVEALEKG GVFQRLRSMTSAGLQGPQYVKLQFSRHHRQLRSRYELSLGMHLRDQIALGVTPSKVPHWTAFLSM LIGLFYNKTFRQKLEYLLEQISEVWLLPHWLDLANVEVLAADDTRVPLYMLMVAVHKELDSDDVPDG RFDILLCRDSSREVGE 9

10 what cellular component? what molecular function? what biological process?

11 what cellular component? what molecular function? what biological process? GO used in curation of literature

and in integration of databases MouseEcotope GlyProt DiabetInGene GluChem sphingolipid transporter activity 12

The GO Idea MouseEcotope GlyProt DiabetInGene GluChem Holliday junction helicase complex 13

The GO Idea MouseEcotope GlyProt DiabetInGene GluChem sphingolipid transporter activity 14

Clark et al., 2005 part_of is_a 15 GO used in reasoning

How does the Gene Ontology work? with thanks to Jane Lomax, Gene Ontology Consortium 16

The methodology of annotations: tagging scientific literature with terms from GO Model organism databases (MODs) employ scientific curators, who use experimental observations reported in the biomedical literature to link gene products (such as proteins) with GO terms in annotations. International Society of Biocurators

GO provides a controlled system of representations for use in annotating data multi-species multi-disciplinary multi-granularity, from molecules to population 18

Gene products involved in cardiac muscle development in humans 19

$100 mill. invested in literature curation using GO over 11 million annotations relating gene products described in the UniProt, Ensembl and other databases to terms in the GO 20

GO allows a new kind of biological research based on analysis and comparison of the massive quantities of annotations linking GO terms to the gene products described in scientific literature and in scientific databases 21

GO is amazingly successful in overcoming data silo problems but it covers only – cellular components – molecular functions – biological processes 22

RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) The Open Biomedical Ontologies (OBO) Foundry 23

The OBO Foundry – to extend the GO to enable intelligent integration of gigantic bodies of heterogeneous data across the entire domain of the life sciences, including clinical medicine – to create an evolving, map-like, computable representation of the entire domain of biological and medical reality 24

Initial Candidate Members –GO Gene Ontology –CL Cell Ontology –SO Sequence Ontology –ChEBI Chemical Ontology –PATO Phenotype (Quality) Ontology –FMA Foundational Model of Anatomy –ChEBI Chemical Entities of Biological Interest –CARO Common Anatomy Reference Ontology –PRO Protein Ontology 25 The OBO Foundry

Under development –Disease Ontology –Infectious Disease Ontology –Mammalian Phenotype Ontology –Plant Trait Ontology –Environment Ontology –Ontology for Biomedical Investigations –Behavior Ontology –RNA Ontology –RO Relation Ontology 26 The OBO Foundry

A success story in top-down information integration Ontologies configured as extensions of a single upper level ontology (BFO) Used by 100s of researchers to promote interoperability of experimental data in scores of high- throughput domains of biology and medicine via semantic annotation 27

The linguistic approach Bottoms-up, focused on linguistic properties manifested by the contents of a large corpus viewed from a cognitive perspective (mapping/modeling meanings or concepts rather than entities in reality) 28

Automatic mining of “assocations” from MEDLINE FACTA: Finding Associated Concepts with Text Analysis –What diseases are related to a particular chemical? –What proteins are related to a particular disease? 29

For the linguistic approach fiction may be no less important than fact English has no privileged status (the larger the corpus, the better) consistency (and thus additivity) of annotations is not important, because cognitive perspectives differ goal is automatic generation of semantic annotations via pattern- matching 30

For the scientific approach factual discourse alone important English is lingua franca regimentation is allowed goal of truth: to create a single computer-processable map of reality via painstaking Handarbeit truth is one  we strive for consistency of annotations 31

The linguistic approach is concerned with knowledge representation The scientific approach is concerned with reality representation 32

OBO Relation Ontology (RO 1.0) Foundationalis_a part_of Spatiallocated_in contained_in adjacent_to Temporaltransformation_of derives_from preceded_by Participationhas_participant has_agent 33

Relation Ontology supports consistent linkage of OBO Foundry ontologies through a common system of formally defined relations to enable reasoning both within and across ontologies, and thus also within and between the literature annotated in its terms 34

Relation Ontology instance_of is_a (= is a subtype of) depends_on part_of inheres_in has_input has_participant …. 35

Basic Formal Ontology (BFO) Continuant Occurrent (Process, Event) Independent Continuant Dependent Continuant 36

Fundamental Dichotomy Continuants preserve their identity through change Occurrents (aka processes) –have temporal parts –unfold themselves in successive phases –exist only in their phases –have all their parts of necessity 37

instance_of Continuant Occurrent process, event Independent Continuant thing Dependent Continuant quality types instances 38

types vs. instances compare OWL: T-box vs. A-box (terminology vs. assertions) 39

3 kinds of (binary) relations Between types human is_a mammal human heart part_of human Between an instance and a type this human instance_of the type human this human allergic_to the type tamiflu Between instances Mary’s heart part_of Mary Mary’s aorta connected_to Mary’s heart 40

depends_on Continuant Occurrent process, event Independent Continuant thing Dependent Continuant quality quality depends on bearer 41

Dependent continuants the whiteness quality of this cheese your role as lecturer the disposition of this peach to ripen 42

depends_on Continuant Occurrent process Independent Continuant thing Dependent Continuant quality temperature depends on bearer 43

depends_on Continuant Occurrent process, event Independent Continuant thing Dependent Continuant quality, … event depends on participant 44

Type-level relations presuppose the underlying instance-level relations A is_a B =def. A and B are types and all instances of A are instances of B A part_of B =def. All instances of A are instance-level-parts-of some instance of B 45

Rule for including relations in RO In every case we need to check, before we add a relation [A] R [B] to RO, that, for some set of A and B terms we have data about the As and data about the Bs which is such that all the instances of A stand in R to some instance of B e.g. all the instances of cell membrane stand in part_of to cell 46

The assertions linking terms in ontologies must hold universally Hence all type-level relations in RO are provided with All-Some definitions (For linguists, Some-Some relations are equally important) 47

Rule for including relations in RO Before including a new relation in RO ask yourself: can the relation can be easily defined in terms of existing relations e.g. define ‘synaptically_connected_to’ in terms of: is_connected_to, is_a and synapse 48

Including only All-Some relations means: All relations evaluable as 1.Transitive 2.Symmetric 3.Reflexive 4.Anti-Symmetric All relations support logical reasoning – as contrasted with: is_related_to, is_associated_with, is_narrower_in_meaning_than … 49

Reasoning should be able to cascade from one relational assertion (A R 1 B) to the next (B R 2 C). Find all DNA binding proteins should also Find all transcription factor proteins because –Transcription factor is_a DNA binding protein Only the All-Some structure guarantees such cascading of relational assertions 50

Why All-Some ? If you know A part_of B, and B part_of C then whichever A you choose, the instance of B of which it is a part will be included in some C, which will include as part also the A with which you began 51

A part_of B, B part_of C... The All-Some structure of the definitions in the RO allows cascading of inferences (i) within ontologies (ii) between ontologies (iii) between ontologies and repositories of instance-data 52

Organisms are continuants they are entities which endure through time through gain and loss of parts Processes are occurrents they are entities which unfold through time, and have all their parts as a matter of necessity 53

human testis part_of adult human being but not human being has_part human testis and not even male human being has_part human testis 54

part_of for continuant types A part_of B =def. For all x, t if x instance_of A at t then there is some y, y instance_of B at t and x instance_level_part_of y at t cell membrane part_of cell 55

part_of for occurrent types A part_of B =def. For all x, if x instance_of A then there is some y, y instance_of B and x instance_level_part_of y EVERY A IS PART OF SOME B 56

is_a (for continuants) A is_a B  For all x, t if x instance_of A at t then x instance_of B at t abnormal cell is_a cell adult human is_a human but not: adult is_a child 57

Lacks Instance-type level p lacks U with respect to r at time t =def. there is no instance u of U such that p stands in r to u at t. Type-type level C1 lacks C2 with respect to r =def. for all c,t, if c instance of C1 at t then c lacks C2 with respect to r at time t. Need a way to state on top of this: that C1s normally stand in r to some C2 58

transformation_of A transformation_of B =Def. Every instance of A was at some earlier time an instance of B –adult transformation_of child 59

transformation_of 60 c at t 1 C c at t C 1 time same instance pre-RNAmature RNA adultchild

C c at t C 1 c 1 at t 1 C' c' at t time instances zygote derives_from ovum sperm derives_from correction to original Genome Biology paper: derivation is never one-to-one 61

two continuants fuse to form a new continuant C c at t C 1 c 1 at t 1 C' c' at t fusion derives_from 62

one initial continuant is replaced by two successor continuants C c at t C 1 c 1 at t 1 C 2 c 1 at t 1 fission derives_from 63

one continuant detaches itself from an initial continuant, which itself continues to exist C c at t c at t 1 C 1 c 1 at t budding derives_from combined with transformation_of 64

one continuant absorbs a second continuant while itself continuing to exist C c at t c at t 1 C' c' at t capture derives_from combined with transformation_of 65

ISO “Concept logic” for mereology Toronto part_of Ontario brain part_of central nervous system ISO, “Guidelines for the Construction, Format, and Management of Monolingual Controlled Vocabularies” ANSI/NISO Z ) sees these as examples of the same part_of relation 66

Instances vs. types Instance-level relations and type-level relations have logically distinct properties Type relations are liftings of instance relations 67

What is symmetric on the level of instances need not be symmetric on the level of types adjacency on the instance level is always symmetric 68

Not however on the level of types: seminal vesicle adjacent_to urinary bladder Not: urinary bladder adjacent_to seminal vesicle 69

Similarly, on the level of types, while: nucleus adjacent_to cytoplasm it is not the case that cytoplasm adjacent_to nucleus 70

continuous_with on the instance level is always symmetric a continuous_with b on the instance level means: there is a fiat boundary between a and b if a continuous_with b, then b continuous_with a 71

72

continuous_with as a relation between types A continuous_with B =Def. for all x, if x instance-of A then there is some y such that y instance_of B and x continuous_with y 73

continuous_with is not symmetric Consider lymph node and lymphatic vessel Each lymph node is continuous with some lymphatic vessel, but there are lymphatic vessels (e.g. lymphs and lymphatic trunks) which are not continuous with any lymph nodes 74

3 kinds of binary relations Between types human is_a mammal cell nucleus part_of cell Between an instance and a type this human instance_of the type human this human allergic_to the type penicillin Between instances Mary’s heart part_of Mary Mary’s aorta connected_to Mary’s heart 75

Linguistic vs. scientific approach to semantic annotation Semantic annotation can provide support for logical reasoning across the content of scientific literature only if the distinctions between relations at the type level and relations at the instance level are taken account of. (Many?) linguistic accounts of relations do not take account of this distinction. 76

Why not? Because linguistic accounts (like dictionaries) focus on relations between meanings, not on instances in reality Because linguistic accounts focus on what is meaningfully combinable, rather than on what is logically inferrable Because linguistic accounts focus on relations captured grammatically, not on relations observed experimentally and captured in scientific theories 77

The Relation Ontology Barry Smith 78 Sophia Ananiadou UK National Centre for Text Mining

79 Do linguistics and biology truly ever meet?