Species and Classification in Biology Barry Smith

Slides:



Advertisements
Similar presentations
1 Five Steps to Interoperability (in the domain of scientific ontology) Barry Smith.
Advertisements

Upper Ontology Summit Tuesday March 14 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center.
Enhancing GO for the sake of clinical bionformatics Anand Kumar IFOMIS, University of Leipzig/Saarbrücken.
Ontology in Buffalo Barry Smith. 2 Ontology (phil.) The science of being Ontologies (tech.) Standardized classification systems which enable data from.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Introduction to Functional Analysis J.L. Mosquera and Alex Sanchez.
1 ANATOMY AND TIME Barry Smith. 2 SNAP AND SPAN 3 To understand relations between universals Reference to times and instances are important A derives.
1 Introduction to Ontology: Terminology Barry Smith with thanks to Werner Ceusters, Waclaw Kusnierczyk, Daniel Schober.
1 An Ontology of Relations for Biomedical Informatics Barry Smith 10 January 2005.
The Role of Foundational Relations in the Alignment of Biomedical Ontologies Barry Smith and Cornelius Rosse.
1 Introduction to (Geo)Ontology Barry Smith
1 Beyond Concepts Barry Smith
1 Ontology in 15 Minutes Barry Smith. 2 Main obstacle to integrating genetic and EHR data No facility for dealing with time and instances (particulars)
Thomas Bittner and Barry Smith IFOMIS (Saarbrücken) Normalizing Medical Ontologies Using Basic Formal Ontology.
STOP Barry Smith Smart Terminologies via Ontological Principles.
On the Application of Formal Principles to Life Science Data: A Case Study in the Gene Ontology Barry Smith * Jacob Köhler † Anand Kumar * *
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
1 Forms of Life Barry Smith 2.
AN INTRODUCTION TO BIOMEDICAL ONTOLOGY Barry Smith University at Buffalo 1.
VT. From Basic Formal Ontology to Medicine Barry Smith and Anand Kumar.
Pathways and Networks for Realists Barry Smith 1.
1 A General Introduction to Biomedical Ontology Barry Smith
1 The OBO Relation Ontology Genome Biology 2005, 6:R46 based on the fundamental distinction between instances and universals takes instances and time into.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
The Logic of Biological Classification Barry Smith March 2004.
Reference Ontologies, Application Ontologies, Terminology Ontologies Barry Smith
1 The Future of Clinical Bioinformatics: Overcoming Obstacles to Information Integration Barry Smith Brussells, Eurorec Ontology Workshop, 25 November.
Why, in the future, all sciences will be computer sciences Barry Smith.
Chapter 1 Reading Quiz What is the lowest level of matter?
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
The Foundational Model of Anatomy and its Ontological Commitment(s) Stefan Schulz University Medical Center, Freiburg, Germany FMA in OWL meeting November.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Introduction to Ontology Barry Smith August 11, 2012.
Core 6 (University at Buffalo) Dissemination of Ontology Best Practices Barry Smith (PI) Fabian Neuhaus (Post-Doc) Werner.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Studying Life Vodcast 1.3 Unit 1: Introduction to Biology.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
What I SHOULD Have Learned in 7 th Grade. Energy Transfers Describe that thermal (heat) energy transfers from warmer objects to cooler ones until the.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
1 A View of Life. 2 The Human Genome Project mapped the complete set of human genes Genomics analyses the DNA sequence of an organism While genomics will.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 How to build an ontology Barry Smith
1 The OBO Relation Ontology: Preliminaries Barry Smith
What I SHOULD Have Learned in Life Science Class
Life Science. Explain that cells are the basic unit of structures and function of living organisms. Cells are the basic unit of structures of living organisms.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
1 The Logic of Biological Classification Barry Smith
Ontologies COMP6028 Semantic Web Technologies Dr Nicholas Gibbins
“The importance of science is not that it answers questions, but that it continually discovers new questions to ask.” Dr. Frank Oppenheimer.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U Discovery Seminar /UE 141 MMM – Spring 2008 Solving Crimes using Referent.
1 Standards and Ontology Barry Smith
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
COMP6215 Semantic Web Technologies
Towards a Computational Paradigm for Biological Structure
Ontology From Wikipedia, the free encyclopedia
Ontological Foundations for Biomedical Sciences
Ontology in 15 Minutes Barry Smith.
Introduction to Applied and Theoretical Ontology Barry Smith
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Ontology in 15 Minutes Barry Smith.
Presentation transcript:

Species and Classification in Biology Barry Smith

2

3 DNA m

ifomis.org4 DNA Protein Organelle Cell Tissue Organ Organism m m m

ifomis.org5 New golden age of classification* ~ 30 million species 30,000 genes in human 200,000 proteins 100s of cell types 100,000s of disease types 1,000,000s of biochemical pathways (including disease pathways) *… legacy of Human Genome Project

ifomis.org6 DNA Protein Organelle Cell Tissue Organ Organism m m m

ifomis.org7 FUNCTIONAL GENOMICS proteomics, reactomics, metabonomics, phenomics, behaviouromics, toxicopharmacogenomics …

ifomis.org8 The incompatibilities between different scientific cultures and terminologies immunology genetics cell biology

ifomis.org9 have resurrected the problem of the unity of science in a new guise: The logical positivist solution to this problem addressed a world in which sciences are associated with printed texts. What happens when sciences are associated with databases?

ifomis.org10 … when each (chemical, pathological, immunological, toxicological) information system uses its own classifications how can we overcome the incompatibilities which become apparent when data from distinct sources are combined?

ifomis.org11 Answer: “Ontology”

ifomis.org12 = building software artefacts standardized classification systems/ controlled vocabularies so that data from one source should be expressed in a language which makes it compatible with data from every other source

ifomis.org13 Google hits (in millions) ontology 52.4 ontology + philosophy 2.7 ontology + information science 6.0 ontology + database 7.8

ifomis.org14 A Linnaean Species Hierarchy

ifomis.org15 (Small) Disease Hierarchy

ifomis.org16 Combining hierarchies Organisms Diseases

ifomis.org17 via Dependence Relations Organisms Diseases

ifomis.org18 A Window on Reality

ifomis.org19 A Window on Reality Organisms Diseases

ifomis.org20 A Window on Reality

ifomis.org21 How to understand species (aka types, universals, kinds) Species are something like invariants in reality which can be studied by science Species have instances: this mouse, this cell, this cell membrane...

ifomis.org22 Entity =def anything which exists, including things and processes, functions and qualities, beliefs and actions, documents and software

ifomis.org23 Domain =def a portion of reality that forms the subject- matter of a single science or technology or mode of study; proteomics radiology viral infections in mouse

ifomis.org24 Representation =def an image, idea, map, picture, name or description... of some entity or entities.

ifomis.org25 Analogue representations

ifomis.org26 Representational units =def terms, icons, photographs, identifiers... which refer, or are intended to refer, to entities

ifomis.org27 Composite representation =def representation (1) built out of representational units which (2) form a structure that mirrors, or is intended to mirror, the entities in some domain

ifomis.org28 Periodic Table The Periodic Table

ifomis.org29 Ontologies are here

ifomis.org30 Ontologies are representational artifacts

ifomis.org31 What do ontologies represent?

ifomis.org32 A515287DC3300 Dust Collector Fan B521683Gilmer Belt C521682Motor Drive Belt

ifomis.org33 A515287DC3300 Dust Collector Fan B521683Gilmer Belt C521682Motor Drive Belt instances types

ifomis.org34 Two kinds of composite representational artifacts Databases, inventories: represent what is particular in reality = instances Ontologies, terminologies, catalogs: represent what is general in reality = types

ifomis.org35 What do ontologies represent?

ifomis.org36 Ontologies do not represent concepts in people’s heads

ifomis.org37 Ontology is a tool of science Scientists do not describe the concepts in scientists’ heads They describe the types in reality, as a step towards finding ways to reason about (and treat) instances of these types

ifomis.org38 The biologist has a cognitive representation which involves theoretical knowledge derived from textbooks

ifomis.org39 An ontology is like a scientific text; it is a representation of types in reality

ifomis.org40 Two kinds of composite representational artifacts Databases represent instances Ontologies represent types

ifomis.org41 Instances stand in similarity relations Frank and Bill are similar as humans, mammals, animals, etc. Human, mammal and animal are types at different levels of granularity

ifomis.org42 siamese mammal cat organism substance types animal instances frog

ifomis.org43 science needs to find uniform ways of representing types ontology =def a representational artifact whose representational units (which may be drawn from a natural or from some formalized language) are intended to represent 1. types in reality 2. those relations between these types which obtain universally (= for all instances) lung is_a anatomical structure lobe of lung part_of lung

ifomis.org44 is_a A is_a B =def For all x, if x instance_of A then x instance_of B cell division is_a biological process

ifomis.org45 Entities

ifomis.org46 Entities universals (species, types, taxa, …) particulars (individuals, tokens, instances)

ifomis.org47 Canonical instances within the realm of individuals = those individuals which 1. instantiate universals (entering into biological laws) 2. are prototypical  Canonical Anatomy: no Siamese twins, no six-fingered giants, no amputation stumps, …

ifomis.org48 Entities universals instances junk example of junk particulars: desk-mountain

ifomis.org49 Entities human Jane inst

ifomis.org50 Ontologies are More than Just Taxonomies

ifomis.org51 The Gene Ontology 7 million google hits a cross-species controlled vocabulary for annotations of genes and gene products deeper than Darwinianism

ifomis.org52 When a gene is identified three important types of questions need to be addressed: 1. Where is it located in the cell? 2. What functions does it have on the molecular level? 3. To what biological processes do these functions contribute?

ifomis.org53 GO has three ontologies molecular functions cellular components biological processes

ifomis.org54 GO astonishingly influential used by all major species genome projects used by all major pharmacological research groups used by all major bioinformatics research groups

ifomis.org55 GO part of the Open Biological Ontologies consortium Fungal Ontology Plant Ontology Yeast Ontology Disease Ontology Mouse Anatomy Ontology Cell Ontology Sequence Ontology Relations Ontology

ifomis.org56 Each of GO’s ontologies is organized in a graph-theoretical structure involving two sorts of links or edges: is-a (= is a subtype of ) (copulation is-a biological process) part-of (cell wall part-of cell)

ifomis.org57

ifomis.org58 The Gene Ontology a ‘controlled vocabulary’ designed to standardize annotation of genes and gene products used by over 20 genome database and many other groups in academia and industry and methodology much imitated

ifomis.org59 The Methodology of Annotations Scientific curators use experimental observations reported in the biomedical literature to link gene products with GO terms in annotations. The gene annotations taken together yield a slowly growing computer-interpretable map of biological reality, The process of annotating literature also leads to improvements and extensions of the ontology, which institutes a virtuous cycle of improvement in the quality and reach of future annotations and of the ontology itself. The Gene Ontology as Cartoon

ifomis.org60 cellular components molecular functions biological processes 1372 component terms 7271 function terms 8069 process terms

ifomis.org61 The Cellular Component Ontology (counterpart of anatomy) membrane nucleus

ifomis.org62 The Molecular Function Ontology protein stabilization The Molecular Function ontology is (roughly) an ontology of actions on the molecular level of granularity

ifomis.org63 Biological Process Ontology death An ontology of occurrents on the level of granularity of cells, organs and whole organisms

ifomis.org64 GO here an example a.of the sorts of problems confronting life science data integration b.of the degree to which formal methods are relevant to the solution of these problems

ifomis.org65 Each of GO’s ontologies is organized in a graph-theoretical data structure involving two sorts of links or edges: is-a (= is a subtype of ) (copulation is-a biological process) part-of (cell wall part-of cell)

ifomis.org66 Linnaeus

ifomis.org67

ifomis.org68 Entities

ifomis.org69 Entities universals (kinds, types, taxa, …) particulars (individuals, tokens, instances …) Axiom: Nothing is both a universal and a particular

ifomis.org70 Entities universals* *natural, biological, kinds

ifomis.org71 Entities universals instances

ifomis.org72 universals are natural kinds Instances are natural exemplars of natural kinds (problem of non-standard instances) Not all individuals are instances of universals

ifomis.org73 Entities universals instances penumbra of borderline cases

ifomis.org74 Entities universals instances junk example of junk: beachball-desk

ifomis.org75 Primitive relations: inst and part inst(Jane, human being) part(Jane’s heart, Jane’s body) A universal is anything that is instantiated An instance as anything (any individual) that instantiates some universal

ifomis.org76 Entities human Jane inst

ifomis.org77 A is_a B genus(B) species(A) instances

ifomis.org78 is-a D3* e is a f =def universal(e)  universal(f)   x (inst(x, e)  inst(x, f)). genus(A)=def universal(A)   B (B is a A  B  A) species(A)=def universal(A)   B (A is a B  B  A)

ifomis.org79 solve problem of false positives insist that A is_a B holds always as a matter of scientific law

ifomis.org80 nearest species nearestspecies(A, B)= def A is_a B &  C ((A is_a C & C is_a B)  (C = A or C = B) B A

ifomis.org81 Definitions highest genus lowest species instances

ifomis.org82 Lowest Species and Highest Genus lowestspecies(A)= def species(A) & not-genus(A) highestgenus(A)= def genus(A) & not-species(A) Theorem: universal(A)  (genus(A) or lowestspecies(A))

ifomis.org83 Axioms Every universal has at least one instance Distinct lowest species never share instances SINGLE INHERITANCE: Every species is the nearest species to exactly one genus

ifomis.org84 Axioms governing inst genus(A) & inst(x, A)   B nearestspecies(B, A) & inst(x, B) EVERY GENUS HAS AN INSTANTIATED SPECIES nearestspecies(A, B)  A’s instances are properly included in B’s instances EACH SPECIES HAS A SMALLER CLASS OF INSTANCES THAN ITS GENUS

ifomis.org85 Axioms nearestspecies(B, A)   C (nearestspecies(C, A) & B  C) EVERY GENUS HAS AT LEAST TWO CHILDREN nearestspecies(B, A) & nearestspecies(C, A) & B  C)  not-  x (inst(x, B) & inst(x, C)) SPECIES OF A COMMON GENUS NEVER SHARE INSTANCES

ifomis.org86 Theorems (genus(A) & inst(x, A))   B (lowestspecies(B) & B is_a A & inst(x, B)) EVERY INSTANCE IS ALSO AN INSTANCE OF SOME LOWEST SPECIES (genus(A) & lowestspecies(B) &  x(inst(x, A) & inst(x, B))  B is_a A) IF AN INSTANCE OF A LOWEST SPECIES IS AN INSTANCE OF A GENUS THEN THE LOWEST SPECIES IS A CHILD OF THE GENUS

ifomis.org87 Theorems universal(A) & universal(B)  (A = B or A is_a B or B is_a A or not-  x(inst(x, A) & inst(x, B))) DISTINCT UNIVERSALS EITHER STAND IN A PARENT-CHILD RELATIONSHIP OR THEY HAVE NO INSTANCES IN COMMON

ifomis.org88 Theorems A is_a B & A is_a C  (B = C or B is_a C or C is_a B) UNIVERSALS WHICH SHARE A CHILD IN COMMON ARE EITHER IDENTICAL OR ONE IS SUBORDINATED TO THE OTHER

ifomis.org89 Theorems (genus(A) & genus(B) &  x(inst(x, A) & inst(x, B)))   C(C is_a A & C is_a B) IF TWO GENERA HAVE A COMMON INSTANCE THEN THEY HAVE A COMMON CHILD

ifomis.org90 Expanding the theory Sexually reproducing organisms Organisms in general To take account of development (child, adult; larva, butterfly) Biological processes Biological functions -- at different levels of granularity

ifomis.org91 How to understand species (aka types, universals, kinds) Species are something like invariants in reality which can be studied by science Species have instances: this mouse, this cell, this cell membrane...

ifomis.org92 Universal, Classes, Sets A class is the extension of universal

ifomis.org93 Class =def a maximal collection of particulars determined by a general term (‘cell’, ‘mouse’, ‘Saarländer’) the class A = the collection of all particulars x for which ‘x is A’ is true

ifomis.org94 Universals and Classes vs. Sums The former are marked by granularity: they divide up the domain into whole units, whose interior parts are traced over. The universal human being is instantiated only by human beings as single, whole units. A mereological sum is not granular in this sense (molecules are parts of the mereological sum of human beings)

ifomis.org95 A bad solution Identify both universals and classes with sets in the mathematical sense Problem of false positives adult  child lion in Leipzig  lion animal owned by the Emporer  mammal mammal weighing less than 200 Kg  animal

ifomis.org96 Sets in the mathematical sense are marked by granularity Granularity = each class or set is laid across reality like a grid consisting (1) of a number of slots or pigeonholes each (2) occupied by some member. Each set is (1) associated with a specific number of slots, each of which (2) must be occupied by some specific member. A class survives the turnover in its instances: both (1) the number of slots and (2) the individuals occupying these slots may vary with time

ifomis.org97 But sets are timeless A set is an abstract structure, existing outside time and space. The set of human beings existing at t is (timelessly) a different entity from the set of human beings existing at t because of births and deaths. Biological classes exist in time Darwin: because the universals of which they are extensions exist in time