The Gene Ontology Barry Smith March 2004.

Slides:



Advertisements
Similar presentations
Enhancing GO for the sake of clinical bionformatics Anand Kumar IFOMIS, University of Leipzig/Saarbrücken.
Advertisements

ARCHITECTURES FOR ARTIFICIAL INTELLIGENCE SYSTEMS
Nucleus Control center of the cell –contains the “genetic library” encoded in the sequences of nucleotides in molecules of DNA code for the amino acid.
Knowledge Representation
What is Ontology? Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of the kinds.
Genome organization Lesk, Ch 2 (Lesk, 2008). Genomes and proteomes Genome of a typical bacterium comes as a single DNA molecule of about 5 million characters.
Gene Ontology John Pinney
Chapter 1 What is a Cell? By Benjamin Lewin. 1.1 Introduction Cells arise only from preexisting cells. Every cell has genetic information whose expression.
3.1 An overview of genetic possesses 3.2 The basis of hereditary 3.3 DNA replication 3.4 RNA and protein synthesis 3.5 Gene expression.
The Role of Foundational Relations in the Alignment of Biomedical Ontologies Barry Smith and Cornelius Rosse.
1 Ontology in 15 Minutes Barry Smith. 2 Main obstacle to integrating genetic and EHR data No facility for dealing with time and instances (particulars)
Medical Ontologies: An Overview Barry Smith
Thomas Bittner and Barry Smith IFOMIS (Saarbrücken) Normalizing Medical Ontologies Using Basic Formal Ontology.
The Ontology of the Gene Ontology Barry Smith Jennifer Williams Steffen Schulze-Kremer
STOP Barry Smith Smart Terminologies via Ontological Principles.
On the Application of Formal Principles to Life Science Data: A Case Study in the Gene Ontology Barry Smith * Jacob Köhler † Anand Kumar * *
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
Internet tools for genomic analysis: part 2
ifomis.de 1 Outline Part 0: HL7 RIM Part 1: Survey of GO and its problems Part 2: Extending GO to make a full ontology Part 3: Conclusion.
Reference Ontologies, Application Ontologies, Terminology Ontologies Barry Smith
Chapter 7 Cell Structure and Function
Vocabulary Review Cells.
Meaghan Malcolm Madi THE NUCLEUS/NUCLEOLUS. THE NUCLEUS So what is the nucleus? The nucleus is essentially the brain of the cell. It’s job is to control.
Chapter 3 The Biological Basis of Life. Chapter Outline  The Cell  DNA Structure  DNA Replication  Protein Synthesis  What is a Gene?  Cell Division:
BASIC BIOCHEMISTRY MLAB Introduction. INTRODUCTION TO BASIC BIOCHEMISTRY Biochemistry can be defined as the science concerned with the chemical.
Chapter 6 System Engineering - Computer-based system - System engineering process - “Business process” engineering - Product engineering (Source: Pressman,
Unit 4 Vocabulary Review. Nucleic Acids Organic molecules that serve as the blueprint for proteins and, through the action of proteins, for all cellular.
AP Biology Control of Eukaryotic Genes.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Cell membranes, Membrane lipids, Membrane proteins.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Learning Outcome B1. Analyze the functional inter-relationship of cell structures.
Introduction to the Cell Chapter 7.1. Introduction to the Cell Cell- smallest unit of matter that can carry on all of the processes of life Hooke and.
Raven - Johnson - Biology: 6th Ed. - All Rights Reserved - McGraw Hill Companies Control of Gene Expression Copyright © McGraw-Hill Companies Permission.
Studying Life Vodcast 1.3 Unit 1: Introduction to Biology.
Cells A Comparison of Eukaryotes and Prokaryotes.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
Cell Structure.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
VR. Formal Principles for Biomedical Ontologies Barry Smith
Organelles Biology I. Plant Cell Cellular boundaries Cell membrane: thin flexible barrier around cells Cell Wall: An inflexible barrier protects the.
Central dogma: the story of life RNA DNA Protein.
Research! Look up each antibiotic from our lab and describe what it does to bacteria. As a group, draw a picture that you think best represents.
Vocabulary Review Cells. Smallest Unit of Life CELL.
Mining the Biomedical Research Literature Ken Baclawski.
Development and Genes Part 1. 2 Development is the process of timed genetic controlled changes that occurs in an organism’s life cycle. Mitosis Cell differentiation.
Copyright © 2002 Pearson Education, Inc., publishing as Benjamin Cummings Fig To summarize, at the replication fork, the leading stand is copied.
Formal Specification: a Roadmap Axel van Lamsweerde published on ICSE (International Conference on Software Engineering) Jing Ai 10/28/2003.
Cell Growth Does an animal get larger because each cell increases in size or because it produces more of them? Mature red blood cells are not capable of.
1 A View of Life. 2 The Human Genome Project mapped the complete set of human genes Genomics analyses the DNA sequence of an organism While genomics will.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
DNA and the Genetic Code 46 molecules of DNA are located in the nucleus of all cells in the human body except sperm and oocytes –23 molecules are inherited.
Life Science. Explain that cells are the basic unit of structures and function of living organisms. Cells are the basic unit of structures of living organisms.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Protein. Protein and Roles 1: biological process unknown 1.1 Structural categories 1.2 organism categories 1.3 cellular component o unlocalized.
A Tour of the Cell Chapter 6. Objectives Be familiar with the regions of the cell and the overall role of each region Be familiar with the basic organelles.
Cells Organelles Specialization Communication. What is Alive? All living things are:  Made of cells  Obtain energy  Metabolize  Evolve  Respond 
Lesson 4- Gene Expression PART 2 - TRANSLATION. Warm-Up Name 10 differences between DNA replication and transcription.
+ Cell checkpoints and Cancer. + Introduction Catastrophic genetic damage can occur if cells progress to the next phase of the cell cycle before the previous.
Integrating SysML with OWL (or other logic based formalisms)
Chapter 11 and 12 Mitosis and Meiosis
Chromosomes and Cellular Reproduction
Ontology in 15 Minutes Barry Smith.
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Ontology in 15 Minutes Barry Smith.
What is Ontology? s Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of.
Chapter 11 and 12 Mitosis and Meiosis
Presentation transcript:

The Gene Ontology Barry Smith March 2004

ifomis.de 2 Complexity of biological structures About 30,000 genes in a human Probably ,000 proteins Individual variation in most genes 100s of cell types 100,000s of disease types 1,000,000s of biochemical pathways (including disease pathways)

ifomis.de 3 DNA Protein Organelle Cell Tissue Organ Organism m m Scales of anatomy m

ifomis.de 4 The Challenge Each (clinical, pathological, genetic, proteomic, pharmacological …) information system uses its own terminology and category system biomedical research demands the ability to navigate through all such information systems How can we overcome the incompatibilities which become apparent when data from distinct sources is combined?

ifomis.de 5 Answer: “Ontology”

ifomis.de 6 Three levels of ontology 1)formal (top-level) ontology dealing with categories employed in every domain: object, event, whole, part, instance, class 2) domain ontology, applies top-level system to a particular domain cell, gene, drug, disease, therapy 3) terminology-based ontology large, lower-level system Dupuytren’s disease of palm, nodules with no contracture

ifomis.de 7 Three levels of ontology 1)formal (top-level) ontology dealing with categories employed in every domain: object, event, whole, part, instance, class 2) domain ontology, applies top-level system to a particular domain cell, gene, drug, disease, therapy 3) terminology-based ontology large, lower-level system Dupuytren’s disease of palm, nodules with no contracture

ifomis.de 8 Three levels of ontology 1)formal (top-level) ontology dealing with categories employed in every domain: object, event, whole, part, instance, class 2) domain ontology, applies top-level system to a particular domain cell, gene, drug, disease, therapy 3) terminology-based ontology large, lower-level system Dupuytren’s disease of palm, nodules with no contracture

ifomis.de 9 Compare: 1)pure mathematics (re-usable theories of structures such as order, set, function, mapping) 2)applied mathematics, applications of these theories = re-using the same definitions, theorems, proofs in new application domains 3)physical chemistry, biophysics, etc. = adding detail

ifomis.de 10 Three levels of biomedical ontology 1)formal (top-level) ontology = biomedical ontology has nothing like the technology of re-usable definitions, theorems and proofs provided by pure mathematics 2) domain ontology = e.g. GO, the Gene Ontology 3) terminology-based ontologies = ICD-10, UMLS, SNOMED-CT, GALEN, FMA ?????

ifomis.de 11 Outline Part 1: Survey of GO and its problems Part 2: Extending GO to make a full ontology Part 3: Conclusion

ifomis.de 12 Part One Survey of GO

ifomis.de 13 GO is three large telephone directories of terms used in annotating genes and gene products ‘annotating’ = indexing GO is a ‘controlled vocabulary’ – proximate goal: to standardize reporting of biological results ultimate goal: to unify biology / bio-informatics

ifomis.de 14 GO an impressive achievement used by over 20 genome database and many other groups in academia and industry methodology much imitated now part of OBO (open biological ontologies) consortium

ifomis.de 15 GO here used as an example a.of the sorts of problems faced by current biomedical informatics b.of the degree to which philosophy and logic are relevant to the solution of these problems

ifomis.de 16 GO is three ontologies cellular components molecular functions biological processes December 16, 2003: 1372 component terms 7271 function terms 8069 process terms

ifomis.de 17 Michael Ashburner: GO’s philosophy from the beginning was ‘just in time’ - that is, we made no great attempt to ‘complete’ the ontologies …. If you try and ‘complete’ an ontology, or worse: try and ‘get it right,’ then you will fail …

ifomis.de 18 GO built by biologists Gene “Ontology” Gene “Statistic”

ifomis.de 19 When a gene is identified three important types of questions need to be addressed: 1. Where is it located in the cell? 2. What functions does it have on the molecular level? 3. To what biological processes do these functions contribute?

ifomis.de 20 GO’s three ontologies molecular functions cellular components biological processes

ifomis.de 21 GO confined to what annotations can be associated with genes and gene products (proteins …)

ifomis.de 22 The Cellular Component Ontology (counterpart of anatomy) flagellum chromosome membrane cell wall nucleus

ifomis.de 23 The Cellular Component Ontology (counterpart of anatomy) “Generally, a gene product is located in or is a subcomponent of a particular cellular component.” Cellular components are independent continuants (= they endure through time while undergoing changes of various sorts)

ifomis.de 24 The Molecular Function Ontology ice nucleation protein stabilization kinase activity binding The Molecular Function ontology is (roughly) an ontology of actions on the molecular level of granularity

ifomis.de 25 DNA Protein Organelle Cell Tissue Organ Organism m m Scales of anatomy m

ifomis.de 26 Molecular Function Definition: An activity or task performed by a gene product. It often corresponds to something (such as a catalytic activity) that can be measured in vitro. GO confuses function with functioning

ifomis.de 27 Biological Process Ontology Examples: glycolysis death adult walking behavior response to blue light = occurrents on the level of granularity of organs and whole organisms

ifomis.de 28 Biological Process Definition: A biological process is a biological goal that requires more than one function. Mutant phenotypes often reflect disruptions in biological processes.

ifomis.de 29 Each of GO’s ontologies is organized in a graph-theoretical structure involving two sorts of links or edges: is-a (= is a subtype of ) (copulation is-a biological process) part-of (cell wall part-of cell)

ifomis.de 30

ifomis.de 31 Primary aim not rigorous definition and principled classification but rather: to provide a practically useful framework for keeping track of the biological annotations that are applied to gene products

ifomis.de 32 GO’s graph-theoretic architecture designed to help human annotators to locate the designated terms for the features associated with specific genes

ifomis.de 33 GO is a ‘controlled vocabulary’ designed to ensure that the same terms are used by different research groups with the same meanings

ifomis.de 34 Principle of Univocity terms should have the same meanings (and thus point to the same referents) on every occasion of use

ifomis.de 35 Principle of Compositionality The meanings of compound terms should be determined 1. by the meanings of component terms together with 2. the rules governing syntax

ifomis.de 36 The story of ‘ / ’

ifomis.de 37 / GO: microtubule/kinetochore interaction =df Physical interaction between microtubules and chromatin via proteins making up the kinetochore complex

ifomis.de 38 / GO: ciliary/flagellar motility =df Locomotion due to movement of cilia or flagella.

ifomis.de 39 / GO: negative regulation of chromatin assembly/disassembly =df Any process that stops, prevents or reduces the rate of chromatin assembly and/or disassembly

ifomis.de 40 / GO: G1/S transition of mitotic cell cycle =df Progression from G1 phase to S phase of the standard mitotic cell cycle.

ifomis.de 41 / GO: interpretation of nuclear/cytoplasmic to regulate cell growth =df The process where the size of the nucleus with respect to its cytoplasm signals the cell to grow or stop growing.

ifomis.de 42 / GO: hexuronate (glucuronate/galacturonate) porter activity =df Catalysis of the reaction: hexuronate(out) + cation(out) = hexuronate(in) + cation(in)

ifomis.de 43 comma lactose, galactose: hydrogen symporter activity male courtship behavior (sensu Insecta), wing vibration

ifomis.de 44 Principle of Positivity Class names should be positive. Logical complements of classes are not themselves classes. (Terms such as ‘non-mammal’ or ‘non- membrane’ or ‘invertebrate’ or do not designate natural kinds.)

ifomis.de 45 Problems with negation GO has no way to express ‘not’ and no way to express ‘is localized at’) Holliday junction helicase complex is-a unlocalized

ifomis.de 46 GO: cellular component unknown cellular component unknown is-a cellular component

ifomis.de 47 Principle of Objectivity which classes exist is not a function of our biological knowledge. (Terms such as ‘unclassified’ or ‘unknown ligand’ or ‘not otherwise classified as peptides’ do not designate biological natural kinds, and nor do they designate differentia of biological natural kinds)

ifomis.de 48 Rabbit and copulation both designate natural kinds, but terms such as rabbit and copulation rabbit or copulation do not Cf. Lewis-Armstrong sparse theory of universals Veterinary proprietary drug and/or biological has 2532 children in SNOMED-CT

ifomis.de 49 Principle of Sparseness Which biological classes exist is not a matter of logic. (Biological combination is not reflected in a Boolean algebra)

ifomis.de 50 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors

ifomis.de 51 Is biological classification Linnaean?

ifomis.de Principle of Single Inheritance no class in a classificatory hierarchy should have more than one parent on the immediate higher level no diamonds:

ifomis.de Principle of Taxonomic Levels the terms in a classificatory hierarchy should be divided into predetermined levels (analogous to the levels of kingdom, phylum, class, order, etc., in traditional biology). ‘depth’ in GO’s hierarchies not determinate because of multiple inheritance

ifomis.de 54 Principle of Taxonomic Levels

ifomis.de 55 Principle of Exhaustiveness the classes on any given level should exhaust the domain of the classificatory hierarchy.

ifomis.de 56 Single Inheritance + Exhaustiveness = JEPD Exhaustiveness often difficult to satisfy in the realm of biological phenomena; but its acceptance as an ideal is presupposed as a goal by every scientist. Single inheritance accepted in all traditional (species-genus) classifications, now under threat because multiple inheritances is a computationally useful device (allows one to avoid certain kinds of combinatoric explosion).

ifomis.de 57 Problems with multiple inheritance B C is-a 1 is-a 2 A ‘is-a’ no longer univocal

ifomis.de 58 Problems with multiple inheritance B C is-a 1 is-a 2 A E D ‘sibling’ is no longer determinate

ifomis.de 59 ‘is-a’ is pressed into service to mean a variety of different things the resulting ambiguities make the rules for correct coding difficult to communicate to human curators they also serve as obstacles to integration with neighboring ontologies

ifomis.de 60 is-a GO’s definition: A is-a B =def every instance of A is an instance of B = standard definition of computer science (confusion of ‘class’ with ‘set’, failure to take time seriously) adult is-a child

ifomis.de 61 is-a (  ) there are times at which instances of A exist, and at all such times these instances are also instances of B animal-owned-by-the-emperor is-a animal- weighing-less-than-200-kgs

ifomis.de 62 is-a (  )A and B are natural kinds, and there are times at which instances of A exist, and at all such times these instances are also instances of B albino antelope is-a antelope susceptible to rabies

ifomis.de 63 is-a (  ) A and B are natural kinds, and there are times at which instances of A exist, and at all such times these instances are necessarily (of their very nature) also instances of B 1. eukaryotic cell is-a cell 2. terminal glycosylation is-a protein glycosylation

ifomis.de 64

ifomis.de 65 storage vacuole is-a vacuole a storage vacuole is not a special kind of vacuole a box used for storage is not a special kind of box

ifomis.de 66

ifomis.de 67 ‘within’ lytic vacuole within a protein storage vacuole lytic vacuole within a protein storage vacuole is-a protein storage vacuole time-out within a baseball game is-a baseball game embryo within a uterus is-a uterus

ifomis.de 68 Problems with Location is-located-at / is-located-in and similar relations need to be expressed in GO via some combination of ‘is-a’ and ‘part-of’ … is-a unlocalized … is-a site of … … within … … in …

ifomis.de 69 Problems with location extrinsic to membrane part-of membrane extrinsic to membrane Definition: Loosely bound, by ionic or covalent forces, to one or other surface of the cell membrane, but not integrated into the hydrophobic region.

ifomis.de 70 part-of not a mereological relation between individuals but a relation between classes

ifomis.de 71 Problems with GO’s part-of GO’s old definition of part-of: A part-of B =def A can be part of B asserted to be transitive

ifomis.de 72 Three meanings of ‘part-of ’ ‘part-of’ = ‘can be part of’ (flagellum part-of cell) ‘part-of’ = ‘is sometimes part of’ (replication fork part-of the nucleoplasm) ‘part-of’ = ‘is included as a sublist in’

ifomis.de 73 New definition of part-of There are four basic levels of restriction for a part_of relationship:

ifomis.de 74 New definition of part-of The first type has no restrictions. That is, no inferences can be made from the relationship between parent and child other than that the parent may or may not have the child as a part, and the the child may or may not be a part of the parent. The second type, 'necessarily is_part', means that wherever the child exists, it is as part of the parent: 'replication fork' is part_of 'chromosome', so whenever 'replication fork' occurs, it is as part_of 'chromosome', but 'chromosome' does not necessarily have part 'replication fork'.

ifomis.de 75 Type three, 'necessarily is_part', is the exact inverse of type two … The final type is a combination of both three and four, 'has_part' and 'is_part'.

ifomis.de 76 part-of = is necessarily part of The part_of relationship used in GO is usually type two, 'necessarily is_part'. Note that part_of types 1 and 3 are not used in GO

ifomis.de 77 Official definition term: part_of definition: Used for representing partonomies.

ifomis.de 78 Official definition term: derived_from definition: Any kind of temporal relationship, such as derived_from, translated_from

ifomis.de 79 Problems with GO’s definitions GO: : cell fate commitment Definition: The commitment of cells to specific cell fates and their capacity to differentiate into particular kinds of cells. x is a cell fate commitment =def x is a cell fate commitment and p

ifomis.de 80 rules for definitions intelligibility: the terms used in a definition should be simpler (more intelligible) than the term to be defined definitions: do not confuse definitions with the communication of new knowledge

ifomis.de 81 Principle of Substitutability in all extensional contexts a defined term should be substitutable by its definition in such a way that the result is both grammatically correct and has the same truth-value as the sentence with which we begin

ifomis.de 82 toxin transporter activity Definition: Enables the directed movement of a toxin into, out of, within or between cells. A toxin is a poisonous compound (typically a protein) that is produced by cells or organisms and that can cause disease when introduced into the body or tissues of an organism.

ifomis.de 83 fimbrium-specific chaperone activity Definition: Assists in the correct assembly of fimbria, extracellular organelles that are used to attach a bacterial cell to a surface, but is not a component of the fimbrium when performing its normal biological function.

ifomis.de 84 Genbank a gene is a DNA region of biological interest with a name and that carries a genetic trait or phenotype

ifomis.de 85 GO’s three ontologies are separate No links or edges defined between them molecular functions cellular components biological processes

ifomis.de 86 Occurrents Both molecular function and biological process terms refer to occurrents = entities which do not endure through time but rather unfold themselves in successive temporal phases. Occurrents can be segmented into parts along the temporal dimension. Continuants exist in toto in every instant at which they exist at all.

ifomis.de 87 Three granularities: Molecular (for ‘functions’) Cellular (for components) Whole organism (for processes)

ifomis.de 88 GO does not include molecules or organisms within any of its three ontologies The only continuant entities within the scope of GO are cellular components (including cells themselves)

ifomis.de 89 Are the relations between functions and processes a matter of granularity? Molecular activities are the building blocks of biological processes ? But they cannot be represented in GO as parts of biological processes

ifomis.de 90 GO does not recognize parthood relations between entities on its three distinct levels of granularity Compare: this wheel is part of the car this molecule is part of the car

ifomis.de 91 Functions ‘The functions of a gene product are the jobs it does or the “abilities” it has’

ifomis.de 92 Functions chaperone activity motor activity catalytic activity signal transducer activity structural molecule activity transporter activity binding antioxidant activity chaperone regulator activity enzyme regulator activity transcription regulator activity triplet codon-amino acid adaptor activity translation regulator activity nutrient reservoir activity

ifomis.de 93 Appending function terms with ‘activity’ In 2003 all GO molecular function terms were appended … with the word 'activity'. structural constituent of bone structural constituent of cuticle structural constituent of cytoskeleton structural constituent of epidermis structural constituent of eye lens structural constituent of muscle structural constituent of nuclear pore structural constituent of ribosome structural constituent of tooth enamel

ifomis.de 94 terms appended with ‘activity’ … because GO molecular functions are what philosophers would call 'occurrents', meaning events, processes or activities, rather than 'continuants' which are entities e.g. organisms, cells, or chromosomes. The word activity helps distinguish between the protein and the activity of that protein, for example, nuclease and nuclease activity. In fact, a molecular 'function' is distinct from a molecular 'activity'. A function is the potential to perform an activity, whereas an activity is the realisation, the occurrence of that function; so in fact, 'molecular function' might more properly be renamed 'molecular activity'. However, for reasons of consistency and stability, the string 'molecular function' endures.

ifomis.de 95

ifomis.de 96 Part Two Extending GO to make a full ontology

ifomis.de 97 toxin transporter activity Definition: Enables the directed movement of a toxin into, out of, within or between cells. A toxin is a poisonous compound (typically a protein) that is produced by cells or organisms and that can cause disease when introduced into the body or tissues of an organism.

ifomis.de 98 Some formal ontology Components are independent continuants Functions are dependent continuants (the function of an object exists continuously in time, just like the object which has the function; and it exists even when it is not being exercised) Processes are (dependent) occurrents

ifomis.de 99 GO must be linked with other, neighboring ontologies GO has: adult walking behavior but not adult GO has: eye pigmentation but not eye GO has: response to blue light but not light (or blue) 94% of words used in GO terms are not GO terms

ifomis.de 100 Principle of Dependence If an ontology recognizes a dependent entity then it (or a linked ontology) should recognize also the relevant class of bearers

ifomis.de 101 Linking to external ontologies can also help to link together GO’s own three separate parts

ifomis.de 102 GO’s three ontologies molecular functions cellular components biological processes  dependent   independent

ifomis.de 103 GO’s three ontologies molecular functions cellular components organism- level biological processes cellular processes

ifomis.de 104 ‘part-of’; ‘is dependent on’ molecular functions molecule complexe s cellular processes cellular components organism- level biological processes organisms

ifomis.de 105 part-of: is dependent on:

ifomis.de 106 molecular functions molecule complexe s cellular processes cellular components organism- level biological processes organisms

ifomis.de 107 molecule complexe s cellular component s molecular function s cellular functions organism- level biological functions organisms molecular processe s cellular processes organism- level biological processes

ifomis.de 108 molecule complexe s cellular component s molecular function s cellular functions organism- level biological functions organisms molecular processe s cellular processes organism- level biological processes functionings

ifomis.de 109 Human beings know what ‘walking’ means Human beings know that adults are older than embryos GO needs to be linked to ontology of development and in general to resources for reasoning about time and change

ifomis.de 110 but such linkages are possible only if GO itself has a coherent formal architecture

ifomis.de 111

ifomis.de 112 Is this all just philosophy ?

ifomis.de 113 Human consequences of inconsistent and/or indeterminate use of operators such as ‘/ ’ 29% of GO’s contain one or more problematic syntactic operators but these terms are used in only 14% of annotations Hypothesis: reflects the fact that poorly defined operators are not well understood by annotators, who thus avoid the corresponding terms

ifomis.de 114 Computational consequences of inconsistent and/or indeterminate use of operators The information captured by GO through its use of problematic syntactic operators is not available for purposes of information retrieval

ifomis.de 115 Problems caused by GO’s formal incoherence 1. Coding errors  constant updating 2. Need for expert knowledge (which computers do not have access to) 3. Obstacles to ontology integration

ifomis.de 116 Problems caused by GO’s formal incoherence 4. It is unclear what kinds of reasoning are permissible on the basis of GO’s hierarchies. 5. The rationale of GO’s subclassifications is unclear. 6. No procedures are offered by which GO can be validated.

ifomis.de 117 Quality assurance and ontology maintenance must be automated As GO increases in size and scope it will “be increasingly difficult to maintain the semantic consistency we desire without software tools that perform consistency checks and controlled updates”

ifomis.de 118 The End