Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.

Slides:



Advertisements
Similar presentations
Everything must be logical Terms should be in the right place with good definition and the correct relationships I need very specific terms to describe.
Advertisements

More than one way to dissect an animal Melissa Haendel ZFIN Scientific Curator.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
What is Ontology? Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of the kinds.
Classification of developing structures Developing structures are typically classified by developmental biologists according to their fate. If such terms.
Weaving and untangling the GO is_a completeness ~9 slides granularity & BP ~3 slides Linking MF to BP ~15 slides Sensu ~13 slides –linguistic qualifiers.
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
Bridging Multiple Ontologies: Route to Representation of the Liver Immune Response Anna Maria Masci, Jeffrey Roach, Bernard de Bono, Pierre Grenon, Lindsay.
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Ontology Notes are from:
Iowa State University Animal Science Department Bioinformatics & Computational Biology Program - 01/16/06 1 Overview of Animal Trait Ontology and PATO.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Abstract The Cell Ontology (CL) is a candidate OBO Foundry 1 ontology for the representation of in vivo cell types. As part of our work in redeveloping.
1 The OBO Foundry 2 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast.
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
Phenotype annotation using ontologies Chris Mungall (+ BS) Berkeley Bioinformatics and Ontologies Project (BBOP) National Center for Biomedical Ontology.
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
Internet tools for genomic analysis: part 2
Ontological Model for Colon Carcinoma: A Case Study for Knowledge Representation in Clinical Bioinformatics Kumar A 1,2, Yip L 3, Jaremek M 2, Scheib H.
How to Organize the World of Ontologies Barry Smith 1.
The Core Infectious Disease Ontology. Purpose: To make infectious disease-relevant data deriving from different sources comparable and computable Across.
The OBO Foundry Chris Mungall Lawrence Berkeley Laboratory NCBO GO Consortium May 2007.
GO Ontology Editing Workshop: Using Protege and OWL Hinxton Jan 2012.
Automatic methods for functional annotation of sequences Petri Törönen.
PATO An ontology for phenotypes. The development of PATO is the work of George Gkoutos, supported by the NCBO, working in Cambridge.
Linking Multiple Ontologies: The OBO Foundry Approach Chris Mungall NIAID Cell Ontology Workshop May 2008.
The Plant Ontology: Linking Phenotypes and Genomics Across Plant Taxa Laurel D. Cooper* 1, Ramona L. Walls 2, Justin Elser 1, Justin Preece 1, Dennis W.
Building Biomedical Ontologies Jennifer Clark, GO Editorial Office.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Amo amos amot amomus amotis amont. Happy birthday Swiss-Prot Fortaleza August 2006.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL.
Gene Ontology Consortium
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
TermGenie – Granting Biocurators’ Wishes for the GeneOntology BioCurator Meeting 2013 Heiko Dietze – Lightning Talk.
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Sunday, July 22, 2012 Plan Areas of coverage: high-level neurological system process, inc. sensory perception, sensory processing, cognition transmission.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Gene Onotology Part 1: what is the GO? Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse Genome Informatics.
The “über-ontology” (Uberon) Melissa Häendel, Chris Müngall, George Gkoütos Cell Ontology Workshop May, 2010.
Proposed formal definition of a stage A part of the lifetime event of an entity –Lifetime event of a cell (e.g. cell cycle) –Lifetime event of an organism.
GO terms implicitly refer to other term cysteine biosynthesis myoblast fusion hydrogen ion transporter activity snoRNA catabolism wing disc pattern formation.
The Plant Ontology: Development of a Reference Ontology for all Plants Plant Ontology Consortium Members and Curators*: Laurel D.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
2 3 where in the body ? where in the cell ?
What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared.
About ontologies Melissa Haendel. And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist,
Anatomy Ontology Community Melissa Haendel. The OBO Foundry More than just a website, it’s a community of ontology developers.
Expanding species-specific anatomy ontologies to include the cell ontology Melissa Haendel (1), Ceri Van Slyke (1), Chris Mungall (2), Peiran Song (1),
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Neil A. Campbell Brad Williamson Robin J. Heyden Created by Roxanne Leitner.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
The Gene Ontology Project
Mental Functioning and the Gene Ontology
Many GO terms are implicitly composite
The Gene Ontology: an evolution
GO/PO interconnections
What is Ontology? s Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of.
OBO Foundry Update: April 2010
Presentation transcript:

Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium

The GO is 3 ontologies Molecular function (MF) Biological Process (BP) Cellular component (CC)

The GO is 3 orthogonal ontologies Molecular function –(a kind of dependent continuant) Biological Process –(a kind of occurrent) Cellular component –(a kind of independent continuant)

The GO is 3 orthogonal ontologies of canonical biology Molecular function –(a kind of dependent continuant) Biological Process –(a kind of occurrent) Cellular component –(a kind of independent continuant) oncogenesis X fin regeneration yes acquisition of nutrients from host yes

The GO is 3 orthogonal canonical species-neutral ontologies Molecular function –many core functions Biological Process –core shared processes ( e.g. transcription) –processes specific to organism types ( e.g. fin development, fly courtship behaviour) Cellular component –prokaryotes and eukaryotes

part of the part_of tree in GO

Granularity of the GO FunctionProcessContinuant MolecularGO-MF GO-BPGO-CC (Sub?)Cellul ar GO-BPGO-CC OrganismalGO-BP

The GO is an ontology, with rich terminological features GO ‘terms’ are actually representations of types (aka kinds, universals, classes) –The actual terms (i.e. the phrases used by biologists) are attached to the representations of types as names and synonyms synonyms have linguistic relations to the GO types –exact, broad, narrow, related distinct from ontological relations between GO types –is_a, part_of GO is moving to genus-differentia style definitions –Many definitions are still dictionary-style, terminological describe how the term is used

Genus differentia definitions central nervous system morphogenesis Genus: morphogenesis Differentia: has_outcome central nervous system The process by which the anatomical structure of the central nervous system is generated and organized

The GO is both reference and application ontology The same artefact (i.e. file) is used for both ontology editing and data annotation This has worked reasonably well until now We may encourage making a distinction –application views (aka GO slims) currently only used to present a very small subset of GO consider wider use for extracting most of ontology

Relations in GO (current) part_of –conforms to RO X part_of Y: all Xs are part_of some Y (for the entirety of the duration of the existence of the X) –e.g. nucleus part_of cell (all nuclei are always part_of a cell, not all cells have a nucleus as part) for both continuants and processes –no ordering for processes is_a –sort-of conforms to RO X is_a Y: all Xs are Ys (for the entirety of Xs existence) but there are issues with is_a in GO: –is_a incompleteness –is_a polyhierarchies

is a has issues Not all GO types have is_a parents –not a problem in MF –fixed in CC (July 2006) –being fixed in BP (Sept 2006: right now, here in Seattle) Still a contentious issue? –is_a completion requires new high-level types in ontology –perceived as being too abstract by biologists –simple solution: application ontology remove high level terms in annotation view

is_a polyhierarchies is_a diamonds cause problems –tangled DAGs, easy to make mistakes Source of problems typically due to multiple axes of classification –e.g. due to composite terms Solution: –Genus - differentia (aristotelian) definitions aka cross-products [Hill et al] –Always a single genus choose consistent axis of classification –Allow classifier/reasoner to provide different views of ontology

a tangled hierarchy in GO problem: mixes (at least) two axes of classification

biosynthesis is_a metabolism

cysteine is_a serine family amino acid is_a amino acid is_a amine

cysteine is_a serine family amino acid is_a amino acid is_a serine

The solution: separate the axes cysteine biosynthesis (GO) Genus: biosynthesis Differentia: has_outcome cysteine biosynthesis (GO) metabolism (GO) cysteine (ChEBI) serine family amino acid (ChEBI) computable genus-differentia definition

Compute the subsumption DAG from the definition cysteine biosynthesis (GO) Genus: biosynthesis Differentia: has_outcome cysteine cysteine metabolism (GO) serine family amino acid biosynthesis (GO) the DAG is required for applications such as annotation search

Pre- and post- composition References to types can be pre-composed in ontology, prior to annotation –Ontology editor creates term, with ID –Use reasoners to classify the DAG automatically References to types can be post-composed (created on the fly) at annotation time –No term with ID is created Computationally, it makes no difference –provided we adhere to the genus-differentia formalism

OBO Foundry practices and pre-composition Pre-composition of terms in the ontology is good as it creates a map of biological reality, linking foundry ontologies –within reason

Examples GO Biological Process x OBO Cell –neuron migration –cone cell fate specification –T cell homeostasis –erythrocte degranulation OBO Cell is species-neutral

Current status The ability to effectively created computationally visible genus-differentia definitions is new to most OBO ontologies Soon to be created: –SO (many terms now done) –GO-BP definitions referencing OBO-Cell –OBO Disease definitions referencing FMA –And more… Difficult: –GO-BP and ChEBI (chemical entities) –GO-BP and anatomy (we need CARO!)

development in GO (current) neural tube formation neural tube development neural plate formation neural plate development GO part_of neural plate morphogenesis (is_a not shown)

development in GO (future) presumptive spinal cord neural plate neural keel neural rod neural tube spinal cord transformation_of neural tube formation neural tube development neural plate formation neural plate development has_participant AO GO part_of neural tube formation Genus: tube formation Differentia: has_outcome neural tube neural plate morphogenesis