Domain and Family Ontology Building an ontology of protein domains and families to support GO classification https://www.ebi.ac.uk/panda/jira/browse/GO-144.

Slides:



Advertisements
Similar presentations
Using Ontology Reasoning to Classify Protein Phosphatases K.Wolstencroft, P.Lord, L.tabernero, A.brass, R.stevens University of Manchester.
Advertisements

Bridging GO, Uberon and multiple species specific anatomy ontologies.
FP7 meeting - Gent - Carlos Rodríguez - April 18 WP4: Conceptual Mining from Text for Knowledge Engineering State of the Art WP Coordinators: Alfonso Valencia.
+ OWL for annotators David Osumi-Sutherland. + What is OWL? Web Ontology Language Can express everything in OBO and more. Certified web standard Fast.
Building ontologies using Jenkins. Changing requirements for ontology engineering Original ontology build pipeline – What pipeline? – Life on the bleeding.
Higgs bosons, mars missions, and unicorn delusions: How to deal with terms of dubious reference in scientific ontologies Stefan Schulz a,b Mathias Brochhausen.
FADO: Family and Domain Ontology Building an ontology of protein domains and families to support GO classification
PRO-PO-GO Alexander Diehl The Gene Ontology Cellular Component – Subcellular structures, locations, and macromolecular complexes including protein.
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Real-life ontology development: lessons from the Gene Ontology.
The cell ontology covers multiple taxonomic contexts OntologyContext# of links to cl MAadult mouse1 FMAadult human658 XAOfrog63 AAOamphibian ZFAzebrafish391.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
Today’s menu: -UniProt - SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
BLOSUM Information Resources Algorithms in Computational Biology Spring 2006 Created by Itai Sharon.
GO Ontology Editing Workshop: Using Protege and OWL Hinxton Jan 2012.
Noctua Development 2015 Noctua: Express yourself. Clearly.
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Integration of PRO and UniProtKB Amherst, NY May 16, 2013 Cathy H. Wu, Ph.D. PRO-PO-GO Meeting.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Managing Information Quality in e-Science using Semantic Web technology Alun Preece, Binling Jin, Edoardo Pignotti Department of Computing Science, University.
The Foundational Model of Anatomy and its Ontological Commitment(s) Stefan Schulz University Medical Center, Freiburg, Germany FMA in OWL meeting November.
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
CSE-291: Ontologies in Data & Process Integration Department of Computer Science & Engineering University of California, San Diego CSE-291: Ontologies.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL.
GO Galaxy. Enrichment Enrichment analysis is a ‘killer app’ for GO – Should be more central to what we do – Also other tools: e.g. function prediction.
Michael Eckert1CS590SW: Web Ontology Language (OWL) Web Ontology Language (OWL) CS590SW: Semantic Web (Winter Quarter 2003) Presentation: Michael Eckert.
Dimitrios Skoutas Alkis Simitsis
Classifying the Diversity of Life Targets: 17. State the goals of taxonomy. 18. Describe how evolutionary biology and molecular biology influence classification.
Ontology Alignment. Ontologies in biomedical research many biomedical ontologies e.g. GO, OBO, SNOMED-CT practical use of biomedical ontologies e.g. databases.
Sunday, July 22, 2012 Plan Areas of coverage: high-level neurological system process, inc. sensory perception, sensory processing, cognition transmission.
The “über-ontology” (Uberon) Melissa Häendel, Chris Müngall, George Gkoütos Cell Ontology Workshop May, 2010.
Delivering the Power of Net-Centric Data to Ensure Mission Success UCore Semantic Layer: A Logically Enhanced (OWL) Version of the UCore Taxonomy Session.
GO terms implicitly refer to other term cysteine biosynthesis myoblast fusion hydrogen ion transporter activity snoRNA catabolism wing disc pattern formation.
The Gene Ontology Categorizer C.A. Joslyn 1, S.M. Mniszewski 1, A. Fulmer 2 and G. Heaton 3 1 Computer and Computational Sciences, Los Alamos National.
Expanding species-specific anatomy ontologies to include the cell ontology Melissa Haendel (1), Ceri Van Slyke (1), Chris Mungall (2), Peiran Song (1),
Ontology Resource Discussion
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
JSON exchange format. Current GO annotation download options Tab-separated – GAF – GPAD/GPI (not available yet) XML – Pseudo RDF/XML (circa 2001) Relational.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
CSE-291: Ontologies in Data Integration Department of Computer Science & Engineering University of California, San Diego CSE-291: Ontologies in Data Integration.
Ontology domain & modeling extensions. Modeling enhancements: overview Enhancements: – Increased expressivity in ontology – Increased expressivity in.
OBO Foundry Workshop 2009 Cell Ontology (CL) Preliminary review.
InterPro Sandra Orchard.
Higgs bosons, mars missions, and unicorn delusions: How to deal with terms of dubious reference in scientific ontologies Stefan Schulz a,b Mathias Brochhausen.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Maintaining Ontologies as They Scale Across Multiple Species Darren A. Natale Protein Information Resource.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Ccs.  Ontologies are used to capture knowledge about some domain of interest. ◦ An ontology describes the concepts in the domain and also the relationships.
T cells 34 terms for types of T cell currently exist in the Cell Ontology. Broadly divided into immature and mature subsets, and into alpha-beta and gamma-
Syntax and semantics >AMYLASEE1 TGCATNGY A very simple FASTA file.
Annotation extension meeting summary
Formal ontologies vs. triple based KR gap or convergence?
Information Organization
Functional manual annotation including GO
MedDRA and Ontology Discussion of Strategy
Mental Functioning and the Gene Ontology
Partial Integration of GO with the Ingenuity Ontology
Biological Classification: The science of taxonomy
محاضرة عامة التقنيات الحيوية (هندسة الجينات .. مبادئ وتطبيقات)
Overview Gene Ontology Introduction Biological network data
The Gene Ontology: an evolution
Advanced PGDB Editing: Gene Ontology (GO) Terms
Animating the reference terminology – showing classifiers at work
FIGURE 3.2. Small uncharged molecules can pass through membranes by simple diffusion, but hydrophilic solute cannot.
Presentation transcript:

Domain and Family Ontology Building an ontology of protein domains and families to support GO classification logical-definitions

Why do we need a DOMO?  To provide logical definitions (Equivalence Axioms) for the GO protein binding branch  What this gives us:  Auto-classification of Ontology  Auto-classification of Gene Products assigned to generic ‘protein binding’ class

Background: Annotation deepening  Ontology (GO):  Neuron fate specification EquivalentTo ‘cell fate specification’ and results_in_specification_of some neuron  Ontology (CL):  ‘sensory neuron’ SubClassOf neuron  GAF:  Tbx1 annotated to GO: ‘cell fate specification’, extension column: results_in_specification_of(CL: ) [sensory neuron]  Elk (Reasoner)  deepening step  Tbx1 annotated to ‘neuron fate specification’

Can we do this for protein binding branch of GO?  Competency Question:  Given annotations to GO: ‘protein binding’ and WITH column indicating binding partner,  Can we deepen to specific subclasses?  Approach:  Create a classification of proteins by domain family  Analogous to any other external ontology we use in GO, e.g. CL  Create equivalence axioms  E.g. “X binding” = binding and has_input some X  Treat WITH column for protein binding as if it were c16  Classify using reasoner

Why not use an existing ontology?

Connexins in PRO is_a PR: ! protein is_a PR: ! gap junction alpha-10 protein *** is_a PR: ! gap junction alpha-3 protein *** is_a PR: ! gap junction alpha-4 protein *** is_a PR: ! gap junction alpha-5 protein *** is_a PR: ! gap junction alpha-8 protein *** is_a PR: ! gap junction alpha-9 protein *** is_a PR: ! gap junction beta-1 protein *** is_a PR: ! gap junction beta-2 protein *** is_a PR: ! gap junction beta-3 protein *** is_a PR: ! gap junction beta-4 protein *** is_a PR: ! gap junction beta-5 protein *** is_a PR: ! gap junction beta-6 protein *** is_a PR: ! gap junction beta-7 protein *** is_a PR: ! gap junction gamma-1 protein *** is_a PR: ! gap junction gamma-2 protein *** is_a PR: ! gap junction gamma-3 protein *** is_a PR: ! gap junction delta-2 protein *** is_a PR: ! gap junction delta-3 protein *** is_a PR: ! gap junction delta-4 protein *** is_a PR: ! gap junction alpha-1 protein *** is_a PR: ! gap junction alpha-6 protein *** is_a PR: ! gap junction epsilon-1 protein ***

Why not just use interpro? 1.Hierarchy in IPR is highly incomplete 2.Work needs to be done to align with GO protein binding hierarchy

Connexins in InterPro

Protein phosphatases in IPR

Hybrid Approach  Use curated GO binding IPR associations (Marijn’s work)  Mix with implicit protein dom/fam ontology extracted from GO  Mix with IPR classification  Auto-create logical definitions for GO

Examples

Results  [Heiko to fill in once we make the Jenkins job that uses domo]

Future Directions  We (GO) shouldn’t be doing this  But no one else was  Whose job?  IPR?  PRO?  Happy to hand over work  So long as group produces an OWL file with labels, synonyms, classification, etc we can use that