A marriage of chemistry and biology Aligning the Gene Ontology with CHEBI.

Slides:



Advertisements
Similar presentations
Chemical named entity recognition and literature mark-up Colin Batchelor Informatics Department Royal Society of Chemistry
Advertisements

AP Biology Essential Chemistry Packet: Questions 4 & 5
The MGED Ontology: Providing Descriptors for Microarray Data Trish Whetzel Department of Genetics Center for Bioinformatics University of Pennsylvania.
The Chemical Evolution The molecules we know today are descended from the first molecules that formed life on Earth. The behavior of today's biological.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
Extending to the GO model OBO open biology ontologies aka - extended go - (ego)
09 / 23 / Predicting Protein Function Using Machine-Learned Hierarchical Classifiers Roman Eisner Supervisors: Duane Szafron.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
Toward Making Online Biological Data Machine Understandable Cui Tao Data Extraction Research Group Department of Computer Science, Brigham Young University,
Editing Description Logic Ontologies with the Protege OWL Plugin.
Topics Covered: Data preparation Data preparation Data capturing Data capturing Data verification and validation Data verification and validation Data.
1.Review- Name four groups of organic compounds found in living things Explain- Describe at least one function of each group of organic compounds Infer-
Protein 3D-structure analysis Exercises. Practicals Find update frequency for RCSB PDB: weekly. When was the last update? How many protein structures.
New data and tools at TAIR (The Arabidopsis Information Resource)
Editing the Gene Ontology Midori A. Harris GO Editorial Office EBI, Hinxton, UK.
Warm-Up: The shaded sequence of nucleotides is for a gene from DNA that is similar to what you might find from a living human (Living DNA). The rest are.
Enrolment Services – Class Scheduling Fall 2014 Course Combinations.
Community Ontology Development Lessons from the Gene Ontology.
Unit 4: Biochemistry Basic Chemistry.
Web Apollo and the VectorBase user community Gloria I. Giraldo-Calderón March 31, 2015.
EBI is an Outstation of the European Molecular Biology Laboratory. ChEBI: The story so far Paula de Matos.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
TermGenie – Granting Biocurators’ Wishes for the GeneOntology BioCurator Meeting 2013 Heiko Dietze – Lightning Talk.
Sunday, July 22, 2012 Plan Areas of coverage: high-level neurological system process, inc. sensory perception, sensory processing, cognition transmission.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
The ‘regulates’ relationships Chris, David, Tanya.
ALMA Archive Operations Impact on the ARC Facilities.
10.11 Data Manipulation 2 Queries. You will need… Specimen 2007 Paper 2 Task C (a PDF) q 43 Read this question carefully before you start The database.
Honors Biology Ch 4 THE CHEMISTRY OF LIFE.  M1: Ecology  Study of large scale stuff  M2: Molecules to Organisms  Study of really small scale stuff.
Copyright OpenHelix. No use or reproduction without express written consent1.
BioInformatics Database of Primer Results In order to help predict the way proteins will act in an organism, biologists cross-examine sequences of amino.
Building the Process Ontology One Branch at a Time David Hill Tanya Berardini Rebecca Foulger Norberto de la Cruz.
Copyright © 2007 Pearson Education Inc., publishing as Pearson Benjamin Cummings Lectures by Chris C. Romero PowerPoint ® Lectures for Essential Biology,
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
“ Good annotation practice ” for chemical data: ChEBI experience Kirill Degtyarenko European Patent Office.
ChEBI, text mining and ontological best practice Colin Batchelor Royal Society of Chemistry
EBI is an Outstation of the European Molecular Biology Laboratory. Rhea Annotated reactions database 17 December 2015.
WHAT IS A BIOCHEMICAL ENGINEER? Engineering I Fall 2014.
Life Science 8 th Grade Week 1 Mrs. Rubright. - Characteristics of Living Things - Concept of spontaneous generation - Characteristics of Living Things.
EBI is an Outstation of the European Molecular Biology Laboratory. Tutorial 5: ChEBI - On-line Submission and Curation.
Integration of Bioinformatics into Inquiry Based Learning by Kathleen Gabric.
Macromolecules. Objectives List the elements that make up living things. List the four kinds of macromolecules. Describe carbohydrates, lipids, fats and.
Protein databases Petri Törönen Shamelessly copied from material done by Eija Korpelainen and from CSC bio-opas
And natural products of plant origin ChEBI Janna Hastings.
Substances Redesign Project update for Content Committee February 11, 2015.
Copyright 2007, Paradigm Publishing Inc. BACKNEXTEND 8-1 LINKS TO OBJECTIVES Import data from another Access table Import data from another Access table.
251 st ACS National Meeting 15 th March 2016 The ChEBI Database and Ontology: a key resource for chemical biology and metabolomics Gareth Owen EMBL-EBI,
Classifying Chemistry: Current Efforts in Canada
What is organic chemistry?
The Gene Ontology Project
Chapter 1 – Biochemistry: An Introduction
Mental Functioning and the Gene Ontology
Electronic Data Processing
Properties of Matter Extensive properties depend on the amount of matter that is present. Mass Volume.
Organic Compounds.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Macromolecules September 16th/17th, 2008.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Biological Molecules -Biological molecules consist primarily of carbon, oxygen, hydrogen, and nitrogen. -These elements share valence electrons to form.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
MACROMOLECULES A very large molecule consisting of many smaller structural units linked together.
2-3 Carbon Compounds p45 Q: What elements does carbon bond with to make up life’s molecules? A:Carbon can bond with many elements, including hydrogen,
The Gene Ontology: an evolution
The ChEBI ontology Modelling chemical entities: current challenges
The molecules that form life.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Let's create your English Folder!.
Presentation transcript:

A marriage of chemistry and biology Aligning the Gene Ontology with CHEBI

ontologies: what are they really for, and why do I care?

systems of categorization, useful for handling large amounts of data

Gene Ontology arose from a need to classify functional data about genes and proteins in various databases

Parallel efforts grew up elsewhere

GO has always contained a chemical hierarchy

Roncaglia

biologists are not great chemists

Use CHEBI to make chemical hierarchy in GO

The story begins in 2009 when we decided to get started. We figured it would take a few months.

GO terms were ‘decomposed’ and text-matched to ChEBI terms

A few technical issues needed to be sorted out with the relationships in CHEBI first

Once that was done, we generated a list of mismatches between GO and CHEBI

One of the first things we realized was that we weren’t even consistent with our chemical classification with ourselves

Generated the implicit chemical ontology of GO - GOChe

Four of us sat in a room for a weekend with a large pot of coffee and edited GOChe (March 2010)

One of the things we consistently noticed was that ChEBI classified nucleotides as carbohydrates

Called another meeting with ChEBI. Who confirmed that nucleotides are indeed carbohydrates.

plurals e.g. pyridines v/s pyridine “pyridine-containing compound”

Acids and bases

Biologists are much more wooly and conflate these terms acids and bases often interconvert during biological processes

Union terms of acids and bases in GO: carboxylic acid OR carboxylic acid anion

Where are we now? Pretty much there Bridging files between GO and CHEBI ready Single update – use reasoner to ‘fix’ GO Editing will involve loading both GO + CHEBI and reasoning Paper ready for submission

What does this buy us? Consistency and accuracy Time savings Potential for cool queries and analysis that combine biology and chemistry

People David Hill Jane Lomax Harold Drabkin Chris Mungall Midori Harris Tanya Berardini Rebecca Foulger Paola Roncaglia Marcus Ennis Paula de Matos Janna Hastings Nico Adams Mike Bada Colin Bachelor