Using The Gene Ontology: Gene Product Annotation.

Slides:



Advertisements
Similar presentations
A Comparative mapping resource ONTOLOGY DEVELOPMENT AND INTEGRATION IN GRAMENE Pankaj Jaiswal Cornell University.
Advertisements

The Gene Ontology Project: Content for the Semantic Web.
1 Gene Ontology and Functional Annotation Donghui Li ASPB Plant Biology, June 29, 2008, Merida.
Annotation of Gene Function …and how thats useful to you.
Applications of GO. Goals of Gene Ontology Project.
25th June 2007 Jane Lomax Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Gene Ontology John Pinney
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
COG and GO tutorial.
Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 13: Protein Function Centre for Integrative Bioinformatics.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
CACAO - Penn State Gene Function and Gene Ontology January 2011
Gene Ontology at WormBase: Making the Most of GO Annotations Kimberly Van Auken.
Methods for Creating GO Annotations Emily Dimmer European Bioinformatics Institute Wellcome Trust Genome Campus Cambridge UK.
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
A Common Language for Annotation of Genes from Yeast, Flies and Mice The Gene Ontologies …and Plants and Worms …and Humans …and anything else!
Daniel Rico, PhD. Daniel Rico, PhD. ::: Introduction to Functional Analysis Course on Functional Analysis Bioinformatics Unit.
SPH 247 Statistical Analysis of Laboratory Data 1 May 12, 2015 SPH 247 Statistical Analysis of Laboratory Data.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Gene Ontology Overview and Perspective Lung Development Ontology Workshop.
SPH 247 Statistical Analysis of Laboratory Data 1May 14, 2013SPH 247 Statistical Analysis of Laboratory Data.
Ontologies, data standards and controlled vocabularies.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Gene Ontology Consortium
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Lecture Four: GO: The Gene Ontology ----Infrastructure for Systems Biology.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Gene Product Annotation using the GO ml Harold J Drabkin Senior Scientific Curator The Jackson Laboratory.
Part II GO-Vocabulary of Genome. S. cerevisiae D. melanogaster.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
A Common Language for Annotation of Genes from Yeast, Flies and Mice The Gene Ontologies …and Plants and Worms …and Humans …and anything else!
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
Gene Ontology Consortium
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
1 Annotation EPP 245/298 Statistical Analysis of Laboratory Data.
An example of GO annotation from a primary paper Rebecca E. Foulger (UniProt Curator) GO Annotation Camp, June 2005 PMID:
An example of GO annotation from a primary paper GO Annotation Camp, July 2006 PMID:
Gene Ontology TM (GO) Consortium
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
Module 1: Gene Lists 1 Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Gene Annotation & Gene Ontology
Annotating with GO: an overview
Introduction to the Gene Ontology
Department of Genetics • Stanford University School of Medicine
Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI 25th June 2007 Jane Lomax.
Annotating Gene Products to the GO
Insight into GO and GOA Angelica Tulipano , INFN Bari CNR
Presentation transcript:

Using The Gene Ontology: Gene Product Annotation

Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary terms (annotation) Develop tools: to query and modify the vocabularies and annotations annotation tools for curators GO Project Goals

GO provides two bodies of data: Terms with definitions and cross- references Gene product annotations with supporting data GO Data

Molecular Function — elemental activity or task nuclease, DNA binding, transcription factor Biological Process — broad objective or goal mitosis, signal transduction, metabolism Cellular Component — location or complex nucleus, ribosome, origin recognition complex The Three Ontologies

DAG Structure Directed acyclic graph: each child may have one or more parents

Every path from a node back to the root must be biologically accurate The True Path Rule

Association between gene product and applicable GO terms Provided by member databases Made by manual or automated methods GO Annotation

DAG Structure Annotate to any level within DAG

DAG Structure Annotate to any level within DAG mitotic chromosome condensation S.c. BRN1, D.m. barren

DAG Structure Annotate to any level within DAG mitosis S.c. NNF1 mitotic chromosome condensation S.c. BRN1, D.m. barren

Database object: gene or gene product GO term ID Reference publication or computational method Evidence supporting annotation GO Annotation: Data

IDA - Inferred from Direct Assay IMP - Inferred from Mutant Phenotype IGI - Inferred from Genetic Interaction IPI - Inferred from Physical Interaction IEP - Inferred from Expression Pattern GO Evidence Codes TAS - Traceable Author Statement NAS - Non-traceable Author Statement IC - Inferred by Curator ISS - Inferred from Sequence or structural Similarity IEA - Inferred from Electronic Annotation ND - Not Determined

IDA - Inferred from Direct Assay IMP - Inferred from Mutant Phenotype IGI - Inferred from Genetic Interaction IPI - Inferred from Physical Interaction IEP - Inferred from Expression Pattern GO Evidence Codes TAS - Traceable Author Statement NAS - Non-traceable Author Statement IC - Inferred by Curator ISS - Inferred from Sequence or structural Similarity IEA - Inferred from Electronic Annotation ND - Not Determined From primary literature

IDA - Inferred from Direct Assay IMP - Inferred from Mutant Phenotype IGI - Inferred from Genetic Interaction IPI - Inferred from Physical Interaction IEP - Inferred from Expression Pattern GO Evidence Codes TAS - Traceable Author Statement NAS - Non-traceable Author Statement IC - Inferred by Curator ISS - Inferred from Sequence or structural Similarity IEA - Inferred from Electronic Annotation ND - Not Determined From reviews or introductions From primary literature

IDA - Inferred from Direct Assay IMP - Inferred from Mutant Phenotype IGI - Inferred from Genetic Interaction IPI - Inferred from Physical Interaction IEP - Inferred from Expression Pattern GO Evidence Codes TAS - Traceable Author Statement NAS - Non-traceable Author Statement IC - Inferred by Curator ISS - Inferred from Sequence or structural Similarity IEA - Inferred from Electronic Annotation ND - Not Determined From reviews or introductions From primary literature

IDA - Inferred from Direct Assay IMP - Inferred from Mutant Phenotype IGI - Inferred from Genetic Interaction IPI - Inferred from Physical Interaction IEP - Inferred from Expression Pattern GO Evidence Codes TAS - Traceable Author Statement NAS - Non-traceable Author Statement IC - Inferred by Curator ISS - Inferred from Sequence or structural Similarity IEA - Inferred from Electronic Annotation ND - Not Determined From reviews or introductions From primary literature automated

Manual Automated sequence similarity transitive annotation nomenclature, other text matching GO Annotation: Methods

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C Encodes Saccharomyces cerevisiae Purine Nucleoside Phosphorylase. J. Bacteriology 183(16):

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C encodes Saccharomyces cerevisiae purine nucleoside phosphorylase. J. Bacteriol. 183(16): 4910–4913. IDA

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C encodes Saccharomyces cerevisiae purine nucleoside phosphorylase. J. Bacteriol. 183(16): 4910–4913. FUNCTION: purine nucleoside phosphorylase IDA

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C ncodes Saccharomyces cerevisiae purine nucleoside phosphorylase. J. Bacteriol. 183(16): 4910–4913. FUNCTION: purine nucleoside phosphorylase IDA IMP

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C encodes Saccharomyces cerevisiae purine nucleoside phosphorylase. J. Bacteriol. 183(16): 4910–4913. FUNCTION: purine nucleoside phosphorylase IDA IMP

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C encodes Saccharomyces cerevisiae purine nucleoside phosphorylase. J. Bacteriol. 183(16): 4910–4913. FUNCTION: purine nucleoside phosphorylase IDA PROCESS: purine nucleoside catabolism IMP

Experiment 1 - Purification and enzyme assay Purified His-tagged Ylr209cp; can convert various nucleoside substrates to bases + P i ; inosine and guanosine are substrates Experiment 2 - Knockout of YLR209C null mutant excretes inosine and guanosine into medium (compounds in medium separated by chromatography and identified by HPLC separation profiles) Literature-Based Manual Annotation: Experimental Evidence Codes Lecoq, K., et al. (2001) YLR209C encodes Saccharomyces cerevisiae purine nucleoside phosphorylase. J. Bacteriol. 183(16): 4910–4913. FUNCTION: purine nucleoside phosphorylase IDA PROCESS: purine nucleoside catabolism IMP This paper has no data for cellular component.

InterPro2go links InterPro entries and GO terms Automated Annotation: InterPro Example YFP InterPro entry GO entry

InterPro2go links InterPro entries and GO terms Automated Annotation: InterPro Example YFP InterPro entry GO entry Run InterProScan to link YFP and InterPro entry

InterPro2go links InterPro entries and GO terms Automated Annotation: InterPro Example YFP Infer GO term from the other two links InterPro entry GO entry Run InterProScan to link YFP and InterPro entry

detailed view of term AmiGO Browser

gene products annotated to term

FlyBase WormBase Saccharomyces Genome Database DictyBase Mouse Genome Informatics Gramene The Arabidopsis Information Resource Compugen, Inc. Swiss-Prot/TrEMBL/InterPro Pathogen Sequencing Unit (Sanger Institute) PomBase (Sanger Institute) Rat Genome Database The Institute for Genomic Research GO Annotation: Contributors

Fruit fly (Drosophila melanogaster) Budding yeast (Saccharomyces cerevisiae) Fission yeast (Schizosaccharomyces pombe) Human (Homo sapiens) Mouse (Mus musculus) Rice (Oryza sativa) Rat (Rattus norvegicus) Tsetse fly (G. morsitans) Caenorhabditis elegans Arabidopsis thaliana Vibrio cholerae Dictyostelium discoideum GO Annotation: Organisms

Current GO Annotations

FlyBase & Berkeley Drosophila Genome Project WormBase Saccharomyces Genome Database DictyBase Mouse Genome Informatics Gramene The Arabidopsis Information Resource Compugen, Inc. Swiss-Prot/TrEMBL/InterPro Pathogen Sequencing Unit (Sanger Institute) PomBase (Sanger Institute) Rat Genome Database Genome Knowledge Base (CSHL) The Institute for Genomic Research The Gene Ontology Consortium is supported by NHGRI grant HG02273 (R01). The Gene Ontology project thanks AstraZeneca for financial support. The Stanford group acknowledges a gift from Incyte Genomics.

Conference: Standards and Ontologies for Functional Genomics (SOFG) Towards unified ontologies for describing biology and biomedicine 17 – 20 November 2002 Hinxton Hall Conference Centre Hinxton, Cambridge, UK