Annotating with GO: an overview

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

GBrowse at TAIR Philippe Lamesch TAIR curator. Seqviewer.
Applications of GO. Goals of Gene Ontology Project.
25th June 2007 Jane Lomax Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
POC tutorial#3: Annotation This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers within.
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Community Annotation of Gene Function with GONUTS Jim Hu EcoliHub/EcoliWiki Dept. of Biochemistry and Biophysics Texas A&M University.
Gene Ontology Luis Tari. Gene Ontology (GO) URL: Gene Ontology is A hierarchy of roles of genes.
COG and GO tutorial.
Bioinformatics master course DNA/Protein structure-function analysis and prediction Lecture 13: Protein Function Centre for Integrative Bioinformatics.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Ontologies for Informatics. Infrastructure for Systems Biology. Oxford October
CACAO - Penn State Gene Function and Gene Ontology January 2011
Gene Ontology at WormBase: Making the Most of GO Annotations Kimberly Van Auken.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
Using The Gene Ontology: Gene Product Annotation.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Ontologies, data standards and controlled vocabularies.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
EBI is an Outstation of the European Molecular Biology Laboratory. GOA: Looking after GO annotations Emily Dimmer Gene Ontology Annotation (GOA) Database.
Lecture Four: GO: The Gene Ontology ----Infrastructure for Systems Biology.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Web Databases for Drosophila Introduction to FlyBase and Ensembl Database Wilson Leung6/06.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Gene Product Annotation using the GO ml Harold J Drabkin Senior Scientific Curator The Jackson Laboratory.
Part II GO-Vocabulary of Genome. S. cerevisiae D. melanogaster.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Building WormBase database(s). SAB 2008 Wellcome Trust Sanger Insitute Cold Spring Harbor Laboratory California Institute of Technology ● RNAi ● Microarray.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Gene Ontology Consortium
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
1 Annotation EPP 245/298 Statistical Analysis of Laboratory Data.
Gene Ontology TM (GO) Consortium
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
Canadian Bioinformatics Workshops
Module 1: Gene Lists 1 Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
What’s new in GO?. Priorities Annotation outreach Reference genomes User advocacy Ontology development Software.
Gene Annotation & Gene Ontology
CACAO Training ASM-JGI 2012.
Introduction to the Gene Ontology
Department of Genetics • Stanford University School of Medicine
Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI 25th June 2007 Jane Lomax.
Part I: Tips and Techniques from curators
Gene expression analysis
Annotating Gene Products to the GO
Insight into GO and GOA Angelica Tulipano , INFN Bari CNR
Presentation transcript:

Annotating with GO: an overview http://www.geneontology.org/ What is a Gene Ontology (GO) annotation? Databases external to GO make cross-links between GO terms and objects in their databases (typically, gene products, or their surrogates, genes), and then provide tables of these links to GO. The GO itself contains no information about genes or gene products. The GO annotation (‘gene association’) files are all publicly available: http://www.geneontology.org/#annotations A gene product is annotated to one or more terms in each of the three ontologies; biological process, cellular component and molecular function. Database name abbreviation Gene products are annotated to the most specific GO term possible for the information available. Abbreviations used by GO are described here: http://www.geneontology.org/doc/GO.xrf_abbs Example annotation: A gene product is annotated with terms reflecting only its normal activities, locations and processes. Database Object identifier. A Database Object is usually a gene product, but can also be a gene or a transcript. When there is no information regarding one or more aspects of a gene product, the gene product is annotated to the GO term ‘unknown’. Fields highlighted in grey are mandatory Used when it is specified in the source that that a gene product is NOT associated with a particular gene product e.g. “we have found that protein Z is not involved in the X cascade”. Annotation of a gene product to one ontology is independent of its annotation to the other two ontologies. Gene Ontology term identifier Object type: gene, transcript or protein The annotation of gene products to GO terms is performed according to two main principles: the recording of the source of the annotation and the type of evidence on which the annotation was based. P = biological process, F = molecular function and C = cellular component. Taxonomic identifier for gene product The evidence describes how the annotation was created, and provides a way of measuring its strength or reliability. GO has developed a set of standard evidence codes which form a loose hierarchy, with ‘inferred by electronic annotation’ (IEA) being the least reliable type of evidence, followed by ‘inferred by sequence similarity’ (ISS). The source of an annotation may be a literature reference, a database record or the type of computational anaylsis. Literature references are entered as an accession number, either from the database in question and/or from PubMed. Annotations based on computational analysis include a reference to the method of analysis. IDA inferred from direct assay IEP inferred from expression pattern IEA inferred from electronic annotation TAS traceable author statement NAS non-traceable author statement ND no biological data available Evidence codes IC inferred by curator IMP inferred from mutant phenotype IGI inferred from genetic interaction IPI inferred from physical interaction ISS inferred from sequence similarity Collaborating databases Many important databases produce GO annotations and contribute to the development of the GO. These include: FlyBase (database for the fruitfly Drosophila melanogaster), Berkeley Drosophila Genome Project (Drosophila informatics; GO database & software), Saccharomyces Genome Database (SGD) (database for the budding yeast Saccharomyces cerevisiae), Mouse Genome Database (MGD) & Gene Expression Database (GXD) (databases for the mouse Mus musculus), The Arabidopsis Information Resource (TAIR) (database for the brassica family plant Arabidopsis thaliana), WormBase (database for the nematode Caenorhabditis elegans), PomBase (database for the fission yeast Schizosaccharomyces pombe), Rat Genome Database (RGD) (database for the rat Rattus norvegicus), DictyBase (informatics resource for the slime mold Dictyostelium discoideum), The Pathogen Sequencing Unit (The Wellcome Trust Sanger Institute), Genome Knowledge Base (GKB) (Cold Spring Harbor Laboratory), EBI : InterPro - SWISS-PROT - TrEMBL groups, The Institute for Genomic Research (TIGR), Gramene (A Comparative Mapping Resource for Monocots), Compugen (with its Internet Research Engine).