Increasing GO Annotation Through Community Involvement Fiona McCarthy*, Nan Wang*, Susan Bridges** and Shane Burgess** GO.

Slides:



Advertisements
Similar presentations
Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy.
Advertisements

European Bioinformatics Institute The Gene Ontology Annotation (GOA) Database and enhancement of GO annotations through InterPro2GO Nicky Mulder
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Quick Guide Completing the External Examiner’s On-line Annual Report MITRE Quick Guide Completing the External Examiner’s On-line Annual Report Version.
RECRUIT Overview November 29, 2005 Academic Personnel Systems 1 Academic Personnel Systems: RECRUIT Please silence cell-phones.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Comprehensive Annotation System for Infectious Disease Data Alexander Diehl University at Buffalo/The Jackson Laboratory IDO Workshop /9/2010.
CACAO - Penn State Gene Function and Gene Ontology January 2011
URL: European Bioinformatics Institute (EMBL-EBI) Swiss Institute of Bioinformatics (SIB) Protein Information Resource.
GO Enrichment analysis COST Functional Modeling Workshop April, Helsinki.
Modifying GO How changes are made to GO, and how you can be involved.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
DWINSA 2007 Website. Website Purpose Allow states to track status of questionnaires Allow systems >100K or states to upload project data.
Copyright OpenHelix. No use or reproduction without express written consent1.
PLEXdb Plant Expression database Ethalinda Cannon Iowa State University January 15th, 2007.
Editing the Gene Ontology Midori A. Harris GO Editorial Office EBI, Hinxton, UK.
My Resource for Excellence. Canadian Heritage Information Network Creation of the Collections Management Software Review (CMSR) Heather Dunn, CHIN.
AgBase: bioinformatics enabling knowledge generation from agricultural omics data Fiona McCarthy.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Natural Resource Program Center NPSpecies Update Alison Loar and Michelle Flenner 4/21/2010.
Gene Ontology Overview and Perspective Lung Development Ontology Workshop.
Examples of functional modeling. NCSU GO Workshop 29 October 2009.
TAIR Workshop Model Organism Databases and Community Annotation Plant and Animal Genome XVI Conference, San Diego January 13, 2008.
Adding GO for Large Datasets COST Functional Modeling Workshop April, Helsinki.
Organizing information in the post-genomic era The rise of bioinformatics.
DAVID Genome Biol. 2003;4(5):P3 Analysis of gene lists using DAVID
Sunday, July 22, 2012 Plan Areas of coverage: high-level neurological system process, inc. sensory perception, sensory processing, cognition transmission.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Integrating the Cell Cycle Ontology with the Mouse Genome Database David R. Smith Mary Dolan Dr. Judith Blake.
GO-based tools for functional modeling TAMU GO Workshop 17 May 2010.
IEEE-SA Public Review IEEE/PES PE/SPDC Committee San Diego, CA May 2015.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Expanding GO annotations with text classification Nicko Goncharoff Reel Two, Inc.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
Exercise Your your Library ® RefWorks: The Basics October 10, 2006.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
Using OMB Section 508 reporting in addressing your agency's program maturity. How to Measure Your Agency's 508 Program.
Introduction to the Gene Ontology GO Workshop 3-6 August 2010.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
9/10/06 GO Users Meeting 2006 Seattle, Washington The AgBase GO Annotation Tools Susan Bridges 1,3, Fiona McCarthy 2,3, Nan Wang 1,3, G. Bryce Magee 1,3,
Computer Science Ph. D. Seminar Gene Ontology (GO) Based Search for Protein Structure Similarity Clustering Metrics Ph.D. Candidate Steve Johnson Committee.
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
RiceWiki: a wiki-based database for community curation of rice genes Available at
Getting GO: how to get GO for functional modeling Iowa State Workshop 11 June 2009.
AgBase Shane Burgess, Fiona McCarthy Mississippi State University.
Prioritization of Avian GO Annotation , , Chicken ,06949,5163.4Rat ,69664, Mouse ,83036, Human.
Building Resources for Teaching Computer Architecture Through Peer Review Edward F. Gehringer Dept. of Electrical & Computer Engineering Dept. of Computer.
Need a solid base for analysis of future genomes Reference genome criteria: Sequenced genome MOD Functional genomics projects Adequate research community.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
Comprehensive Planning: Charter School Training Comprehensive Planning: Charter School Training Comprehensive Planning Team
Getting GO annotation for your dataset
Introduction to the Gene Ontology
Genome Sequence Annotation Server
Workshop Aims TAMU GO Workshop 17 May 2010.
Department of Genetics • Stanford University School of Medicine
Workshop Aims GO Workshop 3-6 August 2010.
Functional Annotation of the Horse Genome
Strategies for annotation of a genome
Fiona McCarthy, Carl Schmidt, Parker Antin, Shane Burgess
Browsing the GO at MGI Harold Drabkin, Ph.D. Senior Scientific Curator
1. C. briggsae sequence curation 2. SNP data handling
Mylan Quick Reference Guide (QRG) February 2016.
Presentation transcript:

Increasing GO Annotation Through Community Involvement Fiona McCarthy*, Nan Wang*, Susan Bridges** and Shane Burgess** GO Consortium User Meeting, 10 Sept 2006

Who Are We? Dr Susan Bridges Bryce Magee Nan Wang Dr Shane Burgess Teresia Buza Divyaswetha Peddinti

MGI: Judith Blake, David Hill, Mary Dolan GOA: Evelyn Camon, Dan Barrell GO Editorial Office: Jennifer Clark, Midori Harris dictyBase: Rex Chisholm, Eric Just GO Consortium Member mentor

Overview From genes to function How much GO do I need? Getting more GO annotation: targeted GO annotation and community involvement Targeting GO annotations GO annotation quality scores communty directed annotation Community annotation The hook: what’s in it for me?

"Today’s challenge is to realise greater knowledge and understanding from the data-rich opportunities provided by modern high-throughput genomic technology." - Professor Andrew Cossins Consortium for Post-Genome Science

From Genes to Function

Use GO for……. Grouping gene products by biological function Determining which classes of gene products are over-represented or under-represented Focusing on particular biological pathways and functions (hypothesis-driven data interrogation) Relating a protein’s location to its function

(PubMed 06/09/06) The number of publications using GO is increasing exponentially. Use of GO is increasing in species for which there is a dedicated GO annotation effort.

How Much GO Do I Need? Compiled 15 June 06 using GOProfiler.

How Much GO Do I Need? Compiled 15 June 06 using GOProfiler and PubMed.

‘breadth’ of annotation –how many gene products have GO annotation? –for each gene product, how many annotations? ‘depth’ of annotation –how detailed is the GO annotation? How can we effectively use our resources to get the best GO annotation? Analyzing Microrarray Data:

Getting more GO annotation electronic GO annotations (IEA) –get many annotations quickly but lack detail (get ‘breadth’ but lack ‘depth’) manual GO annotations –literature curation (gold standard) –Slower many, many more researchers publishing papers than biocurators reading them cleverer GO annotation: –targeted GO annotation –community involvement: researchers are the ‘local’ experts

Targeting GO Annotation target gene products with poor GO annotation –determining GO annotation quality target gene products most interesting for the community –community feedback & prioritization

GO Annotation Quality (GAQ) For a single gene product: GAQ = no. annotations x dag depth x evidence code calculate overall GAQ score for an organism calculate GAQ for functional subsets of gene products –target GO annotation efforts to genes in functional subsets with low GAQ score

Comparative GAQ: Scores for Chicken and Mouse

Comparative GAQ: Scores for Chicken Cell Processes

Community Directed Annotation AgBase web form to enable community requests prioritization: –requests prioritized for each species –one gene product request equals one count –gene products with the most counts have higher priority –annotation time for each species is split proportionally based on the number of requests for each species –requests annotated using both ISS and available published literature –higher priority for gene products submitted to the community gene association file –priority for gene products commonly represented on arrays –prioritization modified based on community input

Community Annotation login allows annotations to be acknowledged and notification when the request is completed users may request annotation of a gene product/PubMed reference submitters assist in pertinant literature slection for curation allows annotators to focus on GO annotations most important for the community submitters notified when the request is completed advanced submission allows trained users to upload gene association files for inclusion in the Community annotation file quality checks before annotations are transferred from the Community file to the GOC file

Targeting GO Annotation use GAQ to target poorly annotated processes –determining GO annotation quality use community requests to prioritize annotations BUT still many, many more researchers publishing papers than biocurators reading them… …and researchers need their data analyzed yesterday...

Community GO Annotation AgBase provides 2 annotation files: 1.“GO Consortium” file: fully quality checked annotations that meet current GO Consortium guidelines 2.“Community” file: –annotations for ‘predicted proteins’ without UniProtKB identifiers (until 10 July 2006, were not supported by EBI-GOA) –ISS annotations to evidence codes no longer accepted (as of April 2006) –annotations from community researchers that have not been fully quality checked by a trained GO curator –these annotations will eventually be transferred to the GOC file or (for ISS) superseded by higher quality literature annotations

AgBase Annotation Files GOC file = 1,508 GO associations Community file = 5,146 GO associations

What’s in it for ME? two tier annotation file systems provides most comprehensive annotation in instances where there are few annotations available researchers can choose which annotation files best suits their system researchers with GO training may submit (& be acknowledged for) their own annotations researchers with specific knowledge of particular gene products can add to the annotation of their gene products of interest via the AgBase request page

What’s in it for ME?