Why do we need good quality annotations? Pankaj Jaiswal Oregon State University Gene Annotation Workshop July 31, 2010 ASPB Plant Biology 2010 Montreal,

Slides:



Advertisements
Similar presentations
Stein Lab In-House Symposium The Plan  Overview of my lab’s activities  Detailed look at the Gramene Database  Run out of time  Talk really.
Advertisements

Introduction to the Plant Ontology™ Laurel Cooper* 1, Justin Elser 1, Maria A. Gandolfo 3, Chris Mungall 4, Pankaj Jaiswal 1, Barry Smith 5, Dennis Wm.
Pathways analysis Iowa State Workshop 11 June 2009.
Gramene Comparative & Phylogenomics Resources for Plants Joshua C. Stein 1, William Spooner 1, Sharon Wei 1, Liya Ren 1, Doreen Ware 1,2 1 Cold Spring.
The Plant Ontology A Resource for Plant Genomics Laurel Cooper Department of Botany and Plant Pathology Oregon State University,
Gene Ontology John Pinney
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Natural Variation in Arabidopsis thaliana Light Response: Genomic Approaches Justin Borevitz Salk Institute naturalvariation.org.
Genomic Methods for Cloning QTL Justin Borevitz University of Chicago naturalvariation.org.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProt Jennifer McDowall, Ph.D. Senior InterPro Curator Protein Sequence Database:
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
2013 iPlant workshop Marcela Karey Monaco What is Gramene? An integrated plant reference genome resource Comparative genomics hub of data & tools.
Subsystem Approach to Genome Annotation National Microbial Pathogen Data Resource Claudia Reich NCSA, University of Illinois, Urbana.
Ten years of GDR Current Resources and Functionality S Jung, T Lee, S Ficklin, CH Cheng, P Zheng, A Blenda, S Ru, K Evans, C Peace, N Oraguzie, AG Abbott,
Claire O’Donovan EMBL-EBI. In UniProtKB, we aim to provide… o A high quality protein sequence database A non redundant protein database, with maximal.
New Data and Functionality of GDR, the Genome Database for Rosaceae Sook Jung, Taein Lee, Stephen Ficklin, Chun-Huai Cheng, Ping Zheng, Anna Blenda, Sushan.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
14.5L/9.5DTime7:00- 9:00 9:00- 11:00 11:00- 15:00 15:00- 17:00 17:00- 19:00 19:00- 21:00 21:00- 23:00 23:00- 5:00 5:00- 7:00 Temperature
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Using The Gene Ontology: Gene Product Annotation.
Gramene Objectives Develop a database and tools to store, visualize and analyze data on genetics, genomics, proteomics, and biochemistry of grass plants.
Natural Variation in Arabidopsis ecotypes. Using natural variation to understand diversity Correlation of phenotype with environment (selective pressure?)
Using Gramene: A genomics and genetics resource for rice and other grasses. PAG XIV Jan 15, 2006 Immanuel (Noel) Yap Claire Hebbard.
1 Building Communities Around Ontology Development Pankaj Jaiswal Dept. of Plant Breeding and Genetics Cornell University Ithaca, NY FAO,
A Comparative Genomics Resource for Grains. Tutorial Tips If you are viewing this tutorial with Adobe Acrobat Reader, click the "bookmarks" on the left.
A comparative mapping resource for the grasses Gramene Plant Biology July 25, 2004 Molly Fogleman Doreen Ware Pankaj Jaiswal.
1 of 38 Data Mining in Ensembl with BioMart. 2 of 38 Simple Text-based Search Engine.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
The Plant Ontology Consortium Lincoln Stein 1, Susan McCouch 2, Elizabeth Kellogg 3, Seung Rhee 4, Pankaj Jaiswal 2, Doreen Ware 1, Peter Stevens 5 1 Cold.
NextGen Pipeline: Enabling the Plant Science Community Tom Brutnell (lead), Steve Rounsley (co-lead), Matt Vaughn (Engagement Lead) Ed Buckler, Justin.
Protein and RNA Families
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Pathway Database Pathway Comparison Expression Viewer Discovery. Pankaj Jaiswal Oregon State University 1.
Gramene: Interactions with NSF Project on Molecular and Functional Diversity in the Maize Genome Maize PIs (Doebley, Buckler, Fulton, Gaut, Goodman, Holland,
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
TRPGR Gramene: A Platform for Comparative Plant Genomics Doreen Ware USDA ARS Cold Spring Harbor Laboratory Pankaj Jaiswal Oregon State University Joshua.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
Sequencing the World of Possibilities for Energy & Environment MGM workshop. 19 Oct 2010 Information Sources for Genomics Konstantinos Mavrommatis Genome.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Data Mining at PLEXdb : Plant and Plant Pathogen Gene Expression Database.
Exploring Rice Phenotypes Using Gramene Molly Fogleman – Outreach coordinator A Comparative Mapping Resource GRAMENE.
Exploring Rice Phenotypes Using Gramene A Comparative Mapping Resource GRAMENE.
A collaborative tool for sequence annotation. Contact:
Phenotype Curation Susan R. McCouch Department of Plant Breeding Cornell University.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Copyright OpenHelix. No use or reproduction without express written consent1.
What do we already know ? The rice disease resistance gene Pi-ta Genetically mapped to chromosome 12 Rybka et al. (1997). It has also been sequenced Bryan.
GMOD – What Next?. Application Areas Genome –Single annotation –Comparative annotation Genetics –Stocks, strains, mutants –QTL –Variation Protein annotation.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
(Quantitative, Evolution, & Development)
CottonGen: An Up-to-Date Resource Enabling Genetics, Genomics and Breeding Research for Crop Improvement Plant and Animal Genome Conference XXV Jing Yu1,
Functional Annotation of the Horse Genome
There are four levels of structure in proteins
Genome Annotation w/ MAKER
for the Cotton Community
Working in the Post-Genomic C. elegans World
CottonGen: Enabling Cotton Research through Big-Data Analysis and Integration Jing Yu, Sook Jung, Chun-Huai Cheng, Taein Lee, Katheryn Buble, Ping Zheng,
Key CottonGen Features For more information contact:
Summary of differentially expressed genes.
Germplasm Overview Page Trait Descriptor Standard Images
Tutorial for: Gramene Website Navigation
Presentation transcript:

Why do we need good quality annotations? Pankaj Jaiswal Oregon State University Gene Annotation Workshop July 31, 2010 ASPB Plant Biology 2010 Montreal, Canada

2 Physical and/or Sequence map Map-2 Polymorphism Candidate gene Gene models Transcripts Peptides Function Expression Pathway Homologs Paralogs Orthologs Interactions Comparative and Translational Genomics Genetic markers & phenotypes (genes and QTL) Map-1 Forward Reverse Genotype Phenotype

Jaiswal lab: Strawberry Genome Consortium Schnable et al., pp – 1115 (Maize synteny and Venn diagram) Comparative Genomics

Expression pattern and over-represented metabolic processes of rice diurnally cycling genes Expression data provided by Todd Mockler, Analysis by Palitha Dharmawardhana (jaiswal lab)

Trp pathway genes-Diurnal expression pattern clustering Expression data provided by Todd Mockler, Analysis by Palitha Dharmawardhana (jaiswal lab)

Diurnal variation of Trp biosynthetic gene expression hrs12 hrs24 hrs Expression data provided by Todd Mockler, Analysis by Palitha Dharmawardhana (jaiswal lab)

Over-represented gene ontology categories associated to genes expressed in (A) Fruit and (B) Root. The circles are shaded based on significance level (yellow = FDR below 0.05), and the radius of each circle denotes the number of genes in each category. GO enrichment: RootGO enrichment: Fruit Data provided by Todd Mockler and Kevin Folta

8 InterPro Uniprot & RefSeq MetaCyc Function Function Gene Ontology Genes EC number Text Mining Phenotype KEGG-Rice Gene-Pathway Expression Manual Curation Interactions Pathways Enriched Gene Function MonocotsDicots Orthologs (Compara/Inparanoid) Arabidopsis and OthersMaize, Rice and Others Non plants Genome/maps A3 B1 B2 C A1 A2 F B3 D A B C D

Annotations by homology !

Comparison of Metabolic Pathway Gene Annotations

11 Race to the ……. ! Are we there yet ?

Solutions ! Get involved in the community curation Provide details as much as possible in your publications Work with your species specififc database (if available*) before/after publication on making sure data from your publication is integrated. – * If not available contact archives, succh as NCBI, Uniprot, Gramene. You need it, I need it, community needs it.