Bioinformatics resources for IITA Crops GO Workshop 3-6 August 2010.

Slides:



Advertisements
Similar presentations
Martin John Bishop UK HGMP Resource Centre Hinxton Cambridge CB10 1 SB
Advertisements

Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy.
Welcome to mini-symposium on ontologies for biological sample description EMBL-EBI Wellcome Trust Genome Campus Deceber 5, 2001.
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Bioinformatics and the Engineering Library ASEE 2008 Amy Stout.
August 19, 2002Slide 1 Bioinformatics at Virginia Tech David Bevan (BCHM) Lenwood S. Heath (CS) Ruth Grene (PPWS) Layne Watson (CS) Chris North (CS) Naren.
A member of CGIAR consortium Climate change research at IITA Michael Abberton Head, Genetic Resources Centre IITA, Ibadan.
The Golden Age of Biology DNA -> RNA -> Proteins -> Metabolites Genomics Technologies MECHANISMS OF LIFE Health Care Diagnostics Medicines Animal Products.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
Genomics, Proteomics and Metabolomics. Genomics l The complete set of DNA found in each cell is known as the genome l Most crop plant genomes have billions.
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Genome Annotation BCB 660 October 20, From Carson Holt.
Doug Brutlag Professor Emeritus Biochemistry & Medicine (by courtesy) Genome Databases Computational Molecular Biology Biochem 218 – BioMedical Informatics.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Databases in Bioinformatics and Systems Biology Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Tomato genome annotation pipeline in Cyrille2
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Sequence Databases What are they and why do we need them.
AgBase: bioinformatics enabling knowledge generation from agricultural omics data Fiona McCarthy.
Copyright OpenHelix. No use or reproduction without express written consent 2 Overview of Genome Browsers Materials prepared by Warren C. Lathe, Ph.D.
Managing Data Modeling GO Workshop 3-6 August 2010.
ANALYSIS AND VISUALIZATION OF SINGLE COPY ORTHOLOGS IN ARABIDOPSIS, LETTUCE, SUNFLOWER AND OTHER PLANT SPECIES. Alexander Kozik and Richard W. Michelmore.
DAY 1c: Accessing Completed Genomes 1. UCSC Genome Bioinformatics 2. Ensembl 3. NCBI Genomic Biology.
I. Introduction and Red Line Education for Data-unlimited Science.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Strategies for functional modeling TAMU GO Workshop 17 May 2010.
Biological Databases Biology outside the lab. Why do we need Bioinfomatics? Over the past few decades, major advances in the field of molecular biology,
Genomics and Arabidopsis. What is ‘genomics’? Study of an organism’s entire genome –All the DNA encoded in the organism –Nucleus, mitochondria, chloroplasts.
Professional Development Course 1 – Molecular Medicine Genome Biology June 12, 2012 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Biological databases Exercises. Discovery of distinct sequence databases using ensembl.
Genomics.
Building WormBase database(s). SAB 2008 Wellcome Trust Sanger Insitute Cold Spring Harbor Laboratory California Institute of Technology ● RNAi ● Microarray.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
EB3233 Bioinformatics Introduction to Bioinformatics.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Introduction to the Gene Ontology GO Workshop 3-6 August 2010.
Data Mining at PLEXdb : Plant and Plant Pathogen Gene Expression Database.
A collaborative tool for sequence annotation. Contact:
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
An Introduction to NCBI & BLAST National Center for Biotechnology Information Richard Johnston Pasadena City College.
The Bovine Genome Database Abstract The Bovine Genome Database (BGD, facilitates the integration of bovine genomic data. BGD is.
Graduate Research with Bioinformatics Research Mentors Nancy Warter-Perez, ECE Robert Vellanoweth Chem and Biochem Fellow Sean Caonguyen 8/20/08.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Getting GO annotation for your dataset
Strategies for functional modeling
Daphnia Genome Preview at wFleaBase.org
Introduction to bioinformatics
Workshop Aims GO Workshop 3-6 August 2010.
Functional Annotation of the Horse Genome
Access to Sequence Data and Related Information
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Genome Annotation w/ MAKER
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
A User’s Guide to GO: Structural and Functional Annotation
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Bioinformatics resources for IITA Crops GO Workshop 3-6 August 2010

IITA Crops  Genome annotation  General plant genome resources  IITA crops  What is needed?

Genomic Annotation  Genome annotation is the process of attaching biological information to genomic sequences. It consists of two main steps: 1. identifying functional elements in the genome: “structural annotation” 2. attaching biological information to these elements: “functional annotation”  biologists often use the term “annotation” when they are referring only to structural annotation

CHICK_OLF6 DNA annotation Protein annotation Data from Ensembl Genome browser TRAF 1, 2 and 3TRAF 1 and 2 Structural annotation:

catenin Functional annotation:

Bioinformatics Resources  Model Organism Databases (MODs) & Sequencing Consortia – organise and control genome annotation  NCBI – public sequence resources (collate, annotate, analysis)  UniProt – functional annotation (also displays GO)  Functional genomics tools – e.g. arrays  GO Consortium – members supply GO annotations to support functional modeling

IITA Crops  Global: Cowpea, Soybean, Bananas, Plantain, and Yams  Sub-Saharan Africa: Cassava and Maize

Cowpea  Vigna unguiculata (taxon:3917) & subspecies  JCVI – “reduced representation” sequencing (clone ends) of Blackeye 5  54,123 genome sequences  187,483 ESTs  Annotated via homology to Arabidopsis & other plants  GO annotation via homology – availability?

Cowpea  Vigna unguiculata (cowpea) has 1,510 GO annotations for 175 proteins  Vigna unguiculata subsp. unguiculata (cowpea) has 25 annotations for 5 proteins  University of Cape Town Cowpea 9K drought stress SSH library  some studies using soybean microarray

Soybean  Glycine max (taxon:3847)  genome sequencing at DOE JGI as a preliminary assembly  NCBI: 1,459,639 ESTs, 34,946 proteins, 2,882 genes  UniProt: 12,837 proteins (EBI GOA automatic GO annotation)  UniGene assemblies available  multiple microarrays available

Banana  Musa acuminata (taxon:4641) & subspecies  Global Musa Genomics Consortium (GMGC) - in progress (includes JCVI, NARO)  7,102 genome sequences  14,864 ESTs  1,399 NCBI proteins; 680 UniProt

Banana  Musa acuminata (sweet banana): 3898 GO annotations to 491 proteins  Musa acuminata AAA Group (Cavendish banana): 579 annotations to 96 proteins  SAGE libraries generated for Musa acuminata

Plantain  Musa ABB Group (taxon:214693) - cooking banana or plantain  11,070 ESTs, 112 proteins  173 GO annotations to 53 proteins  functional genomics based on banana?

Yams 55577Dioscorea rotundata white yam 55571Dioscorea alata water yam 29710Dioscorea cayenensis yellow yam  Dioscorea (taxon:4672) & subspecies  NCBI: 31 ESTs, 623 proteins  Genome sequencing for Dioscorea alata – EST development (IITA & VSU)  183 GO annotations to 25 proteins

Cassava  Manihot esculenta (taxon:3983)  ESTs: 80,631  EST projects at Centre National de la Recherche Scientifique & University of Illinois at Urbana- Champaign  Genome sequencing in progress at DOE JGI  NCBI proteins: 568, UniProt:253  2,251 GO annotations assigned to 218 proteins  2 Euphorbia esula (leafy spurge) /cassava arrays

Maize  Zea mays (taxon:4577)  Genome sequencing completed by Washington University – other subspecies being sequenced  Active GO annotation project - 131,925 GO annotations to 20,288 proteins

IITA Crops: what is needed  Do we really need a MOD for every species?  How do we co-ordinate genome updates/structural annotation?  Functional annotation to support functional genomics (arrays, RNA-Seq, proteomics)?  Nomenclature to support comparative genomics?  Functional modeling of disease/infection?  Genetic makers for improvement – do we need to understand molecular mechanisms?  How can we leverage existing resources?

AgBase Collaborative Model  How can we help you?  Can make GO annotations public via the GO Consortium  Have computational pipelines to do rapid, first pass GO annotation (including transcript/EST sequences)  Provide bioinformatics support for collaborators  Developing new tools  Training/support for modeling data