Modeling Functional Genomics Datasets CVM8890-101 Lesson 3 13 June 2007Fiona McCarthy.

Slides:



Advertisements
Similar presentations
SRI International Bioinformatics Comparative Analysis Q
Advertisements

Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
The IntAct Database Sandra Orchard & Birgit Meldal.
European Bioinformatics Institute The Gene Ontology Annotation (GOA) Database and enhancement of GO annotations through InterPro2GO Nicky Mulder
Bioinformatics at WSU Matt Settles Bioinformatics Core Washington State University Wednesday, April 23, 2008 WSU Linux User Group (LUG)‏
Bioinformatics for biomedicine Summary and conclusions. Further analysis of a favorite gene Lecture 8, Per Kraulis
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Many genes have unknown function 30% have unknown function only 9% are experimentally verified The Arabidopsis Genome Initiative, Nature 2000 of the 25,498.
Bioinformatics. Analysis of proteomic data. Dr Richard J Edwards 28 August 2009; CALMARO workshop. ©Gary Larson (In not much detail)
We are developing a web database for plant comparative genomics, named Phytome, that, when complete, will integrate organismal phylogenies, genetic maps.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
UniProt - The Universal Protein Resource
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
GO Enrichment analysis COST Functional Modeling Workshop April, Helsinki.
Genome database & information system for Daphnia Don Gilbert, October 2002 Talk doc at
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Copyright OpenHelix. No use or reproduction without express written consent1.
AgBase: bioinformatics enabling knowledge generation from agricultural omics data Fiona McCarthy.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
Managing Data Modeling GO Workshop 3-6 August 2010.
Adding GO for Large Datasets COST Functional Modeling Workshop April, Helsinki.
MMAP: mouse Metabolomics Analysis Platform Preeti Bais 09/09/2014.
Strategies for functional modeling TAMU GO Workshop 17 May 2010.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
GO-based tools for functional modeling TAMU GO Workshop 17 May 2010.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
Gramene Objectives Provide researchers working on grasses and plants in general with a bird’s eye view of the grass genomes and their organization. Work.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Increasing GO Annotation Through Community Involvement Fiona McCarthy*, Nan Wang*, Susan Bridges** and Shane Burgess** GO.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
Generic Database. What should a genome database do? Search Browse Collect Download results Multiple format Genome Browser Information Genomic Proteomic.
Introduction to the Gene Ontology GO Workshop 3-6 August 2010.
ID Mapping to accessions from different databases. COST Functional Modeling Workshop April, Helsinki.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
9/10/06 GO Users Meeting 2006 Seattle, Washington The AgBase GO Annotation Tools Susan Bridges 1,3, Fiona McCarthy 2,3, Nan Wang 1,3, G. Bryce Magee 1,3,
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
GO based data analysis Iowa State Workshop 11 June 2009.
Copyright OpenHelix. No use or reproduction without express written consent1.
Getting GO: how to get GO for functional modeling Iowa State Workshop 11 June 2009.
AgBase Shane Burgess, Fiona McCarthy Mississippi State University.
Module 5: Future 1 Canadian Bioinformatics Workshops
Prioritization of Avian GO Annotation , , Chicken ,06949,5163.4Rat ,69664, Mouse ,83036, Human.
Biocomputational Languages December 1, 2011 Greg Antell & Khoa Nguyen.
National Cancer Institute Uma Mudunuri ABCC, NCI-Frederick ISRCE Monthly Meeting, Nov 9th 2010 bioDBnet The biological DataBase network.
Getting GO annotation for your dataset
Strategies for functional modeling
Introduction to the Gene Ontology
Workshop Aims TAMU GO Workshop 17 May 2010.
Department of Genetics • Stanford University School of Medicine
Workshop Aims GO Workshop 3-6 August 2010.
Functional Annotation of the Horse Genome
GO Annotation from different sources
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Strategies for annotation of a genome
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Ensembl Genome Repository.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Advanced PGDB Editing: Gene Ontology (GO) Terms
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy

Lesson 3: Tools for functional annotation. Accessing functional data; computational strategies to obtain more complete functional annotation; the AgBase GO annotation pipeline.

Lesson 3 Outline 1.Review: Functional Annotation 2.Tools for functional annotation –Accessing functional data –Computational strategies to obtain more functional data 3.Example: The AgBase GO annotation pipeline 4.Other GO annotation tools

Review: Functional Annotation biologists refer to both the annotation of the genome and functional annotation of gene products: “structural” AND “functional” annotation Functional annotation is required to make biological sense of high throughput datasets eg. genomics, arrays, proteomics COGs, KOGs, GO

Tools for Functional Annotation Need to be able to access functional annotation for your dataset –Breadth and depth –Date updated –No annotation vs function unknown Need to be able to add more annotation Need to be able to use the annotations to model your data –Depth or detail –Compatibility with other programs (eg pathway analysis) –Comparative data?

Tools for Functional Annotation Clusters of Orthologous Groups (COGs) euKaryotic Orthologous Groups (KOGs) UniProt Knowledgebase (UniProtKB) Bioinformatic Harvester FANTOM Puma Gene Ontology (GO)

COGs & KOGs Accessible at ftp download Available for many prokaryotes and 7 eukaryotes Add more annotation using the KOGinator? Modeling: –Has breadth but not always depth –Good for prokaryote comparative analysis?

COGs & KOGs

Automated tools for large numbers of comparisons??

UniProtKB Accessible at ftp download & sophisticated search & download capabilities Available for > 132,000 species Annotation across both literature (for selected species) and biological databases Modeling: –Has breadth but not always depth; many proteins not represented in UniProtKB –Those that are represented have a detailed summary of function from a range of sources –Rapid help and feedback from the database help

UniProtKB

UniProtKB

UniProtKB

Bioinformatic Harvester Accessible at no download Available for 6 model species Integrates data from multiple sources Modeling: –Has breadth and depth; not useful for large datasets –Updates?

Bioinformatic Harvester

FANTOM Mouse only

PUMA

Gene Ontology Accessible at updated downloads for 34 species + downloads for UniProtKB species (>130,000) UniProtKB species annotation: some depth, less breadth GO data mapped from other databases Modeling: –Many tools available for modeling using the GO –Can use computational or manual curation to add annotations

Gene Ontology

Accessing GO Data

EBI-GOA Project

The AgBase GO Annotation Pipeline Accessible at Access available annotations for agriculturally important species Provide your own GO annotations Model GO for your dataset

Coming soon; GOModeler quantitative hypothesis driven modeling using GO

Other GO Annotation Tools

Evaluate: Can I run it from my computer? Does it include my species of interest? When was it last updated? Does it display evidence codes? Does it display IEA annotations? What are the inputs it accepts? Does it do batch searches? Other GO Annotation Tools

Using GO to Analyze Array Data

Evaluate: Does it include my species of interest? When were the annotations last updated? Can I add my own annotations? Does it tell me how many of my genes are used for the analysis? Does it account for “not” annotations? Does it display IEA annotations? What are the input IDS it accepts? Does it analyze both over & under-represented terms? What statistics does it use for the analysis? Does it do a graphical representation? ANY tool will only be as good as the annotations.