From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Applications of GO. Goals of Gene Ontology Project.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
GO-based tools for functional modeling GO Workshop 3-6 August 2010.
Gene Ontology John Pinney
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
COG and GO tutorial.
CACAO Biocurator Training CACAO Fall CACAO Syllabus What is CACAO & why is it important? Training Examples.
Demonstration Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Protein and Function Databases
BICH CACAO Biocurator Training Session #3.
Gene Ontology at WormBase: Making the Most of GO Annotations Kimberly Van Auken.
GO Enrichment analysis COST Functional Modeling Workshop April, Helsinki.
Methods for Creating GO Annotations Emily Dimmer European Bioinformatics Institute Wellcome Trust Genome Campus Cambridge UK.
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
SPH 247 Statistical Analysis of Laboratory Data 1 May 12, 2015 SPH 247 Statistical Analysis of Laboratory Data.
Using The Gene Ontology: Gene Product Annotation.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
SPH 247 Statistical Analysis of Laboratory Data 1May 14, 2013SPH 247 Statistical Analysis of Laboratory Data.
Managing Data Modeling GO Workshop 3-6 August 2010.
Adding GO for Large Datasets COST Functional Modeling Workshop April, Helsinki.
Adding GO GO Workshop 3-6 August GOanna results and GOanna2ga 2. gene association files 3. getting GO for your dataset 4. adding more GO (introduction)
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Strategies for functional modeling TAMU GO Workshop 17 May 2010.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
GO-based tools for functional modeling TAMU GO Workshop 17 May 2010.
Workshop Aims NMSU GO Workshop 20 May Aims of this Workshop  WIIFM? modeling examples background information about GO modeling  Strategies for.
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Gene Product Annotation using the GO ml Harold J Drabkin Senior Scientific Curator The Jackson Laboratory.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
9/10/06 GO Users Meeting 2006 Seattle, Washington The AgBase GO Annotation Tools Susan Bridges 1,3, Fiona McCarthy 2,3, Nan Wang 1,3, G. Bryce Magee 1,3,
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
CACAO Training Jim Hu and Suzi Aleksander Fall 2015.
GO based data analysis Iowa State Workshop 11 June 2009.
1 Annotation EPP 245/298 Statistical Analysis of Laboratory Data.
Getting GO: how to get GO for functional modeling Iowa State Workshop 11 June 2009.
AgBase Shane Burgess, Fiona McCarthy Mississippi State University.
An example of GO annotation from a primary paper Rebecca E. Foulger (UniProt Curator) GO Annotation Camp, June 2005 PMID:
An example of GO annotation from a primary paper GO Annotation Camp, July 2006 PMID:
Protein. Protein and Roles 1: biological process unknown 1.1 Structural categories 1.2 organism categories 1.3 cellular component o unlocalized.
Gene Ontology TM (GO) Consortium
CACAO Training Jim Hu and Suzi Aleksander Fall 2015.
Gene Annotation & Gene Ontology
Getting GO annotation for your dataset
CACAO Training ASM-JGI 2012.
Annotating with GO: an overview
Strategies for functional modeling
GO : the Gene Ontology & Functional enrichment analysis
Introduction to the Gene Ontology
Workshop Aims TAMU GO Workshop 17 May 2010.
Functional Annotation of the Horse Genome
Modified from slides from Jim Hu and Suzi Aleksander Spring 2016
Gene expression analysis
Insight into GO and GOA Angelica Tulipano , INFN Bari CNR
Presentation transcript:

From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of Digital Biology, Mississippi State University

From Functional Genomics to Physiological Model 1. A user’s guide to the Gene Ontology (GO) 2. Finding GO for farm animal species 3. Adding GO to your dataset 4. GO based tools for biological modeling 5. Examples: using GO for biological modeling

Presentation available at AgBase Websites available as handout

1. A User’s Guide to GO

What is the Gene Ontology? Emily Dimmer, GOA EBI: “a controlled vocabulary that can be applied to all organisms even as knowledge of gene and protein roles in cells is accumulating and changing” assign functions to gene products at different levels, depending on how much is known about a gene product is used for a diverse range of species structured to be queried at different levels, eg:  find all the chicken gene products in the genome that are involved in signal transduction  zoom in on all the receptor tyrosine kinases human readable GO function has a digital tag to allow computational analysis of large datasets

GO Mapping Example NDUFAB1 (UniProt P52505) Bovine NADH dehydrogenase (ubiquinone) 1, alpha/beta subcomplex, 1, 8kDa Biological Process (BP or P) GO: fatty acid biosynthetic process TAS GO: mitochondrial electron transport, NADH to ubiquinone TAS GO: lipid biosynthetic process IEA Cellular Component (CC or C) GO: mitochondrial matrix IDA GO: mitochondrial respiratory chain complex I IDA GO: mitochondrion IEA NDUFAB1 Molecular Function (MF or F) GO: fatty acid binding IDA GO: NADH dehydrogenase (ubiquinone) activity TAS GO: oxidoreductase activity TAS GO: acyl carrier activity IEA

GO Mapping Example NDUFAB1 (UniProt P52505) Bovine NADH dehydrogenase (ubiquinone) 1, alpha/beta subcomplex, 1, 8kDa Biological Process (BP or P) GO: fatty acid biosynthetic process TAS GO: mitochondrial electron transport, NADH to ubiquinone TAS GO: lipid biosynthetic process IEA Cellular Component (CC or C) GO: mitochondrial matrix IDA GO: mitochondrial respiratory chain complex I IDA GO: mitochondrion IEA NDUFAB1 Molecular Function (MF or F) GO: fatty acid binding IDA GO: NADH dehydrogenase (ubiquinone) activity TAS GO: oxidoreductase activity TAS GO: acyl carrier activity IEA aspect or ontology GO:ID (unique) GO term name GO evidence code

GO Mapping Example NDUFAB1 (UniProt P52505) Bovine NADH dehydrogenase (ubiquinone) 1, alpha/beta subcomplex, 1, 8kDa Biological Process (BP or P) GO: fatty acid biosynthetic process TAS GO: mitochondrial electron transport, NADH to ubiquinone TAS GO: lipid biosynthetic process IEA Cellular Component (CC or C) GO: mitochondrial matrix IDA GO: mitochondrial respiratory chain complex I IDA GO: mitochondrion IEA NDUFAB1 Molecular Function (MF or F) GO: fatty acid binding IDA GO: NADH dehydrogenase (ubiquinone) activity TAS GO: oxidoreductase activity TAS GO: acyl carrier activity IEA GO EVIDENCE CODES Direct Evidence Codes IDA - inferred from direct assay IEP - inferred from expression pattern IGI - inferred from genetic interaction IMP - inferred from mutant phenotype IPI - inferred from physical interaction Indirect Evidence Codes inferred from literature IGC - inferred from genomic context TAS - traceable author statement NAS - non-traceable author statement IC - inferred by curator inferred by computational analysis RCA - inferred from reviewed computational analysis ISS - inferred from sequence or structural similarity IEA - inferred from electronic annotation Other NR - not recorded (historical) ND - no biological data available

Unknown Function vs No GO ND – no data  Biocurators have tried to add GO but there is no functional data available  Previously: “process_unknown”, “function_unknown”, “component_unknown”  Now: “biological process”, “molecular function”, “cellular component” No annotations (including no “ND”): biocurators have not annotated

2. Finding GO for Farm Animals

GO Browsers QuickGO Browser (EBI GOA Project)   Can search by GO Term or by UniProt ID  Includes IEA annotations AmiGO Browser (GO Consortium Project)   Can search by GO Term or by UniProt ID  Does not include IEA annotations

Getting GO includes farm animals

Getting GO filter filter

Getting GO

3. Adding GO to your dataset

GO analysis of array data Probe data is linked to gene product data  gene, cDNA, ESTs IDs For some arrays, gene product data has corresponding GO data  available from vendor (updated?) Not all gene products will have GO annotation  will not be included in modeling Need to get the maximum amount of GO data to do biological modeling

Example: Netaffx

Secondary source of GO annotation

GORetriever + many more

GORetriever

GORetriever Results

save as text file For GOSlimViewer

GORetriever Results

But what about IDs not supported by GORetriever?

GOanna

GOanna Results

query IDs are hyperlinked to BLAST data (files must be in the same directory)

*WHAT IS A GOOD ALIGNMENT? If there is a good alignment* to a protein with GO  transfer GO to your record If there is not a good alignment or the record doesn’t have GO  literature

good alignment add to GO summary file (tab-delimited text file containing ID, GO:ID, aspect)

Contact AgBase to request GO annotation of specific gene products.

GOSlimViewer: summarizing results

GOSlimViewer results

response to stimulus amino acid and derivative metabolic process transport behavior cell differentiation metabolic process regulation of biological process cell communication nucleobase, nucleoside, nucleotide and nucleic acid metabolic process cell death cell motility macromolecule metabolic process multicellular organismal development catabolic process biological_process

response to stimulus amino acid and derivative metabolic process transport behavior cell differentiation metabolic process regulation of biological process cell communication nucleobase, nucleoside, nucleotide and nucleic acid metabolic process cell death cell motility macromolecule metabolic process multicellular organismal development catabolic process biological_process “process unknown” “function unknown” “component unknown” ??

B-cellsStroma immune response apoptosis cell-cell signaling Looking at function, not genes Pie Graphs – relative proportions

GOModeler: quantitative, hypothesis-driven modeling. Coming soon (contact AgBase) GOModeler

McCarthy et al “AgBase: a functional genomics resource for agriculture.” BMC Genomics Sep 8;7:229.

4. GO based tools for biological modeling

However…. many of these tools do not support farm animal species the tools have different computing requirements may be difficult to determine how up-to-date the GO annotations are… Need to evaluate tools for your system.

Evaluating GO tools Some criteria for evaluating GO Tools: 1. Does it include my species of interest (or do I have to “humanize” my list)? 2. What does it require to set up (computer usage/online) 3. What was the source for the GO (primary or secondary) and when was it last updated? 4. Does it report the GO evidence codes (and is IEA included)? 5. Does it report which of my gene products has no GO? 6. Does it report both over/under represented GO groups and how does it evaluate this? 7. Does it allow me to add my own GO annotations? 8. Does it represent my results in a way that facilitates discovery?

5. Using GO for biological modeling

Using GO for biological modeling: hypothesis generating hypothesis driven