Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.

Slides:



Advertisements
Similar presentations
A Comparative mapping resource ONTOLOGY DEVELOPMENT AND INTEGRATION IN GRAMENE Pankaj Jaiswal Cornell University.
Advertisements

Annotation of Gene Function …and how thats useful to you.
Applications of GO. Goals of Gene Ontology Project.
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Gene Ontology John Pinney
POC tutorial#3: Annotation This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers within.
Introduction to Functional Analysis J.L. Mosquera and Alex Sanchez.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Community Annotation of Gene Function with GONUTS Jim Hu EcoliHub/EcoliWiki Dept. of Biochemistry and Biophysics Texas A&M University.
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
1 Using Gene Ontology. 2 Assigning (or Hypothesizing About) Biological Meaning to Clusters What do you want to be able to to? –Identify over-represented.
COG and GO tutorial.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Comprehensive Annotation System for Infectious Disease Data Alexander Diehl University at Buffalo/The Jackson Laboratory IDO Workshop /9/2010.
Protein and Function Databases
BICH CACAO Biocurator Training Session #3.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
PAT project Advanced bioinformatics tools for analyzing the Arabidopsis genome Proteins of Arabidopsis thaliana (PAT) & Gene Ontology (GO) Hongyu Zhang,
SPH 247 Statistical Analysis of Laboratory Data 1 May 12, 2015 SPH 247 Statistical Analysis of Laboratory Data.
Using The Gene Ontology: Gene Product Annotation.
1 Identifying differentially expressed sets of genes in microarray experiments Lecture 23, Statistics 246, April 15, 2004.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Annotating Gene Products to the GO Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse.
Introduction to GO Annotation Eurie Hong (SGD), Michelle Gwinn (TIGR), Tanya Berardini (TAIR), Karen Pilcher (DictyBase), Russell Collins (FlyBase), Carol.
Gene Ontology Overview and Perspective Lung Development Ontology Workshop.
SPH 247 Statistical Analysis of Laboratory Data 1May 14, 2013SPH 247 Statistical Analysis of Laboratory Data.
Inferring Function From Known Genes Naomi Altman Nov. 06.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene expression analysis
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
Manual GO annotation Evidence: Source AnnotationsProteins IEA:Total Manual: Total
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
24th Feb 2006 Jane Lomax GO Further. 24th Feb 2006 Jane Lomax GO annotations Where do the links between genes and GO terms come from?
Gene Product Annotation using the GO ml Harold J Drabkin Senior Scientific Curator The Jackson Laboratory.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
Alastair Kerr, Ph.D. WTCCB Bioinformatics Core An introduction to DNA and Protein Sequence Databases.
Tutorial 7 Gene expression analysis 1. Expression data –GEO –UCSC –ArrayExpress General clustering methods –Unsupervised Clustering Hierarchical clustering.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
Functional Annotation and Functional Enrichment. Annotation Structural Annotation – defining the boundaries of features of interest (coding regions, regulatory.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
A Common Language for Annotation of Genes from Yeast, Flies and Mice The Gene Ontologies …and Plants and Worms …and Humans …and anything else!
Rice Proteins Data acquisition Curation Resources Development and integration of controlled vocabulary Gene Ontology Trait Ontology Plant Ontology
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Statistical Testing with Genes Saurabh Sinha CS 466.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Agenda 1.Biological databases related to microarray 1.Gene Ontology 2.KEGG 3.Biocarta 4.Reactome 5.MSigDB 2.Pathway enrichment analysis 1.GSEA 2.GSA 3.Ingenuity.
Update Susan Bridges, Fiona McCarthy, Shane Burgess NRI
1 Annotation EPP 245/298 Statistical Analysis of Laboratory Data.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
The Future of Genetics Research Lesson 7. Human Genome Project 13 year project to sequence human genome and other species (fruit fly, mice yeast, nematodes,
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
An example of GO annotation from a primary paper GO Annotation Camp, July 2006 PMID:
Module 1: Gene Lists 1 Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Extracting Biological Information from Gene Lists
Gene Annotation & Gene Ontology
Annotating with GO: an overview
Introduction to the Gene Ontology
Department of Genetics • Stanford University School of Medicine
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Gene expression analysis
Annotating Gene Products to the GO
Insight into GO and GOA Angelica Tulipano , INFN Bari CNR
Presentation transcript:

Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007

Sections Introduction to Gene Ontology GOstat Example

Gene Ontology Michael Ashburner Annotate genes or proteins Started for Drosophila melanogaster (fly). Now expanded for all taxa

Gene Ontology Biological process A phenomenon marked by changes that lead to a particular result, mediated by one or more gene products. Molecular function Elemental activities, such as catalysis or binding, describing the actions of a gene product at the molecular level. A given gene product may exhibit one or more molecular functions. Cellular component The part of a cell of which a gene product is a component; for purpose of GO includes the extracellular environment of cells; a gene product may be a component of one or more parts of a cell; this term includes gene products that are parts of macromolecular complexes, by the definition that all members of a complex normally copurify under all except extreme conditions.

Gene Ontology Biological process

Gene Ontology

Gene Ontology

Gene Ontology Evidence codes IC: Inferred by Curator IDA: Inferred from Direct Assay IEA: Inferred from Electronic Annotation IEP: Inferred from Expression Pattern (2006) IGC: Inferred from Genomic Context (2007) IGI: Inferred from Genetic Interaction IMP: Inferred from Mutant Phenotype IPI: Inferred from Physical Interaction ISS: Inferred from Sequence or Structural Similarity NAS: Non-traceable Author Statement (2006) ND: No biological Data available RCA: inferred from Reviewed Computational Analysis TAS: Traceable Author Statement NR: Not Recorded (2006)

Gene Ontology Stats. May 29 th biological_process: 13,553 terms (10,894 in 2006; 9,277 in 2005) cellular_component: 1,966 terms (1,815; 1,512) molecular_function: 7,609 terms (7,927; 6,957), Total: 23,128 terms (20,636; 17,746)

Gene Ontology Stats. May 29 th Mouse Genome Informatics (The Jackson Laboratory biological_process: 14,200 genes, 42,675 annotations (3.0 kw/gene) [13,329 genes, 33,783 annotations (2.5 kw/gene) in 2006] cellular_component: 14,713 genes, 31,330 annotations (2.1 kw/gene) [13,547 genes, 26,515 annotations (2.0 kw/gene)] molecular_function: 15,553 genes, 50,343 annotations (3.2 kw/gene) [14,056 genes, 40,806 annotations (2.9 kw/gene)] 8.3 terms per gene [ 7.5 in 2006]

Databases using Gene Ontology NetAffx (Affymetrix probe annotations) Flybase (sequences) was the first SGD (yeast) MGI (mouse) InterPro (Protein sequences) ProDom (Protein domains) Entrez Gene (gene information)

GOstat Find statistically overrepresented properties within a group of genes as selected by......typically, analysis of a DNA microarray experiment Beissbarth & Speed (2004) Bioinformatics, 20:

gene A gene B gene C gene D gene E XXXX Total set of genes 2,000 of 5,000 are X Not significant YYYY Total set of genes 4 of 5000 are Y Very significant Do it for all Gene Ontology terms Take into account the structure of the ontology Sort by p-values GOstat

Contigency Table genes with GO in group total genes in group selected genes (e.g. differentially expressed) reference group (e.g. all genes on array) p-value 8e-52 Chi-square Test (Fisher's Exact Test for small values) Probability of obtaining those values from a random distribution.

Web tool

Output

Example We will study the function of a set of genes selected via StemBase (see corresponding Unit for more info on using StemBase)

1. Select a set of genes Objective: Genes correlated to Lgals3bp (lectin, galactoside-binding, soluble, 3 binding protein) A galectin, a beta-galactoside-binding protein implicated in modulating cell- cell and cell-matrix interactions

1. Select a set of genes

2. Run in GOstat

Calcium ion binding mannosyl-oligosaccharide mannosidase activity

2. Run in GOstat

2. Run in GOstat Calcium ion binding mannosyl-oligosaccharide mannosidase activity

2. Run in GOstat

3. Examine expression MAN2A _at MAN1A _at Lgals3bp _at

3. Examine expression

To know more Gene Ontology. GOstat Beissbarth & Speed (2004) Bioinformatics, 20: StemBase. See corresponding Unit in this course.