1 Using Gene Ontology. 2 Assigning (or Hypothesizing About) Biological Meaning to Clusters What do you want to be able to to? –Identify over-represented.

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO.
GoMiner: (Zeeberg et al., Genome Biology, March 2003) For Tour of GoMiner: Advance using forward arrow.
Www. GeneOntology.org Gene Ontology Collaboration.
Asking translational research questions using ontology enrichment analysis Nigam Shah
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Integrating Cross-Platform Microarray Data by Second-order Analysis: Functional Annotation and Network Reconstruction Ming-Chih Kao, PhD University of.
Gene Ontology John Pinney
Open Day 2006 From Expression, Through Annotation, to Function Ohad Manor & Tali Goren.
Gene function analysis Stem Cell Network Microarray Course, Unit 5 May 2007.
Introduction to Functional Analysis J.L. Mosquera and Alex Sanchez.
CACAO - Remote training Gene Function and Gene Ontology Fall 2011
Using Gene Ontology Models and Tests Mark Reimers, NCI.
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
COG and GO tutorial.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Cluster analysis  Function  Places genes with similar expression patterns in groups.  Sometimes genes of unknown function will be grouped with genes.
Internet tools for genomic analysis: part 2
POC tutorial #1: Introduction This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers within.
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
 2 Outline  Review of major computational approaches to facilitate biological interpretation of  high-throughput microarray  and RNA-Seq experiments.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Automatic methods for functional annotation of sequences Petri Törönen.
Using The Gene Ontology: Gene Product Annotation.
Getting the story – biological model based on microarray data Once the differentially expressed genes are identified (sometimes hundreds of them), we need.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Networks and Interactions Boo Virk v1.0.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
Gene Ontology Consortium
Inferring Function From Known Genes Naomi Altman Nov. 06.
COURSE OF BIOINFORMATICS Exam_31/01/2014 A.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Integrating the Cell Cycle Ontology with the Mouse Genome Database David R. Smith Mary Dolan Dr. Judith Blake.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Protein and RNA Families
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Gene Ontology Consortium
Statistical Testing with Genes Saurabh Sinha CS 466.
Bioinformatics and Computational Biology
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Computer Science Ph. D. Seminar Gene Ontology (GO) Based Search for Protein Structure Similarity Clustering Metrics Ph.D. Candidate Steve Johnson Committee.
MAPPING OF SEQUENCES TO GENE ONTOLOGY. GO consortium.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Gene Ontology TM (GO) Consortium
2/3/2005 Gene Ontology (GO) The Gene Ontology (GO) project is a collaborative effort to address the need for consistent descriptions.
COURSE OF BIOINFORMATICS Exam_30/01/2014 A.
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Clench 2.0 A program for cluster enrichment analysis and integrated visualization of expression, annotation and transcription factor binding site data.
Pathway Informatics 16th August, 2017
Gene Annotation & Gene Ontology
Annotating with GO: an overview
Additional file 6. Gene Ontology (GO) term “enrichment status” for the pollen stage down regulated genes in MPGs and GPGs. A, term enrichment levels along.
GO : the Gene Ontology & Functional enrichment analysis
Statistical Testing with Genes
Fig. S12. Hierarchical tree graph of overrepresented GO terms in biological process category in 18O w.r.t. WT under 100mM NaCl at 24 h stress comparative.
Analysis of GO annotation at cluster level by Agnieszka S. Juncker
Overview Gene Ontology Introduction Biological network data
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Statistical Testing with Genes
Presentation transcript:

1 Using Gene Ontology

2 Assigning (or Hypothesizing About) Biological Meaning to Clusters What do you want to be able to to? –Identify over-represented functional categories in the clusters (i.e., cluster contains may more genes known to be involved in a specific biological process than would be expected by chance) Why do you want to be able to do this? –If you find a gene of unknown function in a cluster of genes in which a known function is overrepresented, your gene of unknown function may have the same or a related function! Requirements for systematic analysis: –Standard assignment of genes into functional categories Gene Ontology or GO project at NCBI –Controlled vocabulary for describing biological processes (protein biosynthesis\translation, apoptosis\programmed cell death)

3 Gene Ontology (GO) project Purpose: 1) Establish a unified framework for organism-independent gene annotation 2) Define controlled terms (ontologies) for description of gene products from 3 aspects: Biological process (DNA repair, mitosis) Molecular function (protein serine/threonine kinase activity, transcription factor activity) Cellular component (nucleus, ribosome) Characteristics: 1) A gene can have multiple associations in each ontology 2) GO terms are organized in hierarchical structures called directed acyclic graphs (DAGs) - The most general classifications are at top levels of the graph - More specialized classifications at lower levels

4 Hierarchical classification scheme for proteins that function in M-phase of mitosis Any one gene can be a member of more than one GO classification

5 Example: Cluster 3, 95 genes

6 Identifying enriched GO categories in clusters In the previous example: –Total number of chip’s genes with annotation = 5000 Remember, some genes may be of unknown function and not be annotated. –Total number of chip’s genes associated with metabolism GO category = 3,600 (72%) –Number of annotated genes in cluster 3 = 73 –Number of metabolic genes in cluster 3 = 50 (68%) Is it reasonable to assume that genes in Cluster 3 are enriched for metabolic function? Statistical tests are essential to determine whether enrichment of a certain class of proteins is significant

7 Online Databases that annotate genes by GO Human –Entrez –GOA Mouse – Mouse Genome Informatics (MGI) – Rat – Rat Genome Database – Fly – FlyBase – Arabidopsis – TAIR – Yeast – Sacchromaces Genome Database – Affymetrix chips – Netaffx –