a Cytoscape plugin to assess enrichment of

Slides:



Advertisements
Similar presentations
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Advertisements

Asking translational research questions using ontology enrichment analysis Nigam Shah
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Pathways analysis Iowa State Workshop 11 June 2009.
Fission Yeast Computing Workshop -1- Exercise 5: Looking for overreprsented GO terms in a gene set using Onto-Express GO annotations can be used to obtain.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
Gene Ontology John Pinney
Data mining with the Gene Ontology Josep Lluís Mosquera April 2005 Grup de Recerca en Estadística i Bioinformàtica GOing into Biological Meaning.
Introduction to Functional Analysis J.L. Mosquera and Alex Sanchez.
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
BioPathways SIG, July Networks in Biology Molecular interaction and similarity networks are vital for understanding gene function.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Biological Interpretation of Microarray Data Helen Lockstone DTC Bioinformatics Course 9 th February 2010.
GCB/CIS 535 Microarray Topics John Tobias November 15 th, 2004.
Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker.
Gene Ontology at WormBase: Making the Most of GO Annotations Kimberly Van Auken.
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
1Module 2: Analyzing Gene Lists Canadian Bioinformatics Workshops
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Automatic methods for functional annotation of sequences Petri Törönen.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
1Module #: Title of Module. Network visualization and analysis with Cytoscape Gary Bader July 15, 2013 – Network Analysis, UCLA.
GO::TermFinder Gavin Sherlock Department of Genetics Stanford University
Gene Set Enrichment Analysis (GSEA)
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
Extracting Biological Information from Gene Lists
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
United Nations Economic Commission for Europe Statistical Division The Importance of Databases in the Dissemination Process Steven Vale, UNECE.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Statistical Testing with Genes Saurabh Sinha CS 466.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Software Engineering1  Verification: The software should conform to its specification  Validation: The software should do what the user really requires.
GeWorkbench Overview Support Team Molecular Analysis Tools Knowledge Center Columbia University and The Broad Institute of MIT and Harvard.
GO enrichment and GOrilla
Copyright OpenHelix. No use or reproduction without express written consent1.
CuffDiff ran successfully. Output files include gene_exp.diff What are the next steps? Use Navigation bar to find files; they may be under DNA Subway if.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Network construction and exploration using CORNET and Cytoscape - Excercises SPICY WORKSHOP Wageningen, March 8 th 2012 Stefanie De Bodt.
Canadian Bioinformatics Workshops
Goal: Help users learn to use Cytoscape to answer biologically relevant questions by providing information, tutorials and linking new.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
1 A Discussion of False Discovery Rate and the Identification of Differentially Expressed Gene Categories in Microarray Studies Ames, Iowa August 8, 2007.
Clench 2.0 A program for cluster enrichment analysis and integrated visualization of expression, annotation and transcription factor binding site data.
Canadian Bioinformatics Workshops
Emily Pachunka ● Spring 2017
Module 2: Analyzing gene lists: over-representation analysis
Networks and Interactions
Exploring and Presenting Results
Clustering Manpreet S. Katari.
GO : the Gene Ontology & Functional enrichment analysis
Canadian Bioinformatics Workshops
Statistical Testing with Genes
Canadian Bioinformatics Workshops
Pathway Visualization
Genesets and Enrichment
Analysis of GO annotation at cluster level by Agnieszka S. Juncker
Overview Gene Ontology Introduction Biological network data
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Nicole Anguiano and Anu Varshneya
Boosting Signal-to-Noise in Complex Biology: Prior Knowledge Is Power
Varying Intolerance of Gene Pathways to Mutational Classes Explain Genetic Convergence across Neuropsychiatric Disorders  Shahar Shohat, Eyal Ben-David,
Statistical Testing with Genes
Presentation transcript:

a Cytoscape plugin to assess enrichment of GO categories in biological networks Steven Maere 5th Annual Cytoscape Retreat Amsterdam, November 2007

Gene Ontology (GO) GO provides three structured vocabularies to describe gene function: - biological process - molecular function - cellular component Biological Process Cell cycle M phase crosslinked, hierarchical tree structure level of detail increases with level in hierarchy multiple annotations per gene www.geneontology.org

BiNGO ? Biological Network Gene Ontology tool : assess whether a given set of genes is significantly enriched in genes involved in particular functions/processes/pathways. > 40 tools with similar functionality listed on GO website Why BiNGO ? Its integration in Cytoscape provides several advantages : BiNGO can be used interactively on molecular interaction graphs visualized in Cytoscape. Integrated use of tools : BiNGO can be used in combination with other Cytoscape plugins (e.g. MCODE). intuitive and versatile visualization of results flexibility

BiNGO 2.0 Features assesses overrepresentation or underrepresentation of GO categories in gene sets Cytoscape or gene list input batch mode : analyze several clusters simultaneously using same settings GO and GOSlim ontologies, automated remapping of annotations Wide range of organisms and gene identifiers Evidence code filtering Hypergeometric or binomial test for overrepresentation Multiple testing correction using Bonferroni (FWER) or Benjamini&Hochberg (FDR) correction Cytoscape visualization of results mapped on the GO hierarchy.  extensive results in tab-delimited text file format making and using custom annotation/ontology files is easy open source

GO overrepresentation x out of X genes in group A belong to GO category B which is shared by n out of N genes in the whole genome genome : N Group A : X GO Cat B : n n x we can use the hypergeometric distribution (a.k.a. the 2x2 exact Fisher test) to determine whether category B is significantly overrepresented in cluster A

Multiple testing correction When testing multiple GO categories, a multiple testing correction has to be applied in order to control the false positive rate Several strategies for adjusting p-values : - controlling FWER = prob. of at least 1 type I error e.g. Bonferroni correction very conservative - controlling the FDR = False Discovery Rate controls % of false positives e.g. Benjamini & Hochberg correction valid for independent or + correlated tests less conservative Benjamini, Y. & Hochberg, Y., J. R. Statist. Soc. B 57(1), 289-300 (1995)

BiNGO DEMO

BiNGO Maere S, Heymans K and Kuiper M (2005) BiNGO: a Cytoscape plugin to assess over-representation of Gene Ontology categories in biological networks. Bioinformatics 21, 3448-9. Availability: GO Tools at www.geneontology.org Plugins at www.cytoscape.org Plugin Manager in Cytoscape www.psb.ugent.be/cbd/papers/BiNGO or Google ‘bingo gene ontology’

Thanks ! Karel Heymans, Martin Kuiper Ghent University Andrew Markiel, Iliana Avila Institute for Systems Biology Rowan Christmas Ruth Isserlin, Gary Bader University of Toronto Mike Smoot, Trey Ideker UC San Diego Benno Schwikowski Institut Pasteur