Divining Systems Biology Knowledge from High-throughput Experiments Using EGAN Jesse Paquette ISMB 2010 Biostatistics and Computational Biology Core Helen.

Slides:



Advertisements
Similar presentations
EGAN Tutorial: Loading Network Data October, 2009 Jesse Paquette UCSF Helen Diller Family Comprehensive Cancer Center
Advertisements

BiNoM, a Cytoscape plugin for accessing and analyzing pathways using standard systems biology formats Eric Bonnet Computational Systems Biology of Cancer.
Statistical methods and tools for integrative analysis of perturbation signatures Mario Medvedovic Laboratory for Statistical Genomics and Systems Biology.
EGAN tutorial: Loading experiment results October, 2009 Jesse Paquette UCSF Helen Diller Family Comprehensive Cancer Center
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Oncomine Database Lauren Smalls-Mantey Georgia Institute of Technology June 19, 2006 Note: This presentation contains animation.
Pathways analysis Iowa State Workshop 11 June 2009.
Gene Set Enrichment Analysis (GSEA)
Prof. Carolina Ruiz Computer Science Department Bioinformatics and Computational Biology Program WPI WELCOME TO BCB4003/CS4803 BCB503/CS583 BIOLOGICAL.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Five Slides About EGAN Jesse Paquette UCSF Helen Diller Family Comprehensive Cancer Center
Exploratory Gene Association Networks October 2009 Jesse Paquette Helen Diller Family Comprehensive Cancer Center University of California, San Francisco.
EGAN Tutorial: A Basic Use-case October, 2009 Jesse Paquette UCSF Helen Diller Family Comprehensive Cancer Center
Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis Jonsson.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
August 29, 2002InforMax Confidential1 Vector PathBlazer Product Overview.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Pathway Informatics 6 th July, 2015 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of.
GO Enrichment analysis COST Functional Modeling Workshop April, Helsinki.
Immune Cell Ontology for Networks (ICON) Immunology Ontologies and Their Applications in Processing Clinical Data June 11-13, Buffalo, NY.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
>>> Korean BioInformation Center >>> KRIBB Korea Research institute of Bioscience and Biotechnology GS2PATH: Linking Gene Ontology and Pathways Jin Ok.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
Cancer is heterogeneous disease! -> enabled characterization of new tumor subtypes for improving personalized treatment and ultimately achieving better.
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
Gene Set Enrichment Analysis (GSEA)
Detecting enriched regions (Chip- seq, RIP-seq) Statistical evaluation of enriched regions Data displayed in Genome Browser Detection of enriched motifs.
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Managing Data Modeling GO Workshop 3-6 August 2010.
GenePattern Overview for MAGE-TAB Workshop Ted Liefeld January 24, 2007.
Taverna Workflow. A suite of tools for bioinformatics Fully featured, extensible and scalable scientific workflow management system – Workbench, server,
Text Mining Special Interest Group Stuart Murray, Wyeth Research Novartis Institute for Biomedical Research, Cambridge, MA 6-8 th October 2004.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
GeWorkbench Highlights caBIG ® Molecular Analysis Tools Knowledge Center AACR Annual Meeting, April 3, 2011.
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Integrating the Bioinformatic Technology Group into your research programme Introduction People and Skills Examples Integrating the BTG Contacts BHRC Away.
Bioinformatics lectures at Rice University Li Zhang Lecture 9: Networks and integrative genomic analysis
Bioinformatics Core Facility Guglielmo Roma January 2011.
Developed at the Broad Institute of MIT and Harvard Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, and Mesirov JP. GenePattern 2.0. Nature Genetics 38.
Bioinformatics, Erasmus MC Pathway Analysis Karl Brand, June 2012.
By: Amira Djebbari and John Quackenbush BMC Systems Biology 2008, 2: 57 Presented by: Garron Wright April 20, 2009 CSCE 582.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
A literature network of human genes for high-throughput analysis of gene expression Speaker : Shih-Te, YangShih-Te, Yang Advisor : Ueng-Cheng, YangUeng-Cheng,
GO based data analysis Iowa State Workshop 11 June 2009.
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
EnVisioning Data Integration SME forum 2009, Vienna Henning Hermjakob Henning Hermjakob
Title: Assign Pathways to Gene Set June 21, 2007 Guanming Wu.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
Pathway Informatics 30 th March, 2016 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University.
ARCH/VCDE F2F BoF And the Presentation Subtitle Goes Here Ravi Madduri December 2008.
NCRI Cancer Conference November 1, 2015.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
a Cytoscape plugin to assess enrichment of
Canadian Bioinformatics Workshops
Pathway Visualization
Pathway Informatics December 5, 2018 Ansuman Chattopadhyay, PhD
Network biology An introduction to STRING and Cytoscape
Pathway Visualization
Presentation transcript:

Divining Systems Biology Knowledge from High-throughput Experiments Using EGAN Jesse Paquette ISMB 2010 Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center University of California, San Francisco (AKA BCBC HDFCCC UCSF)

High-throughput experiments This talk applies to –Expression microarrays –aCGH –SNP/CNV arrays –MS/MS Proteomics –DNA methylation –ChIP-Seq –RNA-Seq –In-silico experiments If parts of the output can be mapped to gene IDs –You can use EGAN

What do you hope to accomplish? Collect data Process data Differential analysisPublish! Clusters and/or gene lists New testable hypotheses Produce insight about the underlying biology New grants!New papers! Drug targets!

Leverage organic intelligence Clusters and/or gene lists New testable hypotheses Produce insight about the underlying biology Summarize Visualize Contextualize

Producing insight from clusters and gene lists Summarize: find enriched pathways (and other gene sets) –Hypergeometric over-representation DAVID –Global trends GSEA Visualize: gene relationships in a graph –Protein-protein interactions Cytoscape –Network module discovery Ingenuity IPA –Literature co-occurrence PubGene Contextualize: pertinent literature PubMed Google iHOP

EGAN: Exploratory Gene Association Networks Methods: state-of-the-art analysis of clusters and gene lists –Hypergeometric enrichment of gene sets –Global statistical trends of gene sets –Hypergraph visualization (via Cytoscape libraries) –Literature identification –Network module discovery User Interface: responds quickly to new queries from the biologist –Sandbox-style functionality –Dynamic adjustment of p-value cutoffs –Point-and-click interface –All data in-memory for immediate access –Links to external websites Modular: integrates as a flexible plug-and-play cog –All data is customizable –Proprietary data can be restricted to the client location –Java runs on almost every OS (PC, Mac, LINUX) –Can be configured and launched from a different application (e.g. GenePattern) –Analyses can be scripted for automation

Gene sets A gene set is a a set of semantically related genes –e.g. Wnt signaling pathway EGAN contains a database of gene sets –> 100k gene sets by default KEGG, Reactome, NCI-Nature, Gene Ontology, MeSH, Conserved Domain, Cytoband, miRNA targets –You can easily add your own Simple file format Download from MSigDB (Broad Institute)

Gene-gene relationships EGAN also contains –Protein-protein interactions (PPI) –Literature co-occurrence –Chromosomal adjacency –Kinase-target relationships Other possibilities –Sequence homology –Expression correlation

Example with microarray and aCGH results Mirzoeva et al. (2009) Cancer Research –UCSF-LBL collaboration –Analysis of breast cancer cell lines Basal vs. luminal Discoveries in this presentation –miRNA regulator of subtype (mir-200) –Annexin (ANXA1) as potential regulator of ER, glucocorticoid and EGFR signaling

Gene list - higher expression in basal cell lines

Gene set/pathway enrichment

Importing gene lists from publications

Combining expression with aCGH

Finding network modules

Where to find EGAN Website – paper in Bioinformatics –

Acknowledgements BCBC HDFCCC UCSF –Taku Tokuyasu –Adam Olshen –Ritu Roy –Ajay Jain LBNL –Debopriya Das –Joe Gray Funding –UCSF Cancer Center Support Grant UCSF –Early adopters Ingrid Revet Antoine Snijders Stephan Gysin Sook Wah Yee Joachim Silber –Cytoscape gurus David Quigley Scooter Morris –OTM David Eramian Ha Nguyen –Laura van ’t Veer –Donna Albertson –Graeme Hodgson