Protein network analysis Network motifs Network clusters / modules Co-clustering networks & expression Network comparison (species, conditions) Integration.

Slides:



Advertisements
Similar presentations
Protein network analysis Network motifs Network clusters / modules Co-clustering networks & expression Network comparison (species, conditions) Integration.
Advertisements

Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Charlie Whittaker – BIG meeting 12/3/14
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
Biological pathway and systems analysis An introduction.
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Putting genetic interactions in context through a global modular decomposition Jamal.
Gene Set Enrichment Analysis (GSEA)
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Pathways & Networks analysis COST Functional Modeling Workshop April, Helsinki.
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
Biomarkers as networks, not individual loci October 28, 2010 Trey Ideker UCSD BioEng and Med Genetics.
GENIE – GEne Network Inference with Ensemble of trees Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling,
Comparison of Networks Across Species CS374 Presentation October 26, 2006 Chuan Sheng Foo.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Bioinformatics: a Multidisciplinary Challenge Ron Y. Pinter Dept. of Computer Science Technion March 12, 2003.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
ONCOMINE: A Bioinformatics Infrastructure for Cancer Genomics
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Epistasis Analysis Using Microarrays Chris Workman.
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Modeling Functional Genomics Datasets CVM Lessons 4&5 10 July 2007Bindu Nanduri.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Cytoscape A powerful bioinformatic tool Mathieu Michaud
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Network Analysis and Application Yao Fu
EGAN: Exploratory Gene Association Networks by Jesse Paquette Biostatistics and Computational Biology Core Helen Diller Family Comprehensive Cancer Center.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Course on Functional Analysis
Network & Systems Modeling 29 June 2009 NCSU GO Workshop.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Integrating Biology and Statistics: Gene Set Methods BIOS Winter/Spring 2010.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
EB3233 Bioinformatics Introduction to Bioinformatics.
Introduction to biological molecular networks
DNAmRNAProtein Small molecules Environment Regulatory RNA How a cell is wired The dynamics of such interactions emerge as cellular processes and functions.
GO based data analysis Iowa State Workshop 11 June 2009.
CSC, Dec.15-16,2005. Cytoscape Team Trey Ideker Mark Anderson Nerius Landys Ryan Kelley Chris Workman Past contributors: Nada Amin Owen Ozier Jonathan.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Predicting Protein Function Annotation using Protein- Protein Interaction Networks By Tamar Eldad Advisor: Dr. Yanay Ofran Computational Biology.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
 What is MSA (Multiple Sequence Alignment)? What is it good for? How do I use it?  Software and algorithms The programs How they work? Which to use?
Introduction to Oncomine Xiayu Stacy Huang. Oncomine is a cancer-specific microarray database and has a web-based data-mining platform aimed at facilitating.
(1) Genotype-Tissue Expression (GTEx) Largest systematic study of genetic regulation in multiple tissues to date 53 tissues, 500+ donors, 9K samples, 180M.
Canadian Bioinformatics Workshops
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
David Amar, Tom Hait, and Ron Shamir
CSCI2950-C Lecture 12 Networks
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Modeling cells with protein networks
Cancer Cell Line Encyclopedia
Presentation transcript:

Protein network analysis Network motifs Network clusters / modules Co-clustering networks & expression Network comparison (species, conditions) Integration of genetic & physical nets Network visualization

OPEN SOURCE Java platform for integration of systems biology data Layout and query of networks (physical, genetic, social, functional) Visual and programmatic integration of network state data (attributes) The ultimate goal is to provide tools to facilitate all aspects of network assembly, annotation, and use in biomedicine. RECENT NEWS Version 2.7 released March 2010 Cytoscape ® Registered Trademark The Cytoscape Consortium is a 501(c)3 non-for-profit in the State of California Centerpiece of the new National Resource for Network Biology, $7 million from NCRR Downloaded approximately 3000 times per month Shannon et al. Genome Research 2003 Cline et al. Nature Protocols 2007

Cytoscape Plugin Usage Statistics

Integration of networks and expression

Querying biological networks for “Active Modules” Ideker et al. Bioinformatics (2002) Interaction Database Dump, aka “Hairball” Active Modules Color network nodes (genes/proteins) with: Patient expression profile Protein states Patient genotype (SNP state) Enzyme activity RNAi phenotype

A scoring system for expression “activity” ABCD

Scoring over multiple perturbations/conditions Perturbations /conditions

Searching for “active” pathways in a large network Score subnetworks according to their overall amount of activity Finding the highest scoring subnetworks is NP hard, so we use heuristic search algs. to identify a collection of high-scoring subnetworks (local optima) Simulated annealing and/or greedy search starting from an initial subnetwork “seed” During the search we must also worry about issues such as local topology and whether a subnetwork’s score is higher than would be expected at random

Simulated Annealing Algorithm

Network regions whose genes change on/off or off/on after knocking out different genes

Initial Application to Toxicity: Networks responding to DNA damage in yeast Tom Begley and Leona Samson; MIT Dept. of Bioengineering Systematic phenotyping of gene knockout strains in yeast Evaluation of growth of each strain in the presence of MMS (and other DNA damaging agents) Sensitive Not sensitive Not tested MMS sensitivity in ~25% of strains Screening against a network of protein interactions…

Begley et al., Mol Cancer Res, (2002)

Networks responding to DNA damage as revealed by high-throughput phenotypic assays Begley et al., Mol Cancer Res, (2002)

Host-pathogen interactions regulating early stage HIV-1 infection Genome-wide RNAi screens for genes required for infection utilizing a single cycle HIV-1 reporter virus engineered to encode luciferase and bearing the Vesicular Stomatitis Virus Glycoprotein (VSV-G) on its surface to facilitate efficient infection… Sumit Chanda

Project onto a large network of human-human and human-HIV protein interactions

Network modules associated with infection Konig et al. Cell 2008

Network-based classification

NETWORK-BASED CLASSIFICATION Disease aggression (Time from Sample Collection SC to Treatment TX) Chuang et al. MSB 2007 Lee et al. PLoS Comp Bio 2008 Ravasi et al. Cell 2010

The Mammalian Cell Fate Map: Can we classify tissue type using expression, networks, etc? Gilbert Developmental Biology 4 th Edition

Interaction coherence within a tissue class B B A A B B A A B B A A Endoderm Mesoderm Ectoderm (incl. CNS) r = 0.9 r = 0.0 r = 0.2 Taylor et al. Nature Biotech 2009

Protein interactions, not levels, dictate tissue specification

Functional Enrichment

Gene Set Enrichment Analysis - GSEA - ::: Introduction. MIT Broad Institute v 2.0 available since Jan 2007 v available since Feb 16th 2007 Version 2.0 includes Biocarta, Broad Institute, GeneMAPP, KEGG annotations and more... Platforms: Affymetrix, Agilent, CodeLink, custom... GSEA (Subramanian et al. PNAS )

GSEA applies Kolmogorov-Smirnof test to find assymmetrical distributions for defined blocks of genes in datasets whole distribution. Gene Set Enrichment Analysis - GSEA - ::: Introduction. Is this particular Gene Set enriched in my experiment? Genes selected by researcher, Biocarta pathways, GeneMAPP sets, genes sharing cytoband, genes targeted by common miRNAs …up to you…

Dataset distribution Number of genes Gene Expression Level Gene Set Enrichment Analysis - GSEA - ::: Introduction. ::: K-S test The Kolmogorov–Smirnov test is used to determine whether two underlying one-dimensional probability distributions differ, or whether an underlying probability distribution differs from a hypothesized distribution, in either case based on finite samples. The one-sample KS test compares the empirical distribution function with the cumulative distribution functionspecified by the null hypothesis. The main applications are testing goodness of fit with the normal and uniform distributions. The two-sample KS test is one of the most useful and general nonparametric methods for comparing two samples, as it is sensitive to differences in both location and shape of the empirical cumulative distribution functions of the two samples. Gene set 1 distribution Gene set 2 distribution

ClassA ClassB ttest cut-off FDR< testing genes independently... Biological meaning? Gene Set Enrichment Analysis - GSEA - ::: Introduction.

Correlation with CLASS - + ClassA ClassB Gene Set 1 ttest cut-off Gene Set 2 Gene Set 3 Gene set 3 enriched in Class B Gene set 2 enriched in Class A Gene Set Enrichment Analysis - GSEA - ::: Introduction.

Subramaniam, PNAS 2005

NES pval FDR Gene Set Enrichment Analysis - GSEA - ::: Introduction. The Enrichment Score ::: Benjamini-Hochberg

Network Alignment Species 1 vs. species 2 Physical vs. genetic

Kelley et al. PNAS 2003 Ideker & Sharan Gen Res 2008 Cross-comparison of networks: (1) Conserved regions in the presence vs. absence of stimulus (2) Conserved regions across different species Sharan et al. RECOMB 2004 Scott et al. RECOMB 2005Sharan & Ideker Nat. Biotech Suthram et al. Nature 2005

Conserved Plasmodium / Saccharomyces protein complexes Plasmodium-specific protein complexes Suthram et al. Nature 2005 La Count et al. Nature 2005 Plasmodium: a network apart?

Human vs. Mouse TF-TF Networks in Brain Tim Ravasi, RIKEN Consortium et al. Cell 2010

Finding physical pathways to explain genetic interactions Adapted from Tong et al., Science 2001 Genetic Interactions: Classical method used to map pathways in model species Highly analogous to multi-genic interaction in human disease and combination therapy Thousands are being uncovered through systematic studies Thus as with other types, the number of known genetic interactions is exponentially increasing…

Integration of genetic and physical interactions 160 between- pathway models 101 within- pathway models Num interactions: 1,102 genetic 933 physical Kelley and Ideker Nature Biotechnology (2005)

Systematic identification of “parallel pathway” relationships in yeast

Unified Whole Cell Model of Genetic and Physical interactions

A dynamic DNA damage module map Bandyopadhyay et al. Science (2010)