Protein network analysis Network motifs Network clusters / modules Co-clustering networks & expression Network comparison (species, conditions) Integration.

Slides:



Advertisements
Similar presentations
Motif Mining from Gene Regulatory Networks
Advertisements

Network analysis Sushmita Roy BMI/CS 576
Biological Networks Analysis Degree Distribution and Network Motifs
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Putting genetic interactions in context through a global modular decomposition Jamal.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
Biomarkers as networks, not individual loci October 28, 2010 Trey Ideker UCSD BioEng and Med Genetics.
Seminar in Bioinformatics, Winter 2011 Network Motifs
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Regulatory networks 10/29/07. Definition of a module Module here has broader meanings than before. A functional module is a discrete entity whose function.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Gene Co-expression Network Analysis BMI 730 Kun Huang Department of Biomedical Informatics Ohio State University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Network Motifs Zach Saul CS 289 Network Motifs: Simple Building Blocks of Complex Networks R. Milo et al.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
Network analysis and applications Sushmita Roy BMI/CS 576 Dec 2 nd, 2014.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Epistasis Analysis Using Microarrays Chris Workman.
Protein network analysis Network motifs Network clusters / modules Co-clustering networks & expression Network comparison (species, conditions) Integration.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
341: Introduction to Bioinformatics Dr. Natasa Przulj Deaprtment of Computing Imperial College London
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Biological Networks Lectures 6-7 : February 02, 2010 Graph Algorithms Review Global Network Properties Local Network Properties 1.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Network Analysis and Application Yao Fu
Jesse Gillis 1 and Paul Pavlidis 2 1. Department of Psychiatry and Centre for High-Throughput Biology University of British Columbia, Vancouver, BC Canada.
Gene Regulatory Network Inference. Progress in Disease Treatment  Personalized medicine is becoming more prevalent for several kinds of cancer treatment.
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Course on Functional Analysis
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Introduction to Bioinformatics Biological Networks Department of Computing Imperial College London March 18, 2010 Lecture hour 18 Nataša Pržulj
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
CSCE555 Bioinformatics Lecture 18 Network Biology: Comparison of Networks Across Species Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu.
Introduction to biological molecular networks
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
The Broad Institute of MIT and Harvard Differential Analysis.
Fast test for multiple locus mapping By Yi Wen Nisha Rajagopal.
Biological Networks. Can a biologist fix a radio? Lazebnik, Cancer Cell, 2002.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
Bioinformatics 3 V8 – Gene Regulation Fri, Nov 9, 2012.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Network Motifs See some examples of motifs and their functionality Discuss a study that showed how a miRNA also can be integrated into motifs Today’s plan.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Comparative Network Analysis BMI/CS 776 Spring 2013 Colin Dewey
David Amar, Tom Hait, and Ron Shamir
CSCI2950-C Lecture 12 Networks
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Dynamics and context-specificity in biological networks
System Structures Identification
Biological Networks Analysis Degree Distribution and Network Motifs
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Wendell A. Lim, Connie M. Lee, Chao Tang  Molecular Cell 
SEG5010 Presentation Zhou Lanjun.
Anastasia Baryshnikova  Cell Systems 
Loyola Marymount University
Loyola Marymount University
Presentation transcript:

Protein network analysis Network motifs Network clusters / modules Co-clustering networks & expression Network comparison (species, conditions) Integration of genetic & physical nets Network visualization

Network motifs

Network Motifs (Milo, Alon et al.) Motifs are “patterns of interconnections occurring in complex networks.” That is, connected subgraphs of a particular isomorphic topology The approach queries the network for small motifs (e.g., of < 5 nodes) that occur much more frequently than would be expected in random networks Significant motifs have been found in a variety of biological networks and, for instance, correspond to feed-forward and feed-back loops that are well known in circuit design and other engineering fields. Pioneered by Uri Alon and colleagues

Motif searches in 3 different contexts How many motifs (connected subgraph topologies) exist involving three nodes? If the graph is undirected? If the graph is directed?

All 3-node directed subgraphs What is the frequency of each in the network?

Outline of the Approach Search network to identify all possible n-node connected subgraphs (here n=3 or 4) Get # occurrences of each subgraph type The significance for each type is determined using permutation testing, in which the above process is repeated for many randomized networks (preserving node degrees– why?) Use random distributions to compute a p-value for each subgraph type. The “network motifs” are subgraphs with p < 0.001

Schematic view of network motif detection Networks are randomized preserving node degree

Concentration of feedforward motif: Mean+/-SD of 400 subnetworks (Num. appearances of motif divided by all 3 node connected subgraphs)

Transcriptional network results

Neural networks

Food webs

World Wide Web

Electronic circuits

Interesting questions Which networks have motifs in common? Which networks have completely distinct motifs versus the others? Does this tell us anything about the design constraints on each network? E.g., the feedforward loop may function to activate output only if the input signal is persistent (i.e., reject noisy or transient signals) and to allow rapid deactivation when the input turns off E.g., food webs evolve to allow flow of energy from top to bottom (?!**!???), whereas transcriptional networks evolve to process information

Identifying modules in the network Rives/Galitski PNAS paper 2003 Define distance between each pair of proteins in the interaction network E.g., d = shortest path length To compute shortest path length, use Dijkstra’s algorithm Cluster w/ pairwise node similarity = 1/d 2

Integration of networks and expression

Querying biological networks for “Active Modules” Ideker et al. Bioinformatics (2002) Interaction Database Dump, aka “Hairball” Active Modules Color network nodes (genes/proteins) with: Patient expression profile Protein states Patient genotype (SNP state) Enzyme activity RNAi phenotype

A scoring system for expression “activity” ABCD

Scoring over multiple perturbations/conditions Perturbations /conditions

Searching for “active” pathways in a large network Score subnetworks according to their overall amount of activity Finding the highest scoring subnetworks is NP hard, so we use heuristic search algs. to identify a collection of high-scoring subnetworks (local optima) Simulated annealing and/or greedy search starting from an initial subnetwork “seed” During the search we must also worry about issues such as local topology and whether a subnetwork’s score is higher than would be expected at random

Simulated Annealing Algorithm

Network regions whose genes change on/off or off/on after knocking out different genes

Initial Application to Toxicity: Networks responding to DNA damage in yeast Tom Begley and Leona Samson; MIT Dept. of Bioengineering Systematic phenotyping of gene knockout strains in yeast Evaluation of growth of each strain in the presence of MMS (and other DNA damaging agents) Sensitive Not sensitive Not tested MMS sensitivity in ~25% of strains Screening against a network of protein interactions…

Begley et al., Mol Cancer Res, (2002)

Networks responding to DNA damage as revealed by high-throughput phenotypic assays Begley et al., Mol Cancer Res, (2002)

Host-pathogen interactions regulating early stage HIV-1 infection Genome-wide RNAi screens for genes required for infection utilizing a single cycle HIV-1 reporter virus engineered to encode luciferase and bearing the Vesicular Stomatitis Virus Glycoprotein (VSV-G) on its surface to facilitate efficient infection… Sumit Chanda

Project onto a large network of human-human and human-HIV protein interactions

Network modules associated with infection Konig et al. Cell 2008

Network-based classification

NETWORK-BASED CLASSIFICATION Disease aggression (Time from Sample Collection SC to Treatment TX) Chuang et al. MSB 2007 Lee et al. PLoS Comp Bio 2008 Ravasi et al. Cell 2010

The Mammalian Cell Fate Map: Can we classify tissue type using expression, networks, etc? Gilbert Developmental Biology 4 th Edition

Interaction coherence within a tissue class B B A A B B A A B B A A Endoderm Mesoderm Ectoderm (incl. CNS) r = 0.9 r = 0.0 r = 0.2 Taylor et al. Nature Biotech 2009

Protein interactions, not levels, dictate tissue specification

Functional Enrichment

Gene Set Enrichment Analysis - GSEA - ::: Introduction. MIT Broad Institute v 2.0 available since Jan 2007 Version 2.0 includes Biocarta, Broad Institute, GeneMAPP, KEGG annotations and more... Platforms: Affymetrix, Agilent, CodeLink, custom... GSEA (Subramanian et al. PNAS )

GSEA applies Kolmogorov-Smirnof test to find assymmetrical distributions for defined blocks of genes in datasets whole distribution. Gene Set Enrichment Analysis - GSEA - ::: Introduction. Is this particular Gene Set enriched in my experiment? Genes selected by researcher, Biocarta pathways, GeneMAPP sets, genes sharing cytoband, genes targeted by common miRNAs …up to you…

Dataset distribution Number of genes Gene Expression Level Gene Set Enrichment Analysis - GSEA - ::: Introduction. ::: K-S test The Kolmogorov–Smirnov test is used to determine whether two underlying one-dimensional probability distributions differ, or whether an underlying probability distribution differs from a hypothesized distribution, in either case based on finite samples. The one-sample KS test compares the empirical distribution function with the cumulative distribution functionspecified by the null hypothesis. The main applications are testing goodness of fit with the normal and uniform distributions. The two-sample KS test is one of the most useful and general nonparametric methods for comparing two samples, as it is sensitive to differences in both location and shape of the empirical cumulative distribution functions of the two samples. Gene set 1 distribution Gene set 2 distribution

ClassA ClassB ttest cut-off FDR< testing genes independently... Biological meaning? Gene Set Enrichment Analysis - GSEA - ::: Introduction.

Correlation with CLASS - + ClassA ClassB Gene Set 1 ttest cut-off Gene Set 2 Gene Set 3 Gene set 3 enriched in Class B Gene set 2 enriched in Class A Gene Set Enrichment Analysis - GSEA - ::: Introduction.

Subramaniam, PNAS 2005

NES pval FDR Gene Set Enrichment Analysis - GSEA - ::: Introduction. The Enrichment Score ::: Benjamini-Hochberg

Network Alignment Species 1 vs. species 2 Physical vs. genetic

Kelley et al. PNAS 2003 Ideker & Sharan Gen Res 2008 Cross-comparison of networks: (1) Conserved regions in the presence vs. absence of stimulus (2) Conserved regions across different species Sharan et al. RECOMB 2004 Scott et al. RECOMB 2005Sharan & Ideker Nat. Biotech Suthram et al. Nature 2005

Conserved Plasmodium / Saccharomyces protein complexes Plasmodium-specific protein complexes Suthram et al. Nature 2005 La Count et al. Nature 2005 Plasmodium: a network apart?

Human vs. Mouse TF-TF Networks in Brain Tim Ravasi, RIKEN Consortium et al. Cell 2010

Finding physical pathways to explain genetic interactions Adapted from Tong et al., Science 2001 Genetic Interactions: Classical method used to map pathways in model species Highly analogous to multi-genic interaction in human disease and combination therapy Thousands are being uncovered through systematic studies Thus as with other types, the number of known genetic interactions is exponentially increasing…

Integration of genetic and physical interactions 160 between- pathway models 101 within- pathway models Num interactions: 1,102 genetic 933 physical Kelley and Ideker Nature Biotechnology (2005)

Systematic identification of “parallel pathway” relationships in yeast

Unified Whole Cell Model of Genetic and Physical interactions

A dynamic DNA damage module map Bandyopadhyay et al. Science (2010)