Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl. 1 2001.

Slides:



Advertisements
Similar presentations
Motif Mining from Gene Regulatory Networks
Advertisements

Systems biology SAMSI Opening Workshop Algebraic Methods in Systems Biology and Statistics September 14, 2008 Reinhard Laubenbacher Virginia Bioinformatics.
Bayesian network for gene regulatory network construction
Inferring Quantitative Models of Regulatory Networks From Expression Data Iftach Nachman Hebrew University Aviv Regev Harvard Nir Friedman Hebrew University.
Computational discovery of gene modules and regulatory networks Ziv Bar-Joseph et al (2003) Presented By: Dan Baluta.
Detecting active subnetworks in molecular interaction networks with missing data Luke Hunter Texas A&M University SHURP 2007 Student.
D ISCOVERING REGULATORY AND SIGNALLING CIRCUITS IN MOLECULAR INTERACTION NETWORK Ideker Bioinformatics 2002 Presented by: Omrit Zemach April Seminar.
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman.
Rich Probabilistic Models for Gene Expression Eran Segal (Stanford) Ben Taskar (Stanford) Audrey Gasch (Berkeley) Nir Friedman (Hebrew University) Daphne.
Learning Module Networks Eran Segal Stanford University Joint work with: Dana Pe’er (Hebrew U.) Daphne Koller (Stanford) Aviv Regev (Harvard) Nir Friedman.
Threshold selection in gene co- expression networks using spectral graph theory techniques Andy D Perkins*,Michael A Langston BMC Bioinformatics 1.
Regulatory Network (Part II) 11/05/07. Methods Linear –PCA (Raychaudhuri et al. 2000) –NIR (Gardner et al. 2003) Nonlinear –Bayesian network (Friedman.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
Cs726 Modeling regulatory networks in cells using Bayesian networks Golan Yona Department of Computer Science Cornell University.
Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break 14:45 – 15:15Regulatory pathways lecture 15:15 – 15:45Exercise.
Modularity in Biological networks.  Hypothesis: Biological function are carried by discrete functional modules.  Hartwell, L.-H., Hopfield, J. J., Leibler,
Indiana University Bloomington, IN Junguk Hur Computational Omics Lab School of Informatics Differential location analysis A novel approach to detecting.
Simulation and Application on learning gene causal relationships Xin Zhang.
27803::Systems Biology1CBS, Department of Systems Biology Schedule for the Afternoon 13:00 – 13:30ChIP-chip lecture 13:30 – 14:30Exercise 14:30 – 14:45Break.
6. Gene Regulatory Networks
Bayesian Networks Alan Ritter.
Systems Biology, April 25 th 2007Thomas Skøt Jensen Technical University of Denmark Networks and Network Topology Thomas Skøt Jensen Center for Biological.
Causal Models, Learning Algorithms and their Application to Performance Modeling Jan Lemeire Parallel Systems lab November 15 th 2006.
Cristina Manfredotti D.I.S.Co. Università di Milano - Bicocca An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data Cristina Manfredotti.
Modeling the Gene Expression of Saccharomyces cerevisiae Δcin5 Under Cold Shock Conditions Kevin McKay Laura Terada Department of Biology Loyola Marymount.
Efficient Algorithms for Detecting Signaling Pathways in Protein Interaction Networks Jacob Scott, Trey Ideker, Richard M. Karp, Roded Sharan RECOMB 2005.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Bayesian integration of biological prior knowledge into the reconstruction of gene regulatory networks Dirk Husmeier Adriano V. Werhli.
Statistical Bioinformatics QTL mapping Analysis of DNA sequence alignments Postgenomic data integration Systems biology.
MATISSE - Modular Analysis for Topology of Interactions and Similarity SEts Igor Ulitsky and Ron Shamir Identification.
Detecting robust time-delayed regulation in Mycobacterium tuberculosis Iti Chaturvedi and Jagath C Rajapakse INCOB 2009.
Cis-regulation Trans-regulation 5 Objective: pathway reconstruction.
Using Bayesian Networks to Analyze Expression Data N. Friedman, M. Linial, I. Nachman, D. Hebrew University.
Learning Structure in Bayes Nets (Typically also learn CPTs here) Given the set of random variables (features), the space of all possible networks.
Using Bayesian Networks to Analyze Expression Data By Friedman Nir, Linial Michal, Nachman Iftach, Pe'er Dana (2000) Presented by Nikolaos Aravanis Lysimachos.
Networks and Interactions Boo Virk v1.0.
Microarrays to Functional Genomics: Generation of Transcriptional Networks from Microarray experiments Joshua Stender December 3, 2002 Department of Biochemistry.
Paper prepared for presentation at the 16 th ICABR Conference – 128 th EAAE Seminar “The Political Economy of the Bioeconomy: Biotechnology and Biofuel”
Creating Metabolic Network Models using Text Mining and Expert Knowledge J.A. Dickerson, D. Berleant, Z. Cox, W. Qi, and E. Wurtele Iowa State University.
Data Analysis with Bayesian Networks: A Bootstrap Approach Nir Friedman, Moises Goldszmidt, and Abraham Wyner, UAI99.
Reverse engineering gene regulatory networks Dirk Husmeier Adriano Werhli Marco Grzegorczyk.
Reconstructing gene networks Analysing the properties of gene networks Gene Networks Using gene expression data to reconstruct gene networks.
Using Bayesian Networks to Analyze Whole-Genome Expression Data Nir Friedman Iftach Nachman Dana Pe’er Institute of Computer Science, The Hebrew University.
Learning Linear Causal Models Oksana Kohutyuk ComS 673 Spring 2005 Department of Computer Science Iowa State University.
A Biology Primer Part IV: Gene networks and systems biology Vasileios Hatzivassiloglou University of Texas at Dallas.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Course files
Problem Limited number of experimental replications. Postgenomic data intrinsically noisy. Poor network reconstruction.
DeepDive Model Dongfang Xu Ph.D student, School of Information, University of Arizona Dec 13, 2015.
Integrated Genomic and Proteomic Analyses of a Systematically Perturbed Metabolic Network Science, Vol 292, Issue 5518, , 4 May 2001.
Reverse engineering of regulatory networks Dirk Husmeier & Adriano Werhli.
Shortest Path Analysis and 2nd-Order Analysis Ming-Chih Kao U of M Medical School
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
Dynamic Networks: How Networks Change with Time? Vahid Mirjalili CSE 891.
04/21/2005 CS673 1 Being Bayesian About Network Structure A Bayesian Approach to Structure Discovery in Bayesian Networks Nir Friedman and Daphne Koller.
Network Analysis Goal: to turn a list of genes/proteins/metabolites into a network to capture insights about the biological system 1.Types of high-throughput.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
Network applications Sushmita Roy BMI/CS 576 Dec 9 th, 2014.
Biological Network Analysis
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
Journal club Jun , Zhen.
Learning gene regulatory networks in Arabidopsis thaliana
Dynamics and context-specificity in biological networks
Building and Analyzing Genome-Wide Gene Disruption Networks
CISC 841 Bioinformatics (Spring 2006) Inference of Biological Networks
CSCI2950-C Lecture 13 Network Motifs; Network Integration
Schedule for the Afternoon
Evaluation of inferred networks
Network Inference Chris Holmes Oxford Centre for Gene Function, &,
Bioinformatics, Vol.17 Suppl.1 (ISMB 2001) Weekly Lab. Seminar
Presentation transcript:

Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl

Motivation Expression profiles give genome wide information about the state of metabolism, gene regulation, signal transduction, etc. One would like to infer functional relationships between the genes from this data. Perturbations such as mutations give insight into the effects of particular genes and help us infer causal relationships.

Tool – Bayesian Networks Random Variables: Gene Expression Levels Probabilistic Dependencies: Regulatory Interactions Z YX Pr(X|Z)Pr(Y|Z)

Goal of Paper Extend Bayesian framework in cellular context to deal with mutations Develop better methods to discretize data Define and learn new features in our model such as mediators, activators, and inhibitors Construct subnetworks of strong statistical significance

Learning Networks Network is learned through maximizing a score function with respect to the collected data. D = Data; G = Graph; Pa = Parent; X = expression level; m = sample number

Equivalent Graphs Two graphs may imply the same dependencies and are called equivalent. XYXY = So instead of directed graphs we make partially directed graphs. XYZ ZZ

Learning with Mutations If gene X is mutated we replace its expression level by a constant. For example if X is knocked out, its expression is replaced by 0. Our new score function is: Where Int(m) is the set of “intervened” (mutated) variables in experiment m. Notice that two structurally equivalent graph are no longer guaranteed to get the same score. If two graphs get the same score under this scoring function they are called “intervention equivalent.”

Other Perturbations Temperature sensitivity, kinetic mutations, and environmental stress can also be model in the Bayesian Network framework. A node is added for each condition which can take the values “on” or “off.” YX Temperature Z

What Do Bayesian Networks Buy Us 1) Markov Neighbors (Direct Relationships) XY XY Z XY

2) Activator/Inhibitors XY Let U = Parents(Y) – X. If for all states u of U we have: A B increasing as X increases then we say X is an activator. decreasing as X decreases then we say X is an inhibitor.

3) d-Seperation: Mediators X Y Z U Both Z and U d-separate X and Y. In this framework they would be called mediators of X and Y.

Feature Confidence A confidence can be associated with each feature which measures how sure we are about truth of the detected feature. This confidence is given by: where f(G) is the indicator function of the feature of interest

Building Significant Subnetworks 1)Naïve Approach: For some threshold, T, find all if edges such that confidence is above T. For all maximally connected subgraphs of size greater than 3, grow out the graph by adding edges which have confidence greater than some weaker threshold S. X B Z Y A

2) Score-based Approach They want to build a subnetwork and associate a score measuring the networks significance. If we build a network with k nodes from a possible n nodes and include k edges, the score we assign the network is: where K is (k choose 2), the number of possible edges on k nodes, c i is the confidence of edge i, and F(x) is probability that an edge has confidence greater than or equal to x. F(x) is estimated by calculated by counting the number of edges with confidence greater than x. Using this criteria networks are built from seeds as in the naïve approach are grown one node at a time.

Data The Rosetta Inpharmatics Compendium Organism: S. cerevisiae 300 complete genomes (experiments) 276 deletion mutations 11 tetracyclin regulatable alleles 13 chemical treated cultures In this paper 565 genes analyzed

Pairwise Relations The method can recognize functional relationships missed by similiarity. Scores are reported as (Confidence, Pearson Correlation) Purine Biosynthesis pair: Novel Predictions: Literature search reveal strong support for this interaction. ADE2ADE1 (.797,.518) ESC4KU70 (.914,.162) Chromatin silencing DNA break repair

Seperator Relations Transcription Regulators: Nuclear Fusion Post-Translational Activation (by phosphorylation): cell wall integrity pathway Post-Translational Negative Regulation: G-protein mating signalling pathway FUS1 KAR4 AGA1 Rlm1p SLT2 Swi4/6 TEC1 SST2 STE6 -

Subnetwork Analysis They claim they often get modular components More structure than clustering alone Visual inspection can give clue to unknown gene functions STE12 missing and marginal position of FUS3 disturbing SST2 KAR4 TEC1SLT2KSS1 YLR343W YLR334CSLT2STE6 FUS1PRM1AGA1 FIG1FUS3AGA2TOM6 YEL059W

Conclusions This technique is better than clustering alone because confidence measures can detect interactions previously undetected. Also, we get more specific information about structure of interaction networks so it is easier to guess at unknown gene functions. Statistical significance of features allows biological exploration of interaction network. Can not recover all interactions No incorporation of previous biological knowledge