Download presentation
Presentation is loading. Please wait.
1
Inferring subnetworks from perturbed expression profiles Dana Pe’er, Aviv Regev, Gal Elidan and Nir Friedman Bioinformatics, Vol.17 Suppl. 1 2001
2
Motivation Expression profiles give genome wide information about the state of metabolism, gene regulation, signal transduction, etc. One would like to infer functional relationships between the genes from this data. Perturbations such as mutations give insight into the effects of particular genes and help us infer causal relationships.
3
Tool – Bayesian Networks Random Variables: Gene Expression Levels Probabilistic Dependencies: Regulatory Interactions Z YX Pr(X|Z)Pr(Y|Z)
4
Goal of Paper Extend Bayesian framework in cellular context to deal with mutations Develop better methods to discretize data Define and learn new features in our model such as mediators, activators, and inhibitors Construct subnetworks of strong statistical significance
5
Learning Networks Network is learned through maximizing a score function with respect to the collected data. D = Data; G = Graph; Pa = Parent; X = expression level; m = sample number
6
Equivalent Graphs Two graphs may imply the same dependencies and are called equivalent. XYXY = So instead of directed graphs we make partially directed graphs. XYZ ZZ
7
Learning with Mutations If gene X is mutated we replace its expression level by a constant. For example if X is knocked out, its expression is replaced by 0. Our new score function is: Where Int(m) is the set of “intervened” (mutated) variables in experiment m. Notice that two structurally equivalent graph are no longer guaranteed to get the same score. If two graphs get the same score under this scoring function they are called “intervention equivalent.”
8
Other Perturbations Temperature sensitivity, kinetic mutations, and environmental stress can also be model in the Bayesian Network framework. A node is added for each condition which can take the values “on” or “off.” YX Temperature Z
9
What Do Bayesian Networks Buy Us 1) Markov Neighbors (Direct Relationships) XY XY Z XY
10
2) Activator/Inhibitors XY Let U = Parents(Y) – X. If for all states u of U we have: A B increasing as X increases then we say X is an activator. decreasing as X decreases then we say X is an inhibitor.
11
3) d-Seperation: Mediators X Y Z U Both Z and U d-separate X and Y. In this framework they would be called mediators of X and Y.
12
Feature Confidence A confidence can be associated with each feature which measures how sure we are about truth of the detected feature. This confidence is given by: where f(G) is the indicator function of the feature of interest
13
Building Significant Subnetworks 1)Naïve Approach: For some threshold, T, find all if edges such that confidence is above T. For all maximally connected subgraphs of size greater than 3, grow out the graph by adding edges which have confidence greater than some weaker threshold S. X B Z Y A
14
2) Score-based Approach They want to build a subnetwork and associate a score measuring the networks significance. If we build a network with k nodes from a possible n nodes and include k edges, the score we assign the network is: where K is (k choose 2), the number of possible edges on k nodes, c i is the confidence of edge i, and F(x) is probability that an edge has confidence greater than or equal to x. F(x) is estimated by calculated by counting the number of edges with confidence greater than x. Using this criteria networks are built from seeds as in the naïve approach are grown one node at a time.
15
Data The Rosetta Inpharmatics Compendium Organism: S. cerevisiae 300 complete genomes (experiments) 276 deletion mutations 11 tetracyclin regulatable alleles 13 chemical treated cultures In this paper 565 genes analyzed
16
Pairwise Relations The method can recognize functional relationships missed by similiarity. Scores are reported as (Confidence, Pearson Correlation) Purine Biosynthesis pair: Novel Predictions: Literature search reveal strong support for this interaction. ADE2ADE1 (.797,.518) ESC4KU70 (.914,.162) Chromatin silencing DNA break repair
17
Seperator Relations Transcription Regulators: Nuclear Fusion Post-Translational Activation (by phosphorylation): cell wall integrity pathway Post-Translational Negative Regulation: G-protein mating signalling pathway FUS1 KAR4 AGA1 Rlm1p SLT2 Swi4/6 TEC1 SST2 STE6 -
18
Subnetwork Analysis They claim they often get modular components More structure than clustering alone Visual inspection can give clue to unknown gene functions STE12 missing and marginal position of FUS3 disturbing http://www.cs.huji.ac.il/labs/compbio/ismb01/ SST2 KAR4 TEC1SLT2KSS1 YLR343W YLR334CSLT2STE6 FUS1PRM1AGA1 FIG1FUS3AGA2TOM6 YEL059W
19
Conclusions This technique is better than clustering alone because confidence measures can detect interactions previously undetected. Also, we get more specific information about structure of interaction networks so it is easier to guess at unknown gene functions. Statistical significance of features allows biological exploration of interaction network. Can not recover all interactions No incorporation of previous biological knowledge
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.