Mass-action equilibrium and non-specific interactions in protein interaction networks Sergei Maslov Brookhaven National Laboratory.

Slides:



Advertisements
Similar presentations
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
Advertisements

Non-Markovian dynamics of small genetic circuits Lev Tsimring Institute for Nonlinear Science University of California, San Diego Le Houches, 9-20 April,
Theory. Modeling of Biochemical Reaction Systems 2 Assumptions: The reaction systems are spatially homogeneous at every moment of time evolution. The.
Outlines Background & motivation Algorithms overview
Molecular mechanisms of long-term memory
An Intro To Systems Biology: Design Principles of Biological Circuits Uri Alon Presented by: Sharon Harel.
A model for the Initiation of Replication in Escherichia coli Author Joseph M. Mahaffy Judith W Zyskind J. theor. Biol
CSE Fall. Summary Goal: infer models of transcriptional regulation with annotated molecular interaction graphs The attributes in the model.
VL Netzwerke, WS 2007/08 Edda Klipp 1 Max Planck Institute Molecular Genetics Humboldt University Berlin Theoretical Biophysics Networks in Metabolism.
Max-Planck-Institut für molekulare Genetik Workshop „Systems Biology“ Berlin, Robustness and Entropy of Biological Networks Thomas Manke Max.
Multiscale Stochastic Simulation Algorithm with Stochastic Partial Equilibrium Assumption for Chemically Reacting Systems Linda Petzold and Yang Cao University.
Spontaneous recovery in dynamic networks Advisor: H. E. Stanley Collaborators: B. Podobnik S. Havlin S. V. Buldyrev D. Kenett Antonio Majdandzic Boston.
Weighted networks: analysis, modeling A. Barrat, LPT, Université Paris-Sud, France M. Barthélemy (CEA, France) R. Pastor-Satorras (Barcelona, Spain) A.
Complex Networks Third Lecture TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA TexPoint fonts used in EMF. Read the.
Detecting topological patterns in protein networks Sergei Maslov Brookhaven National Laboratory.
Extracting hidden information from knowledge networks Sergei Maslov Brookhaven National Laboratory, New York, USA.
Work Process Using Enrich Load biological data Check enrichment of crossed data sets Extract statistically significant results Multiple hypothesis correction.
A Real-life Application of Barabasi’s Scale-Free Power-Law Presentation for ENGS 112 Doug Madory Wed, 1 JUN 05 Fri, 27 MAY 05.
Signal Processing in Single Cells Tony 03/30/2005.
Genome-wide prediction and characterization of interactions between transcription factors in S. cerevisiae Speaker: Chunhui Cai.
Protein Binding Networks: from Topology to Kinetics Sergei Maslov Brookhaven National Laboratory.
Adaptive evolution of bacterial metabolic networks by horizontal gene transfer Chao Wang Dec 14, 2005.
Propagation of noise and perturbations in protein binding networks Sergei Maslov Brookhaven National Laboratory.
Copyright  2004 limsoon wong Assessing Reliability of Protein- Protein Interaction Experiments Limsoon Wong Institute for Infocomm Research.
Universality in ultra-cold fermionic atom gases. with S. Diehl, H.Gies, J.Pawlowski S. Diehl, H.Gies, J.Pawlowski.
Graph, Search Algorithms Ka-Lok Ng Department of Bioinformatics Asia University.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Gaussian Processes for Transcription Factor Protein Inference Neil D. Lawrence, Guido Sanguinetti and Magnus Rattray.
Objectives of Multiple Regression
Lecture 3 Protein binding networks. C. elegans PPI from Li et al. (Vidal’s lab), Science (2004) Genome-wide protein binding networks Nodes - proteins.
Electrical Circuits Dr. Sarika Khushalani Solanki
Large-scale organization of metabolic networks Jeong et al. CS 466 Saurabh Sinha.
Analyzing the systemic function of genes and proteins Rui Alves.
Black hole production in preheating Teruaki Suyama (Kyoto University) Takahiro Tanaka (Kyoto University) Bruce Bassett (ICG, University of Portsmouth)
ANALYZING PROTEIN NETWORK ROBUSTNESS USING GRAPH SPECTRUM Jingchun Chen The Ohio State University, Columbus, Ohio Institute.
Network Biology Presentation by: Ansuman sahoo 10th semester
Converting Macromolecular Regulatory Models from Deterministic to Stochastic Formulation Pengyuan Wang, Ranjit Randhawa, Clifford A. Shaffer, Yang Cao,
Clustering of protein networks: Graph theory and terminology Scale-free architecture Modularity Robustness Reading: Barabasi and Oltvai 2004, Milo et al.
ECEN 5341/4341 Lecture 15 Feb 17, Noise Sources 2. Minimal levels of signal detection. 3. Some characteristic s of Neurons. An Important Reference.
Summary Various mathematical models describing gene regulatory networks as well as algorithms for network reconstruction from experimental data have been.
KIAS July 2006 RNA secondary structure Ground state and the glass transition of the RNA secondary structure RNA folding: specific versus nonspecific pairing.
Rates of Reactions Why study rates?
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
PH4705 & ET4305: Measurements Measurement: assign numbers to property of object or event to describe it The absolute true value of a measurement can’t.
1 Departament of Bioengineering, University of California 2 Harvard Medical School Department of Genetics Metabolic Flux Balance Analysis and the in Silico.
Mathematical Modeling of Signal Transduction Pathways Biplab Bose IIT Guwahati.
1 Stochasticity and robustness Steve Andrews Brent lab, Basic Sciences Division, FHCRC Lecture 5 of Introduction to Biological Modeling Oct. 20, 2010.
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
FAT Average lifetime (ps) GFP- Pax GFP-Pax + FAT- mCherry Lifetime (ns) Pax FAT Advanced Fluorescence Microscopy I: Fluorescence (Foster)
Protein folding dynamics and more Chi-Lun Lee ( 李紀倫 ) Department of Physics National Central University.
IMPROVED RECONSTRUCTION OF IN SILICO GENE REGULATORY NETWORKS BY INTEGRATING KNOCKOUT AND PERTURBATION DATA Yip, K. Y., Alexander, R. P., Yan, K. K., &
Genome Biology and Biotechnology The next frontier: Systems biology Prof. M. Zabeau Department of Plant Systems Biology Flanders Interuniversity Institute.
Introduction to biological molecular networks
Kinesin hydrolyses one ATP per 8-nm step Mark J. Schnitzer*† & Steven M. Block†‡ Departments of * Physics and † Molecular Biology, and ‡ Princeton Materials.
Transport in weighted networks: optimal path and superhighways Collaborators: Z. Wu, Y. Chen, E. Lopez, S. Carmi, L.A. Braunstein, S. Buldyrev, H. E. Stanley.
1 Lesson 12 Networks / Systems Biology. 2 Systems biology  Not only understanding components! 1.System structures: the network of gene interactions and.
Response network emerging from simple perturbation Seung-Woo Son Complex System and Statistical Physics Lab., Dept. Physics, KAIST, Daejeon , Korea.
CORRELATION-REGULATION ANALYSIS Томский политехнический университет.
Algorithms and Computational Biology Lab, Department of Computer Science and & Information Engineering, National Taiwan University, Taiwan Network Biology.
Kinetics State of reactionscan be described a couple of ways: State of reactionscan be described a couple of ways: Equilibrium – overall reactions (our.
Identifying submodules of cellular regulatory networks Guido Sanguinetti Joint work with N.D. Lawrence and M. Rattray.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
A Stochastic Model of Cell Differentiation
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
Modelling Structure and Function in Complex Networks
Andreas Hilfinger, Thomas M. Norman, Johan Paulsson  Cell Systems 
Presentation transcript:

Mass-action equilibrium and non-specific interactions in protein interaction networks Sergei Maslov Brookhaven National Laboratory

Living cells contain crowded and diverse molecular environments Proteins constitute ~30% of E. coli and ~5% of yeast cytoplasm by weight ~2000 protein types are co-expressed co-localized in yeast cytoplasm

If that’s not difficult enough: they are all interconnected >80% of proteins are all connected in one giant cluster of PPI network Small-world effect median network distance – 6 steps Map of reproducible (>2 publications) protein-protein interactions in yeast

Why small-world property might cause problems? Interconnected binding networks could indiscriminately spread perturbations Systematic changes in expression: large changes in concentrations of a small number of proteins SM, I. Ispolatov, PNAS and NJP (2007) Noise: small changes in concentrations of a large number of proteins K.-K. Yan, D. Walker, SM, PRL (2008) How individual pathways can be turned on and off without upsetting the whole system ?

What about non-specific interactions? Proteins form transient non-specific bonds with random, non-functional partners For an organism to function specific interactions between proteins must dominate over non-specific ones: How much stronger ~N specific interactions between N proteins need to be to overcome ~N 2 non-specific interactions? What limits it imposes on the number of protein types and their concentrations? J. Zhang, SM, E. Shakhnovich, Molecular Systems Biology (2008)

My “spherical cow” assumptions Protein concentrations C i of all yeast proteins (under the rich growth medium conditions) and subcellular localizations are experimentally known (group of UCSF) Consider only reproducible independently confirmed protein-protein interactions for non-catalytic binding (kinase-substrate pairs~5%) The network: ~4000 heterodimers and ~100 multi-protein complexes (we assume no cooperative binding in complexes) connecting ~1700 proteins Know the relevant average of dissociation constants K ij ~10nM. Turned out their distribution around this average DOES NOT MATTER MUCH!!! Use “evolutionary motivated” binding strength: K ij =max(C i, C j )/const, which is sufficient to bind considerable fraction of twoproteins in a heterodimer

Law of Mass Action (LMA) dD AB /dt = r (on) AB F A F B – r (off) AB D AB In the equilibrium: D AB =F A F B /K AB ; C A = F A +D AB ; C B = F B +D AB or F A = C A /(1+ F B /K AB ) and F B = C B /(1+ F A /K AB ) In a network: A system of ~2000 nonlinear equations for F i that can be solved only numerically

Propagation of perturbations: the in silico study Calculate the unperturbed (wildtype) LMA equilibrium Simulate a twofold increase of the concentration C A  2C A of just one type of protein and recalculate equilibrium free concentrations F i of all other proteins Look for cascading perturbations: A  B  C  D with sign-alternation: A (  up), B (  down), C (  up), D (  down)

Cascades of perturbations exponentially decay (and sign alternate) with network distance S. Maslov, I. Ispolatov, PNAS, (2007);

Mapping to resistor network Conductivities  ij – heterodimer concentrations D ij Losses to the ground  iG – free (unbound) concentrations F i Perturbations spread along linear chains loosely conducting to neighbors and ground Mapping is exact for bi-partite networks  odd- length loops dampen perturbations S.Maslov, K. Sneppen, I. Ispolatov, New J. Phys, (2007)

Perturbations – large changes of few proteins Fluctuations – small changes of many proteins

Two types of fluctuations in equilibrium concentrations Driven fluctuations: changes in D ij driven by stochastic variations in total concentrations C i (random protein production/degradation) Spontaneous fluctuations: stochastic changes in D ij at fixed C i – described by equlibrium thermodynamics Both types propagate through network  network  isolated

Image by Cell Signaling Technology, Inc: Mitochondrial control of apoptosis

What limits do non-specific interactions impose on robust functioning of protein networks? J. Zhang, S. Maslov, E. Shakhnovich, MSB (2008) see talk on 8:48 AM in Room 411 (V39)

The effect of non-specific interactions grows with genome diversity m -- the number of co-expressed & co- localized proteins Compare 3 equilibrium concentrations of a typical protein: free (monomer) specific heterodimer, all non-specific heterodimers Need to know: protein concentrations: C i specific and non-specific dissociation constants: K (s) =K 0 exp(E (s) /kT), K (ns) =K 0 exp(E (ns) /kT Competition between specific and nonspecific interactions

log(C/K 0 ) K ij “Evolutionary motivated” K ij =max(C i, C j )/10 1  M 1 nM C i

We estimate the median non-specific energy to be E (ns) =-4kT  2.5kT or K (ns) =18mM Still thousands of pairs are below the 1  M (-14kT) detection threshold of Y2H which is 3.6 std. dev. away J. Zhang, SM, E. Shakhnovich, Molecular Systems Biology (2008) How to estimate E (ns) ? 1M1M 18mM log K (ns) Use false-positives in noisy high-throughput data!

cytoplasm mitochondria nucleus Phase diagram in yeast J. Zhang, SM, E. Shakhnovich, Molecular Systems Biology (2008) Evolution pushes the number of protein types m up for higher functional complexity, while keeping the concentration is as low as possible to reduce the waste due to non-specific interactions Still, on average proteins in yeast cytoplasm spend 20% of time bound in non-specific complexes

Collaborators and support Koon-Kiu Yan, Dylan Walker, Tin Yau Pang (BNL/Stony Brook) Iaroslav Ispolatov (Ariadne Genomics/BNL) Kim Sneppen (Center for Models of Life, Niels Bohr Institute, Denmark) Eugene Shakhnovich, Jingshan Zhang (Harvard) DOE DMS DE-AC02-98CH10886 NIH/NIGMS R01 GM068954

Thank you!

Conclusions Time to go beyond topology of PPI networks! Interconnected networks present a challenge for robustness: Perturbations and noise Non-specific interactions We were the first to attempt quantifying these effects on genome-wide scale Estimates will get better as we get better data on kinetic & equilibrium constants

Collaborators, papers, and support Koon-Kiu Yan, Dylan Walker, Tin Yau Pang (BNL/Stony Brook) Iaroslav Ispolatov (Ariadne Genomics/BNL) Kim Sneppen (Center for Models of Life, Niels Bohr Institute, Denmark) Eugene Shakhnovich, Jingshan Zhang (Harvard) DOE Division of Material Science, DE-AC02-98CH10886 NIH/NIGMS, R01 GM Propagation of large concentration changes in reversible protein binding networks, S. Maslov, I. Ispolatov, PNAS 104:13655 (2007); 2.Constraints imposed by non-functional protein–protein interactions on gene expression and proteome size, J. Zhang, S. Maslov, E. Shakhnovich, Molecular Systems Biology 4:210 (2008); 3.Fluctuations in Mass-Action Equilibrium of Protein Binding Networks K-K. Yan, D. Walker, S. Maslov, Phys Rev. Lett., 101, (2008); 4.Spreading out of perturbations in reversible reaction networks S. Maslov, K. Sneppen, I. Ispolatov, New Journal of Physics 9: 273 (2007); 5.Topological and dynamical properties of protein interaction networks. S. Maslov, book chapter in the " Protein-protein interactions and networks: Identification, Analysis and Prediction“, Springer-Verlag (2008);

Collective Effects Amplify Spontaneous Noise Collective effects significantly amplify (up to a factor of 20) spontaneous noise Is there an upper bound to this amplification?

Stochastic fluctuations in D* ij at fixed C i Free energy G, for a given occupation state Here is not independent but related to via

What limits do non-specific interactions impose on robust functioning of protein networks? J. Zhang, S. Maslov, E. Shakhnovich, Molecular Systems Biology (2008)

The effect of non-specific interactions grows with m -- the number of co-expressed & co-localized proteins Assume a protein is biologically active when bound to its unique specific interaction partner Compare 3 equilibrium concentrations: free (monomer), specific dimer, all non-specific dimers Need to know the average and distributions of: protein concentrations: C specific and non-specific dissociation constants: K (s) =K 0 exp(E (s) /kT), K (ns) =K 0 exp(E (ns) /kT) Dimensionless parameters: log(C/K 0 ), E (s) /kT, E (ns) /kT Competition between specific and nonspecific interactions

Limits on parameters For specific dimers to dominate over monomers: C  K (s)= =K 0 exp(E (s) /kT) For specific interactions to dominate over non- specific: C/K (s)  mC/K (ns) or m  exp[(E (ns)- E (s) )/kT] m C

Intra-cellular noise Noise typically means fluctuations in total concentrations C i (e.g. cell-to-cell variability measured for of all yeast proteins by Weissman UCSF) Needs to be converted into noise in biologically relevant dimer (D ij ) or monomer (F i ) concentrations Two types of noise: intrinsic (uncorrelated) and extrinsic (correlated) (M. Elowitz, U. Alon, et. al. (2005)) Intrinsic noise could be amplified by the conversion (sometimes as much as 30 times!) Extrinsic noise partially cancels each other Essential proteins seem to be more protected from noise and perturbations PNAS (2007), Phys. Rev. Lett. (2008)

Going beyond topology We already know a lot about topology of complex networks (scale-free, small-world, clustering, etc) Network is just a backbone for complex dynamical processes Time to put numbers on nodes/edges and study these processes For binding networks – governed by law of mass action

SM, I. Ispolatov, PNAS (2007) The total number of cascades is still significant The fraction of significantly (> noise level ~ 20%) affected proteins at distance D quickly decays --> exp(-  D) The total number of neighbors at distance D quickly rises --> exp( D) The number of affected proteins at distance D slowly decays --> exp(- (  - )D) D

Robustness with respect to assignment of K ij Spearman rank correlation: 0.89 Pearson linear correlation: 0.98 Bound concentrations: D ij Spearman rank correlation: 0.89 Pearson linear correlation: Free concentrations: F i SM, I. Ispolatov, PNAS, 104, (2007)

OK, protein binding networks are robust, but can cascading changes be used to send signals?

Robustness: Cascades of perturbations on average exponentially decay S.Maslov, K. Sneppen, I. Ispolatov, NJP (2007)

How robust is the mass-action equilibrium against perturbations? Less robust More robust

SM, I. Ispolatov, PNAS, 104, (2007) HHT1

SM, I. Ispolatov, PNAS, 104, (2007)

Perturbations propagate along dimers with large concentrations They cascade down the concentration gradient and thus directional Free concentrations of intermediate proteins are low SM, I. Ispolatov, PNAS, 104, (2007)

Non-specific phase diagram

Three states of a protein Each protein i has 3 possible states: C i =[ii’]+[i]+[iR] Concentrations are related by the Law of Mass Action Compare the 3 concentrations: [ii’] should dominate

Non-specific binding energies

Assume for nonspecific interactions scales with sum of surface hydrophobicities of two proteins Distribution of fraction of hydrophobic Aas on protein’s surface Distribution of is Gaussian (proportional to hydrophobicity) Model of nonspecific interactions E. J. Deeds, O. Ashenberg, and E. I. Shakhnovich, PNAS 103, 311 (2006)

Parameters of non-specific interactions out of high-throughput Y2H experiments Detection threshold K d * of K ij in Yeast 2-Hybrid experiments J. Estojak, R. Brent and E. A. Golemis. Mol. Cell. Biol. 15, 5820 (1995) If pairwise interactions are detected among N protein types < E* Interaction detected in Y2H if

Chemical potential description of non-specific interactions between proteins

Chemical potential of the system More hydrophobic surface  more likely to bind nonspecific. Probability to be monomeric follows the Fermi-Dirac distribution [ i ]>[ iR ] for E i > , and vise versa Find the chemical potential  by solving

Network Equilibrium Given a set of total concentrations and the protein interaction network, we can determine the equilibrium bound and unbound concentrations We can numerically solve these equations by iteration At equilibrium: This leads to a set of nonlinear equations: Of course, the network is not always in equilibrium. There are fluctuations away from equilibrium: Thus, given a set of total concentrations and a set of dissociation constants, equilibrium free and bound concentrations are uniquely determined.

Empirical PPI Network Curated genome-wide network of PPI interactions in Baker’s Yeast (S. cerevisiae) BIOGRID database: Interactions independently confirmed in at least two published experiments Genome wide set of protein abundances during log-phase growth Retain only interactions between proteins of known total concentration 1740 proteins involved in 4085 heterodimers PPI Net Protein abundance Dissociation constants Dissociation constants are not presently empirically known Denominator is chose to conform to the average association from the PINT database Evolutionary motivated dissociation: Minimum association necessary to bind a sizable fraction of dimers and 77 multi-protein complexes

Driven Fluctuations Consider a set of total concentrations that are typical in the cell (i.e., when the cell is in log growth phase) We want to examine small deviations in total concentration that arise as a result of: 1) Upstream noise in genetic regulation 2) Stochastic fluctuations in protein production/degradation mechanisms The typical time-scale of these small fluctuations in total concentration is minutes. We refer to these as driving fluctuations because they propagate through the network and drive fluctuations in dimer concentration network Driving fluctuations Driven fluctuations

Collective effects amplify fluctuations Significant amplification (up to 20-fold) compared to isolated dimers

Is Collective Amplification Bound? where: To answer this question, let us calculate the noise from the partition function using an alternate formalism Calculate average statistical quantities in the usual way: We can think of the set of total copy numbers as the size of the system notation: Suppressed concentration are unchanged