Protein Biophysics Explains Why Highly Abundant Proteins Evolve Slowly

Slides:



Advertisements
Similar presentations
Neutral Theory of Molecular Evolution most base substitutions are selectively neutral drift dominates evolution at the molecular level Under drift, rate.
Advertisements

TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Chapter 4 Linear Regression 1. Introduction Managerial decisions are often based on the relationship between two or more variables. For example, after.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Selectionist view: allele substitution and polymorphism
NEW TOPIC: MOLECULAR EVOLUTION.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
The neutral theory of molecular evolution
Michael Epstein, Ben Calderhead, Mark A. Girolami, Lucia G. Sivilotti 
Low Dimensionality in Gene Expression Data Enables the Accurate Extraction of Transcriptional Programs from Shallow Sequencing  Graham Heimberg, Rajat.
Jacek Majewski  The American Journal of Human Genetics 
Adaptive Evolution of Gene Expression in Drosophila
Volume 112, Issue 7, Pages (April 2017)
Volume 13, Issue 9, Pages (December 2015)
Leigh K. Harris, Julie A. Theriot  Trends in Microbiology 
A Role for Codon Order in Translation Dynamics
Volume 66, Issue 6, Pages (June 2010)
Bhaswar Ghosh, Uddipan Sarma, Victor Sourjik, Stefan Legewie 
A Bacterial Growth Law out of Steady State
Single-Cell Analysis of Growth in Budding Yeast and Bacteria Reveals a Common Size Regulation Strategy  Ilya Soifer, Lydia Robert, Ariel Amir  Current.
Simple Linear Regression
Volume 27, Issue 18, Pages e3 (September 2017)
Yuki Hara, Akatsuki Kimura  Current Biology 
Volume 140, Issue 5, Pages (March 2010)
Fitness Effects of Fixed Beneficial Mutations in Microbial Populations
Volume 111, Issue 2, Pages (July 2016)
Ultrasensitivity in the Regulation of Cdc25C by Cdk1
Anastasia Baryshnikova  Cell Systems 
Volume 10, Issue 11, Pages (March 2015)
Volume 14, Issue 2, Pages (January 2016)
Volume 49, Issue 1, Pages (January 2013)
Volume 23, Issue 10, Pages (October 2016)
Andreas Hilfinger, Thomas M. Norman, Johan Paulsson  Cell Systems 
Multiple Timescales of Memory in Lateral Habenula and Dopamine Neurons
Volume 5, Issue 4, Pages e4 (October 2017)
Volume 14, Issue 7, Pages (February 2016)
Volume 17, Issue 12, Pages (December 2016)
Volume 16, Issue 1, Pages (June 2016)
Volume 49, Issue 2, Pages (January 2013)
Cofilin-Linked Changes in Actin Filament Flexibility Promote Severing
Evolution by Genetic Drift : Main Points (p. 231)
Volume 23, Issue 11, Pages (November 2015)
Michael M. Desai, Daniel S. Fisher, Andrew W. Murray  Current Biology 
Critical Timing without a Timer for Embryonic Development
Volume 8, Issue 4, Pages (August 2014)
Costs and Benefits of Mutational Robustness in RNA Viruses
Søren Vedel, Harry Nunns, Andrej Košmrlj, Szabolcs Semsey, Ala Trusina 
Wenfeng Qian, Di Ma, Che Xiao, Zhi Wang, Jianzhi Zhang  Cell Reports 
Volume 7, Issue 3, Pages e12 (September 2018)
Pier Francesco Palamara, Todd Lencz, Ariel Darvasi, Itsik Pe’er 
Stochastic Pacing Inhibits Spatially Discordant Cardiac Alternans
Michael Schlierf, Felix Berkemeier, Matthias Rief  Biophysical Journal 
Interbeat Interval Modulation in the Sinoatrial Node as a Result of Membrane Current Stochasticity—A Theoretical and Numerical Study  Hila Dvir, Sharon.
Leigh K. Harris, Julie A. Theriot  Trends in Microbiology 
Volume 15, Issue 11, Pages (June 2016)
Volume 20, Issue 7, Pages (August 2017)
Andrew E. Blanchard, Chen Liao, Ting Lu  Biophysical Journal 
Stochastic Pacing Inhibits Spatially Discordant Cardiac Alternans
Volume 11, Issue 4, Pages (April 2015)
Rafael Couñago, Stephen Chen, Yousif Shamoo  Molecular Cell 
Qian Cong, Dominika Borek, Zbyszek Otwinowski, Nick V. Grishin 
Volume 4, Issue 2, Pages e4 (February 2017)
Parul Mishra, Julia M. Flynn, Tyler N. Starr, Daniel N.A. Bolon 
Matthew R. Roesch, Adam R. Taylor, Geoffrey Schoenbaum  Neuron 
Fig. 1. —GO categories enriched in gene families showing high or low omega (dN/dS) values for Pneumocystis jirovecii. ... Fig. 1. —GO categories enriched.
Joana Pinto Vieira, Julien Racle, Vassily Hatzimanikatis 
Evolution by Genetic Drift : Main Points (p. 231)
Jérémie Barral, Frank Jülicher, Pascal Martin  Biophysical Journal 
Volume 106, Issue 8, Pages (April 2014)
Evolution of Specificity in Protein-Protein Interactions
Presentation transcript:

Protein Biophysics Explains Why Highly Abundant Proteins Evolve Slowly Adrian W.R. Serohijos, Zilvinas Rimas, Eugene I. Shakhnovich  Cell Reports  Volume 2, Issue 2, Pages 249-256 (August 2012) DOI: 10.1016/j.celrep.2012.06.022 Copyright © 2012 The Authors Terms and Conditions

Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions

Figure 1 Multiscale Model of Protein Evolution Model: a constant population of Ne = 104 cells. Each cell contains 103 genes defined by its cellular abundance and the folding stability of its protein product. Because of intrinsic cytotoxicity (Bucciantini et al., 2002), protein misfolding is assumed to be the major determinant of organismic fitness (Drummond and Wilke, 2008). Assuming a two-state protein folding model (Privalov and Khechinashvili, 1974; Shakhnovich and Finkelstein, 1989), the number of misfolded proteins contributed by gene i (with abundance Ai and folding stability ΔGi) is a product of abundance and the probability of being unfolded, 1/(1 + exp(−βΔGi)), where β = 1/kBT. The total number of misfolded proteins in the cell is the sum of all misfolded proteins contributed by each gene. The factor c is the intrinsic fitness cost for every misfolded protein, measured in yeast to be ∼32/(total protein concentration) (Geiler-Samerotte et al., 2011). A random mutation in a gene changes its folding stability by ΔΔG (i.e., ΔGmutant − ΔGwild-type), which consequently increases or decreases the fitness of the cell. From classical population genetics, and assuming monoclonality, the fate of a mutation is determined by the selection coefficient s and the fixation probability Pfix (Crow and Kimura, 1970; Sella and Hirsh, 2005). Dynamics: all genes were assigned an initial stability value of −5 kcal/mol, then subjected to mutation, selection, and drift as indicated by the flowchart. The choice of initial ΔG values was irrelevant because the population would eventually reach the dynamic equilibrium imposed by mutation-selection balance (Movie S1). All analyses and calculations of ERs were performed only during the interval that the population is under mutation-selection balance. Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions

Figure 2 Abundant Proteins Do Not Evolve Slowly When ΔΔG Is Independent of ΔG (A) Abundance-ER correlation from multiscale evolutionary simulations without ΔΔG versus ΔG correlation (r = 0.01). Red line indicates lowess-smoothed data. (B) Molecular clock surface without ΔΔG versus ΔG correlation. The molecular clock (Equation 4) can be integrated over possible ΔΔG (Equation 1) to yield a surface that is solely a function of premutation gene properties. The surface heuristically governs the evolution of a gene: a gene in the neutral regime (green) will acquire mutations (on average with ΔΔG > 0) that make it approach the gully (red line). A gene on the verge of misfolding (yellow) will strongly select for beneficial mutations (ΔΔG < 0) that bring it back to the gully or the neutral regime. A gene in the gully will wait the longest time before fixing a mutation. The gully defines the steady-state relationship between abundance, ΔG, and ER. Projection of the gully predicts no abundance-ER correlation. Instantaneous abundance and ΔG values of genes from evolutionary simulations are shown as blue dots (see Experimental Procedures). Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions

Figure 3 ΔΔG versus ΔG Dependence Is Necessary for Abundance-ER Correlation (A) Biophysical origin of ΔΔG versus ΔG correlation. In the limit of the most stable sequence (blue), all mutations will lead to less stable sequences, hence its ΔΔG distribution will all be destabilizing; for the set of least stable sequences (i.e., random coils) (red), all mutations lead to sequences that are more stable, hence the distribution of ΔΔG will be biased toward all stabilizing mutations. Between these two extremes are wild-type sequences (green), which are not maximally stable (Kumar et al., 2006; Taverna and Goldstein, 2002; Zeldovich et al., 2007). See also Figure S1. (B) ΔΔG versus ΔG correlation in real proteins. Compiled empirical measurements of ΔG and ΔΔG values in the Protherm database (Kumar et al., 2006) exhibit a negative correlation (r = −0.2; p value = 10−22). A linear fit (red line) yields a slope of mΔG,ΔΔG = −0.13. (C) Molecular clock surface with ΔΔG versus ΔG correlation from ProTherm. Abundance and ΔG values of genes from multiscale evolutionary simulation validate the heuristic interpretation (blue dots) (see Movie S1). Two-dimensional projections of the gully predict highly abundant genes will evolve slowly. (D) Abundance-ER correlation is recapitulated in multiscale evolutionary simulations when ΔΔG versus ΔG correlation is taken into account (Spearman rank correlation r = −0.3; p value = 10−9). Red line indicates lowess-smoothed data. Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions

Figure 4 Abundance-ER Anticorrelation Varies with Divergence Time ER is measured by counting the number of fixed mutations between two diverged genomes. Divergence time is measured as the number of substitutions per gene (Na) averaged over all genes in the genome (<Na>) normalized by the average length <L> of genes in the genome. See Experimental Procedures. (A and B) Abundance-ER correlation from multiscale evolutionary simulations. Rates estimated from recently diverged genomes (A, left) exhibit a broader distribution and a weaker correlation with abundance than distantly diverged genome (A, right). (B) The magnitude of abundance-ER correlation varies with divergence time. This variation is influenced by the dependence of ΔG on ΔΔG. (C and D) Abundance-ER correlation in γ-proteobacteria. Nonsynonymous ER (dN) was calculated between E. coli and orthologous genes in other γ-proteobacteria. For example, (C, left) is between E. coli and S. typhimurium and (C, right) is between E. coli and P. ingrahamii. (D) Full time variation of abundance-ER correlation. See Figure S2 for the complete abundance-ER correlations in other γ-proteobacteria. (E) γ-Proteobacteria phylogeny (adapted from Williams et al., 2010). Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions

Figure S1 ΔΔG versus ΔG Correlation in the Phenomenological Model of ΔΔG Estimation, Related to Figure 3 We randomly draw a ΔΔG value from the distribution defined by Equation 5 to demonstrate that the phenomenological model captures the ΔΔG versus ΔG statistical correlation in Figure 3B. Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions

Figure S2 Abundance-ER Correlations in γ-Proteobacteria, Related to Figure 4 ERs (non-synonymous substitutions, dN) are estimated using the orthologs between E. coli and the indicated organisms. Abundance is adopted from (Ishihama et al., 2008). r is Spearman’s rank correlation and n is the number of data points. Histograms show that the data points sample the broad range of abundance values in E. coli. Cell Reports 2012 2, 249-256DOI: (10.1016/j.celrep.2012.06.022) Copyright © 2012 The Authors Terms and Conditions