Combinatorially Complex Equilibrium Model Selection Tom Radivoyevitch Assistant Professor Epidemiology and Biostatistics Case Western Reserve University Website:
Ultimate Goal Present Future Safer flying airplanes with autopilots Safer flying airplanes with autopilots Ultimate Goal: individualized, state feedback based clinical trials Ultimate Goal: individualized, state feedback based clinical trials Better understanding => better control Better understanding => better control Conceptual models help trial designs today Conceptual models help trial designs today Computer models of airplanes help train pilots and autopilots Computer models of airplanes help train pilots and autopilots Radivoyevitch et al. (2006) BMC Cancer 6:104 Systems Cancer Biology
dNTP Supply System Figure 1. dNTP supply. Many anticancer agents act on or through this system to kill cells. The most central enzyme of this system is RNR. UDP CDP GDP ADP dTTP dCTP dGTP dATP dT dC dG dA DNA dUMP dU TS CDA dCK DNA polymerase TK1 cytosol mitochondria dT dC dG dA TK2 dGK dTMP dCMP dGMP dAMP dTTP dCTP dGTP dATP 5NT NT2 cytosol nucleus dUDP dUTP U-DNA dUTPase dN dCK flux activation inhibition ATP RNR dCK
R1 R2 R1 R2 UDP, CDP, GDP, ADP bind to catalytic site dTTP, dGTP, dATP, ATP bind to selectivity site dATP inhibits at activity site, ATP activates at activity site? Selectivity site binding promotes R1 dimers. R2 is always a dimer. ATP drives hexamer. Controversy: dATP drives inactive tetramer vs. inactive hexamer Controversy: Hexamer binds one R2 2 vs. three R2 2 Total concentrations of R1, R2 2, dTTP, dGTP, dATP, ATP and NDPs control the distribution of R1-R2 complexes and this changes in S, G 1 -G 2 and G 0 ATP activates at hexamerization site?? RNR Literature R2
Michaelis-Menten Model With RNR: no NDP and no R2 dimer => k cat of complex is zero. Otherwise, many different R1-R2-NDP complexes can have many different k cat values. E + S ES
solid line = Eqs. (1-2) dotted = Eq. (3) Data from Scott, C. P., Kashlan, O. B., Lear, J. D., and Cooperman, B. S. (2001) Biochemistry 40(6), R=R1 r=R2 2 G=GDP t=dTTP Substitute this in here to get a quadratic in [S] which solves as Bigger systems of higher polynomials cannot be solved algebraically => use ODEs (above) Michaelis-Menten Model [S] vs. [S T ] [S] vs. [S T ] (3)
E ES EI ESI E ES EI ESI E ES EI ESI E ES EI E ES EI ESI E EI ESI E ES ESI E EI ESI E ES ESI = = E EI E ESI E ES E Competitive inhibition uncompetitive inhibition if k cat_ESI =0 E | ES EI | ESI noncompetitive inhibition Example of K d =K d ’ Model = = Let p be the probability that an E molecule is undamaged. Then in each model [E T ] can be replace with p[E T ] to double the number of models to 2*( )=24. E EI E ESI E ES K j =0 Models as K ES approaches 0 Enzyme, Substrate and Inhibitor
Total number of spur graph models is 16+4=20 Radivoyevitch, (2008) BMC Systems Biology 2:15 Rt Spur Graph Models R RR RRtt RRt Rt R RRtt RRt Rt R RR RRtt Rt R RR RRt Rt R RR RRtt RRt R RRtt Rt R RRt Rt R RR Rt 3B 3C 3D3E 3F3G3H 3A R RRtt RRt R RR RRtt R RR RRt R Rt R RRtt R RRt 3J 3I 3K R RR R 3L 3M3N 3O 3P R Rt R RRtt R RRt R RR 3Q 3R 3S3T R = R1 t = dTTP for dTTP induced R1 dimerization
R t RR t Rt R t RRt t Rt RRtt K d_R_R K d_Rt_R K d_Rt_Rt K d_R_t K d_RRt_t K d_RR_t = = = = 2A R t RR t Rt R t RRt t Rt RRtt K d_R_R K d_Rt_R K d_Rt_Rt K d_R_t K d_RRt_t K d_RR_t = = = 2B R t RR t Rt R t RRt t Rt RRtt K d_R_R K d_Rt_R K d_Rt_Rt K d_R_t K d_RRt_t K d_RR_t 2C | | | | | = = = | | R t RR t Rt R t RRt t Rt RRtt K d_R_R K d_Rt_R K d_Rt_Rt K d_R_t K d_RRt_t K d_RR_t 2D | | R t RR t Rt R t RRt t Rt RRtt K d_R_R K d_Rt_R K d_Rt_Rt K d_R_t K d_RRt_t K d_RR_t 2E = = 2A 2B 2C 2D2E 2G 2I 2K | | | | | | | | | | | | | | | = = = = 2F 2H 2J 2L 2M 2N Figure 3. Spur graph models. The following models are equivalent: 3A=2F, 3B=2H, 3C=2J, 3D=2L, 3E=2N Acyclic spanning subgraphs are reparameterizations of equilvalent models 2F0 Figure 2. Grid graph models. 2F12F22F32F42F52F62F72F8 Standardize: take E-shapes and sub E-shapes as defaults Use n-shapes if needed. Other shapes are possible 3B 3C 3D3E3F3G3H3I 3J3K3L3M 3N 3O3P 3Q 3R 3S 3T 3A Rt Grid Graph Models
AIC c = 2P+N*log(SSE/N)+2P(P+1)/(N-P-1) Data and fit from Scott, C. P., Kashlan, O. B., Lear, J. D., and Cooperman, B. S. (2001) Biochemistry 40(6), ~10-fold deviations from Scott et al. (initial values) titration model has lowest AIC Radivoyevitch, (2008) BMC Systems Biology 2:15 Application to Data 2E = = R RRtt 3Rp Infinitely tight binding situation wherein free molecule annihilation (the initial linear ramp) continues in a one-to-one fashion with increasing [dTTP] T until [dTTP] T equals [R1] T =7.6 µ M, the plateau point where R exists solely as RRtt. Experiment becomes a titration scan of [t T ] to estimate [R T ], but [R T ]=7.6 µ M was already known.
Model Space Fit with New Data Radivoyevitch, (2008) BMC Systems Biology 2:15
Yeast R1 structure. Chris Dealwis’ Lab, PNAS 102, , 2006
One additional data point here would reject 3Rp If so, new data here would be logical next No need to constrain data collection to orthogonal profiles
Model Space Predictions Best next 10 measurements if 3Rp is rejected Show new data + R Code
Comments on Methods
= 28 parameters => 2 28 =2.5x10 8 spur graph models via K j =∞ hypotheses (if average fit = 60 sec, then need 60 cores running for 8 straight years) 28 models with 1 parameter, 428 models with 2, 3278 models with 3, with 4 [dATP] uM a = dATP Kashlan et al. Biochemistry :462
Human Thymidine Kinase 1 Assumptions (1)binding of enzyme monomers to form enzyme dimers is approximately infinitely tight across the enzyme concentrations of interest (2)behavior of any higher order enzyme structures (if they exist) is dominated by the behavior of dimers within such structures (3)enzyme concentrations are low enough that free substrate concentrations approximately equal total substrate concentrations. Max(E 0 )=200 pM Min(S T )=50 nM => OK OK Questionable (but I need to start simple)
3-parameter models 2-parameter models 4-parameter model.
Eq. (25): k 1 = k 2 = 0, k 3 > 0 and K 1 = K 2 = K 3 Fits to human thymidine kinase 1 data of Birringer et al. Protein Expr Purif 2006, 47(2): Eq. (7): k 1 = k 3 and K 2 = ∞ and K 1 = K 3 EQSSEAICc α 1 V∞V∞ V1V1 α2α2 n eq eq eq eq eq eq eq eq eq eq eq eq eq213.00E eq eq eq eq eq eq Same top 4 models found by fits to Frederiksen H, Berenstein D, Munch-Petersen B: Eur J Biochem 2004, 271(11): Hill rises from 5 th, 6 th and worse to 2 nd if older data is merged in Conclusion: across the board Hill fits should be avoided since it is not biologically plausible at non-integer n and since it needs competition to guide subsequent experiments
Acknowledgements Case Comprehensive Cancer Center Case Comprehensive Cancer Center NIH (K25 CA104791) NIH (K25 CA104791) Chris Dealwis (CWRU Pharmacology) Chris Dealwis (CWRU Pharmacology) Sanath Wijerathna (CWRU Pharmacology) Sanath Wijerathna (CWRU Pharmacology) Anders Hofer (Umea) Anders Hofer (Umea) Thank you Thank you