Download presentation
Presentation is loading. Please wait.
Published byShanon Jennings Modified over 9 years ago
1
Evolutionary Computing in the Study of Bio/Chemical Mechanisms Petr Kuzmič, Ph.D. BioKin, Ltd. Problem: Finding initial parameter estimates Solution: Differential Evolution (DE) Model selection strategy: "Supermodel Evolution" Example: Inhibition of Lethal Factor protease by curcumin
2
Supermodel Evolution in Bio/Chemical Kinetics 2 The task of mechanistic enzyme kinetics SELECT AMONG MULTIPLE CANDIDATE MECHANISMS concentration initial rate computer Select most plausible model MODELS competitive ? uncompetitive ? mixed type ? competitive ? DATA
3
Supermodel Evolution in Bio/Chemical Kinetics 3 Most models in bio/chemical kinetics are nonlinear LINEAR VS. NONLINEAR MODELS Linear y = A + k x Nonlinear y = A [1 - exp(-k x)] y = 2 [1 - exp(-5 x)] y = 2 +5 x
4
Supermodel Evolution in Bio/Chemical Kinetics 4 We need initial estimates of model parameters NONLINEAR MODELS REQUIRE INITIAL ESTIMATES OF PARAMETERS concentration initial rate DATA computer k +1 k -1 k +2 k +3 k -3 A GIVEN MODEL ESTIMATED PARAMETERS k +1 k -1 k +2 k +3 k -3 BEST-FIT PARAMETERS
5
Supermodel Evolution in Bio/Chemical Kinetics 5 Anthrax bacillus CUTANEOUS AND INHALATION ANTHRAX DISEASE
6
Supermodel Evolution in Bio/Chemical Kinetics 6 Lethal Factor (LF) protease from B. anthracis CLEAVES MITOGEN ACTIVATED PROTEIN KINASE KINASE (MAPKK) Inhibitor?
7
Supermodel Evolution in Bio/Chemical Kinetics 7 Neomycin B: an aminoglycoside inhibitor A POTENT INHIBITOR OF LF PROTEASE Streptomyces fradiae Neomycin B
8
Supermodel Evolution in Bio/Chemical Kinetics 8 Neomycin B mechanism: mixed-type noncompetitive NEOMYCIN B INHIBITION FOLLOWS A COMPLEX MOLECULAR MECHANISM Kuzmic et al. (2006) FEBS J. 273, 3054-3062.
9
Supermodel Evolution in Bio/Chemical Kinetics 9 Curcumin: an natural product inhibitor INHIBITION MECHANISM UNKNOWN Cuminum cyminum L. curcumin cumin
10
Supermodel Evolution in Bio/Chemical Kinetics 10 LF protease inhibition by curcumin: raw data SUBSTRATE INHIBITION (MAXIMUM ON SUBSTRATE SATURATION CURVE) [I] = 0 [I] = 22 M [I] = 44 M [I] = 88 M substrate concentration initial reaction rate
11
Supermodel Evolution in Bio/Chemical Kinetics 11 Two separate problems to solve A PREREQUISITE FOR MODEL DISCRIMINATION = FITTING INDIVIDUAL CANDIDATE MODELS 1. Focus on a single reaction mechanism: Given a model (rate equation), find the best-fit kinetic constants 2. Focus on multiple reaction mechanisms: a. Repeat 1. for all candidate models (mechanisms) b. Select the most plausible model
12
Supermodel Evolution in Bio/Chemical Kinetics 12 The mixed-type inhibition model CONTAINS FIVE KINETIC CONSTANTS DATA computer K m = ? K s = ? K i = ? K is = ? k cat = ? MODEL ESTIMATED PARAMETERS BEST-FIT PARAMETERS
13
Supermodel Evolution in Bio/Chemical Kinetics 13 First major difficulty: Sensitivity to initial estimates TRADITIONAL DATA-FITTING: RESULTS DEPEND ON THE INITIAL GUESS K m = 1 K s = 1 K i = 1 K is = 1 k cat = 1 K m = K s = K i ~ K is = k cat = 2000 0.6 0.0001 5100 "uncompetitive" ? ESTIMATE #1
14
Supermodel Evolution in Bio/Chemical Kinetics 14 First major difficulty: Sensitivity to initial estimates TRADITIONAL DATA-FITTING: RESULTS DEPEND ON THE INITIAL GUESS K m = 100 K s = 100 K i = 100 K is = 100 k cat = 100 K m < K s = K i = K is = k cat = 0.000001 0.004 0.002 0.02 54000 ESTIMATE #2
15
Supermodel Evolution in Bio/Chemical Kinetics 15 First major difficulty: Sensitivity to initial estimates TRADITIONAL DATA-FITTING: RESULTS DEPEND ON THE INITIAL GUESS K m = 10 K s = 100 K i = 10 K is = 100 k cat = 100 K m = K s = K i = K is = k cat = 10 110 40 14 68 mixed-type ? ESTIMATE #3
16
Supermodel Evolution in Bio/Chemical Kinetics 16 First major difficulty: Sensitivity to initial estimates TRADITIONAL DATA-FITTING: RESULTS DEPEND ON THE INITIAL GUESS K m = 10 K s = 100 K i = 10 K is = 100 k cat = 100 K m = K s = K i = K is = k cat = 10 110 40 14 68 ESTIMATE #3 Is this the best we can do with the mixed-type model? ?
17
Supermodel Evolution in Bio/Chemical Kinetics 17 The crux of the problem: Finding global minima Least-squares fitting only goes "downhill" How do we know where to start? MODEL PARAMETER SUM OF SQUARED DEVIATIONS (data - model) 2 global minimum data - model = "residual"
18
Supermodel Evolution in Bio/Chemical Kinetics 18 Charles Darwin to the rescue BIOLOGICAL EVOLUTION IMITATED IN "DE" ISBN-10: 3540209506 Charles Darwin (1809-1882)
19
Supermodel Evolution in Bio/Chemical Kinetics 19 Biological metaphor: "Gene, allele" gene BIOLOGYCOMPUTER...AAGTCG...GTAACCGG... four-letter alphabet variable length "keratin" sequence of bits representing a number...01110011000001101101110011... two letter alphabet fixed length (16 or 32 bits) "K M ""k cat "
20
Supermodel Evolution in Bio/Chemical Kinetics 20 "Chromosome, genotype, phenotype" genotype BIOLOGYCOMPUTER...AAGTCGGTTCGGAAGTCGGTTTA... keratin oncoprotein phenotype particular combination of all model parameters 011010110110011110011010001111101101 K M =4.56 V max =1.23 K is =78.9 full set of parameters
21
Supermodel Evolution in Bio/Chemical Kinetics 21 "Organism, fitness" genotype BIOLOGYCOMPUTER...AAGTCGGTTCGGAAGTCGGTTTA... keratin oncoprotein FITNESS: agreement between the data and the model FITNESS: "agreement" with the environment
22
Supermodel Evolution in Bio/Chemical Kinetics 22 "Population" BIOLOGYCOMPUTER low fitness V max KMKM K is medium fitness V max KMKM K is high fitness V max KMKM KisKis
23
Supermodel Evolution in Bio/Chemical Kinetics 23 DE Population size in DynaFit number of population members per optimized model parameter number of population members per order of magnitude
24
Supermodel Evolution in Bio/Chemical Kinetics 24 "Sexual reproduction, crossover" BIOLOGYCOMPUTER 01101011011001111001101 00011111011 01101011011001111001101 11100011011 mother father "sexual mating" probability p cross 01101011011001111001101 11100011011 child random crossover point V max KMKM K is
25
Supermodel Evolution in Bio/Chemical Kinetics 25 DE Crossover probability in DynaFit probability that child inherits father's genes, not mother's genes
26
Supermodel Evolution in Bio/Chemical Kinetics 26 "Mutation, genetic diversity" BIOLOGYCOMPUTER 01101011011 001111001101 11100011011 father V max KMKM K is 11100111011 001011010101 11001011001 mutant father V max (*) K M (*) K is (*) mutation
27
Supermodel Evolution in Bio/Chemical Kinetics 27 "Mutation, genetic diversity" 01101011011 001111001101 11100011011 uncle#1 V max (1) K M (1) K is (1) 11100111011 001011010101 11001011001 uncle#2 V max (2) K M (2) K is (2) THE "DIFFERENTIAL" IN DIFFERENTIAL EVOLUTION ALGORITHM - STEP 1 Compute difference between two randomly chosen "uncle" phenotypes subtract 11100111011 001011010101 11001011001 uncle#2 minus uncle #1 V max (2) -V max (1) K M (2) -K M (1) K is (2) -K is (1)
28
Supermodel Evolution in Bio/Chemical Kinetics 28 "Mutation, genetic diversity" 01101011011 001111001101 11100011011 father V max KMKM K is 11100111011 001011010101 11001011001 mutant father THE "DIFFERENTIAL" IN DIFFERENTIAL EVOLUTION ALGORITHM - STEP 2 Add weighted difference between two "uncle" phenotypes to "father" add a fraction of 11100111011 001011010101 11001011001 uncle#2 minus uncle #1 V max (2) -V max (2) K M (2) -K M (1) K is (2) -K is (1) V max * KM*KM* K is *
29
Supermodel Evolution in Bio/Chemical Kinetics 29 "Mutation, genetic diversity" THE "DIFFERENTIAL" IN DIFFERENTIAL EVOLUTION ALGORITHM K M * = K M + F (K M (1) K M (2) ) EXAMPLE: Michaelis-Menten equation "father""uncle 1""uncle 2" "mutant father" weight (fraction) mutation rate
30
Supermodel Evolution in Bio/Chemical Kinetics 30 DE Mutation rate in DynaFit fractional difference used in mutations K M * = K M + F (K M (1) K M (2) )
31
Supermodel Evolution in Bio/Chemical Kinetics 31 DE Mutation strategies in DynaFit six different mutation strategies (1, 2,... 6) details in the book:
32
Supermodel Evolution in Bio/Chemical Kinetics 32 "Selection" BIOLOGYCOMPUTER high fitness more likely to breed 0110101101100111100110100011111011 V max KMKM KisKis more likely to be carried to the next generation low sum of squares low fitness less likely to breed 0000000000111111111111100000000000 V max KMKM K is less likely to be carried to the next generation high sum of squares
33
Supermodel Evolution in Bio/Chemical Kinetics 33 Basic Differential Evolution Algorithm - Summary 1 Randomly create the initial population (size N) Repeat until almost all population members have very high fitness: 2 Evaluate fitness: sum of squares for all population members 5 Natural selection: keep child in gene pool if more fit than mother 4 Sexual reproduction: random crossover with probability P cross 3 Mutation: random gene modification (mutate father, weight F)
34
Supermodel Evolution in Bio/Chemical Kinetics 34 Application to curcumin: Mixed-type mechanism THREE EXAMPLES OF POPULATION MEMBERS (POPULATION SIZE n = 845) Km65.7630 Ks1.0488e-3 Ki8.7928e-10 Kis8.1374e+20 kcat3760400 POPUL. MEMBER #642POPUL. MEMBER #7 Km3713600 Ks60218 Ki427880 Kis153.55 kcat918170 POPUL. MEMBER #1 Km3.3355e-11 Ks7868500 Ki106.99 Kis4.0737e-9 kcat29.783
35
Supermodel Evolution in Bio/Chemical Kinetics 35 Application to curcumin: Mixed-type mechanism INITIAL DISTRIBUTION OF THE MICHAELIS-CONSTANT K M 24 orders of magnitude ESTIMATE #
36
Supermodel Evolution in Bio/Chemical Kinetics 36 Application to curcumin: Mixed-type mechanism INITIAL DISTRIBUTION OF ALL MODEL PARAMETERS Generation #1
37
Supermodel Evolution in Bio/Chemical Kinetics 37 Application to curcumin: Mixed-type mechanism INTERMEDIATE DISTRIBUTION OF MODEL PARAMETERS (EVOLUTION IS ONGOING) Generation #20
38
Supermodel Evolution in Bio/Chemical Kinetics 38 Application to curcumin: Mixed-type mechanism FINAL DISTRIBUTION OF MODEL PARAMETERS (EVOLUTION IS COMPLETED) Generation #125
39
Supermodel Evolution in Bio/Chemical Kinetics 39 Application to curcumin: Mixed-type mechanism THE SUPER-ORGANISM K m = K s = K i = K is = k cat = 10 110 40 14 68 The fittest "phenotype" in generation #125 Yes, this is the best we can do with the mixed-type model! !
40
Supermodel Evolution in Bio/Chemical Kinetics 40 The model selection problem remains A PREREQUISITE FOR MODEL DISCRIMINATION = FITTING INDIVIDUAL CANDIDATE MODELS 1. Focus on a single reaction mechanism: Given a model (rate equation), find the best-fit kinetic constants 2. Focus on multiple reaction mechanisms: a. Repeat 1. for all candidate models (mechanisms) b. Select the most plausible model ?
41
Supermodel Evolution in Bio/Chemical Kinetics 41 Model proliferation: CYP450 / reductase example COMBINATORIAL PROLIFERATION OF POSSIBLE MECHANISMS Global Analysis of Protein-Protein Interactions Reveals Multiple Cytochrome P450 2E1 / Reductase Complexes Arvind P. Jamakhandi ‡, Petr Kuzmič §, Daniel E. Sanders ‡, and Grover P. Miller ‡ Journal: Biochemistry Ms# BI-2006-003476.R2 IN PRESS EXAMPLE
42
Supermodel Evolution in Bio/Chemical Kinetics 42 Model proliferation: CYP450 / reductase example COMBINATORIAL PROLIFERATION OF POSSIBLE MECHANISMS P = cytochrome P450 (2E1) R = cytochrome reductase A = catalytically active N = inactive 42 separate mechanisms were examined
43
Supermodel Evolution in Bio/Chemical Kinetics 43 The "Supermodel" approach CREATE AN AGGREGATE MODEL ENCOMPASSING ALL POSSIBLE INTERACTIONS 1. Consider only the most complex model 2. Evolve all parameters using "DE" 3. Identify redundant parameters by examining the final distribution of fittest phenotypes 4. Eliminate redundant parameters thereby reducing the model ("small is beautiful") the most complex (realistic) model
44
Supermodel Evolution in Bio/Chemical Kinetics 44 The "Supermodel" for LF inhibition by curcumin COMPILE AN AGGREGATE OF ALL POSSIBLE MOLECULAR INTERACTIONS Substrate can bind with 1:1 or 2:1 stoichiometry Inhibitor can bind with 1:1 or 2:1 stoichiometry Substrate and inhibitor can bind at the same time Any enzyme-substrate complex can have catalytic activity ASSUMPTIONS
45
Supermodel Evolution in Bio/Chemical Kinetics 45 The "Supermodel" includes all standard mechanisms STANDARD MECHANISMS DIFFER ONLY BY VALUES OF KINETIC CONSTANTS IN THE "SUPERMODEL" "Competitive" mechanism is a subset of the "Supermodel" K ii K ss k sp = 0 K si k ip = 0
46
Supermodel Evolution in Bio/Chemical Kinetics 46 The "Supermodel" includes all standard mechanisms STANDARD MECHANISMS DIFFER ONLY BY VALUES OF KINETIC CONSTANTS IN THE "SUPERMODEL" "Uncompetitive" mechanism is a subset of the "Supermodel" K ii K ss k sp = 0 Ki Ki k ip = 0
47
Supermodel Evolution in Bio/Chemical Kinetics 47 "Supermodel" evolution: Curcumin inhibition of LF INITIAL DISTRIBUTION OF MODEL PARAMETERS (POPULATION SIZE = 1355) total stability constants
48
Supermodel Evolution in Bio/Chemical Kinetics 48 "Supermodel" evolution: Curcumin inhibition of LF FINAL DISTRIBUTION OF MODEL PARAMETERS 10 -12 ZERO
49
Supermodel Evolution in Bio/Chemical Kinetics 49 Curcumin inhibition of LF: Confidence intervals ARE ALL PARAMETERS NECESSARY? logarithm of parameter sum of squares kpkp K ii 95% likelihood
50
Supermodel Evolution in Bio/Chemical Kinetics 50 Curcumin inhibition of LF: Final model FULLY EVOLVED "SUPERMODEL" "Partial Mixed-Type Noncompetitive" "With Substrate Inhibition"
51
Supermodel Evolution in Bio/Chemical Kinetics 51 Summary and Conclusions DIFFERENTIAL EVOLUTION HAS PROVED USEFUL IN BIO/CHEMICAL KINETICS Given a particular model, we can always find globally optimal parameters: DE avoids false minima and other pathologies in nonlinear data fitting Given a set of possible bio/chemical processes, we can "evolve" the most plausible model: - 1. Evolve a "Supermodel" - 2. Exclude redundant reaction steps - 3. Check confidence intervals
52
Supermodel Evolution in Bio/Chemical Kinetics 52 Acknowledgements: Lethal Factor protease work Hawaii Biotech currently Panthera BioPharma Mark Goldman Sheri Millis Lynne Cregar Aiea, Island of Oahu, Hawaii National Institutes of Health Grant No. R43 AI52587-02 U.S. Army Medical Research and Materials Command Contract No. V549P-6073
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.