Presentation is loading. Please wait.

Presentation is loading. Please wait.

Pathway Modeling and Problem Solving Environments Cliff Shaffer Department of Computer Science Virginia Tech Blacksburg, VA 24061.

Similar presentations


Presentation on theme: "Pathway Modeling and Problem Solving Environments Cliff Shaffer Department of Computer Science Virginia Tech Blacksburg, VA 24061."— Presentation transcript:

1 Pathway Modeling and Problem Solving Environments Cliff Shaffer Department of Computer Science Virginia Tech Blacksburg, VA 24061

2 The Fundamental Goal of Molecular Cell Biology

3 Application: Cell Cycle Modeling How do cells convert genes into behavior? Create proteins from genes Protein interactions Protein effects on the cell Our study organism is the cell cycle of the budding yeast Saccharomyces cerevisiae.

4 S cell division G1 DNA replication G2 M (mitosis)

5 growth Clb5 MBF P Sic1 SCF Sic1 Swi5 Clb2 Mcm1 APC Cdc14 CDKs Cln2 SBF ? and Cln3 Bck2 DNA synthesis Inactive trimer P Clb2 Budding Cdc20 Cdh1 Mcm1 Mad2 unaligned chromosomes RENT Cdc14 APC-P Cln2 Clb2 Clb5 Lte1 SBF Esp1 Pds1 Net1 Net1P PPX Cdc15/MEN Tem1-GDP Tem1-GTP Bub2 unaligned chromosomes Cdh1 Sister chromatid separation Mcm1 Cdc20 Mitosis

6 Modeling Techniques One method: Use ODEs that describe the rate at which each protein concentration changes Protein A degrades protein B: … with initial condition [A](0) = A 0. Parameter c determines the rate of degradation. Sometimes modelers use “creative” rate laws to approximate subsystems

7 synthesisdegradation synthesis degradation binding activation inactivation Mathematical Model

8 Time (min) CKI mass Clb2 Cln2 Cdh1 Simulation of the budding yeast cell cycle G1S/M Cdc20

9

10 Differential equations Parameter values k1 = 0.0013, v2’ = 0.001, v2” = 0.17, k3’ = 0.02, k3” = 0.85, k4’ = 0.01, k4” = 0.9, J3 = 0.01, J4 = 0.01, k9 = 0.38, k10 = 0.2, k5’ = 0.005, k5” = 2.4, J5 = 0.5, k6 = 0.33, k7 = 2.2, J7 = 0.05, k8 = 0.2, J8 = 0.05, … Experimental Data

11 Tyson’s Budding Yeast Model Tyson’s model contains over 30 ODEs, some nonlinear. Events can cause concentrations to be reset. About 140 rate constant parameters Most are unavailable from experiment and must set by the modeler

12 Fundamental Activities Collect information Search literature (databases), Lab notebooks Define/modify models A user interface problem Run simulations Equation solvers (ODEs, PDEs, deterministic, stochastic) Compare simulation results to experimental data Analysis

13 Modeling Lifecycle

14 Our Mission: Build Software to Help the Modelers Typical cycle time for changing the model used to be one month Collect data on paper lab notebooks Convert to differential equations by hand Calibrate the model by trial and error Inadequate analysis tools Goal: Change the model once per day. Bottleneck should shift to the experimentalists

15 Another View Current models of simple organisms contain a few 10s of equations. To model mammalian systems might require two orders of magnitude in additional complexity. We hope our current vision for tools can supply one order of magnitude. The other order of magnitude is an open problem.

16 JigCell Current Primary Software Components: JigCell Model Builder JigCell Run Manager JigCell Comparator Automated Parameter Estimation (PET) Bifurcation Analysis (Oscill8) http://jigcell.biol.vt.edu

17 Model Builder Run Manager Comparator Parameter Values Parameter Optimizer Optimum Parameter Values

18 From a wiring diagram… JigCell Model Builder

19 N.B. Parameters are given names, not numerical values! …to a reaction mechanism … to ordinary differential equations (ode files, SBML) JigCell Model Builder

20 Mutations Wild type cell Mutations Typically caused by gene knockout Consider a mutant with no B to degrade A. Set c = 0 We have about 130 mutations each requires a separate simulation run

21 Inheritance patterns Basal Set (wild-type) Derived Set (mutant A) Derived Set (mutant B) Derived Set (mutant C) Derived Set (mutant A’) Derived Set (mutant AB) Derived Set (mutant A’C) Run Manager

22 JigCell Run Manager

23 Phenotypes Each mutant has some observed outcome (“experimental” data). Generally qualitative. Cell lived Cell died in G1 phase Model should match the experimental data. Model should not be overly sensitive to the rate constants. Overly sensitive biological systems tend not to survive

24 Visualize results Kumagai1Kumagai2 Comparator

25

26 Optimization How to decide on parameter values? Key features of optimization Each problem is a point in multidimensional space Each point can be assigned a value by an objective function The goal is to find the best point in the space as defined by the objective function We usually settle for a “good” point

27 Parameter Optimization

28 Error Function orthogonal distance regression Levenberg-Marquardt algorithm Parameter Optimization

29 Only 1 experiment shown here. The model must be fitted simultaneously to many different experiments. Parameter Optimization

30 Global DIRECT Search (DIViding RECTangles)

31

32

33

34 Composition Motivation Models are reaching the limits of manageability due to an increase in: Size Complexity Making a model suitable for stochastic simulation increases the number of reactions by a factor of 3-5. Models of the mammalian cell cycle will require 100-1000 reactions (even more for stochastic simulation).

35 Model Composition Notice that the yeast cell diagram contains natural components

36 Composition Processes Fusion Merging two or more existing models Composition Build up model hierarchy from existing models by describing their interactions and connections Aggregation Connects modular blocks using controlled interfaces (ports) Flattening Convert hierarchy back into a single “flat” model for use with standard simulators

37 Composition Processes

38 Sample Sub-models

39 Sample Composed Model

40 Composition Wizard Final Species Mapping Table

41 Composition Wizard Final Reaction Mapping Table

42 Aggregated Submodels

43 Final Aggregated Model

44 Aggregation Connector

45 Composition in SBML Virginia Tech’s proposed language features to support composition/aggregation being written into forthcoming SBML Level 3 definition

46 Stochastic Simulation ODE-based (deterministic) models cannot explain behaviors introduced by random nature of the system. Variations in mass of division Variations in time of events Differences in gross outcomes

47 Gillespie’s Stochastic Simulation Algorithm There is a population for each chemical species There is a “propensity” for each reaction, in part determined by population Each reaction changes population for associated species Loop: Pick next reaction (random, propensity) Update populations, propensities Slow, there are approximations to speed it up

48 Comments on Collaboration Domain team routinely underestimates how difficult it is to create reliable and usable software. CS team routinely underestimates how difficult it is to stay focused on the needs of the domain team. Partial solution: truly integrate.

49 How to Succeed in CBB Programming skills are necessary but not sufficient Math is usually the biggest bottleneck Statistics for Bioinformatics Numerical analysis, optimization, differential equations for computational biology Chemistry/biochemistry are good choices for domain knowledge You have to have an “interdisciplinary attitude”


Download ppt "Pathway Modeling and Problem Solving Environments Cliff Shaffer Department of Computer Science Virginia Tech Blacksburg, VA 24061."

Similar presentations


Ads by Google