Deterministic Global Parameter Estimation for a Budding Yeast Model T.D Panning*, L.T. Watson*, N.A. Allen*, C.A. Shaffer*, and J.J Tyson + Departments.

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

The Effects of Molecular Noise and Size Control on Variability in the Budding Yeast Cell Cycle  Talia et al, Nature, 23 August 2007  William Morejón.
John J. Tyson Biological Sciences, Virginia Tech
Computational Modeling of the Cell Cycle
Data Mining Methodology 1. Why have a Methodology  Don’t want to learn things that aren’t true May not represent any underlying reality ○ Spurious correlation.
Experimental Design, Response Surface Analysis, and Optimization
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Chapter 6 Feature-based alignment Advanced Computer Vision.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
Introduction to yeast genetics Michelle Attner July 24, 2012.
DARPA BioComp PI Meeting, 2001 “The Eukaryotic Cell Cycle as a Test Case for Modeling Cellular Regulation in a Collaborative Problem Solving Environment”
1 Learning Dynamic Models from Unsequenced Data Jeff Schneider School of Computer Science Carnegie Mellon University joint work with Tzu-Kuo Huang, Le.
Profile Hidden Markov Models Bioinformatics Fall-2004 Dr Webb Miller and Dr Claude Depamphilis Dhiraj Joshi Department of Computer Science and Engineering.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Multiobjective VLSI Cell Placement Using Distributed Simulated Evolution Algorithm Sadiq M. Sait, Mustafa I. Ali, Ali Zaidi.
Evaluation of Fast Electrostatics Algorithms Alice N. Ko and Jesús A. Izaguirre with Thierry Matthey Department of Computer Science and Engineering University.
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
Detecting Image Region Duplication Using SIFT Features March 16, ICASSP 2010 Dallas, TX Xunyu Pan and Siwei Lyu Computer Science Department University.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Aguda & Friedman Chapter 6 The Eukaryotic Cell-Cycle Engine.
Modeling the Cell Cycle with JigCell and DARPA’s BioSPICE Software Departments of Computer Science* and Biology +, Virginia Tech Blacksburg, VA Faculty:
Genetic Algorithms Overview Genetic Algorithms: a gentle introduction –What are GAs –How do they work/ Why? –Critical issues Use in Data Mining –GAs.
Pathway Modeling and Problem Solving Environments Cliff Shaffer Department of Computer Science Virginia Tech Blacksburg, VA
BIONFORMATIC ALGORITHMS Ryan Tinsley Brandon Lile May 9th, 2014.
Operations Research Models
The DARPA BioSPICE Project Clifford A. Shaffer Department of Computer Science Virginia Tech.
BsysE595 Lecture Basic modeling approaches for engineering systems – Summary and Review Shulin Chen January 10, 2013.
Composition and Aggregation in Modeling Regulatory Networks Clifford A. Shaffer* Ranjit Randhawa* John J. Tyson + Departments of Computer Science* and.
Converting Macromolecular Regulatory Models from Deterministic to Stochastic Formulation Pengyuan Wang, Ranjit Randhawa, Clifford A. Shaffer, Yang Cao,
Comparison of methods for reconstruction of models for gene expression regulation A.A. Shadrin 1, *, I.N. Kiselev, 1 F.A. Kolpakov 2,1 1 Technological.
Pairwise Sequence Alignment. The most important class of bioinformatics tools – pairwise alignment of DNA and protein seqs. alignment 1alignment 2 Seq.
ENM 503 Lesson 1 – Methods and Models The why’s, how’s, and what’s of mathematical modeling A model is a representation in mathematical terms of some real.
GENETIC ALGORITHMS.  Genetic algorithms are a form of local search that use methods based on evolution to make small changes to a popula- tion of chromosomes.
The Frequency Dependence of Osmo-Adaptation in Saccharomyces cerevisiae Jerome T. Mettetal, et al. Science 319, 482 (2008); William J. Gibson.
BioSPICE and Problem-Solving Environments for Systems Biology Clifford A. Shaffer Department of Computer Science Virginia Tech.
1 Using JigCell and other BioSPICE Tools to Understand the Regulation of Cell Growth and Division John J. Tyson Virginia Polytechnic Institute and State.
2005MEE Software Engineering Lecture 11 – Optimisation Techniques.
User Interface Paradigms for Describing Pathway Models Cliff Shaffer Department of Computer Science Virginia Tech.
1 Effect of Spatial Locality on An Evolutionary Algorithm for Multimodal Optimization EvoNum 2010 Ka-Chun Wong, Kwong-Sak Leung, and Man-Hon Wong Department.
1 Presented by Sarbagya Buddhacharya. 2 Increasing bandwidth demand in telecommunication networks is satisfied by WDM networks. Dimensioning of WDM networks.
MANAGEMENT SCIENCE The Art of Modeling with Spreadsheets STEPHEN G. POWELL KENNETH R. BAKER Compatible with Analytic Solver Platform FOURTH EDITION OPTIMIZATION.
The Eukaryotic Cell Cycle : Molecules, Mechanisms, and Mathematical Models John J. Tyson Virginia Tech Bela Novak Tech Univ Budapest.
Alice E. Smith and Mehmet Gulsen Department of Industrial Engineering
Fusing and Composing Macromolecular Regulatory Network Models Ranjit Randhawa* Clifford A. Shaffer* John J. Tyson + Departments of Computer Science* and.
Naren Ramakrishnan and Richard F. Helm Departments of Computer Science and Biochemistry Virginia Tech, Blacksburg, VA
JigCell Nicholas A. Allen*, Kathy C. Chen**, Emery D. Conrad**, Ranjit Randhawa*, Clifford A. Shaffer*, John J. Tyson**, Layne T. Watson* and Jason W.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
Genetic algorithms: A Stochastic Approach for Improving the Current Cadastre Accuracies Anna Shnaidman Uri Shoshani Yerach Doytsher Mapping and Geo-Information.
Learning A Better Compiler Predicting Unroll Factors using Supervised Classification And Integrating CPU and L2 Cache Voltage Scaling using Machine Learning.
Inferring Regulatory Networks from Gene Expression Data BMI/CS 776 Mark Craven April 2002.
The Nudged Elastic Band Method and its Implementation in VASP
Journal club Jun , Zhen.
Artificial Neural Networks
Auburn University COMP7330/7336 Advanced Parallel and Distributed Computing Data Partition Dr. Xiao Qin Auburn University.
Chapter 17: Optimization In Simulation
Deep reinforcement learning
Pathway Modeling and Problem Solving Environments
Maria Okuniewski Nuclear Engineering Dept.
Optimization Based Design of Robust Synthetic
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
1 Department of Engineering, 2 Department of Mathematics,
MITOSIS.
An Adaptive Middleware for Supporting Time-Critical Event Response
Runs, Parameter Sets, Outputs and Projects
Computational Biology
Estimating Rate Constants in Cell Cycle Models
Volume 111, Issue 5, Pages (September 2016)
Akram Bitar and Larry Manevitz Department of Computer Science
Presentation transcript:

Deterministic Global Parameter Estimation for a Budding Yeast Model T.D Panning*, L.T. Watson*, N.A. Allen*, C.A. Shaffer*, and J.J Tyson + Departments of Computer Science* and Biology +, Virginia Tech Blacksburg, VA 24061

Outline Application: Cell Cycle Modeling Optimization Techniques: Dividing RECTangles (DIRECT) Mesh Adaptive Direct Search (MADS) Computational Results

The Fundamental Goal of Molecular Cell Biology

Application: Cell Cycle Modeling How do cells convert genes into behavior? Create proteins from genes Protein interactions Protein effects on the cell Our study organism is the cell cycle of the budding yeast Saccharomyces cerevisiae.

mitosis (M phase) DNA replication (S phase) cell division G1 G2

Modeling Techniques We use ODEs that describe the rate at which each protein concentration changes Protein A degrades protein B: … with initial condition [A](0) = A 0. Parameter c determines the rate of degradation.

Tyson’s Budding Yeast Model Tyson’s model contains over 30 ODEs, some nonlinear. Events can cause concentrations to be reset. About 140 rate constant parameters Most are unavailable from experiment and must set by the modeler “Parameter twiddling” Far better is automated parameter estimation

Mutations Wild type cell Mutations Typically caused by gene knockout Consider a mutant with no B to degrade A. Set c = 0 We have about 130 mutations each requires a separate simulation run

Phenotypes Each mutant has some observed outcome (“experimental” data). Generally qualitative. Cell lived Cell died in G1 phase Model should match the experimental data. Model should not be overly sensitive to the rate constants. Overly sensitive biological systems tend not to survive

Transforms The output from ODE solvers are time course data Need to convert this to match the qualitative experimental data We call the function that does this conversion a “transform”

Rules of Viability 1. Modeled cell must execute a series of events, in order a) [Clb2] + [Clb5] drops below K ez2. b) [ORI] goes over 1 before two divisions of wild cell. c) [SPN] increases above 1. d) [Esp1] increases above 1. e) [Clb2] drops below K ez. 2. Cell is inviable if [BUD] does not reach 0.8 before (e) 3. Squared relative differences of masses and G1 phase lengths in last two cycles is less than Cell is inviable if mass has higher ratio than 4 to wild cell.

Organizing the Observations Budding yeast phenotype for a given mutant is defined by a 6-tuple (v, g, m, a, t, c). v is {viable, inviable} Real g > 0 is steady state length of G1 phase Real m > 0 is steady state mass at division a (arrest stage) is {unlicensed, licensed, fired, aligned, separated} Integer t > 0 is the arrest type Integer c >= 0 is number of successful cycles Define 6-tuples O (observed) and P (predicted).

Objective Function We require a scoring mechanism to compare O and P for each mutant.

Objective Function (cont) The constants are tuned such that a rating of around 10 for a given mutant is a critical error (the model effectively fails for that mutant) Note that this objective function is not continuous

Optimization Techniques 143 parameters to optimize DIviding RECTangles (DIRECT) Global optimization Mesh Adaptive Direct Search (MADS) Local optimization

DIviding RECTangles (DIRECT) Global optimization algorithm (Jones et al, 1993) Does not require gradient, but the convergence criteria does require the objective function to be continuous At each iteration, subdivide boxes considered to be “potentially optimal” into three along their longest dimensions The algorithm can be tuned to favor “exploration” for good regions, or focus on known good regions

Potentially Optimal Boxes

Mesh Adaptive Direct Search (MADS) A class of algorithms Alternate SEARCH and POLL steps. All evaluated points are on a mesh, but the mesh can be adjusted each iteration. Each SEARCH step selects some points on the mesh to evaluate. If an improved point is found, MADS may jump directly to resizing the mesh. (GPS) If no better point is found in the SEARCH step, the POLL searches for a better point within a fixed distance (the frame) of the current best point. (Frame – Coope & Price) Resize the mesh up or down depending on success in the last iteration.

Mesh Adaptive Direct Search MADS in action

Computational Results All computation took place on Virginia Tech’s System X supercomputer We used parallel implementations for DIRECT and MADS. Experiment 1 MADS started from the modeler’s best point DIRECT used a box normalized around the modeler’s best point

Experiment 1 MADS evaluated 135,000 points (813 iterations, 128 processors). Final objective function value was 299. DIRECT evaluated 1.5M points (473 iterations, 1024 processors). Final objective value was 212.

Experiment 2 Looking at the results of Experiment 1, DIRECT does not progress much after 200,000 points. What if we start MADS from this point? What about other points on the plateau? Mixed results: The MADS runs starting at the beginning and end of the plateau were worse than DIRECT’s best point. The MADS run starting in the middle of the plateau was better than DIRECT’s best point. MADS made effectively no progress when starting from DIRECT’s best point.

Contributions Demonstrated that it is computationally feasible for search algorithms to improve on the modelers’ best point. The best points found by DIRECT were more stable than the modelers’. MADS can (sometimes!) improve on DIRECT when starting from DIRECT’s good points. The relationship is unclear. Demonstrated the relationship in DIRECT’s tuning parameter of tradeoff between splitting large boxes and refining small boxes.