The Multistrand Simulator: Stochastic Simulation of the Kinetics of Multiple Interacting DNA Strands Joseph Schaeffer, Caltech (slides by John Reif)

Slides:



Advertisements
Similar presentations
Clustering.
Advertisements

Natural Selection Simulation Learning objectives Students are expected to be proficient with either Microsoft PowerPoint or Google Docs Presentation tool.
Counting Outcomes and Tree Diagrams
A model for the Initiation of Replication in Escherichia coli Author Joseph M. Mahaffy Judith W Zyskind J. theor. Biol
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
A Randomized Linear-Time Algorithm to Find Minimum Spanning Trees David R. Karger David R. Karger Philip N. Klein Philip N. Klein Robert E. Tarjan.
1 CE 530 Molecular Simulation Lecture 8 Markov Processes David A. Kofke Department of Chemical Engineering SUNY Buffalo
Accessing Spatial Data
Evolutionary Games The solution concepts that we have discussed in some detail include strategically dominant solutions equilibrium solutions Pareto optimal.
1 On Compressing Web Graphs Michael Mitzenmacher, Harvard Micah Adler, Univ. of Massachusetts.
Lecture 9 Hidden Markov Models BioE 480 Sept 21, 2004.
This material in not in your text (except as exercises) Sequence Comparisons –Problems in molecular biology involve finding the minimum number of edit.
Cluster Analysis (1).
What is Cluster Analysis?
Backtracking.
Radial Basis Function Networks
Genetic Algorithm.
Screens appear here to display the: Data Tables Plots Bibliographic Info Add New Data Form, Molecular Structure-Drawing Form ‘Tree’ for navigation between.
Object-Oriented Software Engineering Practical Software Development using UML and Java Chapter 8: Modelling Interactions and Behaviour.
Chapter 25 Electric Potential Electrical Potential and Potential Difference When a test charge is placed in an electric field, it experiences a.
Chapter 3 Section 1 Copyright © 2008 Pearson Education, Inc. Publishing as Pearson Addison-Wesley.
Copyright © Cengage Learning. All rights reserved.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Proliferation cluster (G12) Figure S1 A The proliferation cluster is a stable one. A dendrogram depicting results of cluster analysis of all varying genes.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Rates of Reactions Why study rates?
INF380 - Proteomics-101 INF380 – Proteomics Chapter 10 – Spectral Comparison Spectral comparison means that an experimental spectrum is compared to theoretical.
Quantum Mechanical Model of the Atom Quantum Numbers & Electron Configurations.
Single-Factor Studies KNNL – Chapter 16. Single-Factor Models Independent Variable can be qualitative or quantitative If Quantitative, we typically assume.
1 Finding Spread Blockers in Dynamic Networks (SNAKDD08)Habiba, Yintao Yu, Tanya Y., Berger-Wolf, Jared Saia Speaker: Hsu, Yu-wen Advisor: Dr. Koh, Jia-Ling.
Pre-Algebra 9-7 Independent and Dependent Events Learn to find the probabilities of independent and dependent events.
This tutorial will describe how to navigate the section of Gramene that allows you to view various types of maps (e.g., genetic, physical, or sequence-based)
VizTree Huyen Dao and Chris Ackermann. Introducing example
CSE280Stefano/Hossein Project: Primer design for cancer genomics.
Chemical Kinetics. A brief note on Collision Theory All matter is made up of particles: atoms, molecules and ions. Kinetics is all about how chemicals.
A Place-based Model for the Internet Topology Xiaotao Cai Victor T.-S. Shi William Perrizo NDSU {Xiaotao.cai, Victor.shi,
Hiroki Sayama NECSI Summer School 2008 Week 2: Complex Systems Modeling and Networks Network Models Hiroki Sayama
Figure 3. Definition of the outside distance, Dout, as the sum of the distances from each of the two protein binding sites to their corresponding.
Volume 15, Issue 3, Pages (April 2016)
Pearson Unit 6 Topic 15: Probability 15-1: Experimental and Theoretical Probability Pearson Texas Geometry ©2016 Holt Geometry Texas ©2007.
Characterization and quantification of clonal heterogeneity among hematopoietic stem cells: a model-based approach by Ingo Roeder, Katrin Horn, Hans-Bernd.
Volume 112, Issue 7, Pages (April 2017)
Volume 6, Issue 2, Pages (February 1998)
Volume 17, Issue 24, Pages (December 2007)
Value Representations in the Primate Striatum during Matching Behavior
Ancestral Protein Reconstruction Yields Insights into Adaptive Evolution of Binding Specificity in Solute-Binding Proteins  Ben E. Clifton, Colin J. Jackson 
Looping Probabilities in Model Interphase Chromosomes
Volume 94, Issue 2, Pages e6 (April 2017)
Volume 40, Issue 6, Pages (December 2003)
MunJu Kim, Katarzyna A. Rejniak  Biophysical Journal 
Mean-Independent Noise Control of Cell Fates via Intermediate States
Fuqing Wu, David J. Menn, Xiao Wang  Chemistry & Biology 
DNA Hairpins: Fuel for Autonomous DNA Devices
Volume 23, Issue 10, Pages (October 2016)
Monika Sharma, Alexander V. Predeus, Nicholas Kovacs, Michael Feig 
Volume 94, Issue 2, Pages e6 (April 2017)
Johannes Schöneberg, Martin Heck, Klaus Peter Hofmann, Frank Noé 
Ion Specificity and Nonmonotonic Protein Solubility from Salt Entropy
Volume 5, Issue 4, Pages e4 (October 2017)
G. Fiorin, A. Pastore, P. Carloni, M. Parrinello  Biophysical Journal 
Probability.
Volume 96, Issue 7, Pages (April 2009)
Information Integration and Energy Expenditure in Gene Regulation
Volume 101, Issue 4, Pages (May 2000)
Sequence and Crowding Effects in the Aggregation of a 10-Residue Fragment Derived from Islet Amyloid Polypeptide  Eva Rivera, John Straub, D. Thirumalai 
Volume 15, Issue 11, Pages (June 2016)
Multiple Folding Pathways of the SH3 Domain
Yongli Zhang, Junyi Jiao, Aleksander A. Rebane  Biophysical Journal 
Matthew R. Roesch, Adam R. Taylor, Geoffrey Schoenbaum  Neuron 
Presentation transcript:

The Multistrand Simulator: Stochastic Simulation of the Kinetics of Multiple Interacting DNA Strands Joseph Schaeffer, Caltech (slides by John Reif)

Loops: -Nearest neighbor interactions between the nucleic acid bases. - These interactions divide the secondary structure of the system into local components which are referred to as loops

Loops Broken into Various Categories: - Parameter tables for each category are determined from experimental data. Energy of Loops: (1) Each loop l has an energy, ∆G(l) which can be retrieved from the appropriate parameter table for its category. (2) Each complex also has an energy contribution associated with the entropic initiation cost (e.g. rotational) of bringing two strands together, ∆G assoc. (The total contribution is proportional to the number of strands L within the complex: (L − 1) ∗ ∆G assoc. ) The energy of a complex microstate c: is the sum of these two types of contributions. Can also divide free energy ∆G into enthalpic (∆H) and entropic (∆S) components, with ∆G = ∆H + T ∗ ∆S, where T is the temperature.

Secondary structure representations: (A) Original loop diagram representation. (B) Base pair list representation: Each base pairing is represented by the indices of the bases involved. (C) Dot-paren representation (also called the flat representation): Each base is represented by either a period, representing an unpaired base, or by a parenthesis, representing a pairing with the base that has the (balanced) matching parenthesis. An underscore represents a break between multiple strands. (D) Loop graph representation: Each loop in the secondary structure is a single node in the graph, which contains the sequence information within the loop.

System microstates i, q adjacent to current state j (with many others not shown) with kinetics rates indicated

Probabilistic Moves between Microstates: If currently in microstate i, the next microstate m in a simulated trajectory is chosen randomly among the adjacent microstate j, weighted by the rate of transition to each. Probability of Transition from microstate i to microstate m: Pr(m) = k im / Σ j k ij The time taken to transition to the next microstate: - Chosen randomly from an exponential distribution with rate parameter λ, where λ is the total rate out of the current state, Σ j k ij. Pr(∆t) = λ exp(−λ∆t)

Choice of Kinetic Rate Model: sets the rates of a pair of reactions so that they obey detailed balance. Detailed Balance of Kinetic Rates: if there is a transition from system microstate i to system microstate j with rate k ij there must be a transition from j to i with rate k ji which are related by: k ij / k ji = e -(∆G (j )−∆G (i) )/RT

Classify transitions between microstates into two exclusive types: Unimolecular Transitions: Transitions where changes are within a single complex. Bimolecular Transitions: Transitions that change the number of complexes present in the system.

Unimolecular Transitions: transitions between microstates where changes are within a single complex. - Define these transitions in terms of the complex microstate which changed, rather than the full system microstate. - Define a complex microstate d as being adjacent to a complex microstate c if it differs by exactly one base pair. Creation Move: a transition from c to d that adds a base pair a. Deletion Move: transition from c to d that removes a base pair a.

Moves: (A) Creation moves (blue line) and deletion moves (red highlight) are represented here by rectangles. Either type of move is associated with a particular loop, and has indices to designate which bases within the loop are affected. (B) All possible moves which affect the interior loop in the center of the structure. These are then arranged into a tree (green area), which can be used to quickly choose a move. (C) Each loop in the loop graph then has a tree of moves that affect it, and we can arrange these into another tree (black boxes), each node of which is associated with a particular loop (dashed line) and thus a tree of moves (blue line). This resulting tree then contains all the moves available in the complex.

Unimolecular Rate Models: Kawasaki method: both “downhill” (energetically favorable) and uphill transitions scale directly with the steepness of their slopes. (There is no bound on the maximum rate.) Metropolis method: all downhill moves occur at the same fixed rate, and only the uphill moves scale with the slope. (Maximum rate for any move is bounded, and all downhill moves occur at this rate.) Entropy/Enthalpy Method: uses the division of free energies into entropic and enthalpic components to assign the transition rates in an intuitive manner: base pair creation moves must overcome the entropic energy barrier to bring the bases into contact, and base pair deletion moves must overcome the enthalpic energy barrier in order to break them apart.

Bimolecular Transitions: transitions between microstates that change the number of complexes present in the system. - A bimolecular transition from system microstate i to system microstate j is one where the single base pair difference between them leads to a differing number of complexes within each system microstate. - Join Move: differing number of complexes is due to a base pair joining two complexes in i to form a single complex in j. - Break Move: the removal of this base pair from i causes one complex in i to break into two complexes within j. Every bimolecular move is reversible: If i to j is a join move, then j to i must be a break move, and vice versa.

Moves of varying types which take current state i to state j: (A) A creation move. (B) A deletion move. - The changed region is highlighted in cyan. - Loops that are in j but not i are highlighted in red (in the loop graph) and must be created and have their moves generated. - Loops shown highlighted in blue have had an adjacent loop change, and thus must have their deletion moves recalculated.

An interior loop: - With all theoretically possible creation moves for the first two bases on the top strand shown as cyan lines, and - All possible deletion moves shown as red boxes. Note that for each creation move shown here, we must check whether the bases could actually pair, and only then continue with the energy computations to determine the rate of the move.

- All plots are log/log except for the inset, which is a linear/linear plot of the same data is in the uniform random sequence plot. - The density of test cases is shown using overlaid regions of varying intensity. - From lightest to darkest, these correspond to 80%, 40% and 10% of the test cases being within the region. Comparison of real time used per simulated time unit between Multistrand and Kinfold 1.0, for four different single stranded systems with varying total length.