An Introduction to Infinite HMMs for Single-Molecule Data Analysis

Slides:



Advertisements
Similar presentations
Michael Epstein, Ben Calderhead, Mark A. Girolami, Lucia G. Sivilotti 
Advertisements

Dechang Li, Ming S. Liu, Baohua Ji  Biophysical Journal 
Koen E. Merkus, Menno W.J. Prins, Cornelis Storm  Biophysical Journal 
Motor Regulation Results in Distal Forces that Bend Partially Disintegrated Chlamydomonas Axonemes into Circular Arcs  V. Mukundan, P. Sartori, V.F. Geyer,
Multi-Image Colocalization and Its Statistical Significance
Fiona E. Müllner, Sheyum Syed, Paul R. Selvin, Fred J. Sigworth 
Volume 112, Issue 7, Pages (April 2017)
Differential Modulation of Cardiac Ca2+ Channel Gating by β-Subunits
Volume 98, Issue 11, Pages (June 2010)
Vilmos Zsolnay, Michael Fill, Dirk Gillespie  Biophysical Journal 
Dynamical Scenarios for Chromosome Bi-orientation
Phase Transitions in Biological Systems with Many Components
Modeling Endoplasmic Reticulum Network Maintenance in a Plant Cell
MCMC Estimation of Markov Models for Ion Channels
Carlos R. Baiz, Andrei Tokmakoff  Biophysical Journal 
Mechanically Probing the Folding Pathway of Single RNA Molecules
He Meng, Johan Bosman, Thijn van der Heijden, John van Noort 
Volume 111, Issue 2, Pages (July 2016)
A Switching Observer for Human Perceptual Estimation
An Equilibrium Model for the Combined Effect of Macromolecular Crowding and Surface Adsorption on the Formation of Linear Protein Fibrils  Travis Hoppe,
Sean A. McKinney, Chirlmin Joo, Taekjip Ha  Biophysical Journal 
Keegan E. Hines, John R. Bankston, Richard W. Aldrich 
Michał Komorowski, Jacek Miękisz, Michael P.H. Stumpf 
Raft Formation in Lipid Bilayers Coupled to Curvature
CW and CCW Conformations of the E
Volume 109, Issue 12, Pages (December 2015)
Mechanistically Consistent Reduced Models of Synthetic Gene Networks
Quantifying Biomolecule Diffusivity Using an Optimal Bayesian Method
Carlos R. Baiz, Andrei Tokmakoff  Biophysical Journal 
A Switching Observer for Human Perceptual Estimation
Volume 113, Issue 5, Pages (September 2017)
Andrew E. Blanchard, Mark J. Arcario, Klaus Schulten, Emad Tajkhorshid 
Pattern Selection by Dynamical Biochemical Signals
Yuno Lee, Philip A. Pincus, Changbong Hyeon  Biophysical Journal 
Colocalization of Multiple DNA Loci: A Physical Mechanism
Stationary Gating of GluN1/GluN2B Receptors in Intact Membrane Patches
Nucleotide Effects on the Structure and Dynamics of Actin
Volume 113, Issue 5, Pages (September 2017)
Florian Hinzpeter, Ulrich Gerland, Filipe Tostevin  Biophysical Journal 
Volume 111, Issue 12, Pages (December 2016)
G. Garbès Putzel, Mark J. Uline, Igal Szleifer, M. Schick 
Stochastic Pacing Inhibits Spatially Discordant Cardiac Alternans
Volume 96, Issue 5, Pages (March 2009)
Volume 98, Issue 11, Pages (June 2010)
Volume 93, Issue 9, Pages (November 2007)
The Effect of Dye-Dye Interactions on the Spatial Resolution of Single-Molecule FRET Measurements in Nucleic Acids  Nicolas Di Fiori, Amit Meller  Biophysical.
Using a Single Fluorescent Reporter Gene to Infer Half-Life of Extrinsic Noise and Other Parameters of Gene Expression  Michał Komorowski, Bärbel Finkenstädt,
Volume 97, Issue 12, Pages (December 2009)
Mariana Levi, Kien Nguyen, Liah Dukaye, Paul Charles Whitford 
Volume 105, Issue 10, Pages (November 2013)
A Kinetic Model for Type I and II IP3R Accounting for Mode Changes
Robust Driving Forces for Transmembrane Helix Packing
Multi-Image Colocalization and Its Statistical Significance
Elementary Functional Properties of Single HCN2 Channels
Volume 98, Issue 2, Pages (January 2010)
Vilmos Zsolnay, Michael Fill, Dirk Gillespie  Biophysical Journal 
Volume 113, Issue 3, Pages (August 2017)
Stochastic Pacing Inhibits Spatially Discordant Cardiac Alternans
Eric R. May, Jun Feng, Charles L. Brooks  Biophysical Journal 
Felix Ruhnow, Linda Kloβ, Stefan Diez  Biophysical Journal 
Volume 101, Issue 3, Pages (August 2011)
Modeling Endoplasmic Reticulum Network Maintenance in a Plant Cell
Yongli Zhang, Junyi Jiao, Aleksander A. Rebane  Biophysical Journal 
Time-Resolved NMR: Extracting the Topology of Complex Enzyme Networks
Investigating Focal Adhesion Substructures by Localization Microscopy
Ping Liu, Ioannis G. Kevrekidis, Stanislav Y. Shvartsman 
Time-Resolved NMR: Extracting the Topology of Complex Enzyme Networks
Zackary N. Scholl, Weitao Yang, Piotr E. Marszalek  Biophysical Journal 
Volume 97, Issue 2, Pages (July 2009)
Evolution of Specificity in Protein-Protein Interactions
Presentation transcript:

An Introduction to Infinite HMMs for Single-Molecule Data Analysis Ioannis Sgouralis, Steve Pressé  Biophysical Journal  Volume 112, Issue 10, Pages 2021-2029 (May 2017) DOI: 10.1016/j.bpj.2017.04.027 Copyright © 2017 Biophysical Society Terms and Conditions

Figure 1 A synthetic time trace illustrating measurements of a hypothetical biomolecule that undergoes conformational transitions. (Left) The state space consists of conformations depicted discretely as σ1,σ2,…. (Middle) Time series of noisy observations, xn, produced by the biomolecule (blue) and the corresponding noiseless trace (red). Over the time course of the measurements, the biomolecule attains only conformations σ1–σ5, though additional conformations might be visited at subsequent times. For the sake of concreteness only, we label these states in order of appearance from 1 through 5. (Right) Binning the collected observations reveals “emission distributions,” Fσk, associated with each conformation. These distributions are highlighted with red lines. The centers (mean values) of the emission distributions are used to obtain the noiseless trace in the middle panel. The illustration on the left is created using data from (47) (PDB: 2N4G). To see this figure in color, go online. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions

Figure 2 Graphical representation of the HMM. In the HMM, a biomolecule of interest transitions between unobserved states sn according to the probability vectors π˜sn and generates observations xn according to the probability distributions Fsn that depend on the parameter ϕsn. Here, following convention, the xn values are shaded to denote that these quantities are observed, whereas the sn values are hidden. Arrows denote the dependences among the model variables and red lines denote the model parameters. To see this figure in color, go online. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions

Figure 3 Graphical representation of the iHMM. The hidden Markov model that formulates the observations to be analyzed (black lines) is shown together with its priors (red lines). For completeness, we also show the concentration parameters α and γ and the prior probability distribution on the emission parameters, H, that fully characterize the iHMM. The key difference from the HMM shown in Fig. 2 is that now the model parameters π˜σk and ϕσk are treated as random variables similar to the hidden states, sn, and observations, xn. For details, see the main text. To see this figure in color, go online. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions

Figure 4 Synthetic data sets resembling a hypothetical biomolecule undergoing transitions between discrete states that we analyzed with the iHMM. (Left) Time series x¯=(x1,…,xN) of noisy observations. During the measuring period, the biomolecule attains five conformations, σ1,…,σ5. The number of conformations are a priori unknown and the iHMM seeks to determine the probability over the number of states, as well as their properties, given the data available. In data set 1, the biomolecule transitions often through every state. By contrast, in data set 2, transitions to some states are rare. As a result, all states in data set 1 are almost equally visited throughout the experiment time course, whereas in data set 2, higher states are visited, by chance, only toward the end of the trace. (Right) The corresponding emission distributions, Fσk, as obtained by simply binning the observations (blue) and plotting the exact ones used for the simulations (red). For both data sets, the emission distributions show significant overlap. In all panels, dotted lines indicate the exact mean values, μσk, of the emission distributions. To see this figure in color, go online. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions

Figure 5 After some iterations, the sampler used in the iHMM to analyze data set 1 of Fig. 4 eventually converges to the correct number of states. The number of visited states, K(r) (top), and the means of the emission distributions, μσk(r) (bottom), change throughout the sampler’s iterations. Unlike the HMM, which uses a finite and fixed state space, the iHMM learns the number of available states and grows/shrinks the state space as required by the data. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions

Figure 6 We may use samples from the iHMM posterior probability to infer the size of the state space and the location of each state. In particular, we illustrate histograms for P(K|x¯) (top) and P(μσk|x¯) (bottom) using data set 1 of Fig. 4. In both panels, dashed lines indicate the exact (ground-truth) values used to produce the data in Fig. 4. To see this figure in color, go online. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions

Figure 7 We may use the iHMM to estimate portions of the complete state space such as those contained in different segments of data set 2 provided in Fig. 4. (Upper) Estimated noiseless traces for two cases: 1) using a limited segment of the full trace; and 2) using the full trace. Although only the latter case allows an estimate of all five states, both cases provide similar estimates over those states that they mutually visit. (Lower) Corresponding estimates of the number of states contained in each trace. To see this figure in color, go online. Biophysical Journal 2017 112, 2021-2029DOI: (10.1016/j.bpj.2017.04.027) Copyright © 2017 Biophysical Society Terms and Conditions