Zachary Frazier, Min Xu, Frank Alber  Structure 

Slides:



Advertisements
Similar presentations
Protein Structure Fitting and Refinement Guided by Cryo-EM Density
Advertisements

Hippocampal Attractor Dynamics Predict Memory-Based Decision Making
Three-Dimensional Structure of the Human DNA-PKcs/Ku70/Ku80 Complex Assembled on DNA and Its Implications for DNA DSB Repair  Laura Spagnolo, Angel Rivera-Calzada,
Robert Englmeier, Stefan Pfeffer, Friedrich Förster  Structure 
Volume 17, Issue 2, Pages (February 2009)
Nir London, Ora Schueler-Furman  Structure 
An Approach for De Novo Structure Determination of Dynamic Molecular Assemblies by Electron Cryomicroscopy  Bjoern Sander, Monika M. Golas, Reinhard Lührmann,
A Fully Automated Approach to Spike Sorting
Signal, Noise, and Variation in Neural and Sensory-Motor Latency
Volume 21, Issue 9, Pages (September 2013)
A Stochastic Hill Climbing Approach for Simultaneous 2D Alignment and Clustering of Cryogenic Electron Microscopy Images  Cyril F. Reboul, Frederic Bonnet,
Volume 14, Issue 11, Pages (November 2006)
Hahnbeom Park, Frank DiMaio, David Baker  Structure 
Volume 16, Issue 5, Pages (May 2008)
Compressed Representations of Macromolecular Structures and Properties
Efficient and Robust Analysis of Biomacromolecular Flexibility Using Ensembles of Network Topologies Based on Fuzzy Noncovalent Constraints  Christopher.
Autofocused 3D Classification of Cryoelectron Subtomograms
Hans Elmlund, Dominika Elmlund, Samy Bengio  Structure 
Ab Initio Structure Determination from Electron Microscopic Images of Single Molecules Coexisting in Different Functional States  Dominika Elmlund, Ralph.
Volume 19, Issue 7, Pages (July 2011)
Volume 15, Issue 12, Pages (December 2007)
Volume 18, Issue 8, Pages (August 2010)
Structure of the Human Dicer-TRBP Complex by Electron Microscopy
Volume 111, Issue 2, Pages (July 2016)
Structural Modeling of Heteromeric Protein Complexes from Disassembly Pathways and Ion Mobility-Mass Spectrometry  Zoe Hall, Argyris Politis, Carol V.
A Switching Observer for Human Perceptual Estimation
Γ-TEMPy: Simultaneous Fitting of Components in 3D-EM Maps of Their Assembly Using a Genetic Algorithm  Arun Prasad Pandurangan, Daven Vasishtan, Frank.
Volume 20, Issue 12, Pages (December 2012)
Solution Structures of Engineered Vault Particles
Volume 5, Issue 4, Pages e4 (October 2017)
Volume 24, Issue 5, Pages (May 2016)
Increased Reliability of Nuclear Magnetic Resonance Protein Structures by Consensus Structure Bundles  Lena Buchner, Peter Güntert  Structure  Volume.
Volume 22, Issue 6, Pages (June 2014)
Filip Lankaš, Richard Lavery, John H. Maddocks  Structure 
Cell Optical Density and Molecular Composition Revealed by Simultaneous Multimodal Label-Free Imaging  Nicolas Pavillon, Alison J. Hobro, Nicholas I.
Electron Cryotomography of the E
Volume 22, Issue 8, Pages (August 2014)
A Scalable Population Code for Time in the Striatum
Volume 20, Issue 2, Pages (February 2012)
Volume 17, Issue 11, Pages (November 2009)
Volume 20, Issue 2, Pages (February 2012)
Volume 20, Issue 3, Pages (March 2012)
Volume 23, Issue 9, Pages (September 2015)
Ryan C. Wilson, Meghan A. Jackson, Janice D. Pata  Structure 
Computational Modeling Reveals that Signaling Lipids Modulate the Orientation of K- Ras4A at the Membrane Reflecting Protein Topology  Zhen-Lu Li, Matthias.
Volume 17, Issue 7, Pages (July 2009)
Deciphering the “Fuzzy” Interaction of FG Nucleoporins and Transport Factors Using Small-Angle Neutron Scattering  Samuel Sparks, Deniz B. Temel, Michael.
Zheng Liu, Fei Guo, Feng Wang, Tian-Cheng Li, Wen Jiang  Structure 
Combining Efficient Conformational Sampling with a Deformable Elastic Network Model Facilitates Structure Refinement at Low Resolution  Gunnar F. Schröder,
Nir London, Ora Schueler-Furman  Structure 
Volume 20, Issue 9, Pages (September 2012)
Katharina Hipp, Clemens Grimm, Holger Jeske, Bettina Böttcher 
Volume 16, Issue 3, Pages (March 2008)
Volume 15, Issue 10, Pages (October 2007)
Volume 21, Issue 11, Pages (November 2013)
Semblance of Heterogeneity in Collective Cell Migration
Volume 23, Issue 9, Pages (September 2015)
Volume 26, Issue 9, Pages (May 2016)
Structure of the Kinesin13-Microtubule Ring Complex
Γ-TEMPy: Simultaneous Fitting of Components in 3D-EM Maps of Their Assembly Using a Genetic Algorithm  Arun Prasad Pandurangan, Daven Vasishtan, Frank.
Kevin R. Foster, Thomas Bell  Current Biology 
Volume 23, Issue 1, Pages (January 2015)
Volume 17, Issue 2, Pages (February 2009)
Volume 23, Issue 6, Pages (June 2015)
Validation of Cryo-EM Structure of IP3R1 Channel
Molecular Mechanism of Antibody-Mediated Activation of β-galactosidase
Volume 15, Issue 12, Pages (December 2007)
Volume 15, Issue 10, Pages (October 2007)
Volume 6, Issue 4, Pages (April 1998)
Qing Yao, Sara J. Weaver, Jee-Young Mock, Grant J. Jensen  Structure 
Presentation transcript:

TomoMiner and TomoMinerCloud: A Software Platform for Large-Scale Subtomogram Structural Analysis  Zachary Frazier, Min Xu, Frank Alber  Structure  Volume 25, Issue 6, Pages 951-961.e2 (June 2017) DOI: 10.1016/j.str.2017.04.016 Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 1 Parallel Processing Architecture Structure 2017 25, 951-961.e2DOI: (10.1016/j.str.2017.04.016) Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 2 Efficiency and Scalability (A) The time required for a single round of alignment and averaging as a function of subtomogram number, for a constant 256 workers. The curve is close to linear across the entire range of data. (B) The relative efficiency of the data scalability when additional data is added, for a constant 256 workers. The rate of processing is very stable across several orders of magnitude. (C) The time required for a single round of alignment and averaging for two different datasets, with 8,000 and 16,000 subtomograms. The number of workers varies from 32 to 256. For a relatively small number of subtomograms, there are not enough subproblems generated to occupy 256 workers, so some are idle, creating the plateau seen in the graphs. (D) The relative efficiency of strong scalability. For these problem sizes TomoMiner scales well, with very little overhead for the increased communication and coordination load of additional workers. There is a clear loss of efficiency when using too many workers for a given problem size, but this demonstrates that even for medium-sized datasets (10,000 + subtomgorams) TomoMiner is far away from reaching its computational limits. Structure 2017 25, 951-961.e2DOI: (10.1016/j.str.2017.04.016) Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 3 Reference-free Classification of 100,000 Subtomograms A total of 20,000 subtomograms are generated for each of five different structures, using the procedure defined in the STAR Methods. The subtomograms were simulated using a signal-to-noise ratio of 0.01 and a tilt angle of ±60°. The clusters converged after 10 iterations of reference-free subtomogram classification, using a cluster number of 10. (A) After ten iterations, the averaged subtomograms in each cluster converged to structures close to the ground truth. Since there are more clusters than structures, some clusters have converged to the same structure. (B) Pairwise correlations between the averaged density maps of all ten clusters. Clusters corresponding to the same complex are easily identified by their high correlation values, then can be combined into a single cluster. (C) The number of subtomograms in each cluster (top). Each cluster is dominated by a single complex. The percentages of subtomograms generated from the dominant complex are 96.2%, 97.8%, 99.9%, 100%, 95.6%, 98.5%, 97.7%, 90.7%, 89.4%, and 99.9% for clusters 1 to 10, respectively. Cluster IDs are shown on the horizontal axis. Since the numbers are arbitrary labels, they have been arranged so that similar clusters are adjacent. The correlations between the true structures (bottom), and the averaged density maps demonstrate that the clustering is accurate. Structure 2017 25, 951-961.e2DOI: (10.1016/j.str.2017.04.016) Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 4 Accuracy of the Averaged Density Maps Generated from Reference-free Alignment and Averaging The accuracy is measured as the Pearson correlation between the generated averages and the template of the true structure. Several correlations are shown, for averages generated with an increasing number of subtomograms in the dataset. We generated 100,000 subtomograms of a randomly oriented complex (PDB: 2AWB) using the procedure described in the STAR Methods, with a signal-to-noise ratio of 0.005 and a tilt angle range of ±60°. The computed average is more accurate when using more subtomograms. TomoMiner's ability to handle large numbers of subtomograms therefore efficiently allows for accurate reconstructions and classifications of structures from noisy data, given sufficiently large datasets. Structure 2017 25, 951-961.e2DOI: (10.1016/j.str.2017.04.016) Copyright © 2017 Elsevier Ltd Terms and Conditions

Figure 5 Reference-free Classification of 886 Experimental Subtomograms Containing the GroEL and GroEL/GroES Complexes Taken from Förster et al. (2008) Convergence was reached after five iterative rounds of reference-free classification. (A) Slice through the resulting cluster averages. Scale bar, 5 nm. (B) Cluster averages depicted by isosurface rendering. The atomic structure of the GroEL/GroES complex is fitted into both cluster averages for comparison. Structure 2017 25, 951-961.e2DOI: (10.1016/j.str.2017.04.016) Copyright © 2017 Elsevier Ltd Terms and Conditions