Instruct Image Processing Centre

Slides:



Advertisements
Similar presentations
Electron Microscopy Chelsea Aitken Peter Aspinall
Advertisements

Copyright 2009 FUJITSU TECHNOLOGY SOLUTIONS PRIMERGY Servers and Windows Server® 2008 R2 Benefit from an efficient, high performance and flexible platform.
Methods: Cryo-Electron Microscopy Biochemistry 4000 Dr. Ute Kothe.
Bridging the solution divide: comprehensive structural analyses of dynamic RNA, DNA, and protein assemblies by small-angle X-ray scattering By Rambo and.
Computing and Chemistry 3-41 Athabasca Hall Sept. 16, 2013.
Picture Archiving And Communication System (PACS)
Scipion: Toward software integration, reproducibility and validation in EM image processing Biocomputing Unit, Instruct Image Processing Center, CNB-CSIC.
Automation in Single-Particle Electron Microscopy
Introducing the LEO 1400 Series
Basic principles Geometry and historical development
Macromolecular Electron Microscopy Michael Stowell MCDB B231
Seamless Medical Image Processing on the Grid on the Example of Segmentation and Partition of the Airspaces Andrzej Rutkowski 1, Michał Chlebiej 1, Marcelina.
lecture 2 : Visualization Basics
DOE Genomics: GTL Program IT Infrastructure Needs for Systems Biology David G. Thomassen Office of Biological and Environmental Research DOE Office of.
DDDs, Screening micrographs and CTF (Practical work) Vahid Abrishami.
KEY CONCEPT Technology continually changes the way biologists work.
GTL Facilities Computing Infrastructure for 21 st Century Systems Biology Ed Uberbacher ORNL & Mike Colvin LLNL.
Design and simulation of micro-SPECT: A small animal imaging system Freek Beekman and Brendan Vastenhouw Section tomographic reconstruction and instrumentation.
CP467 Image Processing and Pattern Recognition Instructor: Hongbing Fan Introduction About DIP & PR About this course Lecture 1: an overview of DIP DIP&PR.
ENTERPRISE COMPUTING QUIZ By: Lean F. Torida
Guide to Linux Installation and Administration, 2e1 Chapter 2 Planning Your System.
Protect Your Business-Critical Data in the Cloud with SoftNAS, a Full-Featured, Highly Available Solution for the Agile Microsoft Azure Platform MICROSOFT.
VIRUS STRUCTURE Basic rules of virus architecture, structure, and assembly are the same for all families Some structures are much more complex than others,
Chapter 27 Quantum Physics.
Why Diffraction, Why Neutrons? J. A. Dura Neutron Small Angle Scattering and Reflectometry NCNR Summer School on June 26, 2006.
Structural Biomedicine
Structural Study of the  12 Virus By:Elizabeth Brown.
 Our mission Deploying and unifying the NMR e-Infrastructure in System Biology is to make bio-NMR available to the scientific community in.
Reaching the Information Limit in Cryo- EM of Biological Macromolecules: Experimental Aspects -Robert M. Glaeser and Richard J. Hall (2011)
Institute of Physical Biology Concept of protocol generator
High-Performance and Grid Computing for Neuroinformatics: NIC and Cerebral Data Systems Allen D. Malony University of Oregon Professor Department of Computer.
3D-EM DAS Extending DAS to 3D-EM and Fitting /02/26.
ELECTRON CRYSTALLOGRAPHY:
Modeling Protein Secondary Structures from Three Dimensional Cryo-EM Density Images Dong Si June,30 th 2014.
Structural Biology on the GRID Dr. Tsjerk A. Wassenaar Biomolecular NMR - Utrecht University (NL)
Snapshot of DAQ challenges for Diamond Martin Walsh.
Discussion Summary for the Challenge Wah Chiu
Grid Computing Unit I Introduction. Information anytime anywhere!!! support computation across administrative domains Generally  virtualizing computing.
Reducing server sprawl and IT power/cooling costs Moving from reactive to proactive state Quickly troubleshooting PC and laptop issues Deploying new.
X-Ray Diffraction Spring 2011.
Using Technology to Study Cellular and Molecular Biology.
Hierarchy of Biological Complexity Interactions of machines (molecular and cellular dynamics) Macromolecular machines Proteins and nucleic acids Sequences.
Biomarkers from Dynamic Images – Approaches and Challenges
 Cloud Computing technology basics Platform Evolution Advantages  Microsoft Windows Azure technology basics Windows Azure – A Lap around the platform.
Wednesday NI Vision Sessions
Worcester College, Oxford 8 th March Integration of different sensors, detectors or instruments for multi-parameter analysis Challenge – integration.
An short overview of INSTRUCT and its computational requirements Alexandre M.J.J. Bonvin Project coordinator Bijvoet Center for Biomolecular Research Faculty.
A Competence Center to Serve Translational Research from Molecule to Brain Alexandre M.J.J. Bonvin MoBrain CC coordinator Bijvoet Center for Biomolecular.
Page 1 Cryo-EM Services by Creative Biostructure.
Grid Computing Unit I Introduction.
Freezing Immunoglobulins to see them move
What is cryo EM? EM = (Transmission) Electron Microscopy
Alexandre M.J.J. Bonvin MoBrain CC coordinator
Cryo-em Electron microscopy (EM) has become an extremely popular method for the ultrastructural study of macromolecules, cells and tissues. With our in-house.
Electron microscope Electron microscopy (EM) has become an extremely popular method for the ultrastructural study of macromolecules, cells and tissues.
Cryo-em services Electron microscopy (EM) has become an extremely popular method for the ultrastructural study of macromolecules, cells and tissues. With.
Cryo-EM Services Electron microscopy (EM) has become an extremely popular method for the ultrastructural study of macromolecules, cells and tissues. An.
University of Technology
Azure Lays the Path to Achieving Holistic App Performance Management, Cloud Optimization MINI-CASE STUDY “Microsoft Azure has allowed us to build a unified,
Cryo-em Electron microscopy (EM) has become an extremely popular method for the ultrastructural study of macromolecules, cells and tissues. An aqueous.
Cryo-EM Services Cryo-EM Services in Creative Biostructure.
FRED Optimization FRED: A software tool for modern engineering.
Dell Data Protection | Rapid Recovery: Simple, Quick, Configurable, and Affordable Cloud-Based Backup, Retention, and Archiving Powered by Microsoft Azure.
EOSCpilot All Hands Meeting 8 March 2018 Pisa
Volume 23, Issue 9, Pages (September 2015)
A Primer to Single-Particle Cryo-Electron Microscopy
PRESENTER GUIDANCE: These charts provide data points on how IBM BaaS mid-market benefits a client with the ability to utilize a variety of backup software.
Single-Particle Cryo-EM at Crystallographic Resolution
4CeeD: Private Cloud and Data Cyber-Infrastructure for Scientific Instruments Steve Konstanty, Senior Research Programmer, CSL.
Computed Tomography (C.T)
Presentation transcript:

Instruct Image Processing Centre I2PC Instruct Image Processing Centre JM CARAZO

CryoEM and Cloud: From “first principles” José María Carazo (carazo@cnb.csic.es) Spanish National Center for Biotechnology Instruct Image Processing Center

THE CELL: The basic block of life Pericentriolar material

An integrated view of Life Optical microscopy Electron microscopy M-Cell Salk Institute, Cornell Univesity X-ray diffraction Nuclear Magnetic Resonance Computational Modelling

Life is based on macromolecular machines DNA replication Protein synthesis Dynein motion

Resolution and Thickness range Resolution range IPC 6

An electron microscope

DnaB·DnaC in vitreous ice

The cryo-EM SPA pledge In 3D Electron Microscopy individual macromolecules are visualized down to atomic resolution. Trapped in ice, these molecules are free to expose their internal flexibility/plasticity.

The value of a «radiography»

Compared to full 3D CT

Tomography principles

Limits the comprehension In a first approximation….. Limits the comprehension of complex objects 2D projections : lack of information

Tomography Principle Acquisition of tilted image series Correction of microscope default (mechanical drift, CTF...) Reconstruction

Cryo 3D-EM Conceptual bases Experimental situation in cryo 3D EM

Krios (MRC- Cambridge) 1day 2.4 Tb Adquire Data as Reconstruct Understand Amount of data In this example Data is adquired with an electron microscope like this one The data look like this dark areas is a virus Merging the information contained in all the images we can get a 3D reconstruction like this Reconstruction that may be used to better understand the viral life-cycle

Tomography Principle Acquisition of tilted image series Correction of microscope default (mechanical drift, CTF...) Reconstruction

The “a priori unkown” geometry in SPA

Parameter space JUST for Geometry characterization For each particle we need to determine 3 angles and 2 shifts. FIVE parameters. If we have 100.000 particle images. We then have a space of 500.000 parameters!

Reconstruction as a linear set of equations

Parameter space of cryo EM SPA Target (the X’s): A volume of (for example) 100 x 100 x 100 voxels = 10**6 variables (Plus 500.000 = 5 x 10**5 geometry variables) (plus 100.000 x k (classes)) Measurements (the Y’s): 100.000 particle images of 100 x 100 pixels = 10**9 But we have noise!: 2 + 2 = 5 (or 3, or 6 …)

The 3D flexibility challenge

Molecular machines 15 m 15 10-9 m Dutch windmill

The 3D flexibility challenge NOW in RELION

Everything is mixed!!! alignment & classification are strongly intertwined! noise forms a serious problem!

Parameter space of cryo EM SPA f(x) local minima x global minimum

Parameter space of cryo EM SPA f(x) fs (x) x

Workflows: How do we do it in practice? Using different EM software packages is now like the tower of Babel Why we are working in Scipion? Simply put, the EM field needs software integration. Currently processing with different software packages is like a Babel Tower. User needs to deals with files convertion between packages, which wastes time and is error prone.

Task 2: Virtualize the execution hosts Internet Scipion client Cloud computing Big data transfers By Virtualizing the Execution Hosts, we will be able use computing resources more efficiently. First, we can computing nodes will be allocated when needed, which avoid having them when There are not computing jobs. And second, the number of nodes can be adapted for each Job requirements (ie. Maybe some jobs requires more RAM memory while other make use of more CPU power) Data storage Scipion server

Task 2: Virtualize server and storage Execution hosts Internet Scipion client Cloud computing Big data transfers By Virtualizing the Execution Hosts, we will be able use computing resources more efficiently. First, we can computing nodes will be allocated when needed, which avoid having them when There are not computing jobs. And second, the number of nodes can be adapted for each Job requirements (ie. Maybe some jobs requires more RAM memory while other make use of more CPU power) Data storage Scipion server

Scipion have specific goals Integrate EM software packages to be used in the same project. Full project traceability, improving reproducibility. Execute complete workflows in an automated manner. Easy to install and use. Easy to extend with new protocols. In this document we describe the main concepts and new features of Xmipp 3.0 for users. 32

Goal 1: Integrate EM software packages to be used in the same project. Our main goal, the reason of why we have started working in the Scipion project, is the need of software Integration for the field, as JMC mentioned. 33

We bridge across package differences by modeling our domain 3D Reconstruction Set of Images Initial Model 3D Volume Protocols Data In order to address the integration problem, we starting by creating a model of the EM domain. This model is composed by abstract entities (or objects), that will reflect the concepts more than The specificities of each software package. In this model we have two type of objects: Data and Protocols(or operations). Data objects will serve as input-output for the protocols. Protocols are like “big steps”, which wraps at higher level the logic of low level operations.

We bridge across package differences by modeling our domain We can´t modify all existing software packages to adopt this model. What we can do is to implement conversion routines that know how To map from our “objects” to the package specific files and operations. So we need conversions in both directions in order to execute the Packages programs and communicate with other protocols. With this approach, we can build tools in the upper world, and them can Be reused for all existing packages and even for future ones.

Goal 2: Full project traceability, improving reproducibility. Having a well-define model of the problem also facilitate to solve the issue of having full traceability. 36

We bet for a simple storage mechanism Data Objects Protocol Objects Mapper Layer We starting by modeling our domain. We consider two main type of objects: Data and Protocols(or operations). Data objects will serve as input-output for the protocols. Protocols are like “big steps”, which wraps at higher level the logic of low level operations.

Results should be reproducible, not more “black boxes” We starting by modeling our domain. We consider two main type of objects: Data and Protocols(or operations). Data objects will serve as input-output for the protocols. Protocols are like “big steps”, which wraps at higher level the logic of low level operations.

Goal 3: Execute complete workflows in an automated manner. In this document we describe the main concepts and new features of Xmipp 3.0 for users. 39

Designed to perform distributed execution Worker Host 2 Worker Host 1 Scipion client Big data transfers Relatives: There are a number of very good existing “Workflow Engines”, such as Taverna (Manchester) or Pegasus (San Diego) ….. BUT SCIPION is NOT a Workflow Engine, but can use any WE in the future Distributed data storage Bookeeping Scipion Server

The Initial Volume Problem in SPA f(x) local minima x global minimum

The Initial Volume Problem (in the Web)

The Initial Volume Problem (in the Web)

Breaking News…… FEI, la principal empresa proveedora de microscopios electrónicos de alta gama, acaba de expresar su deseo de que el I2PC sea su centro de referencia mundial para soluciones y servicios de procesamiento de imagen para Pharma Scipion y Cloud son partes importantes de esta estrategia

Instruct Open call for Access

www.structuralbiology.eu A distributed infrastructure for integrated structural biology