Presentation is loading. Please wait.

Presentation is loading. Please wait.

Instruct Image Processing Centre

Similar presentations


Presentation on theme: "Instruct Image Processing Centre"— Presentation transcript:

1 Instruct Image Processing Centre
I2PC Instruct Image Processing Centre JM CARAZO

2 CryoEM and Cloud: From “first principles”
José María Carazo Spanish National Center for Biotechnology Instruct Image Processing Center

3 THE CELL: The basic block of life
Pericentriolar material

4 An integrated view of Life
Optical microscopy Electron microscopy M-Cell Salk Institute, Cornell Univesity X-ray diffraction Nuclear Magnetic Resonance Computational Modelling

5 Life is based on macromolecular machines
DNA replication Protein synthesis Dynein motion

6 Resolution and Thickness range
Resolution range IPC 6

7 An electron microscope

8 DnaB·DnaC in vitreous ice

9 The cryo-EM SPA pledge In 3D Electron Microscopy individual macromolecules are visualized down to atomic resolution. Trapped in ice, these molecules are free to expose their internal flexibility/plasticity.

10 The value of a «radiography»

11 Compared to full 3D CT

12 Tomography principles

13 Limits the comprehension
In a first approximation….. Limits the comprehension of complex objects 2D projections : lack of information

14 Tomography Principle Acquisition of tilted image series Correction of
microscope default (mechanical drift, CTF...) Reconstruction

15 Cryo 3D-EM Conceptual bases
Experimental situation in cryo 3D EM

16 Krios (MRC- Cambridge)
1day 2.4 Tb Adquire Data as Reconstruct Understand Amount of data In this example Data is adquired with an electron microscope like this one The data look like this dark areas is a virus Merging the information contained in all the images we can get a 3D reconstruction like this Reconstruction that may be used to better understand the viral life-cycle

17 Tomography Principle Acquisition of tilted image series Correction of
microscope default (mechanical drift, CTF...) Reconstruction

18 The “a priori unkown” geometry in SPA

19 Parameter space JUST for Geometry characterization
For each particle we need to determine 3 angles and 2 shifts. FIVE parameters. If we have particle images. We then have a space of parameters!

20 Reconstruction as a linear set of equations

21 Parameter space of cryo EM SPA
Target (the X’s): A volume of (for example) x 100 x 100 voxels = 10**6 variables (Plus = 5 x 10**5 geometry variables) (plus x k (classes)) Measurements (the Y’s): particle images of 100 x 100 pixels = 10**9 But we have noise!: = 5 (or 3, or 6 …)

22 The 3D flexibility challenge

23 Molecular machines 15 m m Dutch windmill

24 The 3D flexibility challenge
NOW in RELION

25 Everything is mixed!!! alignment & classification are strongly intertwined! noise forms a serious problem!

26 Parameter space of cryo EM SPA
f(x) local minima x global minimum

27 Parameter space of cryo EM SPA
f(x) fs (x) x

28 Workflows: How do we do it in practice?
Using different EM software packages is now like the tower of Babel Why we are working in Scipion? Simply put, the EM field needs software integration. Currently processing with different software packages is like a Babel Tower. User needs to deals with files convertion between packages, which wastes time and is error prone.

29

30 Task 2: Virtualize the execution hosts
Internet Scipion client Cloud computing Big data transfers By Virtualizing the Execution Hosts, we will be able use computing resources more efficiently. First, we can computing nodes will be allocated when needed, which avoid having them when There are not computing jobs. And second, the number of nodes can be adapted for each Job requirements (ie. Maybe some jobs requires more RAM memory while other make use of more CPU power) Data storage Scipion server

31 Task 2: Virtualize server and storage
Execution hosts Internet Scipion client Cloud computing Big data transfers By Virtualizing the Execution Hosts, we will be able use computing resources more efficiently. First, we can computing nodes will be allocated when needed, which avoid having them when There are not computing jobs. And second, the number of nodes can be adapted for each Job requirements (ie. Maybe some jobs requires more RAM memory while other make use of more CPU power) Data storage Scipion server

32 Scipion have specific goals
Integrate EM software packages to be used in the same project. Full project traceability, improving reproducibility. Execute complete workflows in an automated manner. Easy to install and use. Easy to extend with new protocols. In this document we describe the main concepts and new features of Xmipp 3.0 for users. 32

33 Goal 1: Integrate EM software packages to be used in the same project.
Our main goal, the reason of why we have started working in the Scipion project, is the need of software Integration for the field, as JMC mentioned. 33

34 We bridge across package differences by modeling our domain
3D Reconstruction Set of Images Initial Model 3D Volume Protocols Data In order to address the integration problem, we starting by creating a model of the EM domain. This model is composed by abstract entities (or objects), that will reflect the concepts more than The specificities of each software package. In this model we have two type of objects: Data and Protocols(or operations). Data objects will serve as input-output for the protocols. Protocols are like “big steps”, which wraps at higher level the logic of low level operations.

35 We bridge across package differences by modeling our domain
We can´t modify all existing software packages to adopt this model. What we can do is to implement conversion routines that know how To map from our “objects” to the package specific files and operations. So we need conversions in both directions in order to execute the Packages programs and communicate with other protocols. With this approach, we can build tools in the upper world, and them can Be reused for all existing packages and even for future ones.

36 Goal 2: Full project traceability, improving reproducibility.
Having a well-define model of the problem also facilitate to solve the issue of having full traceability. 36

37 We bet for a simple storage mechanism
Data Objects Protocol Objects Mapper Layer We starting by modeling our domain. We consider two main type of objects: Data and Protocols(or operations). Data objects will serve as input-output for the protocols. Protocols are like “big steps”, which wraps at higher level the logic of low level operations.

38 Results should be reproducible, not more “black boxes”
We starting by modeling our domain. We consider two main type of objects: Data and Protocols(or operations). Data objects will serve as input-output for the protocols. Protocols are like “big steps”, which wraps at higher level the logic of low level operations.

39 Goal 3: Execute complete workflows in an automated manner.
In this document we describe the main concepts and new features of Xmipp 3.0 for users. 39

40 Designed to perform distributed execution
Worker Host 2 Worker Host 1 Scipion client Big data transfers Relatives: There are a number of very good existing “Workflow Engines”, such as Taverna (Manchester) or Pegasus (San Diego) ….. BUT SCIPION is NOT a Workflow Engine, but can use any WE in the future Distributed data storage Bookeeping Scipion Server

41 The Initial Volume Problem in SPA
f(x) local minima x global minimum

42 The Initial Volume Problem (in the Web)

43 The Initial Volume Problem (in the Web)

44 Breaking News…… FEI, la principal empresa proveedora de microscopios electrónicos de alta gama, acaba de expresar su deseo de que el I2PC sea su centro de referencia mundial para soluciones y servicios de procesamiento de imagen para Pharma Scipion y Cloud son partes importantes de esta estrategia

45 Instruct Open call for Access

46 A distributed infrastructure for integrated structural biology


Download ppt "Instruct Image Processing Centre"

Similar presentations


Ads by Google