AUTOMATION OF MACROMOLECULAR DATA COLLECTION - INTEGRATION OF DATA COLLECTION AND DATA PROCESSING Harold R. Powell 1, Graeme Winter 1, Andrew G.W. Leslie.

Slides:



Advertisements
Similar presentations
Chapter 20 Oracle Secure Backup.
Advertisements

Database Planning, Design, and Administration
COURSE: COMPUTER PLATFORMS
PiMS overview: version 0.3 & beyond Robert Esnouf, PiMS Project Sponsor, Oxford.
WebIce A web browser based application for experiment monitoring and control Ana Gonzalez, SSRL.
Automated data collection Peter Zwart Berkeley Center for Structural Biology Physical Biosciences Division, LBL.
Automated collection and processing of macromolecular diffraction data with DNA Project started in 2001 following an ESRF user meeting Currently involves.
Case study - usability evaluation Howell Istance.
Data collection, data processing and scaling (1) relationship of Mosflm to CCP4 (2) some thoughts on data collection (3) simple processing with Mosflm.
A Fully Automated Small Molecule Single Crystal X-Ray Diffraction System Mark E Light * & Michael B Hursthouse, Department of Chemistry, University of.
Tiered architectures 1 to N tiers. 2 An architectural history of computing 1 tier architecture – monolithic Information Systems – Presentation / frontend,
Working with SQL and PL/SQL/ Session 1 / 1 of 27 SQL Server Architecture.
By Godfrey Aziyo Department of LIS Telephone:
6 th Annual Focus Users’ Conference Application Editor and Form Builder Presented by: Mike Morris.
–Streamline / organize Improve readability of code Decrease code volume/line count Simplify mechanisms Improve maintainability & clarity Decrease development.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
Computer System Lifecycle Chapter 1. Introduction Computer System users, administrators, and designers are all interested in performance evaluation. Whether.
26-28 th April 2004BioXHIT Kick-off Meeting: WP 5.2Slide 1 WorkPackage 5.2: Implementation of Data management and Project Tracking in Structure Solution.
Introduction to Information System Development.
Introduction to Systems Analysis and Design Trisha Cummings.
Chapter 9 Elements of Systems Design
Database System Development Lifecycle © Pearson Education Limited 1995, 2005.
Languages and Environments Higher Computing Unit 2 – Software Development.
1 BTEC HNC Systems Support Castle College 2007/8 Systems Analysis Lecture 9 Introduction to Design.
Peter J. Briggs, Liz Potterton *, Pryank Patel, Alun Ashton, Charles Ballard, Martyn Winn CLRC Daresbury Laboratory, Warrington, Cheshire WA4 4AD, UK *
Database Technical Session By: Prof. Adarsh Patel.
C++ Programming Language Lecture 1 Introduction By Ghada Al-Mashaqbeh The Hashemite University Computer Engineering Department.
MET280: Computing for Bioinformatics Introduction to databases What is a database? Not a spreadsheet. Data types and uses DBMS (DataBase Management System)
BLU-ICE and the Distributed Control System Constraints for Software Development Strategies Timothy M. McPhillips Stanford Synchrotron Radiation Laboratory.
3. Spot Finding 7(i). 2D Integration 2. Image Handling 7(ii). 3D Integration 4. Indexing 8. Results Gwyndaf Evans 1, Graeme Winter 1, David Waterman 2,
High and low level system integration Kappa meeting Grenoble 20 th Sept
HARDWARE INPUT DEVICES GETTING DATA INTO THE COMPUTER.
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
Introduction to Morpho BEAM Workshop Samantha Romanello Long Term Ecological Research University of New Mexico.
Chapter 3: Software Project Management Metrics
POINTLESS & SCALA Phil Evans. POINTLESS What does it do? 1. Determination of Laue group & space group from unmerged data i. Finds highest symmetry lattice.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
E-HTPX: A User Perspective Robert Esnouf, University of Oxford.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
An Introduction to Mosflm (1) what Mosflm does (2) where it fits in the crystallography process (3) run through a typical job (4) introduction to the CCP4.
17 th October 2005CCP4 Database Meeting York University Database Requirements for CCP4 Projects Monday 17 th October 2005 Abstract Gather information on.
3. Spot Finding 7(i). 2D Integration 2. Image Handling 7(ii). 3D Integration 4. Indexing 8. Results 1. Introduction5. Refinement Background mask and plane.
Mantid Stakeholder Review Nick Draper 01/11/2007.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Status & development of the software for CALICE-DAQ Tao Wu On behalf of UK Collaboration.
Presentation on Database management Submitted To: Prof: Rutvi Sarang Submitted By: Dharmishtha A. Baria Roll:No:1(sem-3)
Systems Development Life Cycle
Active-HDL Server Farm Course 11. All materials updated on: September 30, 2004 Outline 1.Introduction 2.Advantages 3.Requirements 4.Installation 5.Architecture.
Bringing structural biology services through collaboration.
SOFTWARE DESIGN & SOFTWARE ENGINEERING Software design is a process in which data, program structure, interface and their details are represented by well.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
Automating the Single Crystal X-Ray Diffraction Experiment – Mark Light – School of Chemistry - University of Southampton – ECM22 – Budapest 2004 Automating.
What does the future hold? SAPHIRE CCP4 libraries Program Developments More automation 3D viewer Project CCP4 Study Weekend 2003 BAR!
Systems Analysis and Design in a Changing World, Fourth Edition
Automatic Data Processing at the ESRF
The Crystal Screening Interface at ALS
System Design Ashima Wadhwa.
Database Requirements for CCP4 17th October 2005
Software Quality Engineering
Graeme Winter STFC Computational Science & Engineering
Autoprocessing updates at the MX beamlines
Current CRIMS – ISPyB communication status
Project tracking system for the structure solution software pipeline
Software life cycle models
Automation from a user perspective
Chapter 1 Introduction(1.1)
Recent changes to Mosflm
ftp://ftp.mrc-lmb.cam.ac.uk/mosflm
Presentation transcript:

AUTOMATION OF MACROMOLECULAR DATA COLLECTION - INTEGRATION OF DATA COLLECTION AND DATA PROCESSING Harold R. Powell 1, Graeme Winter 1, Andrew G.W. Leslie 1, Colin Nave 2, Elizabeth Duke 2, Stephen H. Kinder 2, Dave Love 2, Sean McSweeney 3, Olof Svensson 3, Darren Spruce 3, Solange Delageniere 3 (1) MRC-LMB, Hills Road, Cambridge, UK (2) Daresbury Laboratory, Daresbury, Warrington, UK (3) ESRF, BP 220, F-38043, Grenoble Cedex, France In view of the limited availability of beamtime at synchrotron sources and the large number of projects requiring this resource, automation of both data collection and data processing has become increasingly important. Improvements in area detector technology (e.g. the introduction of fast readout devices such as CCDs) also emphasize the fact that human intervention at this stage and that of subsequent data processing is responsible for decreasing the possible levels of throughput attainable. With this in mind we have made considerable progress in integrating data collection and processing and in automating each of these two components. Implementation The project is divided into five phases and is intended to provide useful added functionality at all stages. Phase I and II functionality will be available at the ESRF (beamlines ID14 EH2, EH1) and SRS (beamline 14.2) in the next few months. Phase I. In this phase, the Expert System will simply provide a communication pathway between the data processing software and the beamline control software. Phase II. The parameters of the data collection will be presented to the user in a GUI where they can be edited. Phase III. An additional button will allow the user to integrate the images as they are collected. Information about the results of the integration (eg as a function of resolution or image) will be fed back to the GUI from the data processing. Results of merging the data will also be displayed. Phase IV. Implementation of fully automated data collection and processing. A single button will activate initial characterisation of the crystal, and providing that user-defined criteria regarding resolution, mosaicity etc. are met, the data will be collected and integrated without any user intervention. Phase V. Automated sample loading (including crystal centring in the beam) and a project management system will be integrated with the Expert System. This will allow rank ordering of multiple crystals based on their diffraction properties, and fully automated beamline operation. Mosflm Expert system Key to division of labour CCP4 Phase I/II autoindex estimate mosaicity integrate single image determine data collection strategy Postrefine cell parameters integrate image determine effective resolution limit collect images for postrefinement collect next image in dataset Merge/Scale data collect two images at 0º and 90º start crystal still okay? finish (error) Determine strategy for new point group data collection finished? finish (okay) n n point group consistent? select next highest symmetry point group Merge/Scale data point group changed? n n y y y y Procedures involved in fully automated data collection and processing beamline software to collect the required images, and the data processing software to process the images as they are collected. Automation of the data processing steps is possible because of improvements to Mosflm itself, which allow the appropriate sequence of operations to be carried out in a flexible and robust manner. Commands which were previously only available from the GUI are now accessible on the command line. All the features listed in the flowchart exist in Mosflm version This work has been funded by CCP4 and the EU via the Autostruct initiative and the Max-Inf network. DNA stands for DNA’s Not Autostruct. For further information, visit The primary objective of the DNA project is the provision of software that will allow fully automated collection and processing of diffraction data, including rapid crystal screening. Ultimately the project will be extended to include automated sample loading and a project management system, which will enable a large number of crystals from a number of different projects to be handled without any manual intervention. Three modules (data processing, beamline control and sample control) are linked together to provide a complete system for controlling data collection and processing. Communication between the three modules is handled by an expert system; this makes the crucial decisions about the data collection based on information provided by the data processing module and some basic parameters relating to the project supplied by the user. Direct communication between the Expert system and the different modules is through a server program which uses TCP/IP sockets and a command language conforming to XML standards. The server program has been developed to provide an extendable interface to a “next-generation” GUI for Mosflm. The modular nature of this system simplifies installation on different beamlines, while the open source communications protocol will allow the straightforward integration of other software into the system. At present the system has been implemented as an additional button on the data collection GUIs (PXGEN++ at SRS, ProDC at ESRF) which gives a "characterize crystal" command. The "characterize crystal" command issues instructions to collect two images from a crystal (at 0º and 90º in phi) autoindex each image individually and also both together estimate the effective mosaicity integrate the first image to determine the effective resolution limit calculate a suitable data collection strategy to give maximum completeness for both unique and anomalous data. The success or failure of the autoindexing of one or more test images is used as the initial indicator of crystal quality. This will be judged by the rms error in predicted spot positions and the fraction of spots that are rejected from indexing or refinement. If the autoindexing is successful, the crystal mosaicity will be estimated and the test images will be integrated to obtain an indication of data quality and the effective resolution, deduced from the values as a function of resolution. A data collection strategy based on the deduced Laue group (with the lowest possible symmetry) will also be calculated. The Expert System uses this information to determine if the crystal is really suitable for data collection (for example, if the required resolution can be achieved) and to determine data collection parameters. It then instructs the Expert System Data processing module Beamline control module Sample control module Disk storage database PXGEN++ (left) and ProDC (above) GUIs before and after characterizing crystal