Developments in xia2 Graeme Winter CCP4 Dev Meeting 2008.

Slides:



Advertisements
Similar presentations
OCCULAR OCCUltation Limovie Analysis Routine Presented by: Bob Anderson and Tony George (IOTA) A program to easily detect and time occultations and standardize.
Advertisements

CCP4 workshop Argonne, June 2011 Strategy of diffraction data collection (at synchrotrons) Zbigniew Dauter MCL-NCI & ANL-BIO.
Global Business Management Group
Why python? Automate processes Batch programming Faster Open source Easy recognition of errors Good for data management What is python? Scripting programming.
SIGMA Workshop Part 3: Statistical Screening Gönenç Yücel SESDYN Research Group Boğaziçi University, Istanbul 1.
RAMADDA for Big Climate Data Don Murray NOAA/ESRL/PSD and CU-CIRES Boulder/Denver Big Data Meetup - June 18, 2014.
Bob Sweet Bill Furey Considerations in Collection of Anomalous Data.
Experimental Phasing stuff. Centric reflections |F P | |F PH | FHFH Isomorphous replacement F P + F H = F PH FPFP F PH FHFH.
Data Collection and Processing Using APEX2, SHELXTL and the Bruker PHOTON 100 Kevin J. Gagnon
1 Team Skill 4 - Team Skill 5 - Scope Refining the Systems Definition (Chapters of the requirements text) CSSE 371 Software Requirements and Specification.
Data collection, data processing and scaling (1) relationship of Mosflm to CCP4 (2) some thoughts on data collection (3) simple processing with Mosflm.
FRAGMENTAION UNCERTAINTIES IN THE EXTRACTION OF FROM THE SEMI-INCLUSIVE MEASUREMENT OF B  X d  Tim Beck, Bruce Schumm Radiative Penguin Meeting April.
Automated protein structure solution for weak SAD data Pavol Skubak and Navraj Pannu Automated protein structure solution for weak SAD data Pavol Skubak.
Data Analysis I19 Upgrade Workshop 11 Feb Overview Short history of automated processing for Diamond MX beamlines Effects of adding Pilatus detectors.
CCP4 Study Weekend 3rd January 2003 CCP4i - “Tricks and Tools” Peter Briggs CCP4 Daresbury.
Peter J. Briggs, Liz Potterton *, Pryank Patel, Alun Ashton, Charles Ballard, Martyn Winn CLRC Daresbury Laboratory, Warrington, Cheshire WA4 4AD, UK *
28 Mar 06Automation1 Overview of developments within CCP4 Generation 1 ccp4i tasks Generation 2 isolated scripts / web service Generation 3 integrated.
Progress report on Crank: Experimental phasing Biophysical Structural Chemistry Leiden University, The Netherlands.
Kappa – from a users perspective ?. Standard today Most Labs/synchrotrons use single rotation axis for data collection Most Labs/synchrotrons use single.
28 th March 2007CCP4 Developers Meeting BIOXHIT/CCP4(i) Database Wanjuan Yang & Peter Briggs.
3. Spot Finding 7(i). 2D Integration 2. Image Handling 7(ii). 3D Integration 4. Indexing 8. Results Gwyndaf Evans 1, Graeme Winter 1, David Waterman 2,
An Introduction to CCP4i The CCP4 Graphical User Interface Peter Briggs CCP4.
Abab presents today. A closer look at the production process of a movie soundtrack. What elements are affecting recorded sound quality ?
Overview of MR in CCP4 II. Roadmap
Crank and Databases Steven Ness Leiden University The Netherlands.
Bulk Model Construction and Molecular Replacement in CCP4 Automation Ronan Keegan, Norman Stein, Martyn Winn.
1. Diffraction intensity 2. Patterson map Lecture
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
Zhang, T., He, Y., Wang, J.W., Wu, L.J., Zheng, C.D., Hao, Q., Gu, Y.X. and Fan, H.F. (2012) Institute of Physics, Chinese Academy of Sciences Beijing,
CONFIDENTIAL MATERIALS CATASTROPHE MODELING, PORTFOLIO BUILDING AND OPTIMIZATION.
POINTLESS & SCALA Phil Evans. POINTLESS What does it do? 1. Determination of Laue group & space group from unmerged data i. Finds highest symmetry lattice.
In context…. xia2: what is it? Automated expert data reduction – images in, reflections suitable for phasing out. Handles: –MAD data –Multiple passes.
Project Database Handler The Project Database Handler dbCCP4i is a brokering application that mediates interactions between the project database and an.
Optimizing structure determination How many are we solving? What is the limit? Are we there yet? Why not? What are the biggest problems?
OPERATING SYSTEMS BY LANDON, KYLE, AND ETHAN. WHAT IS THEIR PURPOSE? (1) manage the computer's resources, such as the central processing unit (2) establish.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
An Introduction to Mosflm (1) what Mosflm does (2) where it fits in the crystallography process (3) run through a typical job (4) introduction to the CCP4.
3. Spot Finding 7(i). 2D Integration 2. Image Handling 7(ii). 3D Integration 4. Indexing 8. Results 1. Introduction5. Refinement Background mask and plane.
Computational Crystallography InitiativePhysical Biosciences Division First Aid & Pathology Data quality assessment in PHENIX Peter Zwart.
Direct Use of Phase Information in Refmac Abingdon, University of Leiden P. Skubák.
BEST strategy / SAD optimization Gleb Bourenkov EMBL-Hamburg Kappa Workgroup Meeting September 28-29, 2009 MAXLAB.
Atomic structure model
Anomalous Differences Bijvoet differences (hkl) vs (-h-k-l) Dispersive Differences 1 (hkl) vs 2 (hkl) From merged (hkl)’s.
CCP4 output Presentation of program output to users The problem: Nobody reads anything Developers can’t be bothered to explain (and don’t know about tools.
Software automation – What STAB sees as key aims? 1.Brief review of activities and recommendations (so far) 2.Reality checks 3. Things to do…
Center for Structures of Membrane Proteins © 2006 Optimizing x-ray structure determination James Holton LBNL/UCSF April 6, 2006.
Peter Hobson - Brunel University, UK3 July 2003 VPT faceplate testing  Samples of faceplate glass are irradiated to approximately 20 kGy gamma ray dose.
, Dan Peterson Apparent inconsistencies and other issues in the xBSM measurements of IBS Scans We have studied the pinhole and CodedAperture.
Electron Spectrometer: Status July 14 Simon Jolly, Lawrence Deacon 1 st July 2014.
Managing crystallization experiments within PIMS.
AUTOMATION OF MACROMOLECULAR DATA COLLECTION - INTEGRATION OF DATA COLLECTION AND DATA PROCESSING Harold R. Powell 1, Graeme Winter 1, Andrew G.W. Leslie.
Alan Williams University of Manchester.  Allows you to call a command line script as part of a workflow  Simplest case is calling a single tool  Can.
ISPyB for MX at Diamond Pierre Aller. -Before beamtime Shipping preparation Sample registration -During beamtime Beamline status (remote) Puck allocation.
Why are. we not solving more struct tures? James Holton University of California San Francisco and Advanced Light Source Lawrence.
C. Rogers, ASTeC Intense Beams Group Rutherford Appleton Laboratory
OASIS-2004 A direct-method program for
Brian Leonard ブライアン レオナルド
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Database Requirements for CCP4 17th October 2005
GRAPHICS Source:
Project tracking system for the structure solution software pipeline
Almost at the end … “If you don’t remember anything else, remember this !!!!”
SAFS Quantitative Seminar
Provide quick feedback to data collection experiments.
CCP4 Daresbury Laboratory
ftp://ftp.mrc-lmb.cam.ac.uk/mosflm
The site to download BALBES:
Presentation transcript:

Developments in xia2 Graeme Winter CCP4 Dev Meeting 2008

What is xia2? Automated robust data reduction and analysis Thorough – takes additional steps when many users wouldn’t bother In: images from e.g. synchrotron beamline Out: measurements for downstream phasing via e.g. HAPPy, Mr BUMP, Phenix…

Recent changes Inclusion in CCP4 6.1 Many command line options Integrated with AutoRickshaw (EMBL H) Robust lattice determination Support for Q270, Pilatus Zero input option

3 Month plans BioXHit ends in June => so does xia2 development Include robust system to decide resolution limits etc (next slides) Finish release to go with release version of CCP4 6.1

Chef Let’s cook them books!

What is chef? A tool to help you use the best of the reflections you have Uses unmerged intensities Uses robust statistics to decide:  d* min for different functions (resolution)  D max for different functions (dose) Additional program “doser” to add dose information to unmerged MTZ files

In MTZ files from scala with “output unmerged” set DOSE / TIME information for doser:  BATCH 1 DOSE 2.5 TIME 2.5  BATCH 2 DOSE 7.5 TIME 8.2  …

Running doser hklin TS03_12287_chef_INFL.mtz hklout infl.mtz < doser.in doser hklin TS03_12287_chef_LREM.mtz hklout lrem.mtz < doser.in doser hklin TS03_12287_chef_PEAK.mtz hklout peak.mtz < doser.in chef hklin1 infl.mtz hklin2 lrem.mtz hklin3 peak.mtz << eof isigma 2.0 resolution 1.65 range width 30 max 1500 print comp rd rdcu anomalous on labin BASE=DOSE eof

Output Resolution vs. dose Completeness vs. dose for each data set

Methods Based on “new” cumulative-pairwise R factor R CP: Inspired by R d in Diederichs (2006)

And R CP means..? How well do the measurements up to dose D agree? Closely related to I/σ Reasonably robust as it does not depend on sigma estimates or means Gets bigger when systematic variation contributes to spread

Requirements Radiation damaged MAD data – what do I want for:  Substructure determination – big anomalous / dispersive signal  Phase calculation – well measured ΔF  Phase extension & improvement – good F  Refinement – good F 85% Limit R CP < R(I/σ) + S(I/σ, N m, N u )

Example JCSG TB0541 – heavily radiation damaged… 3 wavelength MAD – INFL + LREM, PEAK Massive signal P43212, 90 degrees * 3 => plenty of data Chef says “use data to 1.65A, D=~600s”

Before (INFL) For TS03/12287/INFL High resolution limit Low resolution limit Completeness Multiplicity I/sigma Rmerge Rmeas(I) Rmeas(I+/-) Rpim(I) Rpim(I+/-) Wilson B factor Anomalous completeness Anomalous multiplicity Anomalous correlation

After (INFL – first 60 degrees) For TEST001/12287/LREM High resolution limit Low resolution limit Completeness Multiplicity I/sigma Rmerge Rmeas(I) Rmeas(I+/-) Rpim(I) Rpim(I+/-) Wilson B factor Anomalous completeness Anomalous multiplicity Anomalous correlation

Why improvement? Limit radiation damage => σF more meaningful Limit damage => ΔF better Without systematic damage get higher resolution for given I/σ

However… Pipe MTZ through scaleit / solve / cad / resolve / Arp/Warp and get very similar results – slight improvement though This is most interesting, because it means that 55% of the “data” did not add to the quality of the result

Plans Currently writing this up for J. Appl. Cryst Chef will be included in CCP4 6.1 Next: include this as part of xia2 (makes 0.3.0) Extend chef to make decisions about anomalous / dispersive differences