Data collection, data processing and scaling (1) relationship of Mosflm to CCP4 (2) some thoughts on data collection (3) simple processing with Mosflm.

Slides:



Advertisements
Similar presentations
Click on My Page to log in. Completing your personal details.
Advertisements

Summary Statistics/Simple Graphs in SAS/EXCEL/JMP.
The map and reduce functions in MapReduce are easy to test in isolation, which is a consequence of their functional style. For known inputs, they produce.
A Robust Super Resolution Method for Images of 3D Scenes Pablo L. Sala Department of Computer Science University of Toronto.
Sorting Really Big Files Sorting Part 3. Using K Temporary Files Given  N records in file F  M records will fit into internal memory  Use K temp files,
Image reconstruction and analysis for X-ray computed microtomography Lucia Mancini 1, Francesco Montanari 2, Diego Dreossi 3 1 Elettra - Trieste 2 A.R.P.A.
(0,0) RECIPROCAL LATTICE (0,1) (1,1) (2,1) (3,1) REAL LATTICE a b a* b*
ServiceLink Direct From Walker Martyn Software Ltd.
Data Collection & Data Analysis Basic Processing with Mosflm Gwyndaf Evans Diamond Light Source, UK.
Introduction CCP4i, Files and Utilities Martyn Winn CCP4, CCLRC Daresbury Laboratory
Data Collection and Processing Using APEX2, SHELXTL and the Bruker PHOTON 100 Kevin J. Gagnon
Logging and Replay of Go Game Steven Davis Elizabeth Fehrman Seth Groder.
Stanford Synchrotron Radiation Laboratory Software for X-ray Scattering Measurement Apurva Mehta.
Motion Analysis (contd.) Slides are from RPI Registration Class.
CSci 6971: Image Registration Lecture 4: First Examples January 23, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart, RPI Dr.
The goal of Data Reduction From a series of diffraction images (films), obtain a file containing the intensity ( I ) and standard deviation (  ( I ))
Solving Centrosymmetric Crystal Structures in Non-Centrosymmetric Space Groups Michael Shatruk September 12, 2011.
Etomo introduction A) Start “cygwin-bash” on desktop The command prompt window opens up B) Enter “etomo” and hit enter This quick guide will show how to.
ALICE Data Acquisition Ben Shepherd MaRS Group ASTeC STFC Daresbury Laboratory.
Introducing... NPF Connect Press [Space Bar] to continue...
Advanced Excel for Finance Professionals A self study material from South Asian Management Technologies Foundation.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
CCP4 Study Weekend 3rd January 2003 CCP4i - “Tricks and Tools” Peter Briggs CCP4 Daresbury.
Peter J. Briggs, Liz Potterton *, Pryank Patel, Alun Ashton, Charles Ballard, Martyn Winn CLRC Daresbury Laboratory, Warrington, Cheshire WA4 4AD, UK *
CCP4 school at APS, June 2011 Diffraction Data Processing with iMOSFLM, POINTLESS and SCALA Andrew GW Leslie, MRC LMB, Cambridge.
Copyright © 2007, Oracle. All rights reserved. Managing Concurrent Requests.
The goal of Data Processing From a series of diffraction images, obtain the intensity ( I ) and standard deviation (  ( I )) for each reflection, hkl.
DMU: Kinematics Workbench By: Michael Johnson Kyle Pflueger Paul Sowiniski.
Relational Database CISC/QCSE 810 some materials from Software Carpentry.
Understanding typical users for this instrument Graduate studentGraduate student –not an X-ray expert but wants to make a spatially resolved measurement;
interested in how Diamond is planning to integrate the use of imgCIF into the offered Data Processing/Storing Services: which format the users can get.
Lars Ehm National Synchrotron Light Source
An Introduction to CCP4i The CCP4 Graphical User Interface Peter Briggs CCP4.
ISU Basic SAS commands Laboratory No. 1 Computer Techniques for Biological Research Animal Science 500 Ken Stalder, Professor Department of Animal Science.
Lecture 3 The Digital Image – Part I - Single Channel Data 12 September
R. Keegan 1, J. Bibby 3, C. Ballard 1, E. Krissinel 1, D. Waterman 1, A. Lebedev 1, M. Winn 2, D. Rigden 3 1 Research Complex at Harwell, STFC Rutherford.
SP5 - Neuroinformatics SynapsesSA Tutorial Computational Intelligence Group Technical University of Madrid.
POINTLESS & SCALA Phil Evans. POINTLESS What does it do? 1. Determination of Laue group & space group from unmerged data i. Finds highest symmetry lattice.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
David Adams ATLAS Virtual Data in ATLAS David Adams BNL May 5, 2002 US ATLAS core/grid software meeting.
An Introduction to Mosflm (1) what Mosflm does (2) where it fits in the crystallography process (3) run through a typical job (4) introduction to the CCP4.
Almost at the end … “If you don’t remember anything else, remember this”
Diffraction Image Library. What is new with it? Now support Oxford-Diffraction, CBF and mini-CBF Integrated into ccp4-6.1 Development of a masking function.
EagleSoft Version 15 Installation Patterson Imaging Version _Attachment_A_Rev.1_ Revision 1Revision Date: 1/18/12.
17 th October 2005CCP4 Database Meeting (York) CCP4i Database Overview Peter Briggs.
EOVSA Pipeline Processing System J. McTiernan EOVSA Prototype Review 24-Sep-2012.
Before Beginning – Must copy over the p4p file – Enter../xl.p4p. – Enter../xl.hkl. – Do ls to see the files are there – Since the.p4p file has been created.
Single Crystal Software Workshop ECM Louis J. Farrugia, Department of Chemistry University of Glasgow GLASGOW G12 8QQ Scotland Processing of Bruker.
EOVSA Data and Database System J. McTiernan EOVSA Technical DesignMeeting 7-Nov-2011.
1 Berger Jean-Baptiste
AUTOMATION OF MACROMOLECULAR DATA COLLECTION - INTEGRATION OF DATA COLLECTION AND DATA PROCESSING Harold R. Powell 1, Graeme Winter 1, Andrew G.W. Leslie.
1 CAA 2009 Cross Cal 9, Jesus College, Cambridge, UK, March 2009 Caveats, Versions, Quality and Documentation Specification Chris Perry.
Almost at the end … “If you don’t remember anything else, remember this”
Officiating Management Software.
Aa Scripting SPM analyses using aa Rhodri Cusack.
CHARACTERIZATION OF THE STRUCTURE OF SOLIDS
UFOAnalyzerV2 (UA2) the key of accuracy
The Crystal Screening Interface at ALS
Recognition of biological cells – development
OPSE 301: Lab13 Data Analysis – Fitting Data to Arbitrary Functions
Fall 2010 Slide 1.
Systems Thinking: Systems modeling quick start
European Computer Driving Licence
Mosflm Progress Report
SFX and Laue Diffraction
Recent changes to Mosflm
Science of Crime Scenes
ftp://ftp.mrc-lmb.cam.ac.uk/mosflm
X-ray high resolution spectra in the VO: the case of XMM-Newton RGS
Presentation transcript:

Data collection, data processing and scaling (1) relationship of Mosflm to CCP4 (2) some thoughts on data collection (3) simple processing with Mosflm (4) improving data collection with Mosflm (5) simple processing with SCALA (using ccp4i) Harry Powell, San Antonio, May 25th 2002

* Number of Datasets = 1 * Dataset ID, project/crystal name, dataset name, cell dimensions, wavelength: 1 lys_fine / 02_05_02:13:53: * Number of Columns = 16 * Number of Reflections = * Missing value set to NaN in input mtz file * Number of Batches = 50 * HISTORY for current MTZ file : From MOSFLM run on 2/ 5/02 * Resolution Range : ( A ) * There is no sort order recorded in the MTZ header * Space group = P43212 (number 96)

The X-ray Experiment crystal X-rays images integrate scale phase Mosflm SCALA

Optimization of Data Collection Pre-process at least one image (preferably two at 90º to each other) to obtain: Cell parameters, crystal orientation and putative Laue group Estimate of mosaicity Effective resolution limit Crystal to detector distance Exposure time Strategy for data collection Remember! This is the last experimental stage - if you collect bad data now you are stuck with it. No data processing program can rescue the irredeemable!

What is needed prior to running Mosflm ? X-ray images Experimental details (e.g. detector type, direct beam position, wavelength, etc) The program itself and a computer to run it on!

[localhost:~/test/muldlx1] harry% ls *mar2000 muldlx1_301.mar2000 muldlx1_307.mar2000 muldlx1_313.mar2000 muldlx1_319.mar2000 muldlx1_302.mar2000 muldlx1_308.mar2000 muldlx1_314.mar2000 muldlx1_320.mar2000 muldlx1_303.mar2000 muldlx1_309.mar2000 muldlx1_315.mar2000 muldlx1_321.mar2000 muldlx1_304.mar2000 muldlx1_310.mar2000 muldlx1_316.mar2000 muldlx1_322.mar2000 muldlx1_305.mar2000 muldlx1_311.mar2000 muldlx1_317.mar2000 muldlx1_306.mar2000 muldlx1_312.mar2000 muldlx1_318.mar2000 [localhost:~/test/muldlx1] harry% ipmosflm ************ Version for Image plate and CCD data 23 April 2002 *********** A.G.W. Leslie, MRC Laboratory Of Molecular Biology, HILLS ROAD, CAMBRIDGE CB2 2QH, UK New auto-indexing using DPS due to Ingo Steller Robert Bolotovsky and Michael Rossmann (1998) J. Appl. Cryst. 30, Original auto-indexing using REFIX due to Wolfgang Kabsch (Kabsch,W. (1993), J.Appl.Cryst. 24, ) X-windows interface using xdl_view due to John Campbell (Daresbury Laboratory, UK.) (Campbell,J.W. (1995) J. Appl. Cryst. 28, MOSFLM => image muldlx1_301.mar2000 MOSFLM => go (Q)QOPEN: file opened on unit 1 Status: READONLY Logical Name: muldlx1_301.mar2000 Filename: muldlx1_301.mar2000 Crystal to detector distance of mm taken from image header Wavelength of A taken from image header

Crystal to detector distance of mm taken from image header Wavelength of A taken from image header Pixel size of mm taken from image header. Start and end phi values for image 1 from image header are and degrees. image FILENAME: muldlx1_301.mar2000 (Q)QOPEN: file opened on unit 1 Status: READONLY Logical Name: muldlx1_301.mar2000 Filename: muldlx1_301.mar2000 The red circle denotes the region behind the backstop shadow (Use BACKSTOP keyword to set this.)

Running the STRATEGY option In the GUI, click on the STRATEGY button On the command-line, type STRATEGY. In either case, determining a suitable strategy for data collection once you have a cell, orientation and crystal symmetry is straightforward. Then run TESTGEN to check for overlaps

Checking the output (1) There are two useful log files; SUMMARY ; this is of most use when viewed with the CCP4 graph viewer LOGGRAPH, as it contains graphs of parameters which have varied through the data processing. mosflm.lp ; this can be very large, and contains a complete record of the experiment.

Checking the output (2) If everything has gone right so far; check the MTZ file; is it as you’d expect?

Command-line processing Most commands in the GUI are available from the command- line; e.g. for the test images, the following could be done: # detector mar ! not necessary here! template muldlx1_###.pck beam autoindex dps image 301 mosaic estimate go postref segment 1 process go postref nosegment fix all process go exit

Scaling with SCALA in ccp4i Scaling and merging the data is the next step following integration. It is important because: it attempts to put all observations on a common scale it provides the main diagnostics of data quality and whether the data collection is satisfactory Because of this diagnostic role, it is important that data are scaled as soon as possible after collection, or during collection, preferably while the crystal is still on the camera.

Checking the output of SCALA Check these files/plots: ROGUES Normal probability plot(s) Surface plot SCALA log file loggraph output