Progress report on Crank: Experimental phasing Biophysical Structural Chemistry Leiden University, The Netherlands.

Slides:



Advertisements
Similar presentations
Active Appearance Models
Advertisements

A Robust Super Resolution Method for Images of 3D Scenes Pablo L. Sala Department of Computer Science University of Toronto.
1 Regression as Moment Structure. 2 Regression Equation Y =  X + v Observable Variables Y z = X Moment matrix  YY  YX  =  YX  XX Moment structure.
Learning from spectropolarimetric observations A. Asensio Ramos Instituto de Astrofísica de Canarias aasensio.github.io/blog.
M.I.R.(A.S.) S.M. Prince U.M.I.S.T.. The only generally applicable way of solving macromolecular crystal structure No reliance on homologous structure.
M.I.R.(A.S.) S.M. Prince U.M.I.S.T.. The only generally applicable way of solving macromolecular crystal structure No reliance on homologous structure.
Computational Methods for Management and Economics Carla Gomes Module 8b The transportation simplex method.
Analysis of variance (ANOVA)-the General Linear Model (GLM)
Experimental Phasing stuff. Centric reflections |F P | |F PH | FHFH Isomorphous replacement F P + F H = F PH FPFP F PH FHFH.
Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
(Re)introduction to Mx. Starting at the beginning Data preparation Mx expects 1 line per case/family Almost limitless number of families and variables.
ENGG 1801 Engineering Computing MATLAB Lecture 7: Tutorial Weeks Solution of nonlinear algebraic equations (II)
Don't fffear the buccaneer Kevin Cowtan, York. ● Map simulation ⇨ A tool for building robust statistical methods ● 'Pirate' ⇨ A new statistical phase improvement.
In Macromolecular Crystallography Use of anomalous signal in phasing
Automated protein structure solution for weak SAD data Pavol Skubak and Navraj Pannu Automated protein structure solution for weak SAD data Pavol Skubak.
Phasing based on anomalous diffraction Zbigniew Dauter.
CALIBRATION METHODS.
3.7. O THER G AME P HYSICS A PPROACHES Overview of other game engine physics approaches.
Section 8.3 – Systems of Linear Equations - Determinants Using Determinants to Solve Systems of Equations A determinant is a value that is obtained from.
Objectives of Multiple Regression
MODEL REFERENCE ADAPTIVE CONTROL
9/7/ Gauss-Siedel Method Chemical Engineering Majors Authors: Autar Kaw Transforming.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Example Clustered Transformations MAP Adaptation Resources: ECE 7000:
The Campbell Collaborationwww.campbellcollaboration.org Introduction to Robust Standard Errors Emily E. Tanner-Smith Associate Editor, Methods Coordinating.
CS 360 Lecture 3.  The software process is a structured set of activities required to develop a software system.  Fundamental Assumption:  Good software.
28 Mar 06Automation1 Overview of developments within CCP4 Generation 1 ccp4i tasks Generation 2 isolated scripts / web service Generation 3 integrated.
Analytical vs. Numerical Minimization Each experimental data point, l, has an error, ε l, associated with it ‣ Difference between the experimentally measured.
Transmit beam-pattern synthesis Waveform design for active sensing Chapters 13 – 14.
23 rd March 2005CCP4 Annual Developers’ Meeting 1 DL: Releases, Conferences and Other Activities Peter Briggs, CCP4 Daresbury.
An Introduction to CCP4i The CCP4 Graphical User Interface Peter Briggs CCP4.
Danila Filipponi Simonetta Cozzi ISTAT, Italy Outlier Identification Procedures for Contingency Tables in Longitudinal Data Roma,8-11 July 2008.
Crank and Databases Steven Ness Leiden University The Netherlands.
Chapter 6. Control Charts for Variables. Subgroup Data with Unknown  and 
Parallel Fine Sampling to Solve Large or Difficult Structures Manually exploring large parameter space to find right combination of parameters is time-
Statistical-Time Access Fairness Index of One-Bit Feedback Fair Scheduler Fumio Ishizaki Dept. of Systems Design and Engineering Nanzan University, Japan.
Zhang, T., He, Y., Wang, J.W., Wu, L.J., Zheng, C.D., Hao, Q., Gu, Y.X. and Fan, H.F. (2012) Institute of Physics, Chinese Academy of Sciences Beijing,
Silesian University of Technology in Gliwice Inverse approach for identification of the shrinkage gap thermal resistance in continuous casting of metals.
Background Subtraction and Likelihood Method of Analysis: First Attempt Jose Benitez 6/26/2006.
Prediction of NMR Chemical Shifts. A Chemometrical Approach К.А. Blinov, Y.D. Smurnyy, Т.S. Churanova, М.Е. Elyashberg Advanced Chemistry Development (ACD)
Linear Systems – Iterative methods
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
Computational Crystallography InitiativePhysical Biosciences Division First Aid & Pathology Data quality assessment in PHENIX Peter Zwart.
Direct Use of Phase Information in Refmac Abingdon, University of Leiden P. Skubák.
Week 41 How to find estimators? There are two main methods for finding estimators: 1) Method of moments. 2) The method of Maximum likelihood. Sometimes.
Progress report on Crank: Model building Biophysical Structural Chemistry Leiden University, The Netherlands.
Handout Seven: Independent-Samples t Test Instructor: Dr. Amery Wu
17 th October 2005CCP4 Database Meeting (York) CCP4i Database Overview Peter Briggs.
Developments in xia2 Graeme Winter CCP4 Dev Meeting 2008.
September 28, 2000 Improved Simultaneous Data Reconciliation, Bias Detection and Identification Using Mixed Integer Optimization Methods Presented by:
Handout Six: Sample Size, Effect Size, Power, and Assumptions of ANOVA EPSE 592 Experimental Designs and Analysis in Educational Research Instructor: Dr.
Software Development Process CS 360 Lecture 3. Software Process The software process is a structured set of activities required to develop a software.
1 Efficiency calculation using the System8 method Benoit Clément, Thomas Delemontex, Fabienne Ledroit.
Thomas F. Edgar (UT-Austin) RLS – Linear Models Virtual Control Book 12/06 Recursive Least Squares Parameter Estimation for Linear Steady State and Dynamic.
Stony Brook Integrative Structural Biology Organization
Gauss-Siedel Method.
CH 5: Multivariate Methods
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Complete automation in CCP4 What do we need and how to achieve it?
CCP4 from a user perspective
A quadratic equation is written in the Standard Form,
ENGG 1801 Engineering Computing
Almost at the end … “If you don’t remember anything else, remember this !!!!”
Experimental phasing in Crank2 Pavol Skubak and Navraj Pannu Biophysical Structural Chemistry, Leiden University, The Netherlands
Core Activities Review of 06/07 Plans for 07/08 29 Mar 05 Review.
Unfolding with system identification
Solving Multi Step Equations
Solving Multi Step Equations
Presentation transcript:

Progress report on Crank: Experimental phasing Biophysical Structural Chemistry Leiden University, The Netherlands

Crank developments available in CCP4 6.1 “Greatly enhanced” – better tested Underlying programs haven’t changed (much), but crank almost completely re-written from version in Better ccp4i interface Support for more programs (PIRATE, BUCCANEER, RESOLVE, COOT) Faster substructure detection –Use BP3 to (quickly) check trials and look at deviations between different CRUNCH2 trials significantly decreases the time required for successful substructure detection.

Speeding up CRUNCH2: Results showing improvement Resol. (Å) Anom. atoms Exp.Time (old) (min) Time (new) (min) subtilisin1.773 Ca,SAD Carboxyl proteinase 1.89 BrSAD-peak gere SeMAD p/i/h cyanase SeMAD p/i/h thioesterase BrSAD-peak

Improved CCP4i interface

Preliminary substructure detection results from JCSG test cases 144 mostly MAD Se-Met data sets Defaults only: the only input was number of Se- Met per monomer (number of monomer was guessed). Mtz files, f’, f”. Some data sets had f” < 1 (solved by MR) Some data sets had incorrectly labelled X-PLOR files as mtz. DISCLAIMER: 1 st logfiles produced and analyzed yesterday after dinner (until 4 a.m.).

AFRO/CRUNCH2 vs SHELXC/D (both run in CRANK) CRUNCH2SHELXD 100 % found % found15 Input error25 My input error032 total144 Of the 79 jobs in common, crunch2 was faster in 20 jobs, while shelxd was faster in 59.

Comparison not fair Same algorithm to identify solution with BP3 can be used in SHELXD SHELXD uses much better Fa values (i.e. using the MAD data – at the moment, Afro just uses delta F from the data set with the greatest anomalous signal).

Improving F A values An early step in solving a structure by SAD/MAD or SIRAS is to determine F A values. F A is the structure factor amplitude corresponding to the substructure to input to direct methods and/or Patterson programs (i.e. SHELXD or CRUNCH2)

Current F A estimation F A is currently estimated by | |F + | - |F - | | for SAD data in most programs. Direct method programs are very sensitive to F A values. Improving estimates can improve hit rates of direct methods and solve substructures that can not previously been solved.

Multivariate SAD equation E(|F A |,|F + |,|F - |) =     |F A | P(|F A |, α A,| |F + |, α +,|F - |, α - ) d|F A | dα A dα + dα - Giacovazzo previously proposed multivariate F A estimation, with an implementation assuming Bijvoet phases are equal. An equation can be obtained without the equal phase assumption requiring only one numerical integration. The equation has been implemented – which reduces to Giacovazzo’s equation if Bijvoet phases are equal.

Covariance matrix properties The covariance matrix considers experimental sigmas and correlations between F +, F - and F A. Problem: Covariance matrix also depends on (overall) substructure occupancy and b-factor. Solution: Obtain a multivariate likelihood estimate for unknown parameters.

Refining overall substructure parameters Initial guess of number of substructure atoms per monomer obtained from user. Initial guess of B-factor obtained from likelihood estimate of overall B-factor of data set. Result: Refinement is stable and maximizes correlation with calculated final E’s. Another possible application: Use refined overall occupancy and B-factor for anomalous signal estimation.

Test cases: Correlations with final calculated E’s ResoAnom Atom f ˝ Corr ΔE Corr E multi Ferrodoxin0.94Fe Thioesterase2.5Se Lyso S Lyso S DNA P DNA 901.5P

More robustness in difficult cases with CRUNCH2 Using default parameters (resolution cutoff of 0.5 from the high resolution limit). Hit rate: ΔEHit rate: E multi Ferrodoxin1/204/20 DNA 90*0/202/20 * Can be solved with ΔE by using data to 1.5 Angstroms