Complete automation in CCP4 What do we need and how to achieve it?

Slides:



Advertisements
Similar presentations
Unravelling the biochemical reaction kinetics from time-series data Santiago Schnell Indiana University School of Informatics and Biocomplexity Institute.
Advertisements

Crystallography, Birkbeck MOLECULAR SIMULATIONS ALL YOU (N)EVER WANTED TO KNOW Julia M. Goodfellow Dynamic Processes: Lecture 1 Lecture Notes.
Automated phase improvement and model building with Parrot and Buccaneer Kevin Cowtan
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Not just a pretty picture Jul. 14 Hyunwook Lee.
CHE Inorganic, Physical & Solid State Chemistry Advanced Quantum Chemistry: lecture 4 Rob Jackson LJ1.16,
Small Molecule Example – YLID Unit Cell Contents and Z Value
Lipinski’s rule of five
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
A Molecular Replacement Pipeline Garib Murshudov Chemistry Department, University of York 
Macromolecular structure refinement Garib N Murshudov York Structural Biology Laboratory Chemistry Department University of York.
Automated protein structure solution for weak SAD data Pavol Skubak and Navraj Pannu Automated protein structure solution for weak SAD data Pavol Skubak.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
Carbon Chemistry - Properties of Carbon Carbon Atoms and Bonding Carbon atoms and the bonds between them can be modeled in several ways.
A Molecular Replacement Pipeline Garib Murshudov Chemistry Department, University of York 
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
1 Lesson 8: Basic Monte Carlo integration We begin the 2 nd phase of our course: Study of general mathematics of MC We begin the 2 nd phase of our course:
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
Topic 4 - Database Design Unit 1 – Database Analysis and Design Advanced Higher Information Systems St Kentigern’s Academy.
Atomic structure model
Fitting EM maps into X-ray Data Alexei Vagin York Structural Biology Laboratory University of York.
1 Chapter 10 States of Matter. Essential Question What are physical & chemical properties of liquids and solids? Standard 2h Students will identify solids.
RAPPER Nick Furnham Blundell Group – Department of Biochemistry Cambridge University UK
Guide to Functional Decomposition
SFCHECK Alexei Vagin YSBL, Chemistry Department, University of York.
Investigate Plan Design Create Evaluate (Test it to objective evaluation at each stage of the design cycle) state – describe - explain the problem some.
Page 1 Phase Determination by Creative BiostructureCreative Biostructure.
Lipinski’s rule of five
Lesson 8: Basic Monte Carlo integration
Stony Brook Integrative Structural Biology Organization
PDBe Protein Interfaces, Surfaces and Assemblies
CHEMISTRY 103 Fall 2017.
Model Building and Refinement for CHEM 645
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
Chapter 10 Properties of Solids and Liquids
Database Requirements for CCP4 17th October 2005
8.2 Families and Periods of the Periodic Table
Object oriented system development life cycle
An In-Depth Look at the GES Family of Enzymes
Reduce the need for human intervention in protein model building
Crystal structure determination
1.b What are current best practices for selecting an initial target ligand atomic model(s) for structure refinement from X-ray diffraction data?
Sequence comparison: Multiple testing correction
Virtual Screening.
Engineering Design Process
Current Status at BioChemtek
Nobel Laureates of X Ray Crystallography
S. Takeda, A. Yamashita, K. Maeda, Y. Maeda
Flotation Circuits.
CSE (c) S. Tanimoto, 2001 Search-Introduction
Protein structure prediction.
Chemistry – Final Exam Review
Scientific Method—designing an experiment.
MrBUMP: progress and plans
Volume 15, Issue 9, Pages (September 2007)
Solution and Crystal Structures of a Sugar Binding Site Mutant of Cyanovirin-N: No Evidence of Domain Swapping  Elena Matei, William Furey, Angela M.
Scientific Method—designing an experiment.
CSE (c) S. Tanimoto, 2002 State-Space Search
The Chemistry of Carbon
Crystal Structure of the RuvA-RuvB Complex
Volume 87, Issue 2, Pages (October 1996)
Refinement quantity and quality
What is a mole? Why do we use the mole?
CSE (c) S. Tanimoto, 2004 State-Space Search
Auto molecular replacement
Database for MR.
Presentation transcript:

Complete automation in CCP4 What do we need and how to achieve it?

Starting point Given data and experimental setup (protein name or sequence, crystallisation condition(s), probable ligands) to solve the structure, complete refinement with complete model building, fit ligand(s), validate (against experiment as well as prior knowledge), do early analysis or calculations to help early analysis. All calculations and decisions should be made automatically I estimate success rate using current technology roughly around 80% (may be more).

Overall view Interpretation DATA Optimisation feedback data MR/heavy atom/ab in Interpretation DATA Optimisation feedback data Critical eval Validation Completion Early analysis

Aim: Automation of X-ray Analysis (Equivalent to Microscope) Data Data base of chemical, structural knowledge Molecular replacement Experimental phasing Refine/Build/Rebuild Refine/Complete ! Final structure

Knowledgebase: Discuss Structural knowledge domains, multimers, rotamers, links between similar structures, fragments and their properties. Crystals - cells, intensities, solvent contents, resolutions … Chemical knowledge Monomers and their valence bond chemistry Possible modifications Smaller structural units Protocols and structure solution techniques How to organise them? How to use them?

Components DATA – Experimental data, sequence/protein name, crystallisation condition(s), ligand(s) Interpretation – Using DB and accumulated knowledge build the model Optimisation - Refine the model against data and prior knowledge Criticism – Is model consistent with the data and prior knowledge. Some of the prior knowledge may have not been used in optimisation Feedback data – Electron density map, information about misfitted parts, new prior information taken from the DB Completion – Tidying up waters, multiple conformation, hydrogen bonding network. Are there heavier atoms. Etc Validation – Is model consistent with data and prior knowledge. More sophisticated analysis Early analysis – Is there similar structures, family of proteins, similarity of ligand binding etc.

Information flow What is needed by the next stage? In the absence of previous step how to generate necessary information? CCP4i works partially deals with this.

Feedback data: discuss!

Components: Cont Completion – that is at the moment not very well worked out part of the whole system. It requires careful analysis of waters, hydrogen bonding network, multiple conformation. Ligand fitting need to be done here if it has not already been done. For ligands we can use several techniques: For large ligands we can use divide and conqure. I.e. divide ligand into smaller building blocks and try to fit them first and then try to make sense of things. For small ligands we need to generate all possible conformations and try to fit them one after another and score according to experimental data as well as prior knowledge.

Components: Cont Each component should be independent. It means that they can be replaced at any time with another “better” component. User should be able to enter at any point. For example user can start from interpretation. The system will have to be able to work with minimal information. Data Base of knowledge is very important component. It needs to be worked on very carefully.