Not retired: Hall symbols + CIF or How Syd influenced my life without me noticing it. Ralf W. Grosse-Kunstleve Computational Crystallography Initiative.

Slides:



Advertisements
Similar presentations
© S.J. Coles 2006 Usability WS, NeSC Jan 06 Enabling the reusability of scientific data: Experiences with designing an open access infrastructure for sharing.
Advertisements

Continuous improvement of macromolecular crystal structures Tom Terwilliger (Los Alamos National Laboratory) DDD WG member ECM 2012: Diffraction Data Deposition.
CCP4 Molecular Graphics (CCP4MG)
Phasing Goal is to calculate phases using isomorphous and anomalous differences from PCMBS and GdCl3 derivatives --MIRAS. How many phasing triangles will.
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
Changing methods of data sharing in crystallography Professor John R Helliwell Imperial College, June 28th, 2006 The University of Manchester
Introduction to Training and Learning in Neural Networks n CS/PY 399 Lab Presentation # 4 n February 1, 2001 n Mount Union College.
Recent developments 1) Tests (outlier analysis) and Bug fixing ( with Paul) 2) Regeneration of Values of Bonds and Bond-angles existing all structures.
Reusing phenix.refine for powder data? Ralf W. Grosse-Kunstleve Computational Crystallography Initiative Lawrence Berkeley National Laboratory Workshop.
Small Molecule Example – YLID Unit Cell Contents and Z Value
C van Ingen, D Agarwal, M Goode, J Gupchup, J Hunt, R Leonardson, M Rodriguez, N Li Berkeley Water Center John Hopkins University Lawrence Berkeley Laboratory.
Structure Outline Solve Structure Refine Structure and add all atoms
The MEMOPS Programming Framework Wayne Boucher, Cambridge
A Brief Description of the Crystallographic Experiment
Refinement of Macromolecular structures using REFMAC5 Garib N Murshudov York Structural Laboratory Chemistry Department University of York.
The TEXTAL System: Automated Model-Building Using Pattern Recognition Techniques Dr. Thomas R. Ioerger Department of Computer Science Texas A&M University.
The TEXTAL System for Automated Model Building Thomas R. Ioerger Texas A&M University.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
3. Crystals What defines a crystal? Atoms, lattice points, symmetry, space groups Diffraction B-factors R-factors Resolution Refinement Modeling!
Current Status and Future Directions for TEXTAL March 2, 2003 The TEXTAL Group at Texas A&M: Thomas R. Ioerger James C. Sacchettini Tod Romo Kreshna Gopal.
TEXTAL - Automated Crystallographic Protein Structure Determination Using Pattern Recognition Principal Investigators: Thomas Ioerger (Dept. Computer Science)
Don't fffear the buccaneer Kevin Cowtan, York. ● Map simulation ⇨ A tool for building robust statistical methods ● 'Pirate' ⇨ A new statistical phase improvement.
Automated Model-Building with TEXTAL Thomas R. Ioerger Department of Computer Science Texas A&M University.
The Crystallographic Refinement of TM1389- A methyl-transferase from Thermotoga maritima Rosanne Joseph SLAC Summer Intern Joint Center for Structural.
TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University.
The P HENIX project Crystallographic software for automated structure determination Computational Crystallography Initiative (LBNL) -Paul Adams, Ralf Grosse-Kunstleve,
Recommendations and Questions wwPDB/CCDC/D3R Ligand Validation Workshop Center for Integrative Proteomics Research, Rutgers 7/30-31/2015 Group D, Academic.
Metadata Creation with the Earth System Modeling Framework Ryan O’Kuinghttons – NESII/CIRES/NOAA Kathy Saint – NESII/CSG July 22, 2014.
Cab55342 Autobuild model Density-modified map Autobuilding starting with morphed model.
Model-Building with Coot An Introduction Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
First Aid & Pathology Data quality assessment in PHENIX
BALBES (Current working name) A. Vagin, F. Long, J. Foadi, A. Lebedev G. Murshudov Chemistry Department, University of York.
Data quality and model parameterisation Martyn Winn CCP4, Daresbury Laboratory, U.K. Prague, April 2009.
Coot Tools for Model Building and Validation
Module 3: Creating Maps. Overview Lesson 1: Creating a BizTalk Map Lesson 2: Configuring Basic Functoids Lesson 3: Configuring Advanced Functoids.
Chem Patterson Methods In 1935, Patterson showed that the unknown phase information in the equation for electron density:  (xyz) = 1/V ∑ h ∑ k.
Crystallographic Databases I590 Spring 2005 Based in part on slides from John C. Huffman.
Crank and Databases Steven Ness Leiden University The Netherlands.
R. Keegan 1, J. Bibby 3, C. Ballard 1, E. Krissinel 1, D. Waterman 1, A. Lebedev 1, M. Winn 2, D. Rigden 3 1 Research Complex at Harwell, STFC Rutherford.
Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.
17 th October 2005CCP4 Database Meeting (York) CCP4(i)/BIOXHIT Database Project: Scope, Aims, Plans, Status and all that jazz Peter Briggs, Wanjuan Yang.
Data Integration and Management A PDB Perspective.
Data Harvesting: automatic extraction of information necessary for the deposition of structures from protein crystallography Martyn Winn CCP4, Daresbury.
Siena Computational Crystallography School 2005
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
Computational Crystallography InitiativePhysical Biosciences Division Exploring Symmetry, Outlier Detection & Twinning update Peter Zwart.
1 COMPUTER SCIENCE DEPARTMENT COLORADO STATE UNIVERSITY 1/9/2008 SAXS Software.
Dictionary based interchanges for iSURF -An Interoperability Service Utility for Collaborative Supply Chain Planning across Multiple Domains David Webber.
3. Spot Finding 7(i). 2D Integration 2. Image Handling 7(ii). 3D Integration 4. Indexing 8. Results 1. Introduction5. Refinement Background mask and plane.
Computational Crystallography InitiativePhysical Biosciences Division First Aid & Pathology Data quality assessment in PHENIX Peter Zwart.
Direct Use of Phase Information in Refmac Abingdon, University of Leiden P. Skubák.
Atomic structure model
Towards a Structural Biology Work Bench Chris Morris, STFC.
Software automation – What STAB sees as key aims? 1.Brief review of activities and recommendations (so far) 2.Reality checks 3. Things to do…
EMBL-EBI Data Archives – An Overview. The EMBL-EBI mission Provide freely available data and bioinformatics services to all facets of the scientific community.
Interpreting difference Patterson Maps in Lab this week! Calculate an isomorphous difference Patterson Map (native-heavy atom) for each derivative data.
Bethesda, March 4 th 2009 Semi-automatic structure solution with HKL-3000 Structural Biology.
Zach Miller Computer Sciences Department University of Wisconsin-Madison Supporting the Computation Needs.
Cloud-Based Visualization of Value-Added Model Annotations Using Jmol Bob Hanson St. Olaf College, Northfield, MN
Crystallography images
CCP4 6.1 and beyond: Tools for Macromolecular Crystallography
CSc4730/6730 Scientific Visualization
Basic procedure for MD simulations
Ton Spek Utrecht University The Netherlands Vienna –ECM
Presentation transcript:

Not retired: Hall symbols + CIF or How Syd influenced my life without me noticing it. Ralf W. Grosse-Kunstleve Computational Crystallography Initiative Lawrence Berkeley National Laboratory White-Hall Retirement Symposium, July 16/17, 2007

My connections to Syd PhD project: Zeolite structure determination from powder data using extracted intensities –focus –sginfo: Hall symbols Contributions to Xplor/CNS –Joined Axel Brunger’s group encouraged by Syd (ECM 1995) –Single-crystal protein crystallography –About 80% of all PDB entries refined with Xplor/CNS –CNS PDB deposition via mmCIF files Phenix project –Automation of protein structure determination –Fresh start after losing a legal battle –The P in Phenix is for Python: recommended by Syd

Concise space group symbols Symbols for crystallographic space groups Designed to overcome limitations of the familiar Hermann- Mauguin symbols (e.g. P ) H-M symbols are defined in International Tables for Crystallography –Created by a generation of scientist that didn’t have computers Define the space group type uniquely But not the exact setting: inadequate for automatic processing Attempts to add a rule set to H-M symbols leads to complicated algorithms (never standardized) H-M symbols cover only a very limited subset of settings that appear, e.g. in the generation of group-subgroup relations

Hall (1981) symbols Symbols for crystallographic space groups Designed to overcome limitations of the familiar Hermann- Mauguin symbols (e.g. P ) H-M symbols are defined in International Tables for Crystallography –Created by a generation of scientist that didn’t have computers Define the space group type uniquely But not the exact setting: inadequate for automatic processing Attempts to add a rule set to H-M symbols leads to complicated algorithms (never standardized) H-M symbols cover only a very limited subset of settings that appear, e.g. in the generation of group-subgroup relations

Hall symbols Designed for automatic processing –No ambiguities Via attached transformation symbols, any setting of any crystallographic space group can be represented (Int. Tab. Vol. B, 2001) Applications –Determination of space group type –Automatic determination of allowed origin shifts –Automatic group-subgroup processing –Automatic derivation of twin laws –Primitive setting of centered space groups Reduces memory requirements

Translation table Hall Hermann-Mauguin /* 081 */ " P -4", /* P -4 */ /* 082 */ " I -4", /* I -4 */ /* 083 */ "-P 4", /* P 4/m */ /* 084 */ "-P 4c", /* P 42/m */ /* 085 */ "-P 4a", /* P 4/n :2 */ /* 086 */ "-P 4bc", /* P 42/n :2 */ /* 087 */ "-I 4", /* I 4/m */ /* 088 */ "-I 4ad", /* I 41/a :2 */ /* 089 */ " P 4 2", /* P */ /* 090 */ " P 4ab 2ab", /* P */ /* 091 */ " P 4w 2c", /* P */

STAR + CIF Situation before CIF –Vast, diverse variety of data formats –Need to reformat data all the time is a real impediment to scientific progress –What does “gof” mean? –What does it mean exactly? STAR: Self-defining Text Archival and Retrieval format Hall, S. R., "The STAR File: A New Format for Electronic Data Transfer and Archiving," J. Chem. Inf. Comput. Sci. 31, (1991). –Defines format –Framework for defining semantics CIF (1991) –Based on STAR, defines semantics –Similar in concept to XML schema, but a decade ahead –CIF is de-facto standard in small molecule crystallography –Macromoleclar community has more difficulties with the semantics part of CIF

Python Situation: forced end of CNS development –Legal reasons Axel Brunger wanted Paul Adams and me to continue methods development in a different way We started exploring the world around us –We wanted a scripting language like CNS with additional compiled components IUCr meeting Glasgow 1999 –Watching solar eclipse with Syd –BTW: use Python

The PHENIX project Funding: NIH Program Project (NIGMS, PSI), Director - Paul Adams A collaboration between several groups CCI APPS SOLVE / RESOLVE PHASER TEXTAL MolProbity / REDUCE Computational Crystallography Initiative (LBNL) -Paul Adams, Ralf Grosse-Kunstleve, Pavel Afonine -Nigel Moriarty, Nicholas Sauter, Peter Zwart Los Alamos National Lab (LANL) -Tom Terwilliger, Li-Wei Hung Cambridge University -Randy Read, Airlie McCoy Texas A&M University -Tom Ioerger, Jim Sacchettini, Erik McKee Duke University - Jane Richardson, David Richardson, Ian Davis

Spectrum of phenix components Automated analysis of data quality: phenix.xtriage Rapid substructure determination: phenix.hyss Phasing: Maximum likelihood – SOLVE, PHASER for SAD Density modification: Statistical density modification (RESOLVE) Automated model building: –Pattern matching methods (RESOLVE or TEXTAL) Structure refinement: phenix.refine (likelihood, annealing, TLS) Advanced automation: AutoSol – hkl to map Ligand building and fitting: eLBOW, AutoLigand Validation and Hydrogens: MolProbity + Reduce

phenix.refine - Group ADP refinement - Rigid body refinement - Restrained refinement (xyz, iso/aniso ADP) - Automatic water picking - Bond density - Unrestrained refinement - FFT or direct summation - Hydrogens - Automatic NCS restraints - Simulated Annealing - Occupancies (individual, group) - TLS refinement - Twinned data - X-ray, Neutron, joint X-ray + Neutron refinement

Refinement flowchart Input data and model processing Refinement strategy selection Bulk-solvent, Anisotropic scaling, Twinning parameters refinement Ordered solvent (add / remove) Target weights calculation Coordinate refinement (rigid body, individual) (minimization or Simulated Annealing) ADP refinement (TLS, group, individual iso / aniso) Occupancy refinement (individual, group) Output: Refined model, various maps, structure factors, complete statistics PDB model, Any data format (CNS, Shelx, MTZ, …) Files for COOT, O, PyMol Repeated several times

Summary Hall symbols open the door to automatic processing of crystallographic symmetry STAR + CIF enable automation of data flow between researchers and archives Our experience with Python suggests: it is a good idea to listen to Syd! Syd’s wisdom! … Gnu Xtal System, available as an open source project, … but with the expectation that Sourceforge may be its twilight resting place, afore it's entombed by the sands of time, right alongside The King of Kings.

Acknowledgments Syd –for shining the light in the right direction –for showing me Perth Phenix developers –P.D. Adams –P. Afonine –T.R. Ioerger –A.J. McCoy –E.W. McKee –N.W. Moriarty –R.J. Read –N.K. Sauter –J.N. Smith –L.C. Storoni –T.C. Terwilliger –P.H. Zwart Funding: –LBNL (DE-AC03-76SF00098) –NIH/NIGMS (1P01GM063210) –P HENIX Industrial Consortium