Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between.

Slides:



Advertisements
Similar presentations
Determination of Protein Structure. Methods for Determining Structures X-ray crystallography – uses an X-ray diffraction pattern and electron density.
Advertisements

A brief refresher on protein structure Topic 3. Perhaps the most important structural bioinformatics result ever published… Chothia, C. & Lesk, A. M.
Rosetta Energy Function Glenn Butterfoss. Rosetta Energy Function Major Classes: 1. Low resolution: Reduced atom representation Simple energy function.
Molecular Geometry Lewis structures show the number and type of bonds between atoms in a molecule. –All atoms are drawn in the same plane (the paper).
Refinement procedure Copy your best coordinate file to “prok-native-r1.pdb”: cp yourname-coot-99.pdb prok-native-r1.pdb Start refinement phenix.refine.
Protein Secondary Structure II Lecture 2/24/2003.
Structure Outline Solve Structure Refine Structure and add all atoms
Protein Planes Bob Fraser CSCBC Overview Motivation Points to examine Results Further work.
A Brief Description of the Crystallographic Experiment
The Structure and Functions of Proteins BIO271/CS399 – Bioinformatics.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
The ConQuest Interface query space perform Boolean algebra with queries Boolean algebra with hit lists 2D & 3D browser for results.
Proteins are made by linking amino acids Protein Structure Review and Refinement Introduction Brian Bahnson Dept of Chemistry & Biochemistry, University.
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
Basics of protein structure and stability III: Anatomy of protein structure Biochem 565, Fall /29/08 Cordes.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Proteins: Levels of Protein Structure Conformation of Peptide Group
Two parts to successful model building BUILDING TOOLS –how to use Coot –Initiate trace of protein chain (“Place helix here”) –Test sidechain assignments.
Computational Structure Prediction Kevin Drew BCH364C/391L Systems Biology/Bioinformatics 2/12/15.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Empirical energy function Summarizing some points about typical MM force field In principle, for a given new molecule, all force field parameters need.
Introduction to Macromolecular X-ray Crystallography Biochem 300 Borden Lacy Print and online resources: Introduction to Macromolecular X-ray Crystallography,
2008, Prentice Hall Chemistry: A Molecular Approach, 1 st Ed. Nivaldo Tro Roy Kennedy Massachusetts Bay Community College Wellesley Hills, MA.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Protein Secondary Structure Lecture 2/19/2003. Three Dimensional Protein Structures Confirmation: Spatial arrangement of atoms that depend on bonds and.
Proteins: Secondary Structure Alpha Helix
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Model-Building with Coot An Introduction Bernhard Lohkamp Karolinska Institute June 2009 Chicago (Paul Emsley) (University of Oxford)
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Protein Planes Bob Fraser Protein Folding 882 Project November, 2006.
The ‘phase problem’ in X-ray crystallography What is ‘the problem’? How can we overcome ‘the problem’?
Molecular visualization
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Protein Structure 1 Primary and Secondary Structure.
Phasing Today’s goal is to calculate phases (  p ) for proteinase K using PCMBS and EuCl 3 (MIRAS method). What experimental data do we need? 1) from.
EBI is an Outstation of the European Molecular Biology Laboratory. Sanchayita Sen, Ph.D. PDB Depositions Validation & Structure Quality.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Crystallography -- Lecture 22 Refinement and Validation.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Refinement is the process of adjusting an atomic model to:
Proteins: 3D-Structure Chapter 6 (9 / 17/ 2009)
Protein backbone Biochemical view:
Forward and inverse kinematics in RNA backbone conformations By Xueyi Wang and Jack Snoeyink Department of Computer Science UNC-Chapel Hill.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Today: compute the experimental electron density map of proteinase K Fourier synthesis  (xyz)=  |F hkl | cos2  (hx+ky+lz -  hkl ) hkl.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Lecture 47: Structure II -- Proteins. Today’s Outline The monomers: amino acids – Side chain characteristics – Acid-base equilibria and pK a Peptide backbone.
Organic Chemistry Review Part II. Organic Chemistry: Carbon Atom 1. Structural Classifications 2. Atomic Theory 3. Dipoles & Resonance 4. Isomers 5. Functional.
Protein Structure BL
Computational Structure Prediction
Refinement procedure for native structure
Model Building and Refinement for CHEM 645
Reduce the need for human intervention in protein model building
Hierarchical Structure of Proteins
Lecture 5 Protein Structure.
Chapter 10 Chemical Bonding II
Protein Planes Bob Fraser CSCBC 2007.
Comparison of Exemplars of Rotamer Clusters Across the Proteinogenic Amino Acids
Chapter 10 Chemical Bonding II
Chapter 10 Chemical Bonding II
Figure: UN Title: Lewis structure. Caption: CCl4.
Goals for Today Introduce automated refinement and validation.
Protein Structure Prediction
Goals for Today Introduce automated refinement and validation.
Volume 8, Issue 12, Pages (December 2001)
Volume 8, Issue 7, Pages (July 2000)
Richard C. Page, Sanguk Kim, Timothy A. Cross  Structure 
The sequence, crystal structure determination and refinement of two crystal forms of lipase B from Candida antarctica  Jonas Uppenberg, Mogens Trier Hansen,
Note for 2019 If you get positive peaks on the sulfurs after phenix.refine, try setting all the B-factors to a constants, such as 5.00 Å2 or Å2.
Presentation transcript:

Automated Refinement (distinct from manual building) Two TERMS: E total = E data ( w data ) + E stereochemistry E data describes the difference between observed and calculated data. w data is a weight chosen to balance the gradients arising from the two terms. E stereochemistry comprises empirical information about chemical interactions between atoms in the model. It is a function of all atomic positions and includes information about both covalent and non-bonded interactions.

E data (R-factor) Move atoms to minimize the R-factor. Discrepancy between F obs and F calc. Specifically, minimize E E=  w(F obs -F calc ) 2 Over all hkl. Least squares refinement. Atoms shift toward positive density in a difference Fourier electron density map.  x  =1/V*  |F obs -F calc |e -2  i(hx-  calc ) positive negative density Radius of convergence is limited

E stereochemistry (Geometry) –BOND LENGTHS & ANGLES have standard values. Engh & Huber dictionary.  CHIRALITY of  -carbons –PLANARITY of peptide bonds and aromatic side chains –NONBONDED CONTACTS -two atoms cannot occupy the same space at the same time -TORSION ANGLE PREFERENCES side chains have preferred rotamers. –some values of  and  are forbidden. -Ramachandran. Not restrained- used for validation.

Jeopardy clue: The appearance of the atomic model when stereochemical restraints are not included in crystallographic refinement. What is spaghetti, Alex? E total =E stereochemistry + w data E data

Importance of supplementing the Data to Parameter Ratio in crystallographic refinement. PARAMETERS Each atom has 4 parameters (variables) to refine: x coordinate y coordinate z coordinate B factor In proteinase K there are approximately 2000 atoms to refine. This corresponds to 2000*4= 8000 variables. At 1.7 A resolution we have 25,000 observations. About 3 observations per variable. The reliability of the model is still questionable. Adding stereochemical restraints is equivalent to adding observations DATA At 2.5 A resolution we have 8400 observations (data points) (Fobs). When # of observations= # of variables A perfect fit can be obtained irrespective of the accuracy of the model.

2 nd Jeopardy clue: The value of the R-factor resulting when stereochemical restraints are not included in crystallographic refinement. What is zero, Alex? E total =E stereochemistry + w data E data

An atomic model should be validated by several unbiased indicators R free is an unbiased indicator of the discrepancy between the model and the data. The data used in this R-factor calculation were not used in determining atomic shifts in the refinement process. Ramachandran plot is unbiased because phi and psi torsion angles are not restrained in the refinement process. therefore

N O Asn H H O N H BACKBONE AMIDE 2.8 Å BAD

N O H H O N H BACKBONE AMIDE 2.8 Å Asn GOOD

ERRAT plot examines the geometric relationship between non-bonded atoms. Looks at the fraction of non-bonded contacts with C, N, O as a function of distance.

RuBisCo chain traced backwards Verify 3D plot –Gives an indication if the sequence has been improperly threaded through the backbones. Each of the 20 amino acid types has a characteristic (1) Surface area buried (2) fraction of side-chain area covered by polar atoms (3) local secondary structure. Verify 3D plots correlation between ideal and your model. Compatibility of a model with its sequence.

Plan for today 1)The refinement of native proteinase K is assumed to be complete thanks to ARP/wARP. 2)You will refine the structure of the proteinase K- PMSF complex using the |Fobs| data measured earlier in the course. 3)The starting model for the refinement will be the native proteinase K structure. 4)Begin 5 cycles of automated refinement. This will only move atoms. It will not add new atoms. 5)Then manually build the PMSF inhibitor into an Fo-Fc difference Fourier map. Refinement process typically iterates between automated and manual building. Automated refinement has a limited radius of convergence. For example- automated refinement cannot jump between rotamers or flip between cis and trans peptides. 6)Validate structure. Fill out Refinement Statistics table.

Difference Fourier map  x  =1/V*  |F obs -F calc |e -2  i(hx-  calc ) Here, Fobs will correspond to the ProteinaseK-PMSF complex. Fcalc will correspond to the model of Proteinase K by itself after a few cycles of automated refinement. Positive electron density will correspond to features present in the PMSF Complex that are not in the native structure. Negative electron density will correspond to features present in the native structure that should be removed in the inhibitor complex. After model building, do more automated refinement and then validate.

R R Cis vs. Trans peptide CC C O N CC peptide plane R R C O N CC CC

Cis OK with glycine or proline R H CC C O N CC peptide plane R CC C O N CC Steric hindrance equivalent for cis or trans.

Steric hindrance equivalent for cis or trans proline. R CC CN CC peptide plane O R CC CN O CC CC CC CC CC CC CC

Name _______________________ Proteinase K –PMSF complex