Protein NMR terminology COSY-Correlation spectroscopy Gives experimental details of interaction between hydrogens connected via a covalent bond NOESY-Nuclear Overhauser effect spectroscopy Gives peaks between pairs of hydrogen atoms near in space (1.5-5 Å) (and not necessarily sequence)
NH Fingerprint region Ala COSY TOCSY H H H NOE Walk along the sequence
Connectivites by NOE LEUVAL ALA
Hi-NHi+1 Hi-NHi+3
H i i+3 H i+2 NOE H N An -helix can be recognised by repeating patterns of short range nOes. A short range nOe is defined as a contact between residues less than five apart in the sequence (sequential nOes connect neighbouring residues) For an -helix we see H i -NH i+3 and H i -NH i+4 nOes. i+4
Assignment of secondary structural segments sequential stretches of residues with consistent secondary structure characteristics provide a reliable indication of the location of these structural segments
A -strand is distinguished by strong C H i -NH i+1 contacts and long range nOes connecting the strands. A long range nOe connects residues more than 5 residues apart in the chain.
A real example. The rat fatty acid acyl carrier protein. Involved in fatty acid biosynthesis and part of a larger subunit, the synthase, Is it structured by itself??
Summary of the Sequential and Secondary NOEs observed for rat FAS ACP - most definitely structured
So I have assigned the NMR spectrum and connected the amino acids. I have a good idea of the secondary structure. What next?? At this point we notice there are still many nOes we have not assigned on the 2D spectrum. These are neither sequential or short range. They are long-range and connect residues more more than 5 amino acids apart (But still close in space!). AsnGly Glu Identified as an asparagine amino- hydrogen from COSY spectra NOE indicated the asparagine amino-hydrogen is near a glutamate acidic hydrogen
Schematic showing long range nOes in the lac headpiece protein
What next? STRUCTURE CALCULATIONS From NOE I know close atom-atom distances, but that doesnt give a structure The information you have up to this stage is a list of distance constraints The structure can be determined by inputting this information to computer minimization software. The computer program also contains information about amino acids, bond lengths/angles and standard information about atom-atom interactions such as minimum distance (i.e. Van der Waals radii) With all this information you can generate a model of the structure. Important: NMR gives you a number of possible solutions (all almost identical, rmsd <1Å), This can range from 5-20 models X-ray crystallography give one average structure NMR structures can be averaged to give one average structure as well
Excerpt from an NOE table for Actinorhodin Polyketide ACP This file contained ~ 700 lines of nOe restraints
The simulated annealing protocol - begin by simulating a 1000K heat bath and generate an extended model strand Apply the distance restraints from the NOE data (perhaps 1000 restraints for a protein of 90 amino acids). Weight the nOes to favour the formation of local secondary structure and later long range structure. Allow chain to move through itself Start 30 ps Start to cool the system and increase the penalty for bad contacts. Minimize the final structure to see if it satisfies all the nOes 20 ps
A simulated annealing trajectory over the first few picoseconds 4 helices begin to condense
Unfolded Correctly folded
Challenges for Interpreting 3D Structures To correctly represent a structure (not a model), the uncertainty in each atomic coordinate must be shown Polypeptides are dynamic and therefore occupy more than one conformation –Which is the biologically relevant one?
Representation of Structure Conformational Ensemble Uncertainty RMSD of the ensemble Neither crystal nor solution structures can be properly represented by a single conformation Intrinsic motions Imperfect data
Representations of 3D Structures C N
These 2D methods work for proteins up to about 100 amino acids, and even here, anything from amino acids is difficult. We need to reduce the complexity of these 2D spectra. We can start by replacing 14 N with 15 N, a spin 1/2 nucleus.
Run a COSY type experiment that correlates an amide proton with the 15 N nuclei. This is a heteronuclear experiment, I.e. we are looking at two different nuclei, a 1 H and a 15 N nucleus. The COSY type experiment is beyond the scope of these lectures but is known as HSQC, or heteronuclear single quantum coherence spectroscopy. This refers to how the magnetisation is transferred from the 1 H to the 15 N. So how well dispersed are the 15 N shifts? Is it worth trying to separate our spectra out based on their differences?
1 H- 15 N HSQC of rat FAS ACP
Why? The more we understand about a protein and its function, the more we can do with it. It can be used for a new specific purpose or even be redesigned too carry out new useful functions (biotechnology & industry). We can use this knowledge to help understand the basis of diseases and to design new drugs (medicine & drug design). The more knowledge we have how proteins behave in general, the more we can apply it to others (protein families etc)
A case study - Leukocyte function associated protein-1 (LFA-1) This protein is involved in tethering a leukocyte to a endothelium, allowing migration through the tissue to a site of inflammation. One domain of LFA-1, the I-domain is 181 amino acids and undergoes a conformational change where helix 7 slides down the protein, switching it into an active open form. This open form is competent for cell surface binding. If we can stop this switch, we may have an anti-inflammatory mechanism Inflammation (chronic) is responsible for asthma and arthritis.
Developed small molecule inhibitors and test binding
Weak binding M to mM see a migration of the peaks
A more successful inhibitor- nM tight binding. See unbound and bound populations
Solve NMR structure of complex… Helix 7 is prevented from shifting
NMR is a diverse tool with which we can study protein structure. It gives us information in solution under physiological conditions 2D and 3D techniques combined with modern assignment methods have allowed proteins up to 40 kDa to be solved. The power of NMR lies not just with its ability to solve structures but also its ability to probe binding of ligands and partner proteins in real time. Many aspects we have not had time to deal with. NMR reveals how proteins move in solution - can see domains flexing with different timescale motions. These often correlate with binding patches on the protein.
Textbook I recommend reading. J Evans - Biomolecular NMR Spectroscopy. Chapter 4. Protein Structure, pages After p174 numerous examples of NMR structures, labelling etc. Chapter 2. More high level NMR approach - description of how pulse sequences (I.e. COSY, TOCSY, HNCA etc) work. Beyond the scope of the course but may be of interest. Chapter 3. Details of calculations - for you details not important but will give you more of an idea of how we use the NMR data to calculate the structure.