Principles of protein structure and stability.

Slides:



Advertisements
Similar presentations
Protein Structure Prediction
Advertisements

Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
Protein 3-Dimensional Structure and Function
Pages 42 to 46.  Chemical composition  Carbon  Hydrogen  Oxygen  Nitrogen  Sulfur (sometimes)  Monomer/Building Block  Amino Acids (20 different.
1 Levels of Protein Structure Primary to Quaternary Structure.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Polypeptides – a quick review A protein is a polymer consisting of several amino acids (a polypeptide) Each protein has a unique 3-D shape or Conformation.
Protein Primer. Outline n Protein representations n Structure of Proteins Structure of Proteins –Primary: amino acid sequence –Secondary:  -helices &
Energetics and kinetics of protein folding. Comparison to other self-assembling systems?
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Structure and stability of globular proteins.
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
STRUCTURAL ORGANIZATION
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
Amino Acids and Proteins B.2. Properties of 2-amino acids (B.2.2) Zwitterion (dipolar) – amino acids contain both acidic and basic groups in the same.
BRANDI AND ZAK. Secondary Structure Can fold and align them selves and the repeating pattern is called a secondary structure. Common structures are the.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Chapter 1 2/5-2/6/07 Overall important concept:  G =  H – T  S –Toward lower enthalpy Forming bonds = good –Toward higher entropy More degrees of freedom.
 It refers to the amino acid content (type and number),and sequence in the polypeptide chain and the location of the disulfide bonds if present. 
BIOL 200 (Section 921) Lecture # 2, June 20, 2006 Reading for lecture 2: Essential Cell Biology (ECB) 2nd edition. Chap 2 pp 55-56, 58-64, 74-75; Chap.
PROTEINS C, H, O, N, (S) Polymers made from chains of amino acids 20 amino acids used Linked by a peptide bond.
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein 3-Dimensional Structure and Function. Terminology Conformation – spatial arrangement of atoms in a protein Native conformation – conformation.
The α-helix forms within a continuous strech of the polypeptide chain 5.4 Å rise, 3.6 aa/turn  1.5 Å/aa N-term C-term prototypical  = -57  ψ = -47 
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Chapter 3. Protein structure and function. Proteins are the most versatile macromolecules in living systems. serve crucial functions in essentially all.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit Department of Pathology Tel
Intended Learning Objectives You should be able to… 1. Give 3 examples of proteins that are important to humans and are currently produced by transgenic.
Protein- Secondary, Tertiary, and Quaternary Structure.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Proteins Biochemistry Unit 1. What You Need to Know! How to recognize protein by its structural formula The cellular function of proteins The four structural.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Sections 14.9, 14.10, 14.11, and Hannah Nowell and Jenny Sulouff.
Protein backbone Biochemical view:
PROTEIN STRUCTURE Brianne Morgan, Adrienne Trotto, Alexis Angstadt.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Enzymes SADIA SAYED. Enzymes are proteins  All enzymes are proteins  Strings of amino acids folding up into distinct structures  The properties of.
Structure and Function
Structural organization of proteins
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
19.5 Protein Structure: Tertiary and Quaternary Levels
Protein Structure BL
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Dr. Jagdish Kaur, P.G.G.C.,Sector 11 Chandigarh
Protein Structure and Properties
The heroic times of crystallography
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Conformationally changed Stability
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Introduction to Bioinformatics II
Biochem Block Handout #6: Protein Structure
Protein Structure Prediction
Packet #9 Supplement.
Packet #9 Supplement.
Conformationally changed Stability
Levels of Protein Structure
Protein structure prediction.
Protein Structure INTRODUCTION OF PROTIEN. Organic compounds containing C,H,O,N,P,S Comprise 50% of dry weight of cell. Made up of Amino acids. Protein.
Proteins.
The Three-Dimensional Structure of Proteins
Presentation transcript:

Principles of protein structure and stability.

Polypeptide bond is formed between two amino acids.

Backbone conformation is described by φ and ψ angles. Picture from T. Przytycka, 2002

Hierarchy of protein structure. Amino acid sequence Secondary structure Tertiary structure Quaternary structure Picture from Branden & Tooze “Introduction to protein structure”

Right-handed alpha-helix. Helix is stabilized by HB between backbone –NH and backbone carbonyl atom. Geometrical characteristics: 3.6 residues per turn translation of 5.4 Å per turn translation of 1.5 Å per residue

Β-strand and β-sheet.

Loop regions are at the surface of protein molecules. Adjacent antiparallel β-strands are joined by hairpin loops. Loops are more flexible than helices and strands. Loops can carry binding and active sites, functionally important sites. Branden & Tooze “Introduction to protein structure”

Protein classification based on the secondary structure content. Class α - proteins with only α-helices Class β – proteins with only β-sheets Class α+β - proteins with α-helices and β-sheets

Protein stability. Anfinsen’s experiments:

Native proteins have low stability… Scale of interactions in proteins: - Interactions less than kT~0.6 kcal/mol are neglected. - Interactions more than ΔG = 10 kcal/mol are too large Potential energy = Van der Waals + Electrostatic + Hydrophobic G U F ΔG Reaction coordinate

Electrostatic force. Coulomb’s law for two point charges in a vacuum: q – point charge, ε – dielectric constant ε = 2-3 inside the protein, ε = 80 in water Na+ Cl- d = 2.76 Å, E = 120 kcal/mol

Dipolar interactions. Dipole moment: O C - 0.42 Dipole moment: O +0.42 C Interaction energy of two dipoles separated by the vector r: -0.20 N Peptide bond: μ = 3.5D, Water molecule: μ = 1.85D. +0.20 H

Van der Waals interactions. Lennard-Jones potential: E (kcal/mol) 0.2 repulsion London dispersion energy: δ+ δ- attraction δ+ δ- - 0.2 2 4 6 8 10 12 Distance between centers of atoms

Hydrogen bonds —N—H O==C N H O== N δ+ δ- 3 Ǻ D A D A + HOH::::OHH HOH

Hydrogen bonding patterns in globular proteins. 1. Most HB are local, close in sequence. 2. Most HB are between backbone atoms. 3. Most HB are within single elements of secondary structure. 4. Proteins are almost equally saturated by HB: 0.75 HB per amino acid.

Disulfide bonds. PROTEIN + GS-SG PROTEIN + GSHPROTEIN + 2GSH HS SH S-SG - Breakdown and formation of S-S bonds are catalyzed by disulfide isomerase. - In the cell S-S bonds are reversible, the energetic equilibrium is close to zero. - Secreted proteins have a lot of S-S bonds since outside the cell the equilibrium is shifted towards their formation.

Hydrophobic effect. Hydrophobic interaction – tendency of nonpolar compounds to transfer from an aqueous solution to an organic phase. The entropy of water molecules decreases when they make a contact with a nonpolar surface, the energy increases. As a result, upon folding nonpolar AA are burried inside the protein, polar and charged AA – outside. H O H O H H

Hydrophobicities of amino acids.

Cooperativity of protein interactions Protein denaturation is a first order (“all-or-none”) transition. As T increases: 1. Globule expansion, loose packing. 2. As expansion crosses the barrier, liberation of side chains and increase in enthropy. E T1 T’ T2 W(E) T2 T’ T1

Summary: Hydrophobic effect is mostly responsible for making a compact globule. Final specific tertiary structure is formed by van der Waals interactions, HB, disulfide bonds. Secret of stability of native structures is not in the magnitude of the interactions but in their cooperativity.

Classwork I: CN3D viewer. Go to http://ncbi.nlm.nih.gov Select alpha-helical protein (hemoglobin) Select beta-stranded protein (immunoglobulin) Select multidomain protein 1I50, chain “A” View them in CN3D

PDB databank. Archive of protein crystal structures was established in 1971 with several structures in 2002 – 17000 structure including NMR structures Data processing: data deposition, annotation and validation PDB code – nXYZ, n – integer, X, Y, Z -characters

Content of Data in the PDB. Organism, species name Full protein sequence Chemical structure of cofactors and prosthetic groups Names of all components of the structure Qualitative description of the structural characteristics Literature citations Three-dimensional coordinates

Protein secondary structure prediction. Assumptions: There should be a correlation between amino acid sequence and secondary structure. Short aa sequence is more likely to form one type of SS than another. Local interactions determine SS. SS of a residues is determined by their neighbors (usually a sequence window of 13-17 residues is used). Exceptions: short identical amino acid sequences can sometimes be found in different SS. Accuracy: 65% - 75%, the highest accuracy – prediction of an α helix

Methods of SS prediction. Chou-Fasman method GOR (Garnier,Osguthorpe and Robson) Neural network method

Chou-Fasman method. Analysis of frequences for all amino acids to be in different types of SS. Ala, Glu, Leu and Met – strong predictors of alpha-helices, Pro and Gly predict to break the helix.

GOR method. Assumption: formation of SS of an amino acid is determined by the neighboring residues (usually a window of 17 residues is used). GOR uses principles of information theory for predictions. Method maximizes the information difference between two competing hypothesis: that residue “a” is in structure “S”, and that “a” is not in conformation “S”.

Neural network method. α β Wij Sj Hj Oi Si Hj Oi 1 coil L A W P G E V Input layer Input sequence window 1 Output layer Predicted SS Hidden layer L A W P G E V S T Y α Si Hj Oi 1 β coil Wij Sj Hj Oi

PHD – neural network program with multiple sequence alignments. Blast search of the input sequence is performed, similar sequences are collected. Multiple alignment of similar sequences is used as an input to a neural network. Sequence pattern in multiple alignment is enhanced compared to if one sequence used as an input.

Classwork Go to http://ncbi.nlm.nih.gov, search for protein “flavodoxin” in Entrez, retrieve its amino acid sequence. Go to http://cubic.bioc.columbia.edu/predictprotein and run PHD on the sequence.

Definition of protein domains. Geometry: group of residues with the high contact density, number of contacts within domains is higher than the number of contacts between domains. - chain continuous domains - chain discontinous domains Kinetics: domain as an independently folding unit. Physics: domain as a rigid body linked to other domains by flexible linkers. Genetics: minimal fragment of gene that is capable of performing a specific function.

Domains as recurrent units of proteins. The same or similar domains are found in different proteins. Each domain performs a specific function. Proteins evolve through the duplication and domain shuffling. The total number of different types of domains is small (~1000 – 3000).

The Conserved Domain Architecture Retrieval Tool (CDART). Performs similarity searches of the NCBI Entrez Protein Database based on domain architecture, defined as the sequential order of conserved domains in proteins. The algorithm finds protein similarities across significant evolutionary distances using sensitive protein domain profiles. Proteins similar to a query protein are grouped and scored by architecture.