Protein Structure Prediction

Slides:



Advertisements
Similar presentations
1 Amino acid and proteins Ghollam-Reza Moshtaghi-Kashanian Biochemistry Department Medical School Kerman University of Medical sciences.
Advertisements

Protein Structure C483 Spring 2013.
Protein Structure Prediction
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Biology 107 Macromolecules II September 9, Macromolecules II Student Objectives:As a result of this lecture and the assigned reading, you should.
Biology 107 Macromolecules II September 5, Macromolecules II Student Objectives:As a result of this lecture and the assigned reading, you should.
Biology 107 Macromolecules II September 8, 2003.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Roadmap The topics: basic concepts of molecular biology more on Perl
Protein Basics Protein function Protein structure –Primary Amino acids Linkage Protein conformation framework –Dihedral angles –Ramachandran plots Sequence.
A PEPTIDE BOND PEPTIDE BOND Polypeptides are polymers of amino acid residues linked by peptide group Peptide group is planar in nature which limits.
Computer-Aided Protein Structure Prediction Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial Technology Institute of Microbial.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
7.5: PROTEINS Proteins Function Structure. Function 7.5.4: State four functions of proteins, giving a named example of each. [Obj. 1] Proteins are the.
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
02/03/10 CSCE 769 Dihedral Angles Homayoun Valafar Department of Computer Science and Engineering, USC.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Mrs. Einstein Research in Molecular Biology. Importance of proteins for cell function: Proteins are the end product of the central dogma YOU are your.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure 1 Primary and Secondary Structure.
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
Protein structure and function Part - I
THE STRUCTURE AND FUNCTION OF MACROMOLECULES Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Protein Structure Prediction: Homology Modeling & Threading/Fold Recognition D. Mohanty NII, New Delhi.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Proteins Dr. Sumbul Fatma Clinical Chemistry Unit Department of Pathology Tel
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
PROTEINS L3 BIOLOGY. FACTS ABOUT PROTEINS: Contain the elements Carbon, Hydrogen, Oxygen, and NITROGEN Polymer is formed using 20 different amino acids.
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Enzymes SADIA SAYED. Enzymes are proteins  All enzymes are proteins  Strings of amino acids folding up into distinct structures  The properties of.
Structural organization of proteins
Functions Enzymes – organic catalysts Structural – skin, hair, muscle Antibodies Hormones.
CHM 708: MEDICINAL CHEMISTRY
Protein Structure BL
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Chapter 5 Proteins.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES
© SSER Ltd..
Chemical agents PROTEINS: The Molecular Tools of the Cell
Protein Structure and Properties
Protein Structure Amino Acids Polypeptide Levels of Structure
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Lecture 5 Protein Structure.
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Diverse Macromolecules
Protein Structure Chapter 14.
Protein structure prediction.
Computer-Aided Protein Structure Prediction
Computer-Aided Protein Structure Prediction
Proteins.
CHAPTER 5 THE STRUCTURE AND FUNCTION OF MACROMOLECULES
Fig 3.13 Reproduced from: Biochemistry by T.A. Brown, ISBN: © Scion Publishing Ltd, 2017.
Computer-Aided Protein Structure Prediction
Structure of Proteins Chymotrypin Glycine
Protein structure prediction
The Three-Dimensional Structure of Proteins
Presentation transcript:

Protein Structure Prediction Sequence + Dr. G.P.S. Raghava Structure

Protein Structure Prediction Experimental Techniques X-ray Crystallography NMR Limitations of Current Experimental Techniques Protein DataBank (PDB) -> 23000 protein structures SwissProt -> 100,000 proteins Non-Redudant (NR) -> 10,00,000 proteins Importance of Structure Prediction Fill gap between known sequence and structures Protein Engg. To alter function of a protein Rational Drug Design

Different Levels of Protein Structure

Protein Architecture Proteins consist of amino acids linked by peptide bonds Each amino acid consists of: a central carbon atom an amino group a carboxyl group and a side chain Differences in side chains distinguish the various amino acids

Amino Acid Side Chains Vary in: Size Shape Polarity

Peptide Bond

Peptide Bonds

Dihedral Angles

Conformation Flexibility Backbone (main chain of atoms in peptide bonds, minus side chains) conformation: Torsion or rotation angles around: C-N bond () C-C bond () Sterical hinderance: Most – Pro Least - Gly

Ramachandran Plot

Protein Secondary Structure Regular Secondary Structure (-helices, -sheets) Irregular Secondary Structure (Tight turns, Random coils, bulges)

Secondary Structure: Helices ALPHA HELIX : a result of H-bonding between every fourth peptide bond (via amino and carbonyl groups) along the length of the polypeptide chain Individual Amino acid H-bond

Helix formation is local THYROID hormone receptor (2nll)

Secondary Structure: Beta Sheets BETA PLEATED SHEET: a result of H-bonding between polypeptide chains

b-sheet formation is NOT local

Definition of -turn A -turn is defined by four consecutive residues i, i+1, i+2 and i+3 that do not form a helix and have a C(i)-C(i+3) distance less than 7Å and the turn lead to reversal in the protein chain. (Richardson, 1981). The conformation of -turn is defined in terms of  and  of two central residues, i+1 and i+2 and can be classified into different types on the basis of  and . i+1 i+2 i H-bond i+3 D <7Å

Tight turns 2 3 4 5 6 Type No. of residues H-bonding -turn NH(i)-CO(i+1) -turn 3 CO(i)-NH(i+2) -turn 4 CO(i)-NH(i+3) -turn 5 CO(i)-NH(i+4) -turn 6 CO(i)-NH(i+5)

Secondary Structure shortcuts

Tertiary Structure: Hexokinase (6000 atoms, 48 kD, 457 amino acids) polypeptides with a tertiary level of structure are usually referred to as globular proteins, since their shape is irregular and globular in form

Quarternary Structure: Haemoglobin

What determines fold? Anfinsen’s experiments in 1957 demonstrated that proteins can fold spontaneously into their native conformations under physiological conditions. This implies that primary structure does indeed determine folding or 3-D stucture. Some exceptions exist Chaperone proteins assist folding Abnormally folded Prion proteins can catalyze misfolding of normal prion proteins that then aggregate

Levels of Description of Structural Complexity Primary Structure (AA sequence) Secondary Structure Spatial arrangement of a polypeptide’s backbone atoms without regard to side-chain conformations , , coil, turns (Venkatachalam, 1968) Super-Secondary Structure , , /, + (Rao and Rassman, 1973) Tertiary Structure 3-D structure of an entire polypeptide Quarternary Structure Spatial arrangement of subunits (2 or more polypeptide chains)

Techniques of Structure Prediction Computer simulation based on energy calculation Based on physio-chemical principles Thermodynamic equilibrium with a minimum free energy Global minimum free energy of protein surface Knowledge Based approaches Homology Based Approach Threading Protein Sequence Hierarchical Methods

Energy Minimization Techniques Energy Minimization based methods in their pure form, make no priori assumptions and attempt to locate global minma. Static Minimization Methods Classical many potential-potential can be construted Assume that atoms in protein is in static form Problems(large number of variables & minima and validity of potentials) Dynamical Minimization Methods Motions of atoms also considered Monte Carlo simulation (stochastics in nature, time is not cosider) Molecular Dynamics (time, quantum mechanical, classical equ.) Limitations large number of degree of freedom,CPU power not adequate Interaction potential is not good enough to model

Molecular Dynamics Provides a way to observe the motion of large molecules such as proteins at the atomic level – dynamic simulation Newton’s second law applied to molecules Potential energy function Molecular coordinates Force on all atoms can be calculated, given this function Trajectory of motion of molecule can be determined

Knowledge Based Approaches Homology Modelling Need homologues of known protein structure Backbone modelling Side chain modelling Fail in absence of homology Threading Based Methods New way of fold recognition Sequence is tried to fit in known structures Motif recognition Loop & Side chain modelling Fail in absence of known example

Homology Modeling Simplest, reliable approach Basis: proteins with similar sequences tend to fold into similar structures Has been observed that even proteins with 25% sequence identity fold into similar structures Does not work for remote homologs (< 25% pairwise identity)

Homology Modeling Given: A query sequence Q A database of known protein structures Find protein P such that P has high sequence similarity to Q Return P’s structure as an approximation to Q’s structure

Threading Given: Find: sequence of protein P with unknown structure Database of known folds Find: Most plausible fold for P Evaluate quality of such arrangement Places the residues of unknown P along the backbone of a known structure and determines stability of side chains in that arrangement

Hierarcial Methods Intermidiate structures are predicted, instead of predicting tertiary structure of protein from amino acids sequence Prediction of backbone structure Secondary structure (helix, sheet,coil) Beta Turn Prediction Super-secondary structure Tertiary structure prediction Limitation Accuracy is only 75-80 % Only three state prediction

Thanks