Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.

Slides:



Advertisements
Similar presentations
PROTEOMICS 3D Structure Prediction. Contents Protein 3D structure. –Basics –PDB –Prediction approaches Protein classification.
Advertisements

Protein Structure Prediction
Proteins - Many Structures, Many Functions 1.A polypeptide is a polymer of amino acids connected to a specific sequence 2.A protein’s function depends.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Structural bioinformatics
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Protein structure (Part 2 of 2).
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Energetics and kinetics of protein folding. Comparison to other self-assembling systems?
The Protein Data Bank (PDB)
. Protein Structure Prediction [Based on Structural Bioinformatics, section VII]
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
1 Protein Structure Prediction Reporter: Chia-Chang Wang Date: April 1, 2005.
Protein Tertiary Structure. Primary: amino acid linear sequence. Secondary:  -helices, β-sheets and loops. Tertiary: the 3D shape of the fully folded.
Molecular modelling / structure prediction (A computational approach to protein structure) Today: Why bother about proteins/prediction Concepts of molecular.
Intro to Bioinformatics Computational Approaches to Receptor Structure Prediction Uğur Sezerman Biological Sciences and Bioengineering Program Sabancı.
1 Protein Structure Prediction Charles Yan. 2 Different Levels of Protein Structures The primary structure is the sequence of residues in the polypeptide.
Computing for Bioinformatics Lecture 8: protein folding.
Roadmap The topics: basic concepts of molecular biology more on Perl
PDB-Protein Data Bank SCOP –Protein structure classification CATH –Protein structure classification genTHREADER–3D structure prediction Swiss-Model–3D.
Protein Tertiary Structure Prediction Structural Bioinformatics.
Bioinformatics Ayesha M. Khan Spring 2013.
Protein Structure Prediction and Analysis
Protein Structural Prediction. Protein Structure is Hierarchical.
Bioinf. Data Analysis & Tools Molecular Simulations & Sampling Techniques117 Jan 2006 Bioinformatics Data Analysis & Tools Molecular simulations & sampling.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Protein Tertiary Structure Prediction
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Tertiary Structure Prediction Methods Any given protein sequence Structure selection Compare sequence with proteins have solved structure Homology Modeling.
Macromolecular structure
Practical session 2b Introduction to 3D Modelling and threading 9:30am-10:00am 3D modeling and threading 10:00am-10:30am Analysis of mutations in MYH6.
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
Genomics and Personalized Care in Health Systems Lecture 9 RNA and Protein Structure Leming Zhou, PhD School of Health and Rehabilitation Sciences Department.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
CRB Journal Club February 13, 2006 Jenny Gu. Selected for a Reason Residues selected by evolution for a reason, but conservation is not distinguished.
Introduction to Protein Structure
Lecture 10 – protein structure prediction. A protein sequence.
COT 6930 HPC and Bioinformatics Protein Structure Prediction Xingquan Zhu Dept. of Computer Science and Engineering.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Representations of Molecular Structure: Bonds Only.
PROTEINS PROTEINS Levels of Protein Structure.
1 P9 Extra Discussion Slides. Sequence-Structure-Function Relationships Proteins of similar sequences fold into similar structures and perform similar.
Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.
Department of Mechanical Engineering
Operone lac Principles of protein structure and function Function is derived from structure Structure is derived from amino acid sequence Different.
Protein Classification II CISC889: Bioinformatics Gang Situ 04/11/2002 Parts of this lecture borrowed from lecture given by Dr. Altman.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Protein Structure (Foundation Block) What are proteins? Four levels of structure (primary, secondary, tertiary, quaternary) Protein folding and stability.
CS790 – BioinformaticsProtein Structure and Function1 Disulfide Bonds  Two cyteines in close proximity will form a covalent bond  Disulfide bond, disulfide.
Last Tuesday and Beyond Common 2° structural elements: influenced by 1° structure –alpha helices –beta strands –beta turns Structure vs. function –Fibrous.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Tertiary Structure Globular proteins (enzymes, molecular machines)  Variety of secondary structures  Approximately spherical shape  Water soluble 
Protein Tertiary Structure. Protein Data Bank (PDB) Contains all known 3D structural data of large biological molecules, mostly proteins and nucleic acids:
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Protein Folding & Biospectroscopy Lecture 6 F14PFB David Robinson.
Objective 7: TSWBAT recognize and give examples of four levels of protein conformation and relate them to denaturation.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
PROTEIN STRUCTURE (Donaldson, March 10,2003) What are we trying to learn about genes and their proteins: Predict function for unknown protein by comparison.
PROTEINS Characteristics of Proteins Contain carbon, hydrogen, oxygen, nitrogen, and sulfur Serve as structural components of animals Serve as control.
Lecture 10 CS566 Fall Structural Bioinformatics Motivation Concepts Structure Solving Structure Comparison Structure Prediction Modeling Structural.
Forces and Prediction of Protein Structure Ming-Jing Hwang ( 黃明經 ) Institute of Biomedical Sciences Academia Sinica
Protein Structure Prediction. Protein Sequence Analysis Molecular properties (pH, mol. wt. isoelectric point, hydrophobicity) Secondary Structure Super-secondary.
Protein Structure Prediction
Protein structure prediction.
Presentation transcript:

Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily Different families whose structural and functional features suggest common evolutionary origin Fold Different superfamilies having same major secondary structures in same arrangement and with same topological connections Class Secondary structure composition.

Databases of Structural Classification SCOP: Structural Classification of Proteins CATH: Class, Architecture, Topology and Homologous superfamily FSSP: Families of Structurally Similar Proteins

The Protein Folding Problem Given a particular sequence of amino acid residues (primary structure), what will the tertiary/quaternary structure of the resulting protein be? Input: AAVIKYGCAL… Output:  1  1,  2  2 … => backbone conformation (no side chains yet)  But what about the tertiary structure?

Folding intermediates Levinthal’s paradox – Consider a 100 residue protein. If each residue can take only 3 positions, there are = 5  possible conformations. Finding a native folded state among all possible configurations can take an enormously long time Folding must proceed by progressive stabilization of intermediates  for fast folding – Molten globules – most secondary structure formed, but much less compact than “native” conformation. – It is an intermediate between the native state and denatured state.

Forces driving protein folding  It is believed that hydrophobic collapse is a key driving force for protein folding  fast reaction; produces molten globule state  Hydrophobic core  Polar surface interacting with solvent  Minimum volume (no cavities)  Disulfide bond formation stabilizes  Hydrogen bonds  Polar and electrostatic interactions

Forces driving protein folding  Proteins are, in fact, only marginally stable  Native state is typically only 5 to 10 kcal/mole more stable than the unfolded form  Many proteins help in folding  Protein disulfide isomerase – catalyzes shuffling of disulfide bonds  Chaperones – break up aggregates and (in theory) unfold misfolded proteins

Recall: secondary and tertiary structure of proteins, and quaternary structure Determining Protein structure X-ray crystallography Coordinates are determined by X-ray crystallography The interaction of x-rays with electrons arranged in a crystal can produce electron-density map, which can be interpreted to an atomic model. Crystal is very hard to grow. Nuclear magnetic resonance (NMR) Some atomic nuclei have a magnetic spin. Probe the molecule by radio frequency and get the distances between atoms. Only applicable to small molecules.

PDB: Protein Data Bank  Three-dimensional structures of large biological molecules, including proteins and nucleic acids.  Contain sequence details, atomic coordinates, crystallization conditions

Ab initio Structure Prediction An free energy function to describe the protein bond energy bond angle energy dihedral angle energy van der Waals energy electrostatic energy Minimize the function and obtain the structure An algorithm capable of finding the global minimum of the energy function is be used Not practical in general Computationally too expensive Accuracy is poor Deriving structures, approximate or otherwise, from sequence.

Ab initio Structure Prediction contd. It should be kept in mind that native structures exist in a certain solvent environment The native conformation need not necessarily correspond to the global minimum of free energy.

Template based protein structure prediction or also known as: comparative modeling, homology modeling  Used where there is a clear sequence relationship between the target structure and one or more known structures.

Template based…(contd)  The most reliable technique for predicting protein structure  Comparing the sequence of the new protein with the sequences of proteins of known structure Strong similarity No strong similarities  comparative modeling cannot be used.

Components Protein Structure library (Template library) From PDB, FSSP, SCOP Scoring Function Sequence compatibility Structure compatibility Alignment Algorithm Fold Recognition Confidence Assessment

3D Model Building  Programs: MODELLER, SWISS-MOD, SCWRL, etc Similar performances (from CASP)  Best Template? Closest biological function? Environmental factors (pH, ligand, etc) Resolution Choosing family of proteins Using multiple templates  Alignment accuracy is everything! Cannot recover from an incorrect alignment Gap placement Try many plausible alignments, and build multiple models  Build as many models as possible, and determine the best Consensus Using protein structure analysis program

Prediction of protein structure includes: Protein secondary structure prediction Protein Phi-Psi angle prediction Predicting disulphide Bridges Predicting beta-turns Domain recognition Domain boundary detection Protein structural classification Mining structural motifs