Introduction to Bioinformatics II

Slides:



Advertisements
Similar presentations
The Structure and Function of Proteins Bioinformatics Ch 7
Advertisements

Secondary structure prediction from amino acid sequence.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
1 September, 2004 Chapter 5 Macromolecular Structure.
Protein Secondary Structures
Predicting local Protein Structure Morten Nielsen.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Protein Structure Databases Databases of three dimensional structures of proteins, where structure has been solved using X-ray crystallography or nuclear.
IPM-POLYTECHNIQUE-WPI Workshop on Bioinformatics and Biomathematics April 11-21, 2005 IPM School of Mathematics Tehran.
Protein Structure Modeling (1). Protein Folding Problem A protein folds into a unique 3D structure under physiological conditions Lysozyme sequence: KVFGRCELAA.
Protein Secondary Structures Assignment and prediction.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU April 8, 2003Claus Lundegaard Protein Secondary Structures Assignment and prediction.
Protein Secondary Structures Assignment and prediction.
Protein Secondary Structures Assignment and prediction Pernille Haste Andersen
Protein Secondary Structure Prediction Dong Xu Computer Science Department 271C Life Sciences Center 1201 East Rollins Road University of Missouri-Columbia.
Structure Prediction in 1D
1. Primary Structure: Polypeptide chain Polypeptide chain Amino acid monomers Peptide linkages Figure 3.6 The Four Levels of Protein Structure.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU October 29, 2004Claus Lundegaard Protein Secondary Structures Assignment and.
Protein Secondary Structures Assignment and prediction.
Predicting local Protein Structure Morten Nielsen.
Class 7: Protein Secondary Structure
©CMBI 2006 Amino Acids “ When you understand the amino acids, you understand everything ”
Protein Structures.
Proteins Secondary Structure Predictions Structural Bioinformatics.
Prediction to Protein Structure Fall 2005 CSC 487/687 Computing for Bioinformatics.
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
©CMBI 2006 Amino Acids “ When you understand the amino acids, you understand everything ”
Protein Secondary Structure Prediction Some of the slides are adapted from Dr. Dong Xu’s lecture notes.
Protein Secondary Structure Prediction. Input: protein sequence Output: for each residue its associated Secondary structure (SS): alpha-helix, beta-strand,
Protein Secondary Structure Prediction
Secondary structure prediction
2 o structure, TM regions, and solvent accessibility Topic 13 Chapter 29, Du and Bourne “Structural Bioinformatics”
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Web Servers for Predicting Protein Secondary Structure (Regular and Irregular) Dr. G.P.S. Raghava, F.N.A. Sc. Bioinformatics Centre Institute of Microbial.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure 1 Primary and Secondary Structure.
Protein secondary structure Prediction Why 2 nd Structure prediction? The problem Seq: RPLQGLVLDTQLYGFPGAFDDWERFMRE Pred:CCCCCHHHHHCCCCEEEECCHHHHHHCC.
Protein Secondary Structure Prediction G P S Raghava.
1 Protein Structure Prediction (Lecture for CS397-CXZ Algorithms in Bioinformatics) April 23, 2004 ChengXiang Zhai Department of Computer Science University.
Protein Structure Prediction ● Why ? ● Type of protein structure predictions – Sec Str. Pred – Homology Modelling – Fold Recognition – Ab Initio ● Secondary.
Lecture 5 Web: pollev.com/ucibio Text: To: Type in:
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Protein Structure  The structure of proteins can be described at 4 levels – primary, secondary, tertiary and quaternary.  Primary structure  The sequence.
Proteins: 3D-Structure Chapter 6 (9 / 17/ 2009)
Protein backbone Biochemical view:
Levels of Protein Structure. Why is the structure of proteins (and the other organic nutrients) important to learn?
Proteins Structure Predictions Structural Bioinformatics.
3S: Proteins Shireen Rudina. What do proteins do? Structure – Collagen in skin, keratin in hair and nails Signaling between cells Defend against disease.
Lecture 47: Structure II -- Proteins. Today’s Outline The monomers: amino acids – Side chain characteristics – Acid-base equilibria and pK a Peptide backbone.
Predicting Structural Features Chapter 12. Structural Features Phosphorylation sites Transmembrane helices Protein flexibility.
Structural organization of proteins
Functions Enzymes – organic catalysts Structural – skin, hair, muscle Antibodies Hormones.
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Mir Ishruna Muniyat. Primary structure (Amino acid sequence) ↓ Secondary structure ( α -helix, β -sheet ) ↓ Tertiary structure ( Three-dimensional.
Amino Acids and Proteins
Protein Structure September 7,
Protein Synthesis and Protein Folding
Lecture 5 Web: pollev.com/ucibio Text: To: 37607
. Nonpolar (hydrophobic) Nonpolar (hydrophobic) Amino Acid Side Chains
Protein Planes Bob Fraser CSCBC 2007.
Yuchun Tang (1), Preeti Singh (1), Yanqing Zhang (1),
The Chemistry of Life Proteins
Protein Structure Prediction
Protein Structures.
Protein Structure Chapter 14.
بیوشیمی : پروتئین ها و لیپیدها
The final product of the DNA blueprint
Protein structure prediction
Four Levels of Protein Structure
Presentation transcript:

Introduction to Bioinformatics II Protein secondary prediction Lecture 16 By Ms. Shumaila Azam

Proteins – Some Basics What Is a Protein? What is an Amino Acid? Linear Sequence of Amino Acids... What is an Amino Acid?

Proteins – Some Basics How many types of Amino Acids? Part I: Introduction Proteins – Some Basics How many types of Amino Acids? 20 Naturally Occurring Amino Acids Differ only in SIDE CHAINS Isoleucine Arginine Tyrosine

Proteins – Some Basics Amino Acids connect via PEPTIDE BOND Part I: Introduction Proteins – Some Basics Amino Acids connect via PEPTIDE BOND

Proteins – Some Basics Backbone can swivel: DIHEDRAL ANGLES Part I: Introduction Proteins – Some Basics Backbone can swivel: DIHEDRAL ANGLES 2 per Amino Acid Proteins can be 100’s of Amino Acids in length! Lots of freedom of movement

Protein Functions What do proteins do? Enzymes Cellular Signaling Part I: Introduction Protein Functions What do proteins do? Enzymes Cellular Signaling Antibodies

Protein Functions How do proteins do so much? Part I: Introduction Protein Functions How do proteins do so much? Proteins FOLD spontaneously Assume a characteristic 3D SHAPE Shape depends on particular Amino Acid Sequence Shape gives SPECIFIC function

Protein Structure Levels of organization Primary Sequence Part I: Introduction Protein Structure Levels of organization Primary Sequence Secondary Structure (Modular building blocks) α-helices β-sheets Tertiary Structure Quartenary Structure

Structures in Protein Language: Letters  Words  Sentences Residues  Secondary Structure Tertiary Structure

a helix Single protein chain (local) Shape maintained by intramolecular H bonding between -C=O and H-N-

b sheet Several protein chains Shape maintained by intramolecular H bonding between chains Non-local on protein sequence

-sheet (parallel, anti-parallel)

Classification of secondary structure Defining features Dihedral angles Hydrogen bonds Geometry

What is secondary structure prediction? Given a protein sequence (primary structure) GHWIATRGQLIREAYEDYRHFSSECPFIP Predict its secondary structure content (C=Coils H=Alpha Helix E=Beta Strands) CEEEEECHHHHHHHHHHHCCCHHCCCCCC

Why secondary structure prediction? An easier problem than 3D structure prediction (more than 40 years of history). Accurate secondary structure prediction can be an important information for the tertiary structure prediction Protein function prediction Protein classification Predicting structural change

Prediction methods Statistical method Nearest neighbors Neural network Chou-Fasman method, GOR I-IV Nearest neighbors NNSSP, SSPAL Neural network PHD, Psi-Pred, J-Pred Support vector machine (SVM) HMM

correctly predicted residues number of residues Accuracy measure Three-state prediction accuracy: Q3 correctly predicted residues number of residues A prediction of all loop: Q3 ~ 40%

Improvement of accuracy 1974 Chou & Fasman ~50-53% 1978 Garnier 63% 1987 Zvelebil 66% 1988 Qian & Sejnowski 64.3% 1993 Rost & Sander 70.8-72.0% 1997 Frishman & Argos <75% 1999 Cuff & Barton 72.9% 1999 Jones 76.5% 2000 Petersen et al. 77.9%

Prediction accuracy

Assumptions The entire information for forming secondary structure is contained in the primary sequence. Side groups of residues will determine structure. Examining windows of 13 - 17 residues is sufficient to predict structure. Basis for window size selection: a-helices 5 – 40 residues long b-strands 5 – 10 residues long

Secondary structure propensity From PDB database, calculate the propensity for a given amino acid to adopt a certain ss-type Example: #Ala=2,000, #residues=20,000, #helix=4,000, #Ala in helix=500 P(a,aai) = 500/20,000, p(a) = 4,000/20,000, p(aai) = 2,000/20,000 P = 500 / (4,000/10) = 1.25

Chou-Fasman algorithm Helix, Strand Scan for window of 6 residues where average score > 1 (4 residues for helix and 3 residues for strand) Propagate in both directions until 4 (or 3) residue window with mean propensity < 1 Move forward and repeat Conflict solution Any region containing overlapping alpha-helical and beta-strand assignments are taken to be helical if the average P(helix) > P(strand). It is a beta strand if the average P(strand) > P(helix). Accuracy: ~50%  ~60% GHWIATRGQLIREAYEDYRHFSSECPFIP

Initiation Identify regions where 4/6 have a P(H) >1.00 “alpha-helix nucleus”

Propagation Extend helix in both directions until a set of four residues have an average P(H) <1.00.