11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.1 11/9/05 Protein Structure Databases (continued) Prediction & Modeling.

Slides:



Advertisements
Similar presentations
Protein Structure C483 Spring 2013.
Advertisements

Protein Structure Prediction
Protein Structure – Part-2 Pauling Rules The bond lengths and bond angles should be distorted as little as possible. No two atoms should approach one another.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
S ASC Answer to Practice Problem
Folding and flexibility. Outline What is protein folding ? How proteins fold in vivo ? What is protein flexibility ?
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
The Structure and Functions of Proteins BIO271/CS399 – Bioinformatics.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Protein Functions: catalyze reactions (enzymes) receptors (eg. pain receptors) transport (ions across membranes, oxygen in blood) molecular motors recognition.
Protein Secondary Structure : Kendrew Solves the Structure of Myoglobin “Perhaps the most remarkable features of the molecule are its complexity.
The Protein Data Bank (PDB)
CISC667, F05, Lec20, Liao1 CISC 467/667 Intro to Bioinformatics (Fall 2005) Protein Structure Prediction Protein Secondary Structure.
Computing for Bioinformatics Lecture 8: protein folding.
Globular Proteins Proteins with a compact folded structure (with an interior and exterior), generally containing different types of secondary structure.
Protein Structure Analysis - I
Protein Structure Elements Primary to Quaternary Structure.
(Foundation Block) Dr. Ahmed Mujamammi Dr. Sumbul Fatma
Protein Structural Prediction. Protein Structure is Hierarchical.
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
11/11/05 D Dobbs ISU - BCB 444/544X: Protein Structure Prediction1 11/11/05 Protein Structure Prediction & Modeling.
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Housekeeping Your performance on the exam has caused me to re-evaluate how homework will be handled I will now be picking up every problem assigned on.
The structural organization within proteins Kevin Slep June 13 th, 2012.
10/10/07BCB 444/544 F07 ISU Dobbs #21 - Protein Secondary Structure Prediction1 BCB 444/544 Lecture 21 Protein Structure Visualization, Classification.
Supersecondary structures. Supersecondary structures motifs motifs or folds, are particularly stable arrangements of several elements of the secondary.
What are proteins? Proteins are important; e.g. for catalyzing and regulating biochemical reactions, transporting molecules, … Linear polymer chain composed.
Introduction to Protein Structure
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Protein Folding & Biospectroscopy F14PFB Dr David Robinson Lecture 2.
Protein “folding” occurs due to the intrinsic chemical/physical properties of the 1° structure “Unstructured” “Disordered” “Denatured” “Unfolded” “Structured”
Protein Structure and Function 1 , 2 , 3 , 4  Structure Viewing, interpreting structure Protein Characterization BIO520 BioinformaticsJim Lund.
Chapter 4 The Three-Dimensional Structure of Proteins.
The three important structural features of proteins: a. Primary (1 o ) – The amino acid sequence (coded by genes) b. Secondary (2 o ) – The interaction.
CS 177 Proteins, part 2 (Computational modeling) Review of protein structures Computational Modeling Three-dimensional structural analysis in laboratory.
Protein Structure Stryer Short Course Chapter 4. Peptide bonds Amide bond Primary structure N- and C-terminus Condensation and hydrolysis.
Protein Folding & Biospectroscopy F14PFB David Robinson Mark Searle Jon McMaster
CS790 – BioinformaticsProtein Structure and Function1 Review of fundamental concepts  Know how electron orbitals and subshells are filled Know why atoms.
Mrs. Einstein Research in Molecular Biology. Importance of proteins for cell function: Proteins are the end product of the central dogma YOU are your.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure 1 Primary and Secondary Structure.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
The α-helix forms within a continuous strech of the polypeptide chain 5.4 Å rise, 3.6 aa/turn  1.5 Å/aa N-term C-term prototypical  = -57  ψ = -47 
Chap. 4. Problem 1. Part (a). Double and triple bonds are shorter and stronger than single bonds. Because the length of a peptide bond more closely resembles.
Tertiary Structure Globular proteins (enzymes, molecular machines)  Variety of secondary structures  Approximately spherical shape  Water soluble 
11/07/05 D Dobbs ISU - BCB 444/544X: Protein Structure: Classification, Databases, Visualization 1 11/7/05 Protein Structure: Classification, Databases,
11/04/05 D Dobbs ISU - BCB 444/544X: Protein Structure & Function1 11/4/05 Protein Structure & Function.
10/8/07BCB 444/544 F07 ISU Dobbs #20 - Protein Structure Basics & Classification1 BCB 444/544 Lecture 20 Protein Structure Basics, Visualization, Classification.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Doug Raiford Lesson 14.  Reminder  Involved in virtually every chemical reaction ▪ Enzymes catalyze reactions  Structure ▪ muscle, keratins (skin,
Structure and Function
Protein Structure BL
Lecture 13 February 16, 2016 Biotech 3.
Introduction to Protein Structure
Protein Structure and Properties
The heroic times of crystallography
Protein Structure September 7,
#19 - Protein Structure Basics & Classification
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Lecture 5 Protein Structure.
The Peptide Bond Amino acids are joined together in a condensation reaction that forms an amide known as a peptide bond.
Haixu Tang School of Inforamtics
CS 177 Proteins, part 2 (Computational modeling)
Levels of Protein Structure
Protein structure (Foundation Block).
The Three-Dimensional Structure of Proteins
Presentation transcript:

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.1 11/9/05 Protein Structure Databases (continued) Prediction & Modeling

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.2 Bioinformatics Seminars Nov 10 Thurs 3:40 Com S Seminar in 223 Atanasoff Computational Epidemiology Armin R. Mikler, Univ. North Texas Nov 10 Thurs 4:10 EEOB Seminar in 210 Bessey Diversity and Evolution of Plant Immunity Genes: Insights from Molecular Population Genetics Peter Tiffin, Univ. of Minnesota

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.3 Bioinformatics Seminars CORRECTION: Next week - Baker Center/BCB Seminars:Baker Center/BCB Seminars: (seminar abstracts available at above link) Nov 14 Mon 1:10 PM Doug Brutlag, Stanford Discovering transcription factor binding sites Nov 15 Tues 1:10 PM Ilya Vakser, Univ Kansas Modeling protein-protein interactions both seminars will be in Howe Hall Auditorium

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.4 Protein Structure & Function: Analysis & Prediction Mon Protein structure: basics; classification,databases, visualization Wed Protein structure databases - cont. Thurs Lab Protein structure databases Protein structure analysis & prediction Fri Protein structure prediction Protein-nucleic acid interactions

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.5 Reading Assignment (for Mon-Fri) Mount Bioinformatics Chp 10 Protein classification & structure prediction pp Ck Errata: Additional reading assignments for BCB 544: Gene Prediction: Burge & Karlin 1997 JMB 268:78Burge & Karlin 1997 JMB 268:78 Prediction of complete gene structures in human genomic DNA Structure Prediction: Schueler-Furman…Baker, Science 310:638Schueler-Furman…Baker, Science 310:638 Progress in modeling of protein structures and interactions

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.6 Review last lecture: Protein Structure: Basics

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.7 Protein Structure & Function Amino acids characteristics Structural classes & motifs Protein functions & functional families ( not much - more on this later) Classification Databases Visualization

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.8 Amino Acids Each of 20 different amino acids has different "R-Group," side chain attached to C 

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.9 Peptide bond is rigid and planar

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.10 Hydrophobic Amino Acids

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.11 Charged Amino Acids

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.12 Polar Amino Acids

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.13 Certain side-chain configurations are energetically favored (rotamers) Ramachandran plot: "Allowable" psi & phi angles

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.14 Glycine is smallest amino acid R group = H atom Glycine residues increase backbone flexibility because they have no R group

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.15 Proline is cyclic Proline residues reduce flexibility of polypeptide chain Proline cis-trans isomerization is often a rate-limiting step in protein folding Recent work suggests it also may also regulate ligand binding in native proteins - Andreotti

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.16 Cysteines can form disulfide bonds Disulfide bonds (covalent) stabilize 3-D structures In eukaryotes, disulfide bonds are found only in secreted proteins or extracellular domains

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.17 Globular proteins have a compact hydrophobic core Packing of hydrophobic side chains into interior is main driving force for folding Problem? Polypeptide backbone is highly polar (hydrophilic) due to polar -NH and C=O in each peptide unit; these polar groups must be neutralized Solution? Form regular secondary structures, e.g.,  -helix,  -sheet, stabilized by H-bonds

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.18 Exterior surface of globular proteins is generally hydrophilic Hydrophobic core formed by packed secondary structural elements provides compact, stable core "Functional groups" of protein are attached to this framework; exterior has more flexible regions (loops) and polar/charged residues Hydrophobic "patches" on protein surface are often involved in protein-protein interactions

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.19 Protein Secondary Structures  Helices  Sheets Loops Coils

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.20  helix: stabilized by H-bonds between every ~ 4th residue in backbone C = black O = red N = blue

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.21 Certain amino acids are "preferred" & others are rare in  helices Ala, Glu, Leu, Met = good helix formers Pro, Gly Tyr, Ser = very poor Amino acid composition & distribution varies, depending on on location of helix in 3-D structure

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.22  -sheets - also stabilized by H-bonds between back bone atoms Anti-parallelParallel

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.23  Loops Connect helices and sheets Vary in length and 3-D configurations Are located on surface of structure Are more "tolerant" of mutations Are more flexible and can adopt multiple conformations Tend to have charged and polar amino acids Are frequently components of active sites Some fall into distinct structural families (e.g., hairpin loops, reverse turns)

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.24 Coils Regions of 2' structure that are not helices, sheets, or recognizable turns Intrinsically disordered regions appear to play important functional roles

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.25 Globular proteins are built from recurring structural patterns Motifs or supersecondary structures = combinations of 2' structural elements Domains = combinations of motifs Independently folding unit (foldon) Functional unit

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.26 Simple motifs combine to form domains

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.27 6 main classes of protein structure 1)  Domains Bundles of helices connected by loops 2)  Domains Mainly antiparallel sheets, usually with 2 sheets forming sandwich 3)  Domains Mainly parallel sheets with intervening helices, also mixed sheets 4)  Domains Mainly segregated helices and sheets 5) Multidomain (  Containing domains from more than one class 6 ) Membrane & cell-surface proteins

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.28  -domain structures: 4-helix bundles

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.29  -sheets: up-and-down sheets & barrels

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.30  -domains: leucine-rich motifs can form horseshoes

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.31 New today: Protein Structure Databases Classification Visualization Protein Structure Prediction Secondary structure Tertiary structure

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.32 Protein sequence databases UniProt (SwissProt, PIR, EBI) NCBI Protein More on these later: protein function prediction

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.33 Protein sequence & structure: analysis Diamond STING Millennium - many useful structure analysis tools, including Protein Dossier SwissProt (UniProt) protein knowledgebase InterPRO sequence analysis tools

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.34 Protein structure databases PDB Protein Data Bank (RCSB) - THE protein structure database MMDB Molecular Modeling Database (NCBI Entrez) - has "added" value MSD Molecular Structure Database Especially good for interactions, binding sites

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.35 Protein structure classification SCOP = Structural Classification of Proteins Levels reflect both evolutionary and structural relationships CATH = Classification by Class, Architecture, Topology & Homology DALI/FSSP (recently moved to EBI & reorganized) fully automated structure alignments DALI serverhttp:// DALI Database (fold classification)

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.36 Protein structure visualization Molecular Visualization Freeware: MolviZ.Org Protein Explorer RASMOL (& many decendents: Protein Explorer,PyMol, MolMol, etc.) CHIME Cn3D Deep View = Swiss-PDB Viewer

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.37 PDB (RCSB)

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.38 RCSB PDB - Beta site

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.39 RCSB PDB - New Tutorial

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.40 NCBI Structure

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.41 MMDB

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.42 Cn3D

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.43 MM MMDB: Molecular Modeling Data Base Derived PDB structure records Value added to PDB records including: Integration with other ENTREZ databases & tools Conversion to parseable ASN.1 data description language Correction of numbering discrepancies in structure vs sequence Validation Addition of explicit chemical graph information Structure neighbors determined by Vector Alignment Search Tool (VAST)

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.44 Searching MMDB 1CET

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.45 MMDB Structure Summary Cn3D viewer VAST neighbors BLAST neighbors

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.46 Cn3D : Displaying 2' Structures Chloroquine

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.47 Cn3D : Displaying 3' Structures Chloroquine

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.48 Cn3D: Structural Alignments Chloroquine NADH

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.49 Protein Explorer (RasMol/Chime)

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.50 Protein Explorer

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.51 SCOP - Structure Classification

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.52 CATH - Structure Classification

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.53 Structural Genomics ~ 30,000 "traditional" genes in human genome (not counting: ???) ~ 3,000 proteins in a typical cell > 2 million sequences in UniProt > 33,000 protein structures in the PDB  Experimental determination of protein structure lags far behind sequence determination! Goal: Determine structures of "all" protein folds in nature, using combination of experimental structure determination methods (X-ray crystallography, NMR, mass spectrometry) & structure prediction

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.54 Structural Genomics Projects TargetDB: database of structural genomics targets Protein Structure Prediction?

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.55 Protein Folding " Major unsolved problem in molecular biology" In cells:spontaneous assisted by enzymes assisted by chaperones In vitro: many proteins fold spontaneously & many do not!

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.56 Steps in Protein Folding 1- "Collapse"- driving force is burial of hydrophobic aa’s (fast - msecs) 2- Molten globule - helices & sheets form, but "loose" (slow - secs) 3- "Final" native folded state - compaction, some 2' structures rearranged Native state? - assumed to be lowest free energy - may be an ensemble of structures

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.57 Protein Dynamics Protein in native state is NOT static Function of many proteins depends on conformational changes, sometimes large, sometimes small Globular proteins are inherently "unstable" (NOT evolved for maximum stability) Energy difference between native and denatured state is very small (5-15 kcal/mol) (this is equivalent to 1 or 2 H-bonds!) Folding involves changes in both entropy & enthalpy

11/09/05 D Dobbs ISU - BCB 444/544X: Protein Structure Databases - cont.58 Protein Structure Prediction Structure is largely determined by sequence BUT: Similar sequences can assume different structures Dissimilar sequences can assume similar structures Many proteins are multi-functional Protein folding: determination of folding pathways prediction of tertiary structure  still largely unsolved problems