Protein structure Anne Mølgaard, Center for Biological Sequence Analysis.

Slides:



Advertisements
Similar presentations
The Chemical Nature of Enzyme Catalysis
Advertisements

Review.
Amino Acids PHC 211.  Characteristics and Structures of amino acids  Classification of Amino Acids  Essential and Nonessential Amino Acids  Levels.
Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
Proteins Function and Structure.
Proteins. Copyright © 2005 Pearson Education, Inc. publishing as Benjamin Cummings Concept 5.4: Proteins have many structures, resulting in a wide range.
FUNDAMENTALS OF MOLECULAR BIOLOGY Introduction -Molecular Biology, Cell, Molecule, Chemical Bonding Macromolecule -Class -Chemical structure -Forms Important.
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modeling Anne Mølgaard, CBS, BioCentrum, DTU.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Programme Introduction to protein structure Break Homology.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Thomas Blicher, Center for Biological Sequence Analysis Details of Protein Structure.
Protein Secondary Structure : Kendrew Solves the Structure of Myoglobin “Perhaps the most remarkable features of the molecule are its complexity.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Structure Thomas Blicher, Center for Biological Sequence Analysis.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein structure Anne Mølgaard, Center for Biological Sequence Analysis.
Thomas Blicher Center for Biological Sequence Analysis
ProteinStructuralDatabases. Proteins are built from amino-acids. Introduction H | NH2-c-CO2H | R.
Proteins Function and Structure. Proteins more than 50% of dry mass of most cells functions include – structural support – storage, transport – cellular.
Proteins, Pair HMMs, and Alignment. CS262 Lecture 8, Win06, Batzoglou A state model for alignment -AGGCTATCACCTGACCTCCAGGCCGA--TGCCC--- TAG-CTATCAC--GACCGC-GGTCGATTTGCCCGACC.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Homology Modelling Thomas Blicher Center for Biological Sequence Analysis.
Computing for Bioinformatics Lecture 8: protein folding.
Protein structure Classification Ole Lund, Associate professor, CBS, DTU.
CENTER FOR BIOLOGICAL SEQUENCE ANALYSISTECHNICAL UNIVERSITY OF DENMARK DTU Protein Structure Thomas Blicher, Center for Biological Sequence Analysis.
The relative orientation observed for  helices packed on ß sheets.
Protein Structure Elements Primary to Quaternary Structure.
Protein Structure FDSC400. Protein Functions Biological?Food?
You Must Know How the sequence and subcomponents of proteins determine their properties. The cellular functions of proteins. (Brief – we will come back.
Proteins. The central role of proteins in the chemistry of life Proteins have a variety of functions. Structural proteins make up the physical structure.
Proteins and Enzymes Nestor T. Hilvano, M.D., M.P.H. (Images Copyright Discover Biology, 5 th ed., Singh-Cundy and Cain, Textbook, 2012.)
Proteins are polymers of amino acids.
Protein Structural Prediction. Protein Structure is Hierarchical.
Proteins account for more than 50% of the dry mass of most cells
©CMBI 2006 Amino Acids “ When you understand the amino acids, you understand everything ”
Concept 5.4: Proteins have many structures, resulting in a wide range of functions Proteins account for more than 50% of the dry mass of most cells Protein.
Protein structure and modelling ● Orientation ● Protein structure ● Protein modelling Andreas Heger University of Helsinki Bioinformatics Group Slides.
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Genetics in ~1920: 1. Cells have chromosomes Sketch of Drosophila chromosomes (Bridges, C. 1913)
Secondary structure prediction
1 10/26/2015 MOLECULES. 2 10/26/2015 H 2 N-CH-C-OH O R Monomer E.g. protein Monomer vs polymer amino acid monomer R is a side group.
Outline 1.What is an amino acid / protein naturally occurring amino acids 3.Codon – triplet coding for an amino acid 1.How are proteins synthesized.
Part I : Introduction to Protein Structure A/P Shoba Ranganathan Kong Lesheng National University of Singapore.
Protein Structure & Modeling Biology 224 Instructor: Tom Peavy Nov 18 & 23, 2009
Outline What is an amino acid / protein
Macromolecules of Life Proteins and Nucleic Acids
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Proteins.
Proteins Structure of proteins Proteins are made of C, H, O and nitrogen and may have sulfur. The monomers of proteins are amino acids An amino acid.
Chapter 3 Proteins.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
The Structure and Function of Macromolecules Chpt. 5 The Structure and Function of Macromolecules.
Doug Raiford Lesson 14.  Reminder  Involved in virtually every chemical reaction ▪ Enzymes catalyze reactions  Structure ▪ muscle, keratins (skin,
PROTEINS FOLDED POLYPEPTIDES © 2007 Paul Billiet ODWSODWS.
Fibrous Proteins Examples 1. a-keratins 2. Silk Fibroin 3. Collagen
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
Introduction to Molecular Neurobiology
Introduction to Protein Structure
Proteins.
Protein Structure FDSC400. Protein Functions Biological?Food?
Haixu Tang School of Inforamtics
Amino acids R-groups non-polar polar acidic basic proteins
Packet #9 Supplement.
Amino Acids Amine group -NH2 Carboxylic group -COOH
Proteins Genetic information in DNA codes specifically for the production of proteins Cells have thousands of different proteins, each with a specific.
Levels of Protein Structure
The Chemical Building Blocks of Life
“When you understand the amino acids,
Presentation transcript:

Protein structure Anne Mølgaard, Center for Biological Sequence Analysis

“Could the search for ultimate truth really have revealed so hideous and visceral-looking an object?” Max Perutz, 1964 on protein structure John Kendrew, 1959 with myoglobin model

Sep. 2001Feb X-ray NMR theoretical3380 total Holdings of the Protein Data Bank (PDB):

Methods for structure determination X-ray crystallography Nuclear Magnetic Resonance (NMR) Modeling techniques

Modeling Only applicable to ~50% of sequences Fast Accuracy poor for low sequence id. There is still need for experimental structure determination!

Structual genomics consortium (SGC) The SGC deposited its 275th structure into the Protein Data Bank in August 2006 currently operating at a pace of 170 structures per year at a cost of USD$125,000 per structure. Scientific highlights include: several (> 1!!) novel structures of protein kinases completing the structural descriptions of the human adenylate kinase and cytosolic sulfotransferase protein families human chromatin modifying enzymes; human inositol phosphate signaling and a significant number of structures from human parasites.

Amino acids

Livingstone & Barton, CABIOS, 9, , 1993 A – Ala C – Cys D – Asp E – Glu F – Phe G – Gly H – His I – Ile K – Lys L – Leu M – Met N – Asn P – Pro Q – Gln R – Arg S – Ser T – Thr V – Val W – Trp Y - Tyr

Primary Secondary Tertiary Quarternary Levels of protein structure

Primary structure MKTAALAPLFFLPSALATTVYLA GDSTMAKNGGGSGTNGWGEYL ASYLSATVVNDAVAGRSAR… (etc)

 -helix  -sheet left-handed  -helix Ramachandran plot

Hydrophobic core Hydrophobic side chains go into the core of the molecule – but the main chain is highly polar The polar groups (C=O and NH) are neutralized through formation of H-bonds

Secondary structure  -helix C=O (n) … HN (n+4)  -sheet (anti-parallel)

… and all the rest 3 10 helices (C=O (n) … HN (n+3) ),  -helices (C=O (n) … HN (n+5) )  -turns and loops (in old textbooks sometimes referred to as random coil)

The  -helix has a dipole moment + - C N

Two types of  -sheet: anti-parallel parallel

Tertiary structure (domains, modules) Rhamnogalacturonan lyase (1nkg) Rhamnogalacturonan acetylesterase (1k7c)

Quaternary structure B.caldolyticus UPRTase (1i5e) B.subtilis PRPP synthase (1dkr)

A. aculeatus RG acetylesterase Protein structure and water

Classification schemes SCOP –Manual classification (A. Murzin) CATH –Semi manual classification (C. Orengo) FSSP –Automatic classification (L. Holm)

Levels in SCOP Class# Folds# Superfamilies # Families All alpha proteins All beta proteins Alpha and beta proteins (a/b) Alpha and beta proteins (a+b) Multi-domain proteins Membrane and cell surface proteins Small proteins Total

Major classes in SCOP Classes –All alpha proteins –Alpha and beta proteins (a/b) –Alpha and beta proteins (a+b) –Multi-domain proteins –Membrane and cell surface proteins –Small proteins

All  : Hemoglobin (1bab)

All  : Immunoglobulin (8fab)

 Triose phosphate isomerase (1hti)

a+b: Lysozyme (1jsf)

Folds * Proteins which have >~50% of their secondary structure elements arranged the in the same order in the protein chain and in three dimensions are classified as having the same fold No evolutionary relation between proteins *confusingly also called fold classes

Superfamilies Proteins which are (remote) evolutionarily related –Sequence similarity low –Share function –Share special structural features Relationships between members of a superfamily may not be readily recognizable from the sequence alone

Families Proteins whose evolutionarily relationship is readily recognizable from the sequence (>~25% sequence identity) Families are further subdivided into Proteins Proteins are divided into Species –The same protein may be found in several species

Links PDB (protein structure database) – SCOP (protein classification database) –scop.berkeley.eduscop.berkeley.edu CATH (protein classification database) – FSSP (protein classification database) –

Why are protein structures so interesting? They provide a detailed picture of interesting biological features, such as active site, substrate specificity, allosteric regulation etc. They aid in rational drug design and protein engineering They can elucidate evolutionary relationships undetectable by sequence comparisons

COOH NH 2 Asp His Ser Topological switchpoint Inferring biological features from the structure 1deo

Inferring biological features from the structure Active site Triose phosephate isomerase (1ag1) (Verlinde et al. (1991) Eur.J.Biochem. 198, 53)

Engineering thermostability in serpins Overpacking Buried polar groups Cavities Im, Ryu & Yu (2004) Engineering thermostability in serine protease inhibitors PEDS, 17,

Evolution... Structure is conserved longer than both sequence and function

Rhamnogalacturonan acetylesterase (A. aculeatus) (1k7c) Platelet activating factor acetylhydrolase (Bos Taurus) (1wab) Serine esterase (S. scabies) (1esc)

Platelet activating factor acetylhydrolase Serine esterase Rhamnogalacturonan acetylesterase Mølgaard, Kauppinen & Larsen (2000) Structure, 8,

"We wish to suggest a structure for the salt of deoxyribose nucleic acid (D.N.A.). This structure has novel features which are of considerable biological interest…. …It has not escaped our notice that the specific pairing we have postulated immediately suggests a possible copying mechanism for the genetic material." J.D. Watson & F.H.C. Crick (1953) Nature, 171, 737.