Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics,

Slides:



Advertisements
Similar presentations
Transmembrane Protein Topology Prediction Using Support Vector Machines Tim Nugent and David Jones Bioinformatics Group, Department of Computer Science,
Advertisements

Assignment of PROSITE motifs to topological regions: Application to a novel database of well characterised transmembrane proteins Tim Nugent.
Structural Classification and Prediction of Reentrant Regions in Alpha-Helical Transmembrane Proteins: Application to Complete Genomes Håkan Viklunda,
Introduction to protein x-ray crystallography. Electromagnetic waves E- electromagnetic field strength A- amplitude  - angular velocity - frequency.
The amino acids in their natural habitat. Topics: Hydrogen bonds Secondary Structure Alpha helix Beta strands & beta sheets Turns Loop Tertiary & Quarternary.
7.5 Proteins. The 20 different amino acids 7.5.1: Explain the four levels of protein structure, indicating the significance of each level.
Chapter 5: Homeostasis and Cell Transport.  The steady – state physiological condition of the body of a cell.  Cell membranes help organisms maintain.
1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Incorporating additional types of information in structure calculation: recent advances chemical shift potentials residual dipolar couplings.
Structure Prediction. Tertiary protein structure: protein folding Three main approaches: [1] experimental determination (X-ray crystallography, NMR) [2]
Tertiary protein structure viewing and prediction July 5, 2006 Learning objectives- Learn how to manipulate protein structures with Deep View software.
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
Structures and Structure Descriptions Chapter 8 Protein Bioinformatics.
Protein structure prediction May 30, 2002 Quiz#4 on June 4 Learning objectives-Understand difference between primary secondary and tertiary structure.
High Throughput Processing of the Structural Information of the Protein Data Bank Zoltán Szabadka, Vince Grolmusz Department of Computer Science Eötvös.
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
© Wiley Publishing All Rights Reserved. Biological Sequences.
Bioinformatics for biomedicine Protein domains and 3D structure Lecture 4, Per Kraulis
Protein Structure Prediction Dr. G.P.S. Raghava Protein Sequence + Structure.
Chapter 12 Protein Structure Basics. 20 naturally occurring amino acids Free amino group (-NH2) Free carboxyl group (-COOH) Both groups linked to a central.
Proteins: Secondary Structure Alpha Helix
Proteins. Proteins? What is its How does it How is its How does it How is it Where is it What are its.
Intelligent Systems for Bioinformatics Michael J. Watts
Levels of Protein Structure
Chapter 3 Protein Structure and Function. Key Concepts Most cell functions depend on proteins. Amino acids are the building blocks of proteins. Amino.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Question 1: Name, PDB codes Eric Martz (no research lab) Macromolecular visualization I chose*: 3onz HUMAN TETRAMERIC HEMOGLOBIN:
Couple of lab things 1.In calculating rate (  mol/min/mg) from absorbance/time slope, take the absolute value (rates always (usually?) positive) 2.For.
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
EBI is an Outstation of the European Molecular Biology Laboratory. A web service for the analysis of macromolecular interactions and complexes PDBe Protein.
Module 3 Protein Structure Database/Structure Analysis Learning objectives Understand how information is stored in PDB Learn how to read a PDB flat file.
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Protein Modeling Protein Structure Prediction. 3D Protein Structure ALA CαCα LEU CαCαCαCαCαCαCαCα PRO VALVAL ARG …… ??? backbone sidechain.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Question 1: Name, PDB codes Eric Martz Keiichi Namba Macromolecular visualization I chose: 3onz HUMAN TETRAMERIC HEMOGLOBIN: PROXIMAL.
X-ray crystallography – an overview (based on Bernie Brown’s talk, Dept. of Chemistry, WFU) Protein is crystallized (sometimes low-gravity atmosphere is.
Haixu Tang School of Informatics
Chemistry XXI Unit 3 How do we predict properties? M1. Analyzing Molecular Structure Predicting properties based on molecular structure. M4. Exploring.
Protein Structure and Bioinformatics. Chapter 2 What is protein structure? What are proteins made of? What forces determines protein structure? What is.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Protein backbone Biochemical view:
Transmembrane receptor (Handout) prof. aza. Transmembrane receptors are integral membrane proteins, which reside and operate typically within a cell's.
Marlou Snelleman 2012 Protein structure. Overview Sequence to structure Hydrogen bonds Helices Sheets Turns Hydrophobicity Helices Sheets Structure and.
Lecture 53: X-ray crystallography. Electrons deflect x-rays We try to recreate electron density from the x-ray diffraction pattern Each point in space.
PDBe Protein Interfaces, Surfaces and Assemblies
Question 1: Name, PDB codes
The heroic times of crystallography
Types of membrane proteins
Protein Structure and Function
PROTEINS AHL Topic 7.5 IB Biology Miss Werba.
Figure 1. The number of unique PDB, UniProt and Pfam accessions represented in the MemProtMD database over time. A selection of landmark structures are.
Protein Structure Prediction
Sudha Chakrapani, Luis G. Cuello, D. Marien Cortes, Eduardo Perozo 
Sebastian Meyer, Raimund Dutzler  Structure 
Marrying structure and genomics
Microtubule Structure at 8 Å Resolution
Microtubule Structure at 8 Å Resolution
How Does a Voltage Sensor Interact with a Lipid Bilayer
Richard C. Page, Sanguk Kim, Timothy A. Cross  Structure 
Richard C. Page, Sanguk Kim, Timothy A. Cross  Structure 
Computational Modeling Reveals that Signaling Lipids Modulate the Orientation of K- Ras4A at the Membrane Reflecting Protein Topology  Zhen-Lu Li, Matthias.
Volume 92, Issue 5, Pages (March 2007)
Volume 111, Issue 1, Pages (July 2016)
Molecular Dynamics Simulations of the Bacterial Outer Membrane Protein FhuA: A Comparative Study of the Ferrichrome-Free and Bound States  José D. Faraldo-Gómez,
Molecular Dynamics Simulations of the Bacterial Outer Membrane Protein FhuA: A Comparative Study of the Ferrichrome-Free and Bound States  José D. Faraldo-Gómez,
Robust Driving Forces for Transmembrane Helix Packing
Mechanism of Anionic Conduction across ClC
Presentation transcript:

Transmembrane proteins in the Protein Data Bank: identification and classification Gabor, E. Tusnady, Zsuzanna Dosztanyi and Istvan Simon Bioinformatics, vol 20, issue 17, pages

PDB contains only a few hundred membrane proteins. Annotation of the entries is poor – location of the lipid bilayer is not indicated in the PDB as the proteins are crystallized without the natural lipid bilayer Currently no method to detect the membrane plane from atomic coordinates

What they’ve done 1 – locating the most likely position of the lipid bilayer 3 – developed a database of all known transmembrane proteins and fragments - (automatically updated) TMDET 2 – a geometrical approach to distinguish between transmembrane and globular proteins using only structural information PDB_TM

Approximately 300 membrane protein structure files in PDB out of > 20,000 Transmembrane proteins are larger than globular proteins and thus harder to determine their structure using NMR Crystallography of membrane proteins is still a challenge Deposited structures have no information on the protein in its native conditions, immersed in a lipid bilayer

Problems in distinguishing between globular and transmembrane proteins: Surface of globular proteins is not always entirely hydrophilic Membrane embedded parts of transmembrane proteins may have polar and charged residues playing a role in ion transport or enzymatic activities Low resolution of many transmembrane proteins -> distorted secondary structure and hydrogen bond network

Method TMDET: search for the most probable position of the membrane planes relative to given atomic coordinates. (written in C, typical run takes a few seconds on a Pentium 4 processor) - measures the fitness of membrane localization with an objective function - proteins are classified based on the best value of the objective function (Q-value) hydrophobicity structure

Algorithm omit viral/pilus proteins and nucleotides Construct biological molecule Calculate water accesible area Calculate objective function hydrophobicity structure factor Q-value

Construction of the BIOMOLECULE crystallization artifacts removed analyzing internal symmetry, may help find membrane axis - If Q-value using rotational axis exceeds preset threshold, rotational axis considered as membrane normal. Otherwise non-identical chains are tested for most likely position of membrane.

Calculating the water-accessible surface area – Lee and Richards, 1971 Only atoms which potentially interact with lipid bilayer considered (for algorithm efficiency) Protein cut into 1Å wide slices along defined axis (axis defined in previous step) Test points are placed around each slice of atoms Atoms lying closest to any test point are defined to be possibly membrane exposed

Objective function Hydrophobicity + structure factor Fitness of a given membrane position to the protein Return value: Q-value (average of the products of the hydrophobic factor and structure factor for each slice) Hydrophobic factor Relative hydrophobic membrane exposed surface area Protein is cut into 1Å wide slices along the normal vector * For each slice: - the membrane exposed ‘water accessible surface area’ of hydrophilic and hydrophobic residues are summed up separately

Structure factor A product of three factors: 1- straightness factor – if the projection of c-alpha atoms of residue i, i-3, i+3 onto the normal vector of the membrane, are in a monotone decreasing or increasing order 2- turn factor – one minus the frequency of the i-th residue being in the centre of a turn (the c-alpha projection of residues i, i-3, and i+3 onto the normal vector is not in a monotone decreasing or increasing order). 3- end-chain factor – one minus the relative frequency of chain end residues in a given slice

Searching for the best membrane plane Simplest case: - internal symmetry analysis leads a rotational axis - if a 15Å slice exceeds a predefined threshold, it is accepted as the normal vector for the membrane Other cases: - exhaustive search carried out – unit vectors are used to sample possible membrane normals. Best vector (highest Q-value) is found by calculating the Q-value of 15Å slices along each unit vector.

Distribution of Q-values for transmembrane (solid line) and globular (dashed line) proteins

Results Classification of proteins was based on the Q-value i.e. if the best Q-value was below a lower selection limit, it was classified as a globular protein. Tested on 489 globular and 254 transmembrane proteins Q-values clearly separated, only 1.8% overlap (9 proteins) TMDET accuracy 98.7% Entire PDB was eventually scanned with TMDET proteins, 324 classified as membrane proteins

Protein Data Bank of Transmembrane Proteins (PDB_TM) Classification of membrane proteins + localization of membrane planes + localization of transmembrane segments in the sequence. Searchable and public for academic users ( TM proteins grouped into structural families (10 Beta- barrel and 29 alpha helical families) Helps the structural analysis of transmembrane proteins and the validation of transmembrane topology prediction algorithms Automatically updated