On the nature of cavities on protein surfaces: Application to the Identification of drug-binding sites Murad Nayal, Barry Honig Columbia University, NY.

Slides:



Advertisements
Similar presentations
Proteins: Structure reflects function….. Fig. 5-UN1 Amino group Carboxyl group carbon.
Advertisements

The Chemical Nature of Enzyme Catalysis
A Ala Alanine Alanine is a small, hydrophobic
Review of Basic Principles of Chemistry, Amino Acids and Proteins Brian Kuhlman: The material presented here is available on the.
BY: SHERENE MINHAS. Agr Glu Thr Ile Glu Ser Leu Ser Ser Ser Glu Glu Ser Ile Pro Glu Tyr Lys Gln Lys Val Glu Lys Val Lys His Glu Asp Gln Gln Gln Gly Thr.
Ligand Binding Site Prediction for HIV-1 Protease using Shape Comparison Techniques Manasi Jahagirdar 1, Vivek K Jalahalli 2, Sunil Kumar 1, A. Srinivas.
Lactate dehydrogenase + 38 ATP + 2 ATP. How does lactate dehydrogenase perform its catalytic function ?
Review: Amino Acid Side Chains Aliphatic- Ala, Val, Leu, Ile, Gly Polar- Ser, Thr, Cys, Met, [Tyr, Trp] Acidic (and conjugate amide)- Asp, Asn, Glu, Gln.
Metabolic fuels and Dietary components Lecture - 2 By Dr. Abdulrahman Al-Ajlan.
5’ C 3’ OH (free) 1’ C 5’ PO4 (free) DNA is a linear polymer of nucleotide subunits joined together by phosphodiester bonds - covalent bonds between.
An overview of amino acid structure Topic 2. Biomacromolecule A naturally occurring substance of large molecular weight e.g. Protein, DNA, lipids etc.
Protein-a chemical view A chain of amino acids folded in 3D Picture from on-line biology bookon-line biology book Peptide Protein backbone N / C terminal.
1 Levels of Protein Structure Primary to Quaternary Structure.
Amino Acids and Proteins 1.What is an amino acid / protein 2.Where are they found 3.Properties of the amino acids 4.How are proteins synthesized 1.Transcription.
Lectures on Computational Biology HC Lee Computational Biology Lab Center for Complex Systems & Biophysics National Central University EFSS II National.
Sequence analysis June 18, 2008 Learning objectives-Understand the concept of sliding window programs. Understand difference between identity, similarity.
Ensemble Results of PIM1 PIM PIM Ensemble Results of GSK3 GSK GSK GSK
©CMBI 2008 Aligning Sequences The most powerful weapon in the bioinformaticist’s armory is sequence alignment. Why? Lets’ think about an alignment. It.
It & Health 2009 Summary Thomas Nordahl Petersen.
©CMBI 2005 Why align sequences? Lots of sequences with unknown structure and function. A few sequences with known structure and function If they align,
©CMBI 2001 A Ala Alanine Alanine is a small, hydrophobic residue. Its side chain, R, is just a methyl group. Alanine likes to sit in an alpha helix,it.
Protein Structure FDSC400. Protein Functions Biological?Food?
Marlou Snelleman 2012 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Chapter 27 Amino Acids, Peptides, and Proteins. Nucleic Acids.
Chapter Three Amino Acids and Peptides
Protein Synthesis. DNA RNA Proteins (Transcription) (Translation) DNA (genetic information stored in genes) RNA (working copies of genes) Proteins (functional.
©CMBI 2006 Amino Acids “ When you understand the amino acids, you understand everything ”
BIOCHEMISTRY REVIEW Overview of Biomolecules Chapter 4 Protein Sequence.
TRNA Activation (charging) by aminoacyl tRNA synthetases Aminoacyl tRNA synthetase Two important functions: 1.Implement genetic code 2.Activate amino acids.
Mount Mary College Students: Jessica Benson, Amy Ramirez, Nerissa Seward Faculty Advisor: Dr. Colleen Conway Medical College of Wisconsin Research Mentor:
1.Overall amino acid structure 2.Amino acid stereochemistry 3.Amino acid sidechain structure & classification 4.‘Non-standard’ amino acids 5.Amino acid.
AMINO ACIDS.
Amino Acids & Side Groups Polar Charged ◦ ACIDIC negatively charged amino acids  ASP & GLU R group with a 2nd COOH that ionizes* above pH 7.02nd COOH.
Secondary structure prediction
Learning Targets “I Can...” -State how many nucleotides make up a codon. -Use a codon chart to find the corresponding amino acid.
Fig Second mRNA base First mRNA base (5 end of codon) Third mRNA base (3 end of codon)
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Amino acids Met dank aan Dr. Detke.
intro-VIRUSES Virus NamePDB ID HUMAN PAPILLOMAVIRUS 161DZL BACTERIOPHAGE GA1GAV L-A virus1M1C SATELLITE PANICUM MOSAIC VIRUS1STM SATELLITE TOBACCO NECROSIS2BUK.
Amino acids structure. Configuration of Amino Acids.
Outline What is an amino acid / protein
Protein Secondary Structure Prediction G P S Raghava.
A program of ITEST (Information Technology Experiences for Students and Teachers) funded by the National Science Foundation Background Session #3 DNA &
1 Protein synthesis How a nucleotide sequence is translated into amino acids.
Amino Acids ©CMBI 2001 “ When you understand the amino acids, you understand everything ”
Marlou Snelleman 2011 Proteins and amino acids. Overview Proteins Primary structure Secondary structure Tertiary structure Quaternary structure Amino.
Proteins.
Proteins Structure of proteins Proteins are made of C, H, O and nitrogen and may have sulfur. The monomers of proteins are amino acids An amino acid.
©2001 Timothy G. Standish Hebrews 12:28 28Wherefore we receiving a kingdom which cannot be moved, let us have grace, whereby we may serve God acceptably.
X-ray detection xray/facilities.html.
Houssam Nassif, Hassan Al-Ali, Sawsan Khuri, Walid Keirouz, and David Page An ILP Approach to Model and Classify Hexose Binding Sites.
Amino Acids  Amino Acids are the building units of proteins. Proteins are polymers of amino acids linked together by what is called “ Peptide bond” (see.
Supplementary Fig. 1 Relative concentrations of amino acids after transamination reaction catalyzed by PpACL1, α- ketoglutarate as the amino acceptor.
Stephen Taylor i-Biology.net Photo credit: Firefly with glow, by Terry Priest on Flickr (Creative Commons)
Fibrous Proteins Examples 1. a-keratins 2. Silk Fibroin 3. Collagen
Genomics Lecture 3 By Ms. Shumaila Azam. Proteins Proteins: large molecules composed of one or more chains of amino acids, polypeptides. Proteins are.
Arginine, who are you? Why so important?. Release 2015_01 of 07-Jan-15 of UniProtKB/Swiss-Prot contains sequence entries, comprising
Cathode (attracts (+) amino acids)
Figure 3.14A–D Protein structure (layer 1)
Haixu Tang School of Inforamtics
Volume 11, Issue 10, Pages (October 2004)
Amino Acids Amine group -NH2 Carboxylic group -COOH
Cytochrome.
The 20 amino acids.
Ching-Ling Teng, Robert G. Bryant  Biophysical Journal 
Volume 25, Issue 5, Pages e3 (May 2017)
The 20 amino acids.
Chapter 18 Naturally Occurring Nitrogen-Containing Compounds
Volume 110, Issue 9, Pages (May 2016)
Fig. 3 Organization of the active site of DHHC20.
Presentation transcript:

On the nature of cavities on protein surfaces: Application to the Identification of drug-binding sites Murad Nayal, Barry Honig Columbia University, NY Proteins: Structure, Function and Bioinformatics, Accepted 15 Nov. 05 Ankur Dhanik

Abstract Identification of drug-binding sites useful for virtual screening and drug design. Small ligands are known to bind proteins at surface cavities. Two tasks: identification of cavities and prediction of their drugabbilities (whether the cavity is suitable for drug binding). The method presented in this paper encoded in program called SCREEN (Surface Cavity REcognition and EvaluatioN).

Abstract SCREEN works by first constructing two molecular surfaces using GRASP: a conventional molecular surface (MS) using a 1.4 A radius and a second low resolution envelope using a large probe sphere, which serves as ‘sea-level’. Depth of each vertex of MS is computed and compared with threshold. For each surface cavity, 408 attributes are computed (physiochemical, structural, and geometric). Random Forests based classifier is used. Training data set is derived from a collection of 100 nonredundant protein ligand complexes.

Results SCREEN predicts drug binding cavities with a balanced error rate of 7.2% and coverage of 88.9%, while a CASTp ( a popular protein cavity detection program) based druggability predictor (using cavity size criteria alone) predicts with a balanced error rate of 15.7% and coverage of 71.7%. SCREEN predicts drug-binding cavities missed by cavity size criteria (three examples). Out 18 attributes out of 408 used, were found to be significant predictors of drug binding cavities. It follows from the above that drug binding cavities are large, deep, have an intricate curvature profile, are rigid, and have a relatively small number of prolines, as well as amino acids with small but negative octanol-to-water transfer free energies (Asn, Gln, Glu).

Results Protein-tyrosine phosphatase 1B, PTP1B (PDB code: 1l8g). The largest surface cavity (colored green: area, 184 Å 2 ; volume, 400 Å 3 ; residues Gln78, Arg79, Ser80, and Pro210) is about 20 Å from the ligand-binding site. The drug binds at the second largest cavity, colored red, as predicted (area, 170 Å 2 ; volume, 259 Å 3 ; residues Gln262, Ala217, Ile219, Val49, and Asp181).

Results Human carbonic anhydrase II (CA II). The largest cavity (area, 281 Å 2 ; volume, 679 Å 3 ; residues Phe213, Tyr7, Gly8, Asp243, and Lys170), shown in green, is rather shallow and is predicted not to bind a drug. Instead, the second largest cavity (area, 194 Å 2 ; volume, 281 Å 3 ; residues Leu198, Thr200, His94, Val121, and His64) is the one predicted correctly to bind the drug.

Results Human factor Xa complexed with inhibitor RPR (PDB code: 1ezq). Four cavities ranked 1, 2, 3, and 9, shown here in red, were predicted to be potential drug-binding cavities. The ligand actually binds at two cavities, 3 (the S1 pocket: area, 274 Å2; volume, 384 Å3; residues Gln192, Trp215, Ser195, Cys191, Gly216, and Asp) and 9 (the S4 pocket: area, 69 Å2; volume, 155 Å3; residues Trp215, Phe174, Thr98, Tyr99, and Ile175).

Results Surface cavity propertyCategory Drug-binding cavities Non drug- binding cavities Cavity rankSize1.89 ± ± 5.4 Number of residuesSize22.8 ± ± 5.4 Number of atomsSize85.0 ± ± 21.2 Smallest moment of inertiaSize/shape1.7 × 10 4 ± 2.5 × × 10 3 ± 8.3 × 10 3 Depth standard deviationSize/shape2.3 ± 1.1 (Å 3 )0.75 ± 0.45 Maximum depthSize/shape10.5 ± 4.0 (Å)4.75 ± 1.67 Average depthSize/shape5.3 ± 1.9 (Å)3.2 ± 0.7 Normalized smallest moment of inertia Shape17.0 ± ± 5.3 Proportion of cavity at depth between [6.5, 6.75) Shape0.02 ± ± Largest moment of inertiaSize/shape1.6 × 10 4 ± 8.4 × × 10 3 ± 1.6 × 10 4 Average side-chains residual entropy Rigidity-0.41 ± 0.18 (kcal) ± 0.25 Average curvatureShape-49.0 ± ± 13.1 Maximum curvednessShape6.4 ± ± 4.9 Maximum mean curvatureShape5.3 ± ± 4.2 Curvedness < 0.5Shape0.35 ± ± 0.08 Proportion of prolineAmino acid composition ± ± 0.09 Proportion of cavity with logP between [-1, 0) Hydrophobicity0.09 ± ± 0.16 Side-chain residual entropy standard deviation Rigidity0.43 ± 0.18 (kcal) 0.55 ± 0.17

Comments The prediction of drug-binding cavities was done without considering the nature of the drug. Physicochemical cavity properties were not found useful. Perhaps they can play an important role when surface cavities that recognize a particular ligand are characterized. Energy-based approach offers a promising alternative to geometry-based methods