Gidalevitz T, Biswas C, Ding H, Schneidman D,

Slides:



Advertisements
Similar presentations
Taking Geometry to its Edge: Fast Rigid (and Hinge-Bent) Docking Algorithms. Haim Wolfson 1, Dina Duhovny 1, Yuval Inbar 1, Vladimir Polak 1, Ruth Nussinov.
Advertisements

Ligand Binding Site Prediction for HIV-1 Protease using Shape Comparison Techniques Manasi Jahagirdar 1, Vivek K Jalahalli 2, Sunil Kumar 1, A. Srinivas.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
A 3-D reference frame can be uniquely defined by the ordered vertices of a non- degenerate triangle p1p1 p2p2 p3p3.
Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Iterative Relaxation of Constraints (IRC) Can’t solve originalCan solve relaxed PRMs sample randomly but… start goal C-obst difficult to sample points.
1 Chapter 8Proteomics 暨南大學資訊工程學系 黃光璿 2004/06/07 2 proteome  the sum total of an organism’s proteins genome  the sum total of an organism’s genetic.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
Improved prediction of protein-protein binding sites using a support vector machine ( James Bradford, et al (2004)) Tapan Patel CISC841 Trypsin (and inhibitor.
Two Examples of Docking Algorithms With thanks to Maria Teresa Gil Lucientes.
Docking Algorithm Scheme Part 1: Molecular shape representation Part 2: Matching of critical features Part 3: Filtering and scoring of candidate transformations.
Protein Docking and Interactions Modeling CS 374 Maria Teresa Gil Lucientes November 4, 2004.
Alignment of Flexible Molecular Structures. Motivation Proteins are flexible. One would like to align proteins modulo the flexibility. Hinge and shear.
Agenda A brief introduction The MASS algorithm The pairwise case Extension to the multiple case Experimental results.
Seminar in BioInformatics A Method for Biomolecular Structural Recognition and Docking Allowing Conformational Flexibility (1998) Bilha Sandak, Ruth Nussinov.
Docking of Protein Molecules
FLEX* - REVIEW.
SiteEngine: Functional Sites Structural Search Engine
Summary Protein design seeks to find amino acid sequences which stably fold into specific 3-D structures. Modeling the inherent flexibility of the protein.
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
QSD – Quadratic Shape Descriptors Surface Matching and Molecular Docking Using Quadratic Shape Descriptors Goldman BB, Wipke WT. Quadratic Shape Descriptors.
1 Alignment of Flexible Protein Structures Based on: FlexProt: Alignment of Flexible Protein Structures Without a Pre-definition of Hinge Regions / M.
Introduction to molecular networks Sushmita Roy BMI/CS 576 Nov 6 th, 2014.
CAPRI Critical Assessment of Prediction of Interactions.
Unbound Docking of Rigid Molecules. Problem Definition Given two molecules find their correct association: +=
Protein Structure Prediction Samantha Chui Oct. 26, 2004.
Protein-protein and Protein- ligand Docking The geometric filtering.
Model Database. Scene Recognition Lamdan, Schwartz, Wolfson, “Geometric Hashing”,1988.
Identifying similar surface patches on proteins using a spin-image surface representation M. E. Bock Purdue University, USA G. M. Cortelazzo, C. Ferrari,
LSM2104/CZ2251 Essential Bioinformatics and Biocomputing Essential Bioinformatics and Biocomputing Protein Structure and Visualization (3) Chen Yu Zong.
Protein Tertiary Structure Prediction
Molecular Docking S. Shahriar Arab. Overview of the lecture Introduction to molecular docking: Definition Types Some techniques Programs “Protein-Protein.
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
y-sa/2.0/. Integrating the Data Prof:Rui Alves Dept Ciencies Mediques Basiques, 1st.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Flexible Multi-scale Fitting of Atomic Structures into Low- resolution Electron Density Maps with Elastic Network Normal Mode Analysis Tama, Miyashita,
Ozgur Ozturk, Ahmet Sacan, Hakan Ferhatosmanoglu, Yusu Wang The Ohio State University LFM-Pro: a tool for mining family-specific sites in protein structure.
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (part II) Prof. Chen Yu Zong Tel:
CZ5226: Advanced Bioinformatics Lecture 8: Molecular Modeling Method Prof. Chen Yu Zong Tel:
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
Altman et al. JACS 2008, Presented By Swati Jain.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Rational Drug Design : HIV Integrase. A process for drug design which bases the design of the drug upon the structure of its protein target. 1.Structural.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Hierarchical Database Screenings for HIV-1 Reverse Transcriptase Using a Pharmacophore Model, Rigid Docking, Solvation Docking, and MM-PB/SA Junmei Wang,
AN APPROACH TO SEMI FLEXIBLE DOCKING: A case study of the enzymatic reaction catalysed by terpenoid cyclases DIMACS, 13 June 2005
Discrimination of near-native structures by clustering docked conformations and the selection of the optimal radius D. Kozakov 1, K. H. Clodfelter 2, C.
LSM3241: Bioinformatics and Biocomputing Lecture 6: Fundamentals of Molecular Modeling Prof. Chen Yu Zong Tel:
Introduction to biological molecular networks
R L R L L L R R L L R R L L water DOCKING SIMULATIONS.
Coarse and Reliable Geometric Alignment for Protein Docking Yusu Wang Stanford University Joint Work with P. K. Agarwal, P. Brown, H. Edelsbrunner, J.
FlexWeb Nassim Sohaee. FlexWeb 2 Proteins The ability of proteins to change their conformation is important to their function as biological machines.
Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search Engine Ajay N. Jain UCSF Cancer Research Institute and Comprehensive.
How NMR is Used for the Study of Biomacromolecules Analytical biochemistry Comparative analysis Interactions between biomolecules Structure determination.
Protein-Protein Interactions. A Protein may interact with: –Other proteins –Nucleic Acids –Small molecules Protein Interactions.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Rong Chen Boston University
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
CZ3253: Computer Aided Drug design Lecture 9: Drug Design Methods III Ligand-Protein Docking (part II) Prof. Chen Yu Zong Tel:
Rational Drug Design : HIV Integrase
Ligand Docking to MHC Class I Molecules
An Integrated Approach to Protein-Protein Docking
AnchorDock: Blind and Flexible Anchor-Driven Peptide Docking
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Computer Science meets Biology: Guiding “in vitro” experiments with “in silico” predictions. Gidalevitz T, Biswas C, Ding H, Schneidman D, Wolfson HJ, Stevens F, Radford S, Argon Y.

Agenda Introduction to the docking problem The PatchDock algorithm Biological problem Real experimental results

What is Docking? Given two molecules find their correct association: Complex T Receptor Ligand = We can describe the problem as assembling of a jigsaw puzzle. Given two molecules, referred to as the receptor and the ligand, we attempt to predict the shape of their complex as it appears in nature. From the mathematical view point we are looking for a transformation (rotation and translation) in 3D space that aligns best the surfaces of the molecules without causing collisions. +

Problem Importance Computer aided drug design – a new drug should fit the active site of a specific receptor. Understanding of biochemical pathways - many reactions in the cell occur through interactions between the molecules. Despite the advances in the Structural Genomics initiative, there are no efficient techniques for crystallizing large complexes and finding their structure. Docking is widely used in drug design. It is also important in the understanding of biochemical pathways within the cell. With the growing number of single structures due to the SG initiative, the docking task becomes of significant practical value. Yet, the cost of determining the structures of protein complexes is still huge.

Bound Docking In the bound docking we are given a complex of 2 molecules. After artificial separation the goal is to reconstruct the native complex. No conformational changes are involved. Used as a first test of the validity of the algorithm. There are two main classes of docking problem: test docking, called bound and real docking, called unbound. … Docking Algorithm

Unbound Docking In the unbound docking we are given 2 molecules in their native conformation. The goal is to find the correct association. Problems: conformational changes (side-chain and backbone movements), experimental errors in the structures. Unbound docking is a “real life” problem. We are given two molecules that were separately crystallized. We look for their conformation as it appears in the natural complex. This is especially difficult due to many conformational changes such as side-chain and backbone flexibility. + = ?

Docking Algorithms Brute force enumeration of the transformation space: FFT – Katchalski-Katzir et al. (1992) (Walls & Sternberg, Vakser, Gabb et al., Camacho et al., Chen & Weng) Soft Docking – Jiang & Kim (1991), Palma et al., Randomized algorithms: GA, Monte-Carlo - Jones et al., Gardiner et al. Local shape feature matching: Dock - Kuntz (1982) ‘knobs’ and ‘holes’ – Connolly (1986) Geometric Hashing - Norel et al., Fischer et al. (1994) Flexible docking - Sandak et al. FlexX: hydrogen H-bonding – Rarey et al. Docking algorithms can be classified into two broad categories by the way they explore the transformation space. Brute force algorithms perform an entire 6-dimensional search of all possible rotations and translations. While local shape feature matching algorithms align local shape features, reducing the complexity of the conformational search.

PatchDock … http://bioinfo3d.cs.tau.ac.il/PatchDock is an efficient method for unbound docking of rigid molecules. The molecular shape is used explicitly avoiding the exhaustive search of the 6D transformation space. The algorithm focuses on local surface patches divided into three shape types: concave, convex and flat. The geometric surface complementarity scoring is extremely fast and accurate. It employs advanced data structures for molecular representation: Distance Transform Grid and Multi-resolution Surface. http://bioinfo3d.cs.tau.ac.il/PatchDock Duhovny, D., Nussinov, N Wolfson, H.J. Lecture Notes in Computer Science 2452, pp. 185-200, Springer Verlag, 2002

Surface Representation PatchDock Method PDB files Surface Representation Patch Detection Matching Patches Scoring & Filtering Candidate complexes

Surface Representation Dense MS surface (Connolly) Sparse surface (Shuo Lin et al.) The most common surface representation is the Connolly surface. The probe ball is rolled over the molecule, creating 3 types of points: caps (yellow points – belong to one atom), belts (red points – lie between two atoms), pits (green points – belong to the patches where the probe touches the 3 atoms). This representation is very dense and can be reduced to local minima or maxima of the point patches. This is sparse surface representation by Lin.

Patch Detection Connolly surface representation Sparse surface [2]: local minima and maxima of Connolly surface. The surface topology graph is obtained by connecting neighboring points. Shape representation by patches. PatchDock applies a segmentation algorithm to divide the surface into shape- based patches. PatchDock focuses on sparse surface features, preserving the quality of shape representation. The sparse features reduce the complexity of the matching step.

Matching Patches Receptor patches Ligand patches Matching 2 points and their associated normals is sufficient to compute transformation in 3D space. Transformation Base: 1 critical point with its normal from one patch and 1 critical point with its normal from a neighbor patch. Base signature: distances and angles. Match every base from the receptor patches against all the bases from complementary ligand patches with similar signatures. Geometric Hashing of base signatures is used to speed up the search. dE, dG, α, β, ω

Penetrations Filtering Distance Transform Grid stores the distances from the surface of the molecule. The distance is negative inside the molecule and positive outside. Steric clashes are checked by accessing the receptor grid with ligand surface points. +1 -1

Scoring The surface of the receptor is divided into five shells according to the distance function: S1-S5 The number of ligand surface points in every shell is counted. The geometric score is a weighted sum of the number of ligand surface points inside every shell. Multi-resolution surface data structure was developed to speed up this stage.

Dataset and Results Protein-Protein cases from protein-protein docking benchmark [6]: Enzyme-inhibitor – 22 cases Antibody-antigen – 16 cases Protein-DNA docking: 2 unbound-bound cases Protein-drug docking: tens of bound cases (Estrogen receptor, HIV protease, COX) Performance: Several minutes for large protein molecules and seconds for small drug molecules on standard PC computer. Estrogen receptor Estradiol molecule from complex docking solution DNA endonuclease Estrogen receptor with estradiol (1A52). RMSD 0.9Å, rank 1, running time: 11 seconds docking solution Endonuclease I-PpoI (1EVX) with DNA (1A73). RMSD 0.87Å, rank 2

Results Enzyme-Inhibitor docking Complex Description pen. res.1 geom score time with ACE score PDB receptor/ligand rmsd rank min. 1ACB α-chymotrypsin/Eglin C 0,2 2.0 41 9:37 1.8 55 1AVW Trypsin/Sotbean Trypsin inhibitor 3,4 1.9 913 11:27 319 1BRC Trypsin/APPI 5.0 528 5:20 5.6 66 1BRS Barnase/Barstar 1,3 3.5 115 5:18 2.7 7 1CGI α-chymotrypsinogen/trypsin inhibitor 4,2 2.4 114 6:26 3.0 10 1CHO α-chymotrypsin/ovomucoid 3rd Domain 0,3 3.4 148 5:35 1.2 26 1CSE Subtilisin Carlsberg/Eglin C 3.8 166 6:58 2.3 540 1DFJ Ribonuclease inhibitor/Ribonuclease A 12,8 3.9 1446 11:58 11.9 612 1FSS Acetylcholinesterase/Fasciculin II 8,3 2.5 296 11:42 46 1MAH Mouse Acetylcholinesterase/inhibitor 2,5 436 14:39 57 1PPE* Trypsin/CMT-1 0,0 1 2:34 1STF* Papain/Stefin B 2.2 4 8:15 2.1 13 1TAB* Trypsin/BBI 0,1 1.4 96 3:41 7.2* 104 1TGS Trypsinogen/trypsin inhibitor 5,4 345 5:19 3.6 101 1UDI* Virus Uracil-DNA glycosylase/inhibitor 2.6 3 7:40 1UGH Human Uracil-DNA glycosylase/inhibitor 12 5:45 5 2KAI Kallikrein A/Trypsin inhibitor 10,7 4.2 126 7:15 4.7 42 2PTC β-trypsin/ Pancreatic trypsin inhibitor 2,4 4.4 5:13 2SIC Subtilisin BPN/Subtilisin inhibitor 5,3 129 9:41 21 2SNI Subtilisin Novo/Chymotrypsin inhibitor 2 6,7 8.3 1241 5:08 7.3 450 2TEC* Thermitase/Eglin C 7:58 29 4HTC* α-Thrombin/Hirudin 2,2 3.3 2 3:36 2.8 1 Number of highly penetrating residues in unbound structures superimposed to complex

Results Antibody-Antigen docking Complex Description pen. res. 1 geom score time ACE score PDB receptor/ligand rmsd rank min. 1AHW Antibody Fab 5G9/Tissue factor 3,3 2.5 29 10:12 10 1BQL* Hyhel - 5 Fab/Lysozyme 0,0 13 6:21 1.4 7 1BVK Antibody Hulys11 Fv/Lysozyme 3.8 1301 6:25 3.5 809 1DQJ Hyhel - 63 Fab/Lysozyme 18,7 4.3 773 5:30 5.1 953 1EO8* Bh151 Fab/Hemagglutinin 3,1 1.8 567 9:45 1.6 292 1FBI* IgG1 Fab fragment/Lysozyme 2,5 5.0 536 10:13 2416 1IAI* IgG1 Idiotypic Fab/Igg2A Anti-Idiotypic Fab 5,6 4.8 1302 9:13 3.4 1304 1JHL* IgG1 Fv Fragment/Lysozyme 282 13:15 1.3 143 1MEL* Vh Single-Domain Antibody/Lysozyme 0,1 3 2:40 2.0 2 1MLC IgG1 D44.1 Fab fragment/Lysozyme 8,3 4.0 136 5:29 2.6 123 1NCA* Fab NC41/Neuraminidase 114 17:50 2.8 66 1NMB* Fab NC10/Neuraminidase 2.7 2593 28:10 2.4 1734 1QFU* Igg1-k Fab/Hemagglutinin 44 5:42 23 1WEJ IgG1 E8 Fab fragment/Cytochrome C 232 7:44 87 2JEL* Jel42 Fab Fragment/A06 Phosphotransferase 0,2 4.7 5:02 50 2VIR* Igg1-lamda Fab/Hemagglutinin 3.1 258 7:34 306 1 Number of highly penetrating residues in unbound structures superimposed to complex

The Real Challenge: Can we help biologists? + = ?

Identification of the N-terminal peptide binding site of GRP94 GRP94 - Glucose regulated protein 94 VSV8 peptide - derived from vesicular stomatitis virus Gidalevitz T, Biswas C, Ding H, Schneidman-Duhovny D, Wolfson HJ, Stevens F, Radford S, Argon Y. J Biol Chem. 2004

Biological motivation The complex between the two molecules highly stimulates the response of the T-cells of the immune system. The grp94 protein alone does not have this property. The activity that stimulates the immune response is due to the ability of grp94 to bind different peptides. Characterization of peptide binding site is highly important.

GRP94 molecule There was no structure of grp94 protein. Homology modeling was used to predict a structure using another protein with 52% identity. Recently the structure of grp94 was published. The RMSD between the crystal structure and the model is 1.3A.

Docking PatchDock was applied to dock the two molecules, without any binding site constraints. Docking results were clustered in the two cavities:

GRP94 molecule There is a binding site for inhibitors between the helices. There is another cavity produced by beta sheet on the opposite side.

Experimental Verification Goals: Try to eliminate one of the cavities. Find the positions of the amino acids which are important for peptide binding.

Experimental Verification 1 Experimental data shows that inhibitor and peptide can bind simultaneously. Two residues in the inhibitor binding site were mutated. The mutant did not bind inhibitor, however it could still bind peptide. The binding sites of the inhibitor and peptide are distinct. The abolition of the inhibitor does not affect peptide binding.

Experimental Verification 2 The peptide binding was pH sensitive. Therefore involvement of His residue was suspected. His125 was mutated to Asp and Tyr. The first mutated protein did not bind the peptide at all and the second had only partial activity. Both mutants were soluble and could bind the inhibitor.

Computational Verification 2

Conclusions Computational prediction can help in guiding “in vitro” experiments. Further algorithmic improvements will yield in more reliable predictions.