Presentation is loading. Please wait.

Presentation is loading. Please wait.

CZ5226: Advanced Bioinformatics Lecture 8: Molecular Modeling Method Prof. Chen Yu Zong Tel: 6874-6877

Similar presentations


Presentation on theme: "CZ5226: Advanced Bioinformatics Lecture 8: Molecular Modeling Method Prof. Chen Yu Zong Tel: 6874-6877"— Presentation transcript:

1 CZ5226: Advanced Bioinformatics Lecture 8: Molecular Modeling Method Prof. Chen Yu Zong Tel: 6874-6877 Email: csccyz@nus.edu.sg http://xin.cz3.nus.edu.sg Room 07-24, level 7, SOC1, National University of Singapore csccyz@nus.edu.sg http://xin.cz3.nus.edu.sgcsccyz@nus.edu.sg http://xin.cz3.nus.edu.sg

2 2 References on Modeling of MHC Binding Peptide Protein Sci. 2004 Sep;13(9):2523-32 J Am Chem Soc. 2004 Jul 14;126(27):8515-28 Proteins. 2004 Feb 15;54(3):534-56 Hum Immunol. 2003 Dec;64(12):1123-43 Immunity. 2003 Oct;19(4):595-606 Mol Med. 2003 Sep-Dec;9(9-12):220-5 Nature. 2002 Aug 1;418(6897):552-6 Eur J Immunol. 2002 Aug;32(8):2105-16 Immunol Cell Biol. 2002 Jun;80(3):286-99 Ann N Y Acad Sci. 2002 Apr;958:317-20 Mol Immunol. 2002 May;38(14):1039-49 J Pept Res. 2002 Mar;59(3):115-22 Mol Immunol. 2002 Feb;38(9):681-7 Tissue Antigens. 2002 Feb;59(2):101-12 J Comput Aided Mol Des. 2001 Jun;15(6):573-86 J Mol Biol. 2000 Jul 28;300(5):1205-35 J Comput Aided Mol Des. 2000 Jan;14(1):71-82 J Comput Aided Mol Des. 2000 Jan;14(1):53-69 J Mol Graph Model. 1999 Jun-Aug;17(3-4):180-6, 217

3 3 What is Docking? Given two molecules find their correct association: + = Receptor Ligand T Complex

4 4 General Protein–Ligand Binding Ligand - Molecule that binds with a protein - DNA, drug lead compounds, etc. Protein active site(s) - Allosteric binding - Competitive binding Function of binding interaction - Natural and artificial

5 5 What is Protein-Ligand Docking? Definition: Computationally predict the structures of protein-ligand complexes from their conformations and orientations. The orientation that maximizes the interaction reveals the most accurate structure of the complex. Importance of complexes - structure -> function

6 6 Example: HIV-1 Protease Active Site (Aspartyl groups)

7 7 Example: HIV-1 Protease

8 8 PDB files Surface Representation Patch Detection Matching Patches Scoring & Filtering Candidate complexes Docking Strategy

9 9 Issues Involved in Docking Protein Structure and Active Site - assumed knowledge (PDBs, etc.) - PROCAT database: 3d enzyme active site templates Ligand Structure - pharmacophore (base fragment) in potential drug compound - well known groups Rigid vs. Flexible - solution or vacum - structure

10 10 Algorithmic Approaches to Docking Qualitative –Geometric –shape complementarity and fitting Quantitative –Energy Calculations –determine global minimum energy –free energy measure Hybrid –Geometric and energy complementarity –2 phase process: soft and hard docking

11 11. Design of HIV-1 Protease Inhibitor

12 12. Design of HIV-1 Protease Inhibitor

13 13. Design of HIV-1 Protease Inhibitor

14 14. Design of HIV-1 Protease Inhibitor

15 15 Scoring in Ligand-Protein Docking Potential Energy Description:

16 16 Preprocessing Determine internal representation - convert coordinates of both molecules from PDB files - e.g. Michael Connolly’s MS program (www.biohedron.com)www.biohedron.com - dot surface - AutoGrid - 3d grid (array) with discrete values - often used in rigid docking

17 17 Some techniques Surface representation, that efficiently represents the docking surface and identifies the regions of interest (cavities and protrusions) Connolly surface Lenhoff technique Kuntz et al. Clustered-Spheres Alpha shapes Surface matching that matches surfaces to optimize a binding score: Geometric Hashing

18 18 Surface Representation Dense MS surface (Connolly) Sparse surface (Shuo Lin et al.)

19 19 Surface Representation Each atomic sphere is given the van der Waals radius of the atom Rolling a Probe Sphere over the Van der Waals Surface leads to the Solvent Reentrant Surface or Connolly surface

20 20 Lenhoff technique Computes a “complementary” surface for the receptor instead of the Connolly surface, i.e. computes possible positions for the atom centers of the ligand Atom centers of the ligand van der Waals surface

21 21 Kuntz et al. Clustered-Spheres Uses clustered-spheres to identify cavities on the receptor and protrusions on the ligand Compute a sphere for every pair of surface points, i and j, with the sphere center on the normal from point i Regions where many spheres overlap are either cavities (on the receptor) or protrusions (on the ligand) i j

22 22 Alpha Shapes Formalizes the idea of “shape” In 2D an “edge” between two points is “alpha-exposed” if there exists a circle of radius alpha such that the two points lie on the surface of the circle and the circle contains no other points from the point set

23 23 Alpha Shapes: Example Alpha=infinity Alpha=3.0 Å

24 24 Surface Matching Find the transformation (rotation + translation) that will maximize the number of matching surface points from the receptor and the ligand First satisfy steric constraints… Find the best fit of the receptor and ligand using only geometrical constraints … then use energy calculations to refine the docking Selet the fit that has the minimum energy

25 25. Design of HIV-1 Protease Inhibitor

26 26 Docking Programs More information in: http://www.bmm.icnet.uk/~smithgr/soft.html The programs are: DOCK (I. D. Kuntz, UCSF) AutoDOCK (Arthur Olson, The Scripps Research Institute) RosettaDOCK (Baker, Washington Univ., Gray, Johns Hopkins Univ.) INVDOCK (Y. Z. Chen, NUS)

27 27 DOCK as an Example DOCK works in 5 steps: Step 1 Start with crystal coordinates of target receptor Step 2 Generate molecular surface for receptor Step 3 Generate spheres to fill the active site of the receptor: The spheres become potential locations for ligand atoms Step 4 Matching: Sphere centers are then matched to the ligand atoms, to determine possible orientations for the ligand Step 5 Scoring: Find the top scoring orientation

28 28 DOCK as an Example 12 3 - HIV-1 protease is the target receptor - Aspartyl groups are its active side

29 29 DOCK as an Example 45 Three scoring schemes: Shape scoring, Electrostatic scoring and Force-field scoring Image 5 is a comparison of the top scoring orientation of the molecule thioketal with the orientation found in the crystal structure

30 30 The DOCK Algorithm Two steps in rigid ligand mode: Orienting the putative ligand in the site Guided by matching distances, between pre- defined site points on the target to interatomic distances of the ligand.The RT matrix is used for the transform of the ligand. Scoring the resulting orientation Each orientation is scored for each quality fit. The process is repeated a user-defined number of orientations or maximum orientations

31 31 1.Define the target binding site points. 2.Match the distances. 3.Calculate the transformation matrix for the orientation. 4.Dock the molecule. 5.Score the fit.

32 32 Site Points Generation in DOCK Program SPHGEN identifies the active site, and other sites of interest. Each invagination is characterized by a set of overlapping spheres. For receptors, a negative image of the surface invaginations is created; For a ligand, the program creates a positive image of the entire molecule.

33 33 The Matching Can be directed by 2 additional features: Chemical matching - labeling the site points such that only particular atom types are allowed to be matched to them. Critical cluster - subsets of interest can be defined as critical clusters, so that at least one member of them will be part of any accepted ligand “match”. Increase in efficiency and speed due to elimination of potentially less promising orientations!

34 34 Other Docking programs AutoDock –AutoDock was designed to dock flexible ligands into receptor binding sites –The strongest feature of AutoDock is the range of powerful optimization algorithms available RosettaDOCK –It models physical forces and creates a very large number of decoys –It uses degeneracy after clustering as a final criterion in decoy selection INVDOCK –Docking strategy and algorithm similar to DOCK, but with the capability of finding the receptors to which a molecule can bind to.

35 Conformational Ensembles Docking

36 36 Conformational Ensembles Docking Observations: 1.Generating an orientation of a ligand in a binding site may be separated from calculating a conformation of the ligand in that particular orientation. 2.Multiple conformations of a given ligand usually have some portion in common (internally rigid atoms such as ring systems), and therefore, contain redundancies.

37 37 Conformational Ensemble Docking

38 38 Conformational Ensemble Docking Conformational ensembles are generated by overlaying all conformations of a given molecule onto its largest rigid fragment. Only atoms within this largest rigid fragment are used during the distance matching step. The RT matrix is defined. Each of the conformers is oriented into the site and scored. The score measures steric and electrostatic complementarity. One matching steps - all the conformers are docked and scored in the selected orientation.

39 39 Overview of the Ligand Ensemble Method

40 40 Advantages of Conformational Ensemble Docking Speed increase due to: One matching step for all the conformers. The largest rigid fragment usually has fewer atoms (less potential matches are examined).

41 41 Disadvantages of Conformational Ensemble Docking Loss of information when the orientations are guided only by a subset of the atoms in molecule. Orientations may be missed because potential distance matches from non- rigid portions of the molecule are not considered. The ensemble method will fail for ligands that lack internally rigid atoms. The use of chemical matching and critical clusters is limited.

42 42 Results of Docking Studies The docked (blue) and crystal (yellow) structure of ligands in some PDB ligand-protein complexes. The PDB Id of each structure is shown.

43 43 Protein-Protein cases from protein-protein docking benchmark [6]: Enzyme-inhibitor – 22 cases Antibody-antigen – 16 cases Protein-DNA docking: 2 unbound-bound cases Protein-drug docking: tens of bound cases (Estrogen receptor, HIV protease, COX) Performance: Several minutes for large protein molecules and seconds for small drug molecules on standard PC computer. Dataset and Testing Results Endonuclease I-PpoI (1EVX) with DNA (1A73). RMSD 0.87Å, rank 2 DNA endonuclease docking solution Estrogen receptor Estradiol molecule from complex docking solution Estrogen receptor with estradiol (1A52). RMSD 0.9Å, rank 1, running time: 11 seconds

44 44 Results Enzyme-Inhibitor docking ComplexDescription pen. res. 1 geom scoretimewith ACE score PDBreceptor/ligandrmsdrankmin.rmsdrank 1ACBα-chymotrypsin/Eglin C 0,22.0419:371.855 1AVWTrypsin/Sotbean Trypsin inhibitor 3,41.991311:271.9319 1BRC Trypsin/APPI 0,25.05285:205.666 1BRS Barnase/Barstar 1,33.51155:182.77 1CGIα-chymotrypsinogen/trypsin inhibitor 4,22.41146:263.010 1CHOα-chymotrypsin/ovomucoid 3rd Domain 0,33.41485:351.226 1CSESubtilisin Carlsberg/Eglin C 0,23.81666:582.3540 1DFJRibonuclease inhibitor/Ribonuclease A 12,83.9144611:5811.9612 1FSSAcetylcholinesterase/Fasciculin II 8,32.529611:422.346 1MAHMouse Acetylcholinesterase/inhibitor 2,52.543614:392.357 1PPE* Trypsin/CMT-1 0,02.012:342.01 1STF* Papain/Stefin B 0,02.248:152.113 1TAB* Trypsin/BBI 0,11.4963:417.2*104 1TGSTrypsinogen/trypsin inhibitor 5,42.23455:193.6101 1UDI*Virus Uracil-DNA glycosylase/inhibitor 4,22.637:402.41 1UGHHuman Uracil-DNA glycosylase/inhibitor 8,32.1125:453.85 2KAIKallikrein A/Trypsin inhibitor 10,74.21267:154.742 2PTCβ-trypsin/ Pancreatic trypsin inhibitor 2,44.4665:133.412 2SICSubtilisin BPN/Subtilisin inhibitor 5,32.51299:414.721 2SNISubtilisin Novo/Chymotrypsin inhibitor 2 6,78.312415:087.3450 2TEC* Thermitase/Eglin C 0,13.0667:581.429 4HTC* α-Thrombin/Hirudin 2,23.323:362.82 1 Number of highly penetrating residues in unbound structures superimposed to complex

45 45 Results Antibody-Antigen docking ComplexDescription pen. res. 1 geom scoretimeACE score PDBreceptor/ligandrmsdrankmin.rmsdrank 1AHWAntibody Fab 5G9/Tissue factor 3,32.52910:122.510 1BQL* Hyhel - 5 Fab/Lysozyme 0,02.5136:211.47 1BVK Antibody Hulys11 Fv/Lysozyme 0,03.813016:253.5809 1DQJ Hyhel - 63 Fab/Lysozyme 18,74.37735:305.1953 1EO8* Bh151 Fab/Hemagglutinin 3,11.85679:451.6292 1FBI* IgG1 Fab fragment/Lysozyme 2,55.053610:135.02416 1IAI* IgG1 Idiotypic Fab/Igg2A Anti-Idiotypic Fab 5,64.813029:133.41304 1JHL* IgG1 Fv Fragment/Lysozyme 0,01.628213:151.3143 1MEL* Vh Single-Domain Antibody/Lysozyme 0,11.832:402.02 1MLC IgG1 D44.1 Fab fragment/Lysozyme 8,34.01365:292.6123 1NCA* Fab NC41/Neuraminidase 0,02.611417:502.866 1NMB* Fab NC10/Neuraminidase 0,02.7259328:102.41734 1QFU* Igg1-k Fab/Hemagglutinin 0,02.7445:422.723 1WEJ IgG1 E8 Fab fragment/Cytochrome C 0,04.32327:442.687 2JEL* Jel42 Fab Fragment/A06 Phosphotransferase 0,24.71145:024.750 2VIR* Igg1-lamda Fab/Hemagglutinin 0,03.12587:343.5306 1 Number of highly penetrating residues in unbound structures superimposed to complex

46 46 Quality of INVDOCK Algorithm Proteins. 1999; 36:1 Molecule Docked ProteinPDB Id RMSD Description of Docking Quality Energy ( kcal/mol) IndinavirHIV-1 Protease1hsg1.38 Match - 70.25 Xk263 Of Dupont Merck HIV-1 Protease1hvr2.05 Match - 58.07 VacHIV-1 Protease4phv0.80 Match - 88.46 Folate Dihydrofolate Reductase1dhf6.55 One end match, the other in different orientation - 46.02 5-DeazafolateDihydrofolate Reductase2dhf1.48 Match - 65.49 EstrogenEstrogen Receptor1a521.30 Match - 45.86 4-HydroxytamoxifenEstrogen Receptor 3ert 5.45 Complete overlap, flipped along short axis - 55.15 Guanosine-5'-[B,G- Methylene] Triphosphate H-Ras P21 121p 0.94 Match - 80.20 Glycyl-*L-Tyrosine Carboxypeptidase A  3cpa3.56 Overlap, flipped along short axis - 40.63

47 47 Identification of the N-terminal peptide binding site of GRP94 GRP94 - Glucose regulated protein 94 VSV8 peptide - derived from vesicular stomatitis virus Gidalevitz T, Biswas C, Ding H, Schneidman-Duhovny D, Wolfson HJ, Stevens F, Radford S, Argon Y. J Biol Chem. 2004

48 48 Biological motivation The complex between the two molecules highly stimulates the response of the T-cells of the immune system. The grp94 protein alone does not have this property. The activity that stimulates the immune response is due to the ability of grp94 to bind different peptides. Characterization of peptide binding site is highly important.

49 49 GRP94 molecule There was no structure of grp94 protein. Homology modeling was used to predict a structure using another protein with 52% identity. Recently the structure of grp94 was published. The RMSD between the crystal structure and the model is 1.3A.

50 50 Docking PatchDock was applied to dock the two molecules, without any binding site constraints. Docking results were clustered in the two cavities:

51 51 GRP94 molecule There is a binding site for inhibitors between the helices. There is another cavity produced by beta sheet on the opposite side.


Download ppt "CZ5226: Advanced Bioinformatics Lecture 8: Molecular Modeling Method Prof. Chen Yu Zong Tel: 6874-6877"

Similar presentations


Ads by Google