Chemoinformatics in Drug Design

Slides:



Advertisements
Similar presentations
Analysis of High-Throughput Screening Data C371 Fall 2004.
Advertisements

1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
PhysChem Forum, 29 Nov 2006, Newhouse 1 Memories and the future: From experimental to in silico physical chemistry Han van de Waterbeemd AstraZeneca, DMPK.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
In silico small molecule discovery Sales Target gene Discover hit Hit to lead Optimise lead Clinical Target gene identified with a viable assay High throughput.
PharmaMiner: Geometric Mining of Pharmacophores 1.
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
Lipinski’s rule of five
Molecular dynamics refinement and rescoring in WISDOM virtual screenings Gianluca Degliesposti University of Modena and Reggio Emilia Molecular Modelling.
Cheminformatics II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
M. Wagener 3D Database Searching and Scaffold Hopping Markus Wagener NV Organon.
Quantitative Structure-Activity Relationships (QSAR) Comparative Molecular Field Analysis (CoMFA) Gijs Schaftenaar.
Bioinformatics IV Quantitative Structure-Activity Relationships (QSAR) and Comparative Molecular Field Analysis (CoMFA) Martin Ott.
An Integrated Approach to Protein-Protein Docking
BL5203: Molecular Recognition & Interaction Lecture 5: Drug Design Methods Ligand-Protein Docking (Part I) Prof. Chen Yu Zong Tel:
Design of Small Molecule Drugs Targeted to RNA RNA Ontology Group May
Structural biology and drug design: An overview Olivier Taboureau Assitant professor Chemoinformatics group-CBS-DTU
Bioinformatics Ayesha M. Khan Spring Phylogenetic software PHYLIP l 2.
Structure-based Drug Design
Important Points in Drug Design based on Bioinformatics Tools History of Drug/Vaccine development –Plants or Natural Product Plant and Natural products.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Comparative Evaluation of 11 Scoring Functions for Molekular Docking Authors: Renxiao Wang, Yipin Lu and Shaomeng Wang Presented by Florian Lenz.
Pharmacophore and FTrees
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Molecular Descriptors
Functional groups / Pharmacological Activity
Combinatorial Chemistry and Library Design
ClusPro: an automated docking and discrimination method for the prediction of protein complexes Stephen R. Comeau, David W.Gatchell, Sandor Vajda, and.
Chapter 13. The Impact of Genomics on Antimicrobial Drug Discovery and Toxicology CBBL - Young-sik Sohn-
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
CZ5225 Methods in Computational Biology Lecture 4-5: Protein Structure and Structural Modeling Prof. Chen Yu Zong Tel:
Faculté de Chimie, ULP, Strasbourg, FRANCE
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Function first: a powerful approach to post-genomic drug discovery Stephen F. Betz, Susan M. Baxter and Jacquelyn S. Fetrow GeneFormatics Presented by.
BL5203 Molecular Recognition & Interaction Section D: Molecular Modeling. Chen Yu Zong Department of Computational Science National University of Singapore.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
SimBioSys Inc.© Slide #1 Enrichment and cross-validation studies of the eHiTS high throughput screening software package.
Altman et al. JACS 2008, Presented By Swati Jain.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Computer-aided drug discovery (CADD)/design methods have played a major role in the development of therapeutically important small molecules for several.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Chemistry XXI Unit 3 How do we predict properties? M1. Analyzing Molecular Structure Predicting properties based on molecular structure. M4. Exploring.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
Use of Machine Learning in Chemoinformatics
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Molecular mechanics Classical physics, treats atoms as spheres Calculations are rapid, even for large molecules Useful for studying conformations Cannot.
Elon Yariv Graduate student in Prof. Nir Ben-Tal’s lab Department of Biochemistry and Molecular Biology, Tel Aviv University.
Docking and Virtual Screening Using the BMI cluster
Molecular Modeling in Drug Discovery: an Overview
Julia Salas CS379a Aim of the Study To determine distinguishing features of orally administered drugs –Physical and structural features probed.
Natural products from plants
Structural Bioinformatics Elodie Laine Master BIM-BMC Semester 3, Genomics of Microorganisms, UMR 7238, CNRS-UPMC e-documents:
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Lipinski’s rule of five
APPLICATIONS OF BIOINFORMATICS IN DRUG DISCOVERY
DATA MINING FOR SMALL MOLECULE ALLOSTERIC INHIBITORS
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Virtual Screening.
Ligand Docking to MHC Class I Molecules
Important Points in Drug Design based on Bioinformatics Tools
Cheminformatics Basics
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Presentation transcript:

Chemoinformatics in Drug Design Irene Kouskoumvekaki, Associate Professor, Computational Chemical Biology, CBS, DTU-Systems Biology Biological Sequence Analysis, May 6, 2011

Computational Chemical Biology group Tudor Oprea Guest Professor Olivier Taboureau Associate Professor Irene Kouskoumvekaki Associate Professor Sonny Kim Nielsen PhD student Kasper Jensen PhD student Ulrik Plesner master student

Word cloud

Definition: Chemoinformatics Gathering and systematic use of chemical information, and application of this information to predict the behavior of unknown compounds in silico. data prediction

Definition: A drug candidate… ... is a (ligand) compound that binds to a biological target (protein, enzyme, receptor, ...) and in this way either initiates a process (agonist) or inhibits it (antagonist) The structure/conformation of the ligand is complementary to the space defined by the protein’s active site The binding is caused by favorable interactions between the ligand and the side chains of the amino acids in the active site. (electrostatic interactions, hydrogen bonds, hydrophobic contacts...)

In vitro / In silico studies Drug Discovery Animal studies Disease Biological Target Drug candidate In vitro / In silico studies Clinical studies

The Drug Discovery Process Chemoinformatics

The Drug Discovery Process We identify/predict the binding pocket We know the structure of the biological target MKTAALAPLFFLPSALATTVYLA GDSTMAKNGGGSGTNGWGEYL ASYLSATVVNDAVAGRSAR…(etc) Challenge: To design an organic molecule that would bind strong enough to the biological target and modute it’s activity. New drug candidate

Example: – Alzheimer’s disease What is it? Alzheimer's is a disease that causes failure of brain functions and dementia. It starts with bad memory and disability to function in common everyday activities. How do you get it? Alzheimer's disease is the result of malfunctioning neurons at different parts of the brain. This, in turn, is due to an inbalance in the concentration of neurotranmitters.

Example: – Alzheimer’s disease How can we treat it? Acetylkolin neurotransmitter Drug against Alzheimer’s

Old School Drug discovery process HTS Follow-up Hit-to-lead Lead-to-drug Screening collection Lead series Drug candidate Actives Hits 106 cmp. 103 actives 1-10 hits 0-3 lead series 0-1 Clinical trials High rate of false positives !!!

Failures

in vitro in silico + in vitro 4/17/2017 Drug discovery in the 21st Century in vitro in silico + in vitro Diverse set of molecules tested in the lab Computational methods to select subsets (to be tested in the lab) based on prediction of drug-likeness, solubility, binding, pharmacokinetics, toxicity, side effects, ... In silico results can be used to make in vivo methods more efficient. If there are several thousand compounds available for testing, in silico methods are used to identify those that are most likely to be active and these would take priority for screening.

The Lipinski ‘rule of five’ for drug-likeness prediction Octanol-water partition coefficient (logP) ≤ 5 Molecular weight ≤ 500 # hydrogen bond acceptors (HBA) ≤ 10 # hydrogen bond donors (HBD) ≤ 5 If two or more of these rules are violated, the compound might have problems with oral bioavailability. (Lipinski et al., Adv. Drug Delivery Rev., 23, 1997, 3.)

Major Aspects of Chemoinformatics Experimental data Model generation Prediction for unknown compounds

Major Aspects of Chemoinformatics Information Acquisition and Management: Methods for collecting data (mainly experimental). Development of databases for storage and retrieval of information. Information Use: Data analysis, correlation and model building. Information Application: Prediction of molecular properties relevant to chemical and biochemical sciences.

Major Aspects of Chemoinformatics Information Acquisition and Management: Methods for collecting data (mainly experimental). Development of databases for storage and retrieval of information. Information Use: Data analysis, correlation and model building. Information Application: Prediction of molecular properties relevant to chemical and biochemical sciences.

Information Acquisition and Management

Small molecule databases One tricky thing when storing

Growth In PubChem Substances & Compounds Recent count: Substance: 72,156,631 Compound: 28,807,320 Rule of 5: 20,692,980 The PubChem Compound Database contains validated chemical depiction information that is provided to describe substances in PubChem Substance. The PubChem substance database contains chemical structures, synonyms, registration IDs, description, related urls, database cross-reference links to PubMed, protein 3D structures, and biological screening results. If the contents of a chemical sample are known, the description includes links to PubChem Compound.

Searching in PubChem

Structural representation of molecules

Major Aspects of Chemoinformatics Information Acquisition and Management: Methods for collecting data (mainly experimental). Development of databases for storage and retrieval of information. Information Use: Data analysis, correlation and model building. Information Application: Prediction of molecular properties relevant to chemical and biochemical sciences.

Beyond the Lipinski Rule of 5... Chemometrics: The application of mathematical or statistical methods to chemical data (simple, linear methods) e.g. Principal Component Analysis Machine Learning: The design and development of algorithms and techniques that allow computers to learn (complex, non-linear algorithms) e.g. Artificial Neural Networks, K-means clustering

Major Aspects of Chemoinformatics Information Acquisition and Management: Methods for collecting data (mainly experimental). Development of databases for storage and retrieval of information. Information Use: Data analysis, correlation and model building. Information Application: Prediction of molecular properties relevant to chemical and biochemical sciences.

Prediction of Solubility, ADME & Toxicity Membrane transfer Liver extraction Dissolution Solid drug Systemic circulation Drug in solution Absorbed drug Solubility Absorption Metabolism

Prediction of biological activity/selectivity

Prediction models at CBS

Virtual screening Computational techniques for a rapid assessment of large libraries of chemical structures in order to guide the selection of likely drug candidates. Exploit knowledge of the active ligand molecule or the protein target.

Virtual Screening Flavors TARGET-BASED 1D filters e.g. Lipinskis Rule of Five 1D LIGAND-BASED

Molecular similarity on the Chemical Space Similar Property Principle – Molecules having similar structures and properties are expected to exhibit similar biological activity. (Not always true!) Thus, molecules that are located closely together in the chemical space are often considered to be functionally related.

Ligand-based VS: Fingerprints widely used similarity search tool consists of descriptors encoded as bit strings Bit strings of query and database are compared using similarity metric such as Tanimoto coefficient MACCS fingerprints: 166 structural keys that answer questions of the type: Is there a ring of size 4? Is at least one F, Br, Cl, or I present? where the answer is either TRUE (1) or FALSE (0)

Tanimoto Similarity or 90% similarity

Tanimoto Similarity

Ligand-based VS: Pharmacophore

Structure-based Virtual Screening: Docking Binding pocket of target Library of small compounds Given a protein and a database of ligands, docking scores determine which ligands are most likely to bind.

Energy of binding Binding pocket of target Library of small compounds -1 kcal/mol -10 kcal/mol +10 kcal/mol +1 kcal/mol For any spontaneous change in a closed system, the change in [Gibbs] free energy equals the change in enthalpy minus the change in entropy times the temperature. ΔG = ΔH - TΔS Torsional free E vdW Hbond Desolvation E Electrostatic E

“Docking” and “Scoring” Docking involves the prediction of the binding mode of individual molecules Goal: new ligand orientation closest in geometry to the observed X-ray structure (Conformations of ligands in complexes often have very similar geometries to minimum-energy conformations of the isolated ligand) Scoring ranks the ligands using some function related to the free energy of association of the two partners, looking at attractive and repulsive regions and taking into account steric and hydrogen bonding interactions Goal: new ligand score closest in value to the docking score of the X-ray structure

Docking algorithms Most exhaustive algorithms: Accurate prediction of a binding pose Most efficient algorithms Docking of small ligand databases in reasonable time Rapid algorithms Virtual high-throughput screening of millions of compounds

Scoring functions Molecular mechanics force field-based Score is estimated by summing the strength of intermolecular van der Waals and electrostatic interactions between all atoms of the ligand-target complex -CHARMM, AMBER Empirical-based Based on summing various types of interactions between the two binding partners (hydrogen bonds, hydrophobic, …) - ChemScore, GlideScore, AutoDock Knowledge-based Based on statistical observations of intermolecular close contacts from large 3D databases, which are used to derive potentials or mean forces -PMF, DrugScore

Combination of pharmacophore, docking and molecular dynamics (MD) screens Ligand-based VS good enrichment of candidate molecules from the screening of large databases with less computational efforts too coarse to pick up subtle differences induced by small structural variations in the ligands many options for model refinement Structure-based VS better fit for analyzing smaller sets of compounds, especially in retrospective analysis include all possible interactions thus allowing the detection of unexpected binding modes Changing parameters for docking algorithms and scores is demanding Mutants are being developed: pharmacophore methods with information about the target’s binding site docking programs that incorporate pharmacophore constraints

http://www.vcclab.org/lab/edragon/

Public Web Chemoinformatics Tools http://pasilla.health.unm.edu/

ChemSpider www.chemspider.com

Open Babel http://openbabel.org/wiki/Main_page

D. Vidal et al, Ligand-based Approaches to In Silico Pharmacology, Chemoinformatics and Computational Chemical Biology, Ed J. Bajorath, Springer, 2011

Questions?