A Web-Based Computational Tool for Combinatorial Library Design that Simultaneously Optimises Multiple Properties Weifan Zheng, Sunny T. Hung, Joel T.

Slides:



Advertisements
Similar presentations
PhysChem Forum, 29 Nov 2006, Newhouse 1 Memories and the future: From experimental to in silico physical chemistry Han van de Waterbeemd AstraZeneca, DMPK.
Advertisements

Modern Tools for Drug Discovery NIMBUS Biotechnology Modern Tools for Drug Discovery
Improving Candidate Quality Through the Prediction of Clinical Outcome.
A Multiobjective Approach to Combinatorial Library Design Val Gillet University of Sheffield, UK.
Department Author Lead-like Properties, High- throughput Screening and Combinatorial Library Design Andy Davis, Simon Teague, Tudor Oprea, John Steele,
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
ABCD Flexsim-R: A new 3D descriptor for combinatorial library design and in-silico screening 2 nd Joint Sheffield Conference on Chemoinformatics: Computational.
Lipinski’s rule of five
High Throughput Computing and Protein Structure Stephen E. Hamby.
Classification and Prediction: Regression Via Gradient Descent Optimization Bamshad Mobasher DePaul University.
Establishing a Successful Virtual Screening Process Stephen Pickett Roche Discovery Welwyn.
Luddite: An Information Theoretic Library Design Tool Jennifer L. Miller, Erin K. Bradley, and Steven L. Teig July 18, 2002.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
Design of Small Molecule Drugs Targeted to RNA RNA Ontology Group May
Active Learning Strategies for Drug Screening 1. Introduction At the intersection of drug discovery and experimental design, active learning algorithms.
1 Ensembles of Nearest Neighbor Forecasts Dragomir Yankov, Eamonn Keogh Dept. of Computer Science & Eng. University of California Riverside Dennis DeCoste.
Active Learning Strategies for Compound Screening Megon Walker 1 and Simon Kasif 1,2 1 Bioinformatics Program, Boston University 2 Department of Biomedical.
Drug-Like Properties: Optimizing Pharmacokinetics and Safety During Drug Discovery Li Di and Edward H. Kerns ACS Short Course.
Drug discovery and development
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
1 A Combinatorial Toolbox for Protein Sequence Design and Landscape Analysis in the Grand Canonical Model Ming-Yang Kao Department of Computer Science.
Functional groups / Pharmacological Activity
Combinatorial Chemistry and Library Design
Asia’s Largest Global Software & Services Company Genomes to Drugs: A Bioinformatics Perspective Sharmila Mande Bioinformatics Division Advanced Technology.
Leiden University. The university to discover. Enhancing Search Space Diversity in Multi-Objective Evolutionary Drug Molecule Design using Niching 1. Leiden.
VAMOS Visualization of Accessible Molecular Space A new compound filtering and selection interface Spotfire User Conference - Europe - May , 2003.
A genetic algorithm for structure based de-novo design Scott C.-H. Pegg, Jose J. Haresco & Irwin D. Kuntz February 21, 2006.
Introduction to Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Designing a Tri-Peptide based HIV-1 protease inhibitor Presented by, Sushil Kumar Singh IBAB,Bangalore Submitted to Dr. Indira Ghosh AstraZeneca India.
Faculté de Chimie, ULP, Strasbourg, FRANCE
Xiaoxiao Shi, Qi Liu, Wei Fan, Philip S. Yu, and Ruixin Zhu
Custom Spotfire Applications for use in Drug Discovery Chris Louer Team Leader, Cheminformatics © 2001, GlaxoSmithKline, Inc. - All Rights Reserved.
Drug Design Geometrical isomerism Chirality
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Using Spotfire DecisionSite to Realize the Full Value of High-Throughput Screening ADME Data Eric Milgram Pfizer Global Research & Development – La Jolla.
Combinatorial Chemistry Advanced Medicinal Chemistry (Pharm 5219): Section A Ref.: An Introduction to Medicinal Chemistry, 3 rd ed. 2005, G.L.Patrick,
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
Cheminformatics in Drug Discovery and Chemical Genomics Research Weifan Zheng, Ph.D. Associate Professor Department of Pharmaceutical Sciences BRITE Institute,
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
PharmaMiner: Geometric Mining of Pharmacophores 1.
Lipophilicity & Permeability 김연수. Chapter 5. Lipophilicity.
Introduction to Chemoinformatics and Drug Discovery Irene Kouskoumvekaki Associate Professor February 15 th, 2013.
Design of a Compound Screening Collection Gavin Harper Cheminformatics, Stevenage.
Predicting patterns of biological performance using chemical substructure features Diego Borges-Rivera 08/04/08.
Use of Machine Learning in Chemoinformatics
김소연 Permeability OverviewPermeability FundamentalsPermeability EffectPermeability Structure Modification StrategiesProblem.
Background. For designing, discovering or developing a therapeutically relevant molecule, potency and selectivity to the target.
Part 2. Physicochemical Properties 1.Rules ( 양혜란 ) 2.Liphophilicity ( 백아름 ) 3.pKa ( 박숙진 ) 4.Solubility ( 전종수, 최영재 ) 5.Permeability ( 김소연, 강경태 )
Do We Need to Optimize Protein Binding in Drug Discovery? NEDMDG Summary Meeting Xingrong Liu, Ph.D. Genentech.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
We propose an accurate potential which combines useful features HP, HH and PP interactions among the amino acids Sequence based accessibility obtained.
Statistics 350 Lecture 2. Today Last Day: Section Today: Section 1.6 Homework #1: Chapter 1 Problems (page 33-38): 2, 5, 6, 7, 22, 26, 33, 34,
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
Docking and Virtual Screening Using the BMI cluster
Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.
Physiochemical properties of drugs Some background to the Sirius T3.
Advantages of Good Drug-like Properties 손한표.
Julia Salas CS379a Aim of the Study To determine distinguishing features of orally administered drugs –Physical and structural features probed.
Natural products from plants
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Lipinski’s rule of five
Drug Design Geometrical isomerism Chirality
Selcia Fragment Library
Backpage Chicago
Virtual Screening.
Rules for Rapid Property Profiling from Structure
Multi-Biometrics: Fusing At The Classification Output Level Using Keystroke and Mouse Motion Features Todd Breuer, Paola Garcia Cardenas, Anu George, Hung.
Presentation transcript:

A Web-Based Computational Tool for Combinatorial Library Design that Simultaneously Optimises Multiple Properties Weifan Zheng, Sunny T. Hung, Joel T. Saunders, Stephen R. Johnson, George L. Seibel A short paper:

Outline Library Design - Problem Definition Criteria in Early Computational Techniques Important Developability Parameters Multifactorial Nature of Library Design PICCOLO –Optimisation Protocol –Individual Penalty Terms and Their Definition –Snapshots of the Intranet-Based System Conclusions

Library Design - Problem Definition 10 x 10 => 5 x 5 R1 R2 5x5 full combination ?

Criteria Used in Early Computational Design Techniques Diverse Design: –diversity analysis and void-filling Targeted Design: –similarity to leads –docking to a binding site –predicted activity using QSAR/QualSAR models –Pphore models

Failure of Compounds in Development Poor biopharmaceutical properties, 39% Lack of efficacy, 29% Toxicity, 21% Market reasons, 6% - Venkatesh & Lipper, J. Pharm. Sci. 89, (2000) “an efficacious but non-absorbed agent is no better than a well absorbed but in-efficacious one” - Curatolo W. Pharm Sci Tech Today 1, 387 (1998 )

Developability Should Be Considered in Library Design To avoid serious ADME liabilities as early as possible in the drug discovery process Empirical rules –Lipinski rules of 5 (MW, clogP, #HD, #HA) Drug-likeness –Ajay & Murcko (JMC, 1998, 41, ) –Sadowski & Kubinyi (JMC, 1998, 41, )

Some Fundamental Properties Contributing to Pharmacokinetics (PK) Aqueous solubility Membrane passive permeability Cytochrome P450 activities Plasma protein binding Efflux pumping and active transport...

Factors That Are Optimised  Similarity to leads  Reagent diversity/coverage  Product novelty with respect to the corporate compound inventory  Lipinski parameters  Liabilities against P450 enzymes  Aqueous solubility; [Permeability]  Molecular flexibility; MS redundancy; reagent price

Penalty Scores Iteration Initial Library Better Library Optimal Library Lipinski Properties P450 Activity Diversity PICCOLO: reagent PICking by COmbinatorial Library Optimisation R1R2 R1 R2 R1 R2 R1 R2

The Size of the Solution Space is Huge 50 Amines + 50 carboxylic acids Total number of compounds 50 x 50 = 2500 Total number of solutions for an 8 x 12 library 50!/(8!42!) * 50!/(12!38!) = 6.52 x 10 19

Randomly Pick 5x5 Enumerate Calc penalty scores for the trial solution & save scores Metropolis criteria? Reject trial solution Reagent Pool Swap a Fraction of Reagents N Y Stochastic Optimisation to Sample the Solution Space Save the trial solution

Perturbation Scheme Which R-group to perturb –bias toward the R-groups that need more sampling Which new reagent to pick –uniform sampling by cycling through the selected R-group list Which old reagent to kick out –randomly chosen

Total Penalty Score is the Weighted Sum of Individual Penalty Terms

Similarity to Leads E sim ( S ) = Daylight Tanimoto “distances” between all the compounds in a given library and the lead, averaged over the size of the library In case of multiple leads, the Tanimoto distance between a compound and the leads is defined as the nearest neighbour distance

Reagent Diversity: S-Optimal Criterion E sdiv ( S ) = Reverse S optimal scores for all R- groups averaged over the number of R-groups  S opt d    N 1 y,D-y D yD D: a set of design points (i.e., the selected reagents) d(x, A): minimum TD between point x and set of points A

Product Novelty with Respect to Corporate Collection All S.B. compounds were mapped onto a 6D cell space (PCA, or formed by selected features to distinguish biological activities) E pn (S) = the smoothed average number of S.B. compounds in the neighbouring cells

Developability Penalty Scores Lipinski Parameters –MW < = 500 –ClogP: -1 to 5 –NHD <= 5 –NHA <= 10 P450s - non-inhibitory predicted by the P450 classifiers Solubility - should be higher than a limit Each penalty term is the percentage of library compounds that violate the limits for each term

P450 Classifiers and Solubility Predictor P450s: 2d6, 3a4, 1a2, 2c9 –dataset(2d6): Active: ~3500; Inactive: ~4000 –method: 3 layer ANN –FP: 20%; FN: 10%; Ambiguous % Solubility –N = ~550 –3 layer ANN –rms error ~1.0 log unit

Logon page

Experiment list

New Experiment Page

Spreadsheet Page

Structure Show

MW/ClogP

Conclusions PICCOLO is an in-house library design system that can simultaneously optimise all the factors we care about Important developability parameters are taken into account Expandable to include other criteria A Web based system being used by SB chemists worldwide

Acknowledgements Colleagues in Cheminformatics Department Ken Kopple Jie Liang (now at Univ. Illinois at Chicago) Medicinal Chemists Todd Graybill, Jian Jin, Ronggang Liu, Tom Ku, Dennis Yamashita, Scott Thompson, Jia-Ning Xiang