Shape and Color Clustering with SAESAR Norah E. MacCuish, John D. MacCuish, and Mitch Chapman Mesa Analytics & Computing, Inc.

Slides:



Advertisements
Similar presentations
Slide 1 CheVi Tutorial Aniko Simon, Darryl Reid, Sing Yoong Khew, and Zsolt Zsoldos SimBioSys presents.
Advertisements

1/2/2014 (c) 2000, Ron S. Kenett, Ph.D.1 Statistics for Molecular Biology and Bioinformatics Instructor: Ron S. Kenett
Dynamic web application for drug design research M. Chapman 1, N. MacCuish 1, J. MacCuish 1 J. Bradley 2, J. Blankley 3 1 Mesa Analytics & Computing, Inc.,
Parallel tiered clustering for large data sets using a modified Taylors algorithm J. MacCuish 1 N. MacCuish 1, M. Chapman 1 1 Mesa Analytics & Computing,
1
JKlustor clustering chemical libraries presented by … maintained by Miklós Vargyas Last update: 25 March 2010.
A Novel SAR-Driven Approach for Identifying True High-Throughput Screening Hits S. Frank Yan, Hayk Asatryan, Jing Li, Kaisheng Chen, and Yingyao Zhou Genomics.
The new JKlustor suite Miklós Vargyas Solutions for Cheminformatics.
ChemAxon in 3D Gábor Imre, Adrián Kalászi and Miklós Vargyas Solutions for Cheminformatics.
1 Miklós Vargyas May, 2005 Compound Library Annotation.
1 Miklós Vargyas, Judit Papp May, 2005 MarvinSpace – live demo.
Chapter 7 System Models.
Copyright © 2003 Pearson Education, Inc. Slide 1 Computer Systems Organization & Architecture Chapters 8-12 John D. Carpinelli.
Multicriteria Decision-Making Models
Copyright © 2011, Elsevier Inc. All rights reserved. Chapter 6 Author: Julia Richards and R. Scott Hawley.
Variance Estimation in Complex Surveys Third International Conference on Establishment Surveys Montreal, Quebec June 18-21, 2007 Presented by: Kirk Wolter,
Indiana University School of David Wild – CICC Quarterly Meeting, Jan Page 1 Projects 1-4 update David Wild CICC Quarterly Meeting January 27.
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Microsoft Access 2007 Advanced Level. © Cheltenham Courseware Pty. Ltd. Slide No 2 Forms Customisation.
Process a Customer Chapter 2. Process a Customer 2-2 Objectives Understand what defines a Customer Learn how to check for an existing Customer Learn how.
Human Service Providers and Referrals Chapter 5. Human Service Providers and Referrals 5-2 Objectives Demonstrate the process for entering a Human Service.
1 Chapter 12 File Management Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
Pharmaceutical Salt Selection Suzanna Ward BRAINFEST II.
LIBRARY WEBSITE, CATALOG, DATABASES AND FREE WEB RESOURCES.
Solve Multi-step Equations
Break Time Remaining 10:00.
Information Systems Today: Managing in the Digital World
PP Test Review Sections 6-1 to 6-6
Bright Futures Guidelines Priorities and Screening Tables
Hash Tables.
The challenge ahead: Ocean Predictions in the Arctic Region Lars Petter Røed * Presented at the OPNet Workshop May 2008, Geilo, Norway * Also affiliated.
Analysis of High-Throughput Screening Data C371 Fall 2004.
Copyright © 2012, Elsevier Inc. All rights Reserved. 1 Chapter 7 Modeling Structure with Blocks.
CONTROL VISION Set-up. Step 1 Step 2 Step 3 Step 5 Step 4.
MaK_Full ahead loaded 1 Alarm Page Directory (F11)
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
1 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt 10 pt 15 pt 20 pt 25 pt 5 pt Synthetic.
Artificial Intelligence
Before Between After.
TRIANGULAR MATRICES A square matrix in which all the entries above the main diagonal are zero is called lower triangular, and a square matrix in which.
1 Titre de la diapositive SDMO Industries – Training Département MICS KERYS 09- MICS KERYS – WEBSITE.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 12 View Design and Integration.
Weekly Attendance by Class w/e 6 th September 2013.
Chapter 13 Web Page Design Studio
Tutorial 1: Sensitivity analysis of an analytical function
1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
PharmaMiner: Geometric Mining of Pharmacophores 1.
1 PharmID: A New Algorithm for Pharmacophore Identification Stan Young Jun Feng and Ashish Sanil NISSMPDM 3 June 2005.
M. Wagener 3D Database Searching and Scaffold Hopping Markus Wagener NV Organon.
SciFinder ® : Part of the process™ 2006 Edition. SciFinder ® : Part of the process™ 2006 Edition SciFinder ® 2006 provides new, powerful capabilities.
Active Learning Strategies for Compound Screening Megon Walker 1 and Simon Kasif 1,2 1 Bioinformatics Program, Boston University 2 Department of Biomedical.
Pharmacophore and FTrees
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Open source software and web services for designing therapeutic molecules G. P. S. Raghava, Head Bioinformatics Centre, Institute of Microbial Technology,
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Selecting Diverse Sets of Compounds C371 Fall 2004.
December 1, Classification Analysis of HIV RNase H Bioassay Lianyi Han Computational Biology Branch NCBI/NLM/NIH Rocky ‘07.
PharmaMiner: Geometric Mining of Pharmacophores 1.
Use of Machine Learning in Chemoinformatics
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
HP-SEE In the search of the HDAC-1 inhibitors. The preliminary results of ligand based virtual screening Ilija N. Cvijetić, Ivan O. Juranić,
Building Hypotheses and Searching Databases
Virtual Screening.
Presentation transcript:

Shape and Color Clustering with SAESAR Norah E. MacCuish, John D. MacCuish, and Mitch Chapman Mesa Analytics & Computing, Inc.

ABSTRACT SAESAR identifies potentially interesting patterns in shape and color space for leads from HTS screening data. Analysis of a several public datasets will be described as well as a discussion of a successful analysis in an industrial setting.

SAESAR Features Data Exploration, Unsupervised and Supervised Learning with Shape, Electrostatics, and 2D Structure and Properties Powerful OpenEye Scientific Software and Mesa Analytics & Computing tools with Visualization and 2D and 3D depictions. Clustering Taylors (symmetric, asymmetric, non-disjoint, disjoint versions) Hierarchical (RNN implementations of Wards, Complete Link, Group Average) Conformer Generation OMEGA User supplied Modeling - Model Builder Classification Linear and Quadratic discrimination KNN Example Tasks Find Key Shapes Find Key Structures Find Key Color Groups Generate Predictive Model with Shape, Electrostatics, Color, 2D Structure, other variables

SAESAR - 2D & 3D Clustering on Shape and Pharmacophore Features 2D Descriptors MACCS drug like keys and public keys from PubChem, 768 key fingerprints* 3D Descriptors OEShape - volume overlap OEColor - hydrogen-bond donors, hydrogen- bond acceptors, hydrophobes, anions, cations, and rings, can be user defined *New key-based molecular fingerprinter for visualization and data analysis in compound clustering, similarity searching, and substructure commonality analysis,N. MacCuish, J.D.MacCuish, 233rd ACS, Chicago,March 25-29, 2007.

Mining Primary Screening Data Three primary screens -JNK3,Rock2,FAK Cluster hits in 3D shape (full, subshape) Cluster in 3D color Identify Key shape clusters Identify Key color clusters Validate with secondary screening data

Datasets DatasetScreenStructuresConformers JNK3Primary Secondary FAKPrimary Secondary Rock-2Primary Secondary

Results Summary DatasetSecondary Screen Matches Expected if random Significant JNK3 Shape Color yes FAK Shape Color yes no Rock-2 Shape Color Marginal No

JNK3 Key Shapes

Jnk3 Color & shape 8 matches

JNK3 Color & Shape Common Hits Secondary screening hits which group both by shape and color

Xray Structures and Key Shapes Rock2 (2H9V) Matches 1st Key shape FAK (2ETM) Matches 1st Key shape JNK3 (2EXC) Sub-shape Match

Lead Hopping For SIRT1 Activators* SIRT1 Actives and Not Actives Input to SAESAR 3D Key Shape Query Potential leads are in a different 2D space, but similar 3D space as the active SIRT1 compounds Available Compounds *See, J. Bemis, Bioorganic Gordon Research Conference, June 2008.

Lead Hopping For SIRT1 Activators SAESAR was used to identify key shapes which encapsulated 3D shape features of SIRT1 active compounds Key shapes were queries in a virtual screened against 3D database of Available compounds Sets of hits were identified: 20 compounds had highest overall shape matching Tanimoto scores 47 compounds had shape Tanimoto scores > compounds had Tversky score > 0.8 Compounds were ordered and screened in SIRT1 assay: one novel scaffold was identified with low micromolar activity optimization lowered SIRT1 activation potency

Acknowledgements Jean Bemis, Sirtris Pharmaceuticals, a GSK Company Evan Bolton, PubChem, NIH Software and Databases: CDK, R, PDB, ZINC, PubChem OpenEye Scientific Software, Inc.