Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.

Slides:



Advertisements
Similar presentations
Shape and Color Clustering with SAESAR Norah E. MacCuish, John D. MacCuish, and Mitch Chapman Mesa Analytics & Computing, Inc.
Advertisements

The new JKlustor suite Miklós Vargyas Solutions for Cheminformatics.
Analysis of High-Throughput Screening Data C371 Fall 2004.
Christopher Reynolds Supervisor: Prof. Michael Sternberg Bioinformatics Department Division of Molecular Biosciences Imperial College London.
 Data mining has emerged as a critical tool for knowledge discovery in large data sets. It has been extensively used to analyze business, financial,
1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
PharmaMiner: Geometric Mining of Pharmacophores 1.
Jürgen Sühnel Institute of Molecular Biotechnology, Jena Centre for Bioinformatics Jena / Germany Supplementary Material:
Mining Graphs.
GraphSig: Mining Significant Substructures in Compound Libraries 1.
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
M. Wagener 3D Database Searching and Scaffold Hopping Markus Wagener NV Organon.
Pharmacophore-based Molecular Docking Bert E. Thomas, Diane Joseph- McCarthy, Juan C.Avarez.
Design of Small Molecule Drugs Targeted to RNA RNA Ontology Group May
Active Learning Strategies for Drug Screening 1. Introduction At the intersection of drug discovery and experimental design, active learning algorithms.
Active Learning Strategies for Compound Screening Megon Walker 1 and Simon Kasif 1,2 1 Bioinformatics Program, Boston University 2 Department of Biomedical.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Data Mining – Intro.
Predicting Highly Connected Proteins in PIN using QSAR Art Cherkasov Apr 14, 2011 UBC / VGH THE UNIVERSITY OF BRITISH COLUMBIA.
eHiTS Score Darryl Reid, Zsolt Zsoldos, Bashir S. Sadjad, Aniko Simon, The next stage in scoring function evolution: a new statistically.
Pharmacophore and FTrees
Computational Techniques in Support of Drug Discovery October 2, 2002 Jeffrey Wolbach, Ph. D.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Graph Indexing Techniques Seoul National University IDB Lab. Kisung Kim
Similarity Methods C371 Fall 2004.
VAMOS Visualization of Accessible Molecular Space A new compound filtering and selection interface Spotfire User Conference - Europe - May , 2003.
Faculté de Chimie, ULP, Strasbourg, FRANCE
1 PowerMV Chemical Data Mining Environment S. Stanley Young Jun Feng and Jack Liu NISS MPDM, McMaster University 4 June 2005.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Text Mining: Fast Phrase-based Text Indexing and Matching Khaled Hammouda, Ph.D. Student PAMI Research Group University of Waterloo Waterloo, Ontario,
Xiangnan Kong,Philip S. Yu Department of Computer Science University of Illinois at Chicago KDD 2010.
Ozgur Ozturk, Ahmet Sacan, Hakan Ferhatosmanoglu, Yusu Wang The Ohio State University LFM-Pro: a tool for mining family-specific sites in protein structure.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Ligand-based drug discovery No a priori knowledge of the receptor What information can we get from a few active compounds.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
SimFinder: A Unique Topology- based Approach to Similarity Searching 1.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
University at BuffaloThe State University of New York Lei Shi Department of Computer Science and Engineering State University of New York at Buffalo Frequent.
Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction / 16 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Computer-aided drug discovery (CADD)/design methods have played a major role in the development of therapeutically important small molecules for several.
PharmaMiner: Geometric Mining of Pharmacophores 1.
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Catalyst TM What is Catalyst TM ? Structural databases Designing structural databases Generating conformational models Building multi-conformer databases.
PubChem: An Open Repository for Chemical Structure and Biological Activity Information Steve Bryant The NIH Biowulf Cluster: 10 Years of Scientific Supercomputing.
Use of Machine Learning in Chemoinformatics
Bioinformatics Research Overview Li Liao Develop new algorithms and (statistical) learning methods > Capable of incorporating domain knowledge > Effective,
Identification of structurally diverse Growth Hormone Secretagogue (GHS) agonists by virtual screening and structure-activity relationship analysis of.
Computational Approach for Combinatorial Library Design Journal club-1 Sushil Kumar Singh IBAB, Bangalore.
Subgraph Search Over Uncertain Graphs Erşan Demircioğlu.
Molecular Modeling in Drug Discovery: an Overview
1 Substructure Similarity Search in Graph Databases R 陳芃安.
Ning Jin, Wei Wang ICDE 2011 LTS: Discriminative Subgraph Mining by Learning from Search History.
Indiana University School of Indiana University ECCR Summary Infrastructure: Cheminformatics web service infrastructure made available as a community resource.
HP-SEE In the search of the HDAC-1 inhibitors. The preliminary results of ligand based virtual screening Ilija N. Cvijetić, Ivan O. Juranić,
Natural products from plants
Mining Coherent Dense Subgraphs across Multiple Biological Networks Vahid Mirjalili CSE 891.
Jump to first page Relational Data. Jump to first page Inductive Logic Programming (ILP) n Can use ILP to find a set of rules capturing a property that.
Page 1 Computer-aided Drug Design —Profacgen. Page 2 The most fundamental goal in the drug design process is to determine whether a given compound will.
Introduction to PubChem BioAssay
Immunological Library
Analysis of bio-molecular networks through RANKS (RAnking of Nodes
Building Hypotheses and Searching Databases
Virtual Screening.
Data Warehousing and Data Mining
جستجو در وب عميق ارائه‌دهنده: حسين شريفي‌پناه
Graph Classification SEG 5010 Week 3.
Presentation transcript:

Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel index structure Mining of significant fragments in collections Classification of compounds based on significant fragments Development of SAR models through geometric mining Kekule structureGraph Graph with abstract nodes Protein-ligand interaction

Feature/Benefit Summary FeatureBenefitsNovel technology Similarity Searching (SimFinder tool)  Improved hit identification at lower cost  Better comparison to known drugs  Topology-based searching  Scalable index structure Active substructure identification (SigFinder tool)  Identification of active substructures for lead identification and optimization  Mining of protein- ligand complexes  Significance model based on graph analysis Pharmacophore analysis & development (PharmaMiner tool)  Better pharmacophore analysis & models  Fragment based lead discovery  Selective queries on multiple targets  Joint 3D pharmacophore space

SimFinder (Similarity Searching) Accuracy and scalability validated through retrospective studies Ongoing assays on beta-secretase (Alzheimer’s) Comparison of hits with alternative methods such as fingerprints Database of compounds Database of graphs Graph index structure Abstract nodes & distance matrix Substructure & similarity queries transformed to indexed using user-provided parameters accurate & unique results

SigFinder (Mining Significant Substructures) Compound library Database of feature vectors Significant substructures Prediction of biological activity Background statistical model Training data Statistics-based modeling of substructure significance Validation by identification of substructures specific to biological activity transformed to Pictorial view of substructure mining

PharmaMiner (Joint 3D Analysis of Pharmacophore Space) We define a Joint Pharmacophore Space by extracting 3D descriptors from diverse libraries/targets Analysis of pharmacophores in the Joint Space – Classification and clustering for understanding biological activity – Screening for compounds based on presence and/or absence of activity Fragment-based discovery and scaffold hopping Flexible definition of pharmacophoric features (donor, acceptor, hydrophobic core, etc.) 5

References Huahai He; Ambuj K. Singh; Closure-tree: An Index Structure for Graph Queries. Proceedings of the 22nd International Conference on Data Engineering (ICDE), April, 2006, pp DOI : /ICDE DOI : /ICDE Huahai He; Ambuj K. Singh; GraphRank: Statistical Modeling and Mining of Significant Subgraphs in the Feature Space. Proceedings of the 6th IEEE International Conference on Data Mining (ICDM), December, 2006, doi: /ICDM doi: /ICDM Sayan Ranu and Ambuj Singh, GraphSig: A Scalable Approach to Mining Significant Subgraphs in Large Graph Databases in 25th International Conference on Data Engineering (ICDE), 2009.GraphSig: A Scalable Approach to Mining Significant Subgraphs in Large Graph Databases Sayan Ranu; Ambuj Singh; Mining Statistically Significant Molecular Substructures for Efficient Molecular Classification., J. Chem. Inf. Model., 2009, 49(11), pp 2537–2550 DOI : /ci900035zDOI : /ci900035z Huahai He; Ambuj K. Singh; Graphs-at-a-time: Query Language and Access Methods for Graph Databases, SIGMOD, June, 2008, pp ISBN: Graphs-at-a-time: Query Language and Access Methods for Graph Databases