Building Hypotheses and Searching Databases

Slides:



Advertisements
Similar presentations
Scientific & technical presentation JChem Cartridge for Oracle
Advertisements

1 Sequential Screening S. Stanley Young NISS HTS Workshop October 25, 2002.
INTRODUCTION TO THE BEILSTEIN AND GMELIN DATABASES Margarete Bower Chemistry Library.
3D Molecular Structures C371 Fall Morgan Algorithm (Leach & Gillet, p. 8)
PharmaMiner: Geometric Mining of Pharmacophores 1.
M. Wagener 3D Database Searching and Scaffold Hopping Markus Wagener NV Organon.
The ConQuest Interface query space perform Boolean algebra with queries Boolean algebra with hit lists 2D & 3D browser for results.
Chapter 1 Organic Chemistry. Organic chemistry: Organic chemistry: the study of the compounds of carbon.  Organic compounds are made up of carbon and.
eHiTS Score Darryl Reid, Zsolt Zsoldos, Bashir S. Sadjad, Aniko Simon, The next stage in scoring function evolution: a new statistically.
Pharmacophore and FTrees
Module 2: Structure Based Ph4 Design
Molecular Descriptors
Topological Summaries: Using Graphs for Chemical Searching and Mining Graphs are a flexible & unifying model Scalable similarity searches through novel.
Chapter 10 Homework:10.13, 10.17, 10.18, 10.19, 10.24, 10.25,10.27, 10.29, 10.30, 10.32, 10.34, 10.35,
Chapter 2 – Chemical Composition of the Body “Because living things, including humans, are composed only of chemicals, it is absolutely essential for a.
Introduction to Organic Chemistry Bettelheim, Brown, Campbell and Farrell Chapter 10.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
In silico discovery of inhibitors using structure-based approaches Jasmita Gill Structural and Computational Biology Group, ICGEB, New Delhi Nov 2005.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
Carbon and the Molecular Diversity of Life Chapter 4.
Carbon Compounds VERSITILE CARBON  Carbon has a valence of 4 which makes it capable of entering into 4 covalent bonds.
PharmaMiner: Geometric Mining of Pharmacophores 1.
Catalyst TM What is Catalyst TM ? Structural databases Designing structural databases Generating conformational models Building multi-conformer databases.
Generating Synthetically Accessible Ligands by De Novo Design Synthetic Sprout A Peter Johnson Krisztina Boda Attilla Ting Jon Baber.
Lecture 3 Chemical building blocks: water amino acids, proteins.
Use of Machine Learning in Chemoinformatics
Lesson Overview Lesson Overview Carbon Compounds Lesson Overview 2.3 Carbon Compounds.
Organic Compounds Organic molecules are a class of molecules which contain CARBON. Organic molecules are composed of C, H, O, N, P, and S. They are large.
Protein Proteins are biochemical compounds consisting of one or more polypeptides typically folded into a globular or fibrous form in a biologically functional.
Chapter 4. The Chemistry of Carbon
Lecture 13 February 16, 2016 Biotech 3.
Amino Acid & Basic Classification
Lesson Overview 2.3 Carbon Compounds.
Chapter 4. The Chemistry of Carbon
Lesson Overview 2.3 Carbon Compounds.
Chapter 24From Petroleum to Pharmaceuticals
PDBemotif A web based integrated search service to understand ligand binding and secondary structure properties in macromolecular structures.
Lesson Overview Lesson Overview Carbon Compounds Lesson Overview 2.3 Carbon Compounds.
Chapter 3. Amino Acids and Peptides
CHAPTER 4 CARBON AND THE MOLECULAR DIVERSITY OF LIFE
Protein Structure September 7,
AP Biology Chapter 4 P58-64 Modeling Organic Molecules
Lesson Overview 2.3 Carbon Compounds.
Macromolecules & Functional Groups
BIOCHEMISTRY.
Chemistry of Carbon and
Organic Compounds Organic molecules are a class of molecules which contain CARBON. Organic molecules are composed of C, H, O, N, P, and S. They are large.
Chapter 4. The Chemistry of Carbon
Protein Structure and Function
Functional Groups Unit 2.
Functional Groups Unit 3.
Virtual Screening.
Building Blocks of Life
Carbon and the Molecular Diversity of Life
Lesson Overview 2.3 Carbon Compounds.
CARBON Organic compounds
Study Question: What are enzymes?
Lesson Overview 2.3 Carbon Compounds.
Amines AH Chemistry, Unit 3(b).
Chapter 10 Organic Chemistry
Lesson Overview 2.3 Carbon Compounds.
Lesson Overview 2.3 Carbon Compounds.
Lesson Overview 2.3 Carbon Compounds.
Lesson Overview 2.3 Carbon Compounds.
Aim: How are carbon compounds named and drawn?
Lesson Overview 2.3 Carbon Compounds.
The Chemistry of Carbon
Lesson Overview 2.3 Carbon Compounds.
Organic Structure Among neutral (uncharged) organic compounds
PROTEINS.
Presentation transcript:

Building Hypotheses and Searching Databases

Two ways of creating an hypothesis Automatically create a chemical feature-based hypothesis from a set of compounds with respect to a type of activity. Build an hypothesis by assembling substructures and chemical functions and specifying the geometric constraints between them.

Components of an Hypothesis Assemble the components from known data such as the atomic coordinates available from X-ray crystallographic data. Express the characteristics of the hypothesis as a collection of particular chemical substructures or a collection of chemical functions such as hydrogen bond donors and hydrophilic groups, or a combination of substructures and chemical functions.

Chemical Substructures Available The feature dictionary contains a large library of chemical functional groups such as primary, secondary, tertiary amines, hydroxyl, carbonyl, acridyl, acetoamido, 1-beta-glucopyranosyl, amino acids etc, etc.

Chemical Functions Available The chemical functions available include HB ACCEPTOR, HB ACCEPTOR lipid, HB DONOR, HYDROPHOBIC, HYDROPHOBIC aliphatic, HYDROPHOBIC aromatic, NEG CHARGE, NEG IONIZABLE, POS CHARGE, POS IONIZABLE, RING AROMATIC

Chemical Functions Available The distances, angles, and/or torsions between items in an hypothesis, the preferred location of a chemical feature, and a range of elements per atom position may be specified within the hypothesis or a substructure of the hypothesis. Excluded volumes may also be specified.

Using the hypothesis? Having built the hypothesis databases may then be searched with the hypothesis to find compounds within the databases that match the hypothesis.

Building a Substructure Hypotheses and Searching Databases

Hydrogen count set to “anything”

Specifying atom range per atom position

Specifying atom range per atom position

Searching a Database with an Hypothesis Once the hypothesis has been designed and built it may be used to search a database for compounds that contain the defined features.

Results of Database search. Default 300 hits.

Hit example 1

Hit example 1

Compound data

Compound sort. Handling large datasets.

Compound property report.

Managing Databases: Coping with extremely large numbers. Are scientists going to manage in the new world without an in depth knowledge of mathematics and statistics? Are we training scientists to cope with tools such as, cluster analysis, discriminate analysis, cross validation techniques, neural networks, Fourier transformations etc, etc. Do we need courses in the design, building and interrogation of databases.

Building a Feature Based Hypotheses and Searching Databases

Setting distance constraints.

Setting distance constraints.

Hybrid hypothesis of chemical functions and fragments.

Using the generic b-adrenergic agonist to search a database.

Databases? Global structural databases e.g. CCSD Structure specific databases. Therapeutic area based databases. Multi-conformer databases. Composite databases that encapsulate the information of multi-gigabyte files. QSAR based databases. Commercially available versus problem specific databases.