Chemoinformatics P. Baldi, J. Chen, and S. J. Swamidass School of Information and Computer Sciences Institute for Genomics and Bioinformatics University.

Slides:



Advertisements
Similar presentations
VARUNA – Towards a Grid- based Molecular Modeling Environment CICC/MACE – Meeting May 22, 2006 Mookie Baik Department of Chemistry & School of Informatics.
Advertisements

Introduction to Computational Chemistry NSF Computational Nanotechnology and Molecular Engineering Pan-American Advanced Studies Institutes (PASI) Workshop.
Computers in Chemistry Dr John Mitchell & Rosanna Alderson University of St Andrews.
Amber: How to Prepare Parameters for Non-standard Residues
Quantum Mechanics Calculations II Apr 2010 Postgrad course on Comp Chem Noel M. O’Boyle.
. Chapter 1. Introduction, perspectives, and aims. On the science of simulation and modelling. Modelling at bulk, meso, and nano scale. (2 hours). Chapter.
Introduction to Molecular Orbitals
Chemistry 6440 / 7440 Semi-Empirical Molecular Orbital Methods.
Computational Chemistry
Case Studies Class 5. Computational Chemistry Structure of molecules and their reactivities Two major areas –molecular mechanics –electronic structure.
Molecular Modeling of Crystal Structures molecules surfaces crystals.
Lecture 3 – 4. October 2010 Molecular force field 1.
Molecular Simulation. Molecular Simluation Introduction: Introduction: Prerequisition: Prerequisition: A powerful computer, fast graphics card, A powerful.
One assumes: (1) energy, E  (- ℏ /i)  /  t (2) momentum, P  ( ℏ /i)  (3) particle probability density,  (r,t)  = i  /  x + j  /  y + k  / 
Jeffery Loo NLM Associate Fellow ’03 – ’05 chemicalinformaticsforlibraries.
Nuclear Radiation Basics Empirically, it is found that --
Exploring Chemical Space with Computers—Challenges and Opportunities Pierre Baldi UCI.
3. Chemical Data and Data Bases. 2 Datasets and Databases Many small datasets are available Several commercial databases of compounds and reactions (e.g.
Computers in Chemistry Dr John Mitchell University of St Andrews.
Lecture4: relationship between physics and other fields of sciences.
From T. MADHAVAN, & K.Chandrasekaran Lecturers in Zoology.. EXIT.
Ch 9 pages ; Lecture 21 – Schrodinger’s equation.
Density Functional Theory And Time Dependent Density Functional Theory
Computational Chemistry. Overview What is Computational Chemistry? How does it work? Why is it useful? What are its limits? Types of Computational Chemistry.
An Introduction to Molecular Orbital Theory. Levels of Calculation Classical (Molecular) Mechanics quick, simple; accuracy depends on parameterization;
Calculation of Molecular Structures and Properties Molecular structures and molecular properties by quantum chemical methods Dr. Vasile Chiş Biomedical.
Homology Modeling David Shiuan Department of Life Science and Institute of Biotechnology National Dong Hwa University.
Computational Chemistry
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Introduction. What is Computational Chemistry?  Use of computer to help solving chemical problems Chemical Problems Computer Programs Physical.
1 The Discovery Informatics Framework Pat Rougeau President and CEO MDL Information Systems, Inc. Delivering the Integration Promise American Chemical.
Molecular Modeling Fundamentals: Modus in Silico C372 Introduction to Cheminformatics II Kelsey Forsythe.
By: Lea Versoza. Chemistry  A branch of physical science, is the study of the composition, properties and behavior of matter.  Is concerned with atoms.
Computational Science jsusciencesimulation Principles of Scientific Simulation Spring Semester 2005 Geoffrey Fox Community.
Knowledgebase Creation & Systems Biology: A new prospect in discovery informatics S.Shriram, Siri Technologies (Cytogenomics), Bangalore S.Shriram, Siri.
Development of Bioinformatics and its application on Biotechnology
Lectures Introduction to computational modelling and statistics1 Potential models2 Density Functional.
1 Bio + Informatics AAACTGCTGACCGGTAACTGAGGCCTGCCTGCAATTGCTTAACTTGGC An Overview پرتال پرتال بيوانفورماتيك ايرانيان.
Computational Biology BS123A/MB223 UC-Irvine Ray Luo, MBB, BS.
Computational Chemistry, WebMO, and Energy Calculations
Introduction to Pharmacoinformatics
Chemistry I Spring – Understand what CSM is – Be able to apply WebMO in learning chemistry.
Chem 1140; Molecular Modeling Molecular Mechanics Semiempirical QM Modeling CaCHE.
Use of Machine Learning in Chemoinformatics Irene Kouskoumvekaki Associate Professor December 12th, 2012 Biological Sequence Analysis course.
Molecular Mechanics Studies involving covalent interactions (enzyme reaction): quantum mechanics; extremely slow Studies involving noncovalent interactions.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
1.1 – What is Science?. What is Science? Science is … Knowledge – what we know A process – how we discover new things Driven by curiosity Asking questions.
Coordinate Systems for Representing Molecules : Cartesian (x,y,z) – common in MM 2. Internal coordinates (Z-matrix) – common in QM ** It is easy.
MODELING MATTER AT NANOSCALES 3. Empirical classical PES and typical procedures of optimization Classical potentials.
NCN nanoHUB.org Wagner The basics of quantum Monte Carlo Lucas K. Wagner Computational Nanosciences Group University of California, Berkeley In collaboration.
Information Technology in the Natural Sciences Biology – Chemistry – Physics.
TURBOMOLE Lee woong jae.
An overview of Bioinformatics. Cell and Central Dogma.
Javier Junquera Introduction to atomistic simulation methods in condensed matter Alberto García Pablo Ordejón.
H. L. Janardhan Materials science division PPISR, Bangalore
Developing a Force Field Molecular Mechanics. Experimental One Dimensional PES Quantum mechanics tells us that vibrational energy levels are quantized,
Physics Lecture 11 3/2/ Andrew Brandt Monday March 2, 2009 Dr. Andrew Brandt 1.Quantum Mechanics 2.Schrodinger’s Equation 3.Wave Function.
Use of Machine Learning in Chemoinformatics
Advanced methods of molecular dynamics 1.Monte Carlo methods 2.Free energy calculations 3.Ab initio molecular dynamics 4.Quantum molecular dynamics 5.Trajectory.
Comp. Mat. Science School Electrons in Materials Density Functional Theory Richard M. Martin Electron density in La 2 CuO 4 - difference from sum.
Dynamical Systems Modeling
Computational Chemistry:
Computational Chemistry
Chapter 2 Molecular Mechanics
Chapter 1 Section 1 Chemistry Is a Physical Science
Molecular Docking Profacgen. The interactions between proteins and other molecules play important roles in various biological processes, including gene.
Computational Analysis
Prof. Sanjay. V. Khare Department of Physics and Astronomy,
Modern Physics Photoelectric Effect Bohr Model for the Atom
Computational Nanotechnology
Presentation transcript:

Chemoinformatics P. Baldi, J. Chen, and S. J. Swamidass School of Information and Computer Sciences Institute for Genomics and Bioinformatics University of California, Irvine

2 Overall Outline 1.Introduction 2.Molecular Representations 3.Chemical Data and Databases 4.Molecular Similarity 5.Chemical Reactions 6.Machine Learning and Other Predictive Methods 7.Molecular Docking and Drug Discovery

3 1. Introduction What is Chemoinformatics Resources Brief Historical Perspective Chemical Space: Small Molecules Overview of Problems and Methods

4 What is Chemoinformatics? chemoinformatics encompasses the design, creation, organisation, management, retrieval, analysis, dissemination, visualization and use of chemical information

5 What is Chemoinformatics? "the mixing of information resources to transform data into information and information into knowledge, for the intended purpose of making better decisions faster in the arena of drug lead identification and optimizaton"

6 What is Chemoinformatics? “the set of computer algorithms and tools to store and analyse chemical data in the context of drug discovery and design projects” However: drug design/discovery is to chemoinformatics like DNA/RNA/ protein sequencing is to bioinformatics

7 Resources Books: J. Gasteiger, T. E. and Engel, T. (Editors) (2003). Chemoinformatics: A Textbook. Wiley. A.R. Leach and V. J. Gillet (2005). An Introduction to Chemoinformatics. Springer. Journal: Journal of Chemical Information and Modeling Web: and many more………

8 Brief Historical Perspective Historical perspective: physics, chemistry and biology Theorem: computers/biology or computers/physics>> computers/chemistry Proof: Genbank, Swissprot, PDB, Web (CERN), etc..

9 Caveat: Long Tradition Quantum Mechanics Docking Beilstein ACS Etc… Gasteiger, J. (2006). "Chemoinformatics: a new field with a long tradition." Anal Bioanal Chem(384):

10 Possible Causes Alchemy Industrial age and early commercial applications of chemistry Concurrent development of modern computers and modern biology Scientific differences (theory/process) Psychological perceptions (life/inert) ACM

11 Chemical Space: Small Molecules in Organic Chemistry Understanding chemical space Small molecules: –chemical synthesis –drug design – chemical genomics, –systems biology – nanotechnology –etc

12 “A mathematician is a machine that converts coffee into theorems” P. Erdos

13 Cholesterol

14 Aspirin

15 “A chemoinformatician is a machine …..…”

16 Chemical Space StarsSmall Mol. Existing Virtual (?) Mode RealVirtual Access Difficult“Easy”

17 Chemoinformatics Historical perspective: physics, chemistry and biology Understanding chemical space Small molecules (chemical synthesis, drug design, chemical genomics, systems biology, nanotechnology) Predict physical, chemical, biological properties (classification/regression) Build filters/tools to efficiently navigate chemical space to discover new drugs, new reactions, new “galaxies”, etc.

18 Chemo/Bio Informatics Two Key Ingredients 1. Data 2. Similarity Measures Bioinformatics analogy and differences: –Data (GenBank, Swissprot, PDB) –Similarity (BLAST)

19 Computational/Predictive Methods Spetrum of methods: –Quantum Mechanics – …. –Molecular Mechanics – …. –Machine Learning

20 Quantum Mechanics Schrodinger’s Equation (time independent) Hψ=Eψ H=(-h 2 /8π 2 m)∂ 2 +V = Hamiltonian Operator E= Energy V =external potential (time independent) ψ= ψ(x,t) =(complex) wave function = ψ(x)T(t) (time independent case) Ψ 2 = Ψ* Ψ =probability density function (particle at position x)

21 Schrodinger Equation Partial differential eigenvalue equation Where are the electrons and nuclei of a molecule in space? Uncer a given set of conditions, what are their energies? Difficult to solve exactly as number of particle grows (electron-electron interactions, etc) Approximate methods –Ab initio –Semi empirical 3D structures Reaction mechanisms, rates

22 Ab Initio Limited to tens of atoms and best performed using a cluster or supercomputer Can be applied to organics, organo-metallics, and molecular fragments (e.g. catalytic components of an enzyme) Vacuum or implicit solvent environment Can be used to study ground, transition, and excited states (certain methods) Specific implementations include: GAMESS, GAUSSIAN, etc.

23 Semiempirical Methods Semiempirical methods use parameters that compensate for neglecting some of the time consuming mathematical terms in Schrodinger's equation, whereas ab initio methods include all such terms. The parameters used by semiempirical methods can be derived from experimental measurements or by performing ab initio calculations on model systems.Limited to hundreds of atoms Can be applied to organics, organo-metallics, and small oligomers (peptide, nucleotide, saccharide) Can be used to study ground, transition, and excited states (certain methods). Specific implementations include: AMPAC, MOPAC, and ZINDO.

24 Molecular Mechanics Force field approximation Ignore electrons Calculate energy of a system as a function of nuclear positions

25 Molecular Mechanics Energy = Stretching Energy + Bending Energy + Torsion Energy + Non-Bonded Interactions Energy

26 Stretching Energy

27 Bending Energy

28 Torsion Energy

29 Non-Bonded Energy

30 Statistical/Machine Learning Methods NNs and recursive NNs GA SGs Graphical Models Kernels ……… Representations are essential. Must either (1) deal with non-standard data structures of variable size; or (2) represent the data in a standard vector format.