05/02/2008 Jae Hyun Kim Genome scale enzyme-metabolite and drug-target interaction predictions using the signature molecular descriptor Faulon, J. L.,

Slides:



Advertisements
Similar presentations
Enzymes What are enzymes?
Advertisements

Protein Threading Zhanggroup Overview Background protein structure protein folding and designability Protein threading Current limitations.
Structural bioinformatics
Enzymes. What is an enzyme? globular protein which functions as a biological catalyst, speeding up reaction rate by lowering activation energy without.
Protein Homology Detection Using String Alignment Kernels Jean-Phillippe Vert, Tatsuya Akutsu.
Enzymes.
Enzymes, Coenzymes, And Energy Chapter 5. Nutrients Nutrients are molecules required by organisms for growth, reproduction, or repair. Nutrients are a.
 Predicting interactions between small molecules and proteins › Vital to the drug discovery process › Key to understanding biological processes  3 classes.
Systematic Analysis of Interactome: A New Trend in Bioinformatics KOCSEA Technical Symposium 2010 Young-Rae Cho, Ph.D. Assistant Professor Department of.
Cédric Notredame (30/08/2015) Chemoinformatics And Bioinformatics Cédric Notredame Molecular Biology Bioinformatics Chemoinformatics Chemistry.
Biomolecules: Nucleic Acids and Proteins
1 II. Enzymes Proteins Organic catalysts that speed up the rate of a reaction, but are not used up Lower energy of activation Are specific in action, i.e.,
Using reaction mechanism to measure enzyme similarity Noel M. O'Boyle, Gemma L. Holliday, Daniel E. Almonacid and John B.O. Mitchell Unilever Centre for.
Enzymes (B7).
Overview Enzymes are specialized proteins that function as catalysts to increase the rate of biochemical reactions. By interacting with substrates (reactant.
Enzymes grouped in 6 major classes: (p. 643) 1. Oxidoreductases: Double-barreled name catalyze the reduction or oxidation of a molecule. 2. Transferases:
 Four levels of protein structure  Linear  Sub-Structure  3D Structure  Complex Structure.
Enzyme Activity Lab 13 AP Biology
Chemical Reactions & Enzymes. I. Chemistry A. We already know that all living things are made up of chemical compounds. What are they again? Which give.
Pairwise alignment of DNA/protein sequences I519 Introduction to Bioinformatics, Fall 2012.
Polymer Molecule made of many monomers bonded together
Discovering the Correlation Between Evolutionary Genomics and Protein-Protein Interaction Rezaul Kabir and Brett Thompson
Construction of Substitution Matrices
Intel Confidential – Internal Only Co-clustering of biological networks and gene expression data Hanisch et al. This paper appears in: bioinformatics 2002.
MACROMOLECULE REVIEW. Carbon Compounds Most matter in your body that is not water is made of organic compounds Organic compounds contain carbon atoms.
Lesson Overview Lesson Overview Chemical Reactions and Enzymes Lesson Overview 2.4 Chemical Reactions and Enzymes.
Proteomics Session 1 Introduction. Some basic concepts in biology and biochemistry.
Video Questions What do the following prefixes mean? Mono, Poly, Exo, and End What do you need to have “LIFE”? Draw a picture of an atom. Why do atoms.
Question and Answer Samples and Techniques
341- INTRODUCTION TO BIOINFORMATICS Overview of the Course Material 1.
Themes: Structure meets Function
Name Date Hour Notes: Unit 1--Protein. (1) What is a protein? Type of Biomolecule Nitrogen Based Molecule.
Final Report (30% final score) Bin Liu, PhD, Associate Professor.
Macromolecules Protein. Proteins Probably the most diverse group of macromolecules is the proteins.
PROTEIN STRUCTURE (Donaldson, March 10,2003) What are we trying to learn about genes and their proteins: Predict function for unknown protein by comparison.
 Contain carbon, hydrogen, oxygen, nitrogen, and sulfur  Serve as structural components of animals  Serve as control molecules (enzymes)  Serve.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
PROTEINS.
Modeling Cell Proliferation Activity of Human Interleukin-3 (IL-3) Upon Single Residue Replacements Majid Masso Bioinformatics and Computational Biology.
Title: Lesson 4 B.2 Enzymes Learning Objectives: – Describe the structure and the function of an enzyme – Identify and explain the factors that affect.
Organic Macromolecules: Proteins and Nucleic Acids.
Bioinformatics Overview
Preview Science Concepts Using Science Graphics Math Skills.
© SSER Ltd..
Sample Problem 20.1 The Enzyme Active Site
Amino Acids, Proteins & Enzymes Chapter 16
Enzymes Worksheet catalyst amino acids different function
Proteins & Enzymes.
Enzymes Learning Outcome B11.
Protein Structure and Function
Proteins!.
Enzymes Biology 9(C).
Chapter 2: Macromolecules
Biochemistry Enzymes.
Lesson 2.4: Chemical Reactions & Enzymes
Study Question: What are enzymes?
Biological Catalysts - Enzymes
Enzymes and Temperature
Enzymes.
Annotation Presentation
Do Now Get out homework for checking
Amino Acids An amino acid is any compound that contains an amino group (—NH2) and a carboxyl group (—COOH) in the same molecule.
Proteins and Enzymes 2:3.
Biomolecules Enzymes.
. All of the organic molecules are based on which element?
Enzyme digesting a molecule
Proteins and Enzymes 2:3.
Reactions Enzymes More Enzymes! It’s Elemental Random Stuff 100 pt
Presentation transcript:

05/02/2008 Jae Hyun Kim Genome scale enzyme-metabolite and drug-target interaction predictions using the signature molecular descriptor Faulon, J. L., M. Misra, et al. (2008), Bioinformatics 24(2):

Terminology Motivation Method  Molecular Signature  Signature Kernel  Signature Product Kernel Results Conclusion 2 Contents

Catalyst  Increases the rate of chemical reaction / biological process  Remains unchanged Enzyme  Biomolecules that catalyze chemical reactions  Usually proteins Metabolite  Intermediates & products of metabolism  Restricted to small molecules 3 Terminology (1) Reference:

Inhibitor  Molecules that decrease enzyme activity  Compete with substrates  Most of drugs/poisons 4 Terminology (2) Reference:

EC Number  Numerical Classification scheme for Enzyme- catalyzed reactions  Four levels of hierarchy Example: EC : tripeptide aminopeptidases  EC 3 : hydrolases (enzymes that use water to break up some other molecules )  EC 3.4 : hydrolases that act on peptide bonds  EC : hydrolases that cleave off the amino- terminal amino acid from polypeptide  EC : hydrolases that cleave off the amino- terminal end from a tripeptide 5 Enzyme Commission (EC) Number Reference:

Genome scale enzyme-metabolite and drug-target interaction predictions using the signature molecular descriptor 6 Motivation Protein-Chemical Interaction Large-scale Machine-learning Technique

G=(V,E) : Molecular Graph  V : vertex (atom) set  E : edge (bond) set Atomic Signature  Canonical representation of subgraph surrounding a particular atom  include atoms and bonds up to a predefined distance (height) Molecular Signature of G : h  (G)  h  G (x) : atomic signature in G rooted at x of height h  Height Chemicals : 0~6 Protein: 6~18 (amino acid residue 1~7) 7 Molecular Signature

Molecular Signature: Example 8 (Leucine) (Isoleucine)(Glycine) Depth First Search up to “height” deep ‘(‘ going down, ‘)’ going back up c_, n_: sp3 carbon/nitrogen atom c=, o= : sp2 (double-bond) carbon/oxygen atom h_: hydrogen

General form of enzymatic reaction R  s 1 S 1 +s 2 S 2 +…+s n S n  p 1 P 1 +p 2 P 2 +…+p m P m Height h signature of reaction R 9 Reaction Signature

To predict/classify protein-protein interactions  To measure similarity between two pairs of proteins  Kernel Function K( (X 1,X 2 ), (X’ 1,X’ 2 ) ) How to measure similarity between pairs? 10 Pairwise Kernel

Pairwise similarity by component similarity  If X 1 ~X 1 ’ and X 2 ~X 2 ’ then (X 1,X 2 )~(X 1 ’,X 2 ’) Assess directly similarity between pairs  x 12 = (x 1i x 2j + x 2i x 1j ): pairwise representation of (X 1, X 2 ) Similarity inside the pair  Similarity between pairs 11 Kernel Types From Ben-Hur, A. and W. S. Noble (2005). "Kernel methods for predicting protein-protein interactions." Bioinformatics 21 Suppl 1: i38-46.

Definition  Apply to chemicals, proteins, reactions 12 Signature Kernel

P: Protein, C: Chemical Definition : Signature of Complex P  C Two pairs of P-C interaction (P,C) & (Q,D) 13 Signature Product Kernel (1/2)

Similarly, Therefore, 14 Signature Product Kernel (2/2)

Signature Kernel : Example (height 1) 15 # of occurrence

Signature Product Kernel : Example 16

Signature Similarity VS. Sequence Alignment Scores 17 Computed for every pair of amino acids Correlation : Chemically similar  high BLOSUM62 score

Positive Examples  download from KEGG  more than 50, max 500 Negative Examples:  Equal Number, Random Selection Signature Kernel, 5-fold CV 18 EC Number Classification Using only reactions Using only protein sequences

EC Classification 19 Class 1Class 1.1 Class 1.1.1Class Using both sequences & reactions Signature Product Kernel

Comparison with other Methods 20 Accuracy = (TP+TN)/ (TP+TN+FP+FN) Auc = Area Under Curve Precision = TP/(TP+FP) Sensitivity=TP/(TP+FN) Specificity=TN/(TN+FP) Jaccard Coefficient = TP/(TP+FP+FN) A larger number indicates better results

Prediction  EC No. accepted in September 2006 : Test Set  Predict whether or not a given enzyme will catalyze a given reaction Signature Product Kernel 21 Predicting New Enzyme Interactions

Predict DRUGBANK Using KEGG 22 Area under ROC = 0.74 Signature Product Kernel Class I : Both in training set Class II: Different Partners Class III: Only Target Class IV: Only Drug Class V: None

Unified method for predicting protein- chemical interactions Atomistic structure representation of proteins encompasses information stored in substitution matrices. 23 Conclusion