Cysteine Oxidation Prediction Program (COPP): A New Software Program That Predicts Reversible Protein Cysteine Thiol Oxidation Reactions Ricardo Sanchez,

Slides:



Advertisements
Similar presentations
Scientific & technical presentation Structure Visualization with MarvinSpace Oct 2006.
Advertisements

SOMA2 – Drug Design Environment. Drug design environment – SOMA2 The SOMA2 project Tekes (National Technology Agency of Finland) DRUG2000 program.
1 Miklós Vargyas, Judit Papp May, 2005 MarvinSpace – live demo.
Florida International University COP 4770 Introduction of Weka.
The Effect of New Parameters and Increased Database Size on the Cysteine Oxidation Prediction Program Megan Riddle with Ricardo Sanchez and Dr. Jamil Momand.
11/9/99ICTAI-99, Chicago1 Protein Secondary Structure Prediction Using Data Mining Tool C5 Meiliu Lu †, Du Zhang †, Hongjun Xu †, Ken Tse-yau Lau ‡, and.
Weka & Rapid Miner Tutorial By Chibuike Muoh. WEKA:: Introduction A collection of open source ML algorithms – pre-processing – classifiers – clustering.
Ligand Binding Site Prediction for HIV-1 Protease using Shape Comparison Techniques Manasi Jahagirdar 1, Vivek K Jalahalli 2, Sunil Kumar 1, A. Srinivas.
MitoInteractome : Mitochondrial Protein Interactome Database Rohit Reja Korean Bioinformation Center, Daejeon, Korea.
MS-Viewer – A Web Based Spectral Viewer For Database Search Results Peter R. Baker 1, Alma L. Burlingame 1 and Robert J. Chalkley 1 1 Mass Spectrometry.
Case Studies Class 5. Computational Chemistry Structure of molecules and their reactivities Two major areas –molecular mechanics –electronic structure.
Training a Neural Network to Recognize Phage Major Capsid Proteins Author: Michael Arnoult, San Diego State University Mentors: Victor Seguritan, Anca.
Prediction of NO Oxidizable Cysteines Larry Grant Dr. Jamil Momand Cal State L.A.
Automatic Information Retrieval from Bioinformatics Websites Kang Peng.
Alpha Shapes and molecular Representations Research Project CSC/MATH 870 SPRING 2007 Philipp Richter.
Computational Biology, Part 10 Protein Structure Prediction and Display Robert F. Murphy Copyright  1996, 1999, All rights reserved.
The Protein Data Bank (PDB)
Comparing protein structure and sequence similarities Sumi Singh Sp 2015.
Inverse Kinematics for Molecular World Sadia Malik April 18, 2002 CS 395T U.T. Austin.
Computational Chemistry. Overview What is Computational Chemistry? How does it work? Why is it useful? What are its limits? Types of Computational Chemistry.
Protein Interfaces, Surfaces and Assemblies
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Analysis of Molecular and Clinical Data at PolyomX Adrian Driga 1, Kathryn Graham 1, 2, Sambasivarao Damaraju 1, 2, Jennifer Listgarten 3, Russ Greiner.
M ULTIFRAME P OINT C ORRESPONDENCE By Naseem Mahajna & Muhammad Zoabi.
Marc Mansfield Stevens Institute Eunhee Kang (Stevens doctoral student) Jack F. Douglas Polymers Division, NIST Zeno Site- Website Providing a Computational.
Protein Secondary Structure Prediction with inclusion of Hydrophobicity information Tzu-Cheng Chuang, Okan K. Ersoy and Saul B. Gelfand School of Electrical.
Increasing the Value of Crystallographic Databases Derived knowledge bases Knowledge-based applications programs Data mining tools for protein-ligand complexes.
EMBL-EBI Adel Golovin MSDsite The project is funded by the European Commission as the TEMBLOR, contract-no. QLRI-CT under the RTD programme.
SMART Teams: Students Modeling A Research Topic Jmol Training 101!
PROTEIN STRUCTURE CLASSIFICATION SUMI SINGH (sxs5729)
EBI is an Outstation of the European Molecular Biology Laboratory. Annotation Procedures for Structural Data Deposited in the PDBe at EBI.
Molecular visualization
Molecular Dynamics Simulation of Bacterial [FeNi]-Hydrogenase Found in Desulfovibrio fructosovorans Solvated in a Water Sphere Introduction Hydrogenase.
Protein Folding and Modeling Carol K. Hall Chemical and Biomolecular Engineering North Carolina State University.
QSAR Study of HIV Protease Inhibitors Using Neural Network and Genetic Algorithm Akmal Aulia, 1 Sunil Kumar, 2 Rajni Garg, * 3 A. Srinivas Reddy, 4 1 Computational.
Biological Signal Detection for Protein Function Prediction Investigators: Yang Dai Prime Grant Support: NSF Problem Statement and Motivation Technical.
Associating Biomedical Terms: Case Study for Acetylation Aaron Buechlein Indiana University School of Informatics Advisor: Dr. Predrag Radivojac.
Virtual Screening C371 Fall INTRODUCTION Virtual screening – Computational or in silico analog of biological screening –Score, rank, and/or filter.
New Strategies for Protein Folding Joseph F. Danzer, Derek A. Debe, Matt J. Carlson, William A. Goddard III Materials and Process Simulation Center California.
Identification of amino acid residues in protein-protein interaction interfaces using machine learning and a comparative analysis of the generalized sequence-
Self-organized Models of Selectivity in Ca and Na Channels Bob Eisenberg 1, Dezső Boda 2, Janhavi Giri 1,3, James Fonseca 1, Dirk Gillespie 1, Douglas.
Introduction to Protein Structure Prediction BMI/CS 576 Colin Dewey Fall 2008.
Visually Demonstrating the Principles of Protein Folding Bill McClung, Jeff Schwehm, Greg Wolffe.
Structural classification of Proteins SCOP Classification: consists of a database Family Evolutionarily related with a significant sequence identity Superfamily.
Biochemistry Jeopardy Chem IS TRYEnzymesMacromoleculesPotpourriMore Potpourri
Improvement of Apriori Algorithm in Log mining Junghee Jaeho Information and Communications University,
Integrating Bioinformatics and Biochemistry Research into Middle School Physical Sciences Lessons Ricardo Sanchez 1, Brianna Rojas 2, and Dr. Jamil A Momand.
We propose an accurate potential which combines useful features HP, HH and PP interactions among the amino acids Sequence based accessibility obtained.
A new protein-protein docking scoring function based on interface residue properties Reporter: Yu Lun Kuo (D )
Autodock tutorial VINA with UCSF Chimera José R. Valverde CNB/CSIC © José R. Valverde, 2014 CC-BY-NC-SA.
Computational Docking Experiments to Find a Ligand that Will Bind to Xanthine Oxidase Lysengkeng Her and Thao Yang Department of Chemistry, University.
Madhavi Ganapathiraju Graduate student Carnegie Mellon University
Protein Structure Prediction and Protein Homology modeling
Extra Tree Classifier-WS3 Bagging Classifier-WS3
Support Vector Machine (SVM)
Zhiyu Li, Alfonso Mondragón, Russell J DiGate  Molecular Cell 
Virtual Screening.

Prediction of protein structure
Tamas Yelland, Snezana Djordjevic  Structure 
Influence of Protein Scaffold on Side-Chain Transfer Free Energies
Chemical Dissection of an Essential Redox Switch in Yeast
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
Mr.Halavath Ramesh 16-MCH-001 Dept. of Chemistry Loyola College University of Madras-Chennai.
S-Nitrosylation at the Interface of Autophagy and Disease
The Critical Role of Non-catalytic Cysteine Residues in Proteins
A Proposed Mechanism for Neurodegeneration in Movement Disorders Characterized by Metal Dyshomeostasis and Oxidative Stress  Benjamin Guy Trist, Dominic.
Presentation transcript:

Cysteine Oxidation Prediction Program (COPP): A New Software Program That Predicts Reversible Protein Cysteine Thiol Oxidation Reactions Ricardo Sanchez, Larry Grant, and Jamil A. Momand Chemistry and Biochemistry, California State University, Los Angeles Abstract References Li, H., Robertson, A.D., and Jensen, J.H Very fast empirical prediction and rationalization of protein pKa values. Proteins 61, 704–721. Miteva, M.A., Tuffery, P., and Villoutreix, B.O PCE: Web tools to compute protein continuum electrostatics. Nucleic Acids Res. 33: W372–W375. doi: /nar/gki365. Parente, A., Merrifield, B., Geracy, G., and D’Alessio, G Molecular basis of superreactivity of cysteiene residues 31 and 32 of seminal ribonuclease. Biochemistry 24, Quinlan, J.R C4.5: Programs for machine learning. Morgan Kaufmann Publishers, San Francisco,CA. Richmond, T. J Solvent accessible surface area and excluded volume in proteins. Analytical equations for overlapping spheres and implications for the hydrophobic effect. J. Mol. Biol. 178, Sanchez, R., Riddle, M., Woo, J., Momand, J Prediction of reversibly Oxidized Protein Cysteine Thiols Using Protein Structure Properties. Protein Sci. 17, Schreiter, E.R., Rodríguez, M.M., Weichsel, A., Montfort, W.R., and Bonaventura, J S-nitrosylation- induced conformational change in blackfin tuna myoglobin. J. Biol. Chem. 282, A web interface implementation of the COPA algorithm is presented here and it is available at copa.calstatela.edu. COPP is able to predict S-nitrosylation sites at 66.7% accuracy on this dataset. COPP is able to predict disulfide bond formation with 91.8% accuracy on this dataset. COPP is an exceptional tool for predicting reversible disulfide bond formation. Acknowledgements This work is supported by the NSF GK-12 Program (Award No ) and NSF-NIH Award number Travel funds provided by the FASEB MARC Program. A software program called the Cysteine Oxidation Prediction Program (COPP) was created and is accessible by internet ( through a Graphical User Interface. COPP uses molecular structure data from the Protein Data Bank to predict reversible cysteine thiol oxidation susceptibility in non- membrane proteins. COPP was created from an algorithm generated via the decision tree output of the J48 machine learning program. COPP was shown to be 80% accurate by jackknife analysis of a database that contained 161 oxidation susceptible cysteines and 161 oxidation non-susceptible cysteines. Of the cysteine thiols in the oxidation-susceptible portion of the database, 61% formed intramolecular disulfides and 39% formed ligands with oxygen, glutathione, or intermolecular disulfides to cysteine residue thiols. Protein S- nitrosylation is a common event but predicting the target cysteine thiols that become S-nitrosylated remains problematic. We used 29 known sites of S- nitrosylation to test the accuracy of our COPP program and found that 20 were predicted correctly, giving an accuracy of 69%. Out of the 20 that were predicted accurately, 19 were predicted to occur because the calculated target cysteine thiol pKa was equal to or less than 9.05 and there was greater than 1.3 A2 calculated surface exposure of the thiol group. Cysteine Oxidation Prediction Algorithm (COPA) Distance to next thiol ≤ 6.2Å Solvent accessibility of thiol ≥ 1.3Å 2 Thiol pKa ≤ 9.05 Oxidation Susceptible Not Oxidation Susceptible YES NO YES C 4.5/J48COPA Protein Properties Unknown Protein Data Prediction Rules Known Protein Data Prediction 80% Prediction Accuracy Methods Oxidative stress has been linked to aging and disease in the human body. For many proteins, cysteine oxidation is the means for regulation of protein activity, thus, serving as an important molecular nanoswitch. In this research, we identified protein structural features that allow us to predict the oxidation propensity of cysteine amino acids in proteins, and created COPP, a web interface program. Anyone with a PDB file can quickly predict the oxidation susceptibilities of cysteine thiols in their proteins of interest. Introduction Figure 1. COPA was converted into a Java computer program that was integrated into the COPP Web Interface. COPP Interface PDB OR WEB DATABASE PROpKaNACCESS2.1.1 Internet COPP Program Remote User COPP Web Interface Scheme Methods Cont… Figure 2. 1) The remote user issues a request for PDB file. 2) The COPP Interface retrieves a PDB file from the users machine or from a specified website. 3) The COPP Interface runs the COPP Program. 4)The COPP Program calculates atomic distances, retrieves the calculated thiol pKa from PROpKa, retrieves the calculated ASA from Naccess2.1.1., and produces a results file. 5) The COPP Interface returns a results page to the remote user. Results Figure 3. COPP Web Interface request page. Figure 4. COPP Prediction Results page. Results Cont… Figure 5. COPP predicts that myoglobin from blackfin tuna (PDB 2NRL) is susceptible to nitrosylation at CYS10 due to its low pKa and exposure to solvent (see Figure 4). Structure B (PDB 2NRM) shows that CYS10 is indeed S-nitrosylated. We used 29 known sites of S-nitrosylation to test the accuracy of the COPP program and found that 20 were predicted correctly, giving an accuracy of 69%. On this nitrosylation database, COPP achieved an overall 66.7% accuracy, with a precision of 40.6, a specificity of 64.2 %, and sensitivity of 74.3%. Observed OxidizedObserved Reduced Predicted Oxidized404 Predicted Reduced227 Conclusion Table 2. Disulfide Bond Prediction Results. Future Work Collect more oxidation susceptible protein data to create a new model to improve COPP’s prediction accuracy. Incorporate new physicochemical parameters that may improve COPP’s prediction accuracy-especially for non-disulfide bond forming oxidations. Incorporate visual features to the COPP web interface. Observed OxidizedObserved Reduced Predicted Oxidized2638 Predicted Reduced968 Table 1. Nitrosylation Prediction Results. COPP was tested on 42 protein thiol groups that can form intramolecular disulfide bonds and 31 thiols that remain reduced within the oxidized protein. COPP achieved an overall accuracy of 91.8%, a precision of 90.9%, a specificity of 87.1%, and a sensitivity of 95.2%. reduced thiol oxidized to S-nitrosothiol AB CYS10 Abstract # 274