SRI International Bioinformatics 1 Orphan Enzymes Alexander Shearer, Tomer Altman, Anamika Kothari, Christian Ngo, Shahrzad Zarafshar.

Slides:



Advertisements
Similar presentations
Editing Pathway/Genome Databases. SRI International Bioinformatics Pathway Tools Paradigm Separate database from user interface Navigator provides one.
Advertisements

Parallel BioInformatics Sathish Vadhiyar. Parallel Bioinformatics  Many large scale applications in bioinformatics – sequence search, alignment, construction.
SRI International Bioinformatics Comparative Analysis Q
Ameer Effat M. Elfarash Dept. of Genetics Fac. of Agriculture, Assiut Univ. Gene Expression.
Modern Tools for Drug Discovery NIMBUS Biotechnology Modern Tools for Drug Discovery
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Bioinformatics for biomedicine Seminar: Sequence analysis of a favourite gene Lecture 5, Per Kraulis
The Pathway Tools Schema. SRI International Bioinformatics Motivations for Understanding Schema Pathway Tools visualizations and analyses depend upon.
© Wiley Publishing All Rights Reserved. Analyzing Protein Sequences.
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
Biological Databases Notes adapted from lecture notes of Dr. Larry Hunter at the University of Colorado.
Pathways and Networks for Realists Barry Smith 1.
1 Bacterial Identification History- bacterial ID methods have changed –Early methods have not been so much “replaced” as “added to”. –Methods used for.
4 September, 2006 Chapters Methods: Proteins, Model Systems I.
Genomic DNA purification
Pathways Database System: An Integrated System For Biological Pathways L. Krishnamurthy, J. Nadeau, G. Ozsoyoglu, M. Ozsoyoglu, G. Schaeffer, M. Tasan.
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
Development of Bioinformatics and its application on Biotechnology
1 SRI International Bioinformatics Large-Scale Metabolic Network Alignment: MetaCyc and KEGG Tomer Altman Bioinformatics Research Group SRI International.
Table 5-1 Protein Purification Essential for characterizing individual proteins (determining their enzymatic activities, 3D structures, etc.) Two main.
Protein Structure & Function Presented By: Shyla Neher February 4, 2004.
A systems biology approach to the identification and analysis of transcriptional regulatory networks in osteocytes Angela K. Dean, Stephen E. Harris, Jianhua.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
Bioinformatics: Theory and Practice – Striking a Balance (a plea for teaching, as well as doing, Bioinformatics) Practice (Molecular Biology) Theory: Central.
RE digests & RE maps OD and uses Last class How to read a paper Intro to Paper 1.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
PROTEIN PURIFICATION AND ANALYSIS. Assays Need measures for the object (enzyme activity, chromophore, etc.) and for total protein concentration:
Protein Primary Sequence Protein analysis road map: Bioassay design Isolation/purification Analysis Sequencing.
1 SRI International Bioinformatics And now for our ‘Feature’ presentation: Automatic Loading of Protein Sequence Annotation Data from UniProt to Pathway.
Samudrala group - overall research areas CASP6 prediction for T Å C α RMSD for all 70 residues CASP6 prediction for T Å C α RMSD for all.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Reverse Interactomics
Facility I: Production and Characterization of Proteins
ENZYME NOTES. Chemical Reactions Chemical Reaction – process that changes one set of chemicals into another set of chemicals Reactants – elements or compounds.
Proteome and Gene Expression Analysis Chapter 15 & 16.
1 Protein-Protein Interactions High-throughput strategy –Prediction from sequence In silico analysis –Protein A from species A: domain 1 and 2 –Protein.
Rochester Data in Sesame Logging in to Genie to handle 96-well plates
Computational Biology, Part C Family Pairwise Search and Cobbling Robert F. Murphy Copyright  2000, All rights reserved.
Discovery of Therapeutics to Improve Quality of Life Ram Samudrala University of Washington.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
8.2.  Chemical reactions are continually occurring in our bodies to keep us alive.  These chemical reactions must occur at low temperatures so that.
Improving compound–protein interaction prediction by building up highly credible negative samples Toward more realistic drug-target interaction predictions.
Multiplication Find the missing value x __ = 32.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Bioseparation I Centrifugation. What is Bioseparation?  Purification or separation of a specific material of interest from contaminants in a manner that.
Optimizing Biological Data Integration
Mass Spectrometry Vs. Immunoassay
Introduction to bioinformatics
Bioinformatics Capstone Project
Drug Affinity Responsive Target Stability (DARTS).
Proteomics Lecture 4 Proteases.
DNA Extraction and Purification
Predicting Active Site Residue Annotations in the Pfam Database
Bellringer Please grab your lab manual from the front table.
Enzymes Page 23.
Strategies for annotation of a genome
Volume 7, Issue 8, Pages (August 2000)
Molecular Biology 361 BCH.
Enzyme-Substrate Complex
Last class Salting out Dialysis Paper 1 discussion.
Volume 7, Issue 8, Pages (August 2000)
Enzymes.
The MultiOmics Explainer
Enzyme digesting a molecule
SRI Bioinformatics Research Group
Presentation transcript:

SRI International Bioinformatics 1 Orphan Enzymes Alexander Shearer, Tomer Altman, Anamika Kothari, Christian Ngo, Shahrzad Zarafshar

SRI International Bioinformatics 2 The problem – disconnected data

SRI International Bioinformatics 3 The problem – disconnected data

SRI International Bioinformatics 4 An orphan enzyme is an activity that has been extensively characterized in the lab…

SRI International Bioinformatics 5 …but for which no sequence is available in major databases

SRI International Bioinformatics 6 How many orphans? ResourceTotal sequencedTotal orphans Enzyme DB2,4611,783 UniProtKB/Swiss-Prot2,4621,782 UniProtKB/TrEMBL2,9491,295 BioCyc – Proteins 3,1021,142 BioCyc – Reactions 3,1191,125 Orenza3,1221,122 NCBI Psrotein3,1281,116 Final tally3,1281,116

SRI International Bioinformatics 7 Project goals Resolve as many orphans as possible Help others resolve orphans

SRI International Bioinformatics 8 How are orphans resolved? The sequence is out there! –Hidden in papers –Disconnected, in databases –“Sequence” is present in easy protein package… Never been sequenced –Purify and ID in the lab –May have been IDed for a different activity Dubious E.C. numbers…

SRI International Bioinformatics 9 Validating orphans

SRI International Bioinformatics 10 Validating orphans

SRI International Bioinformatics 11 Validating orphans

SRI International Bioinformatics 12

SRI International Bioinformatics 13

SRI International Bioinformatics 14

SRI International Bioinformatics 15 Ranking orphans “Easy” ranking requires all of these: 1 – Purification protocol using standard affinity methods 2 – Source organism readily culturable 3 – Assay uses off-the-shelf substrates “Moderate” ranking requires any two of these: 1 – Purification protocol using standard affinity methods 2 – Source organism readily culturable 3 – Assay uses off-the-shelf substrates 4 – Known molecular weight Otherwise, it’s “Hard.” Activities can be bumped a level (e.g. “Moderate” to “Hard”) if the enzyme is labile, Protease sensitive, or otherwise hard to work with

SRI International Bioinformatics 16 How many true orphans? Resolved – 19% True Orphans – 81%

SRI International Bioinformatics 17 How do the rankings spread out? Hard – 35% Easy – 23% Moderate – 42%

SRI International Bioinformatics 18 Curious cases Perillyl alcohol Stereoisomers Cofactors ADH – when an instance is a class

SRI International Bioinformatics 19 Lab evaluation – ‘super-easy’ targets

SRI International Bioinformatics 20 Lab evaluation – general General method There is no unified lab identification process. The general plan is to use modern methods to update older protocols. For example, older protocols requiring multiple steps, gravity columns, difficult cell lysis methods, and swapping between dialysis tubing arrangements can be updated to use HPLC/FPLC, kit-based steps, and other modern tools. Similarly, older assays can be updated to use modern sensors, new specialty substrates, and, generally, methods that can use either a lower total concentration of the enzyme or a somewhat less purified form, helping to cut out purification time and costs. This example combination of Purification and Assay for is from a paper that was published over 40 years ago.

SRI International Bioinformatics 21 Future considerations What is a “good” annotation? Fixing issues with E.C. activities When does precision become overprecision? How do we find all the other misannotations? Other kinds of “PGDB-ready” orphans?

SRI International Bioinformatics 22 Catching NCBI misses…

SRI International Bioinformatics 23 Catching NCBI misses…