Bringing the Cyc to BioCyc

Slides:



Advertisements
Similar presentations
Enzyme Nomenclature & Classification
Advertisements

Provenance in a Collaborative Bio-database RAASWiki Donald Dunbar & Jon Manning Queen’s Medical Research Institute University of Edinburgh Use Cases for.
1 Enzymes Enzymes are biological catalysts. Recall that a catalyst speeds up the rate of a reaction by decreasing the activation energy needed for the.
EBI is an Outstation of the European Molecular Biology Laboratory. IntEnz Integrated relational Enzyme database 23 May 2015.
Enzymes. What is an enzyme? globular protein which functions as a biological catalyst, speeding up reaction rate by lowering activation energy without.
Enzyme Structure, classification and mechanism of action
Archives and Information Retrieval
Readings for this week Gogarten et al Horizontal gene transfer….. Francke et al. Reconstructing metabolic networks….. Sign up for meeting next week for.
Metabolic Biochemistry BIBC 102 Summer Lecture 1 August 1, 2005 Lehninger (4 th Edition), Chapter 3, 4, 6.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Enzymes: Remarkable Biological Catalysts
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
Introduction to metabolism: Compounds, Reactions, Enzymes and Pathways Kristian Axelsen, Alan Bridge Elisabeth Coudert & Anne Morgat SIB Swiss Institute.
Enzymes as Biological Catalysts Enzymes are proteins that increase the rate of reaction by lowering the energy of activation They catalyze nearly all.
Overview Enzymes are specialized proteins that function as catalysts to increase the rate of biochemical reactions. By interacting with substrates (reactant.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
Enzymes: Basic concepts
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
Copyright OpenHelix. No use or reproduction without express written consent1.
Computer Science Ph. D. Seminar Gene Ontology (GO) Based Search for Protein Structure Similarity Clustering Metrics Ph.D. Candidate Steve Johnson Committee.
Writing Informative Grades College and Career Readiness Standards for Writing Text Types and Purposes arguments 1.Write arguments to support a substantive.
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
Classification of enzymes. Units of enzyme activity.
Classification and Nomenclature of Enzymes
Biochemistry (Lippincott's Illustrated Reviews Series), 6E
Argumentative Writing Grades College and Career Readiness Standards for Writing Text Types and Purposes arguments 1.Write arguments to support a.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
Enzymes: Basic concepts
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
6.1 A Brief Look at Enzyme Energetics and Enzyme Chemistry Converting substrates to product requires intermediate states – Intermediates are less stable.
1 Enzymology INTRODUCTION 2006/09/18 Downloaded from
Enzymes General properties Enzyme definition Factors affecting enzyme activity. Enzyme Inhibition. Application of enzyme inhibition. Isoenzymes.
Editing Pathway/Genome Databases
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
Building a community for genome and proteome annotation
Distinguishing scholarly from nonscholarly periodicals
Enzyme Structure, classification and mechanism of action
The Pathway Tools Schema
Integrated relational Enzyme database
How to Administer a PGDB
Software Documentation
SowiDataNet - A User-Driven Repository for Data Sharing and Centralizing Research Data from the Social and Economic Sciences in Germany Monika Linne, 30.
20.2 Classification of Enzymes
United Nations Statistics Division
Lesson 9 Sharing Documents
Enzyme resources —Creative Enzymes. Focusing on life science research areas, Creative Enzymes has been committed to provide professional information for.
Section II Molecules of Life Universities Press
Site classifications, definitions, and updates to Landnet
Chapter Three: Enzymes
Enzymes What are enzymes?
There are four levels of structure in proteins
Enzyme Structure, Classification and Mechanism of Action
Enzymes as Biological Catalysts Enzymes are proteins that increase the rate of reaction by lowering the energy of activation They catalyze nearly all.
Clinical Enzymology Introduction.
Overview of Microbial Pathway and Genome Databases
Digital Stewardship Curriculum
Biomedical Importance of Enzymes Basic concepts about Enzymes Classification of Enzymes as per IUB.
4 th SEMESTER – BOTANY KARNATAKA UNIVERSITY, DHARWAD Modified from various internet resources by Dr. Jayakara Bhandary Associate Professor of Botany Government.
EC Numbers Are Everywhere
Annotation Presentation
Lesson 14 Sharing Documents
Enzymes & Chemical Reactions.
Enzymes & Chemical Reactions.
Amino Acids, Proteins, and Enzymes
ROLE OF «electronic virtual enhanced research-engaged student teams» WEB PORTAL IN SOLUTION OF PROBLEM OF COLLABORATION INTERNATIONAL TEAMS INSIDE ONE.
Classification of Enzymes
Implementation of the NISO Presentation and Identification of E-Journals (PIE-J) Recommendations PIE-J best practices are recommendations for content providers:
Overview of the Pathway Tools Software and Pathway/Genome Databases
Presentation transcript:

Bringing the Cyc to BioCyc Editing Pathway/Genome Databases Bringing the Cyc to BioCyc MetaCyc is an ambitious project, aiming to catalog all known information about metabolism A metabolic encyclopedia for people and information systems Foundation for all other BioCyc databases SRI International

Naming enzymes is not always simple [α-D-Glc-(1→2)-α-D-Glc-(1→3)-α-D-Glc-(1→3)-α-D-Man-(1→2)-α-D-Man-(1→2)-α-D-Man-(1→3)-{α-D-Man-(1→2)-α-D-Man-(1→3)-[α-D-Man-(1→2)-α-D-Man-(1→6)]-α-D-Man-(1→6)}-β-D-Man-(1→4)-β-D-GlcNAc-(1→4)-β-D-GlcNAc]-N-Asn-[protein] + H2O = [α-D-Glc-(1→3)-α-D-Glc-(1→3)-α-D-Man-(1→2)-α-D-Man-(1→2)-α-D-Man-(1→3)-{α-D-Man-(1→2)-α-D-Man-(1→3)-[α-D-Man-(1→2)-α-D-Man-(1→6)]-α-D-Man-(1→6)}-β-D-Man-(1→4)-β-D-GlcNAc-(1→4)-β-D-GlcNAc]-N-Asn-[protein] + β-D-glucopyranose Glc3Man9NAc2 oligosaccharide glucosidase; trimming glucosidase I; CWH41; MOGS; mannosyl-oligosaccharide glucohydrolase or - EC 3.2.1.106

EC Numbers Are Everywhere

Historical Background

Back in the 1950s The number of known enzymes was increasing rapidly No guiding authority The same enzymes became known by several different names, and The same name was sometimes given to different enzymes Names often conveyed little or no idea of the nature of the reactions catalyzed

The Situation Was Chaotic… Editing Pathway/Genome Databases The Situation Was Chaotic… Catalase (also known as equilase, caperase, optidase…) Diaphorase (dehydrogenase) Zwischenferment (glucose-6-phosphate dehydrogenase) methyl viologen-nitrite reductase old yellow enzyme “Diaphoros” in Greek is “different” “Zwischen” in German is “in between” Zwischenferment Catalase diaphorase SRI International

The International Enzyme Commission To The Rescue! 1955: The International Union of Biochemistry (IUB) sets up an International Enzyme Commission to tackle the problems (yes – it’s even older than PDB!) The Basic Concept: Enzymes should be classified and named by the reactions they catalyze

The EC Number In addition to an established name, each enzyme is given a unique four-digit code, known as the Enzyme Commission, or EC, number, that classifies it EC 1.1.1.1 main class subclass sub-subclass serial number

The Six Main Classes of Enzymes EC 1.1.1.1 Class Name Reaction catalyzed 1 Oxidoreductases AH2 + B = A + BH2 2 Transferases AX + B = A + BX 3 Hydrolases A–B + H2O = AH + BOH 4 Lyases A–B = A + B 5 Isomerases A = B 6 Ligases NTP + A + B = NDP + P + A–B or NTP + A + B = NMP + PP + A–B

The Early Enzyme Commissions Editing Pathway/Genome Databases The Early Enzyme Commissions 1961: The first EC list (712 entries) was presented at the General Assembly of the IUB in Moscow 1964: The second EC list (875 entries) 1969: The Expert Committee on Enzymes 1972: the third EC list (1770 entries) SRI International

1977: Move to NC-IUB A more permanent solution was needed In 1977 a new nomenclature committee were formed: The IUPAC-IUB Joint Commission on Biochemical Nomenclature (JCBN) A subcommittee of JCBN, the new “Nomenclature Committee of IUB” (NC-IUB), assumed responsibility for the EC list

Editing Pathway/Genome Databases Current Status Ongoing curation by the NC-IUBMB since 1977 Last printed version (6th edition) published in 1992 (3196 entries) Transition from print to online content Currently there are 6087 entries Current Members: K.F. Tipton, Ireland (Trinity College Dublin) – historical perspective A. McDonald, Ireland (Trinity College Dublin) – Database and website development G.P. Moss, UK (Queen Mary University of London) – chemical nomenclature K. Axelsen, Denmark (UniProt) – active curation I. Schomburg, Germany (BRENDA) – active curation R. Caspi, USA (MetaCyc) – active curation SRI International

EC Curation

An EC Entry Example

The DraftEnz Portal

Which Enzymes Are Classified? Existing entries in pathway/enzyme databases Curators encounter new enzymes during their regular work Users requests via websites Requirements: The enzyme must have been characterized in a way that leaves no doubt as to its activity. The enzyme must have been described in a paper accepted by a peer-reviewed publication

Reaction Direction EC entries do not imply a direction For consistency, all reactions in a given class are written in the same direction The systematic names are derived from the written direction, even if only the reverse direction has been demonstrated experimentally. Ideally, a comment would indicate that…

EC Numbers Define Enzymes, Not Reactions! More accurately, an EC number stands for an active site. Enzymes with multiple active sites (e.g. if several genes fuse to encode a single polypeptide) receive multiple EC numbers. On the other hand, a reaction may be associated with multiple EC numbers

Difficulties No enzyme can be tested with all potential substrates… Enzymes that perform very complex reactions Enzymes with a very broad substrate range (liver alcohol dehydrogenase) Old enzymes with a single reference - are they real?

Using the EC System

Where Is the EC List? The primary source is A daily-updated MySQL database available at http://www.enzyme-database.org/ Another database, prepared by Gerry Moss, is available at http://www.chem.qmul.ac.uk/iubmb/enzyme/ A copy of the EC list is available via the ENZYME DB (SIB) at http://www.expasy.ch/enzyme/ Yet another one is IntEnz at (EBI-SIB) http://www.ebi.ac.uk/intenz/index.jsp The EC list is also included in databases such as MetaCyc, BRENDA, KEGG etc.

Obtaining the EC Data Is Easy

Growth of the EC List In the last 7 years the Enzyme Commission has created or revised 2753 entries, a significant fraction of the current 6087 total entries.

Propagation of New EC Numbers (data gathered in March 2017) Introduced EC Number MetaCyc Uniprot KEGG MicroScope KBASE EC2GO 2013 3.2.1.185 + - 6.3.4.22 2014 1.1.1.373 5.1.3.29 2015 1.5.1.49 5.1.1.20 2016 1.2.5.3

EC Numbers Representation in MetaCyc

Partial and Provisory EC Numbers Partial EC numbers look like EC numbers except one or more numbers are replaced by a dash, e.g. 2.1.1.- Partial EC numbers can describe partial knowledge (e.g. 2.1.1.- for an uncharacterized methyltransferase) Provisory numbers are given to characterized enzymes that have not been classified yet, and should use the 2.3.4.Nx (or similar) format BRENDA MetaCyc UniProt

Blatant Advertisement In case you were wondering where KEGG and MetaCyc are heading…

Conclusions and Thanks EC Numbers are still very useful All annotation pipelines should use them Propagation of new EC numbers to annotation pipelines could be improved Expansion of the EC list is still slower than desired Thanks to IUBMB for funding the Enzyme Commission