Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International

Slides:



Advertisements
Similar presentations
How pathway databases were created and curated Peifen Zhang Plant Metabolic Network (PMN)
Advertisements

SRI International Bioinformatics Data Import / Export Markus Krummenacker Bioinformatics Research Group SRI, International Q
SRI International Bioinformatics Comparative Analysis Q
1 Microbial Metabolism Databases of Microbial Metabolism & Degradation Ching-Tsan Huang ( 黃慶璨 ) Office: Agronomy Hall, Room 111 Tel: (02)
Overview of the Pathway Tools Software and Pathway/Genome Databases.
SRI International Bioinformatics 1 Orthology-Based Multi-PGDB Curation Tools Suzanne Paley Pathway Tools Workshop 2010.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
The Pathway Tools Schema. SRI International Bioinformatics Motivations for Understanding Schema Pathway Tools visualizations and analyses depend upon.
New Developments in the Pathway Tools Software and EcoCyc Database Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International
The EcoCyc and MetaCyc Pathway/Genome Databases
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
Overview of the Pathway Tools Software and Pathway/Genome Databases.
Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010.
Pathway Tools User Group Meeting Introduction Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Pathway/Genome Databases and Software Tools Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International
Update on The Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org MetaCyc.org.
CalbiCyc, Metabolic Pathways at the Candida Genome Database Martha Arnaud
Creating a … Community Database Organism-Specific Database Model-Organism Database.
Computational Exploration of Metabolic Networks with Pathway Tools Part 1: Overview & Representations Suzanne Paley Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 The Regulation Summary Diagram Suzanne Paley Pathway Tools Workshop 2010.
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
1 SRI International Bioinformatics BioCyc Tutorial Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org,
1 SRI International Bioinformatics The Pathway Tools Software and BioCyc Database Collection Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Pathway Tools: Recent Developments GMOD Meeting, June 2006.
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Computational Exploration of Metabolic Networks with Pathway Tools Part 2: APIs & Examples Randy Gobbel, Ph.D. Bioinformatics Research Group SRI International.
1 SRI International Bioinformatics EcoCyc, MetaCyc, and the Pathway Tools Software Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
Data Content of the BioCyc Databases. BioCyc Tier 1 Databases.
The Pathway Tools Ontology and Inferencing Layer Peter D. Karp, Ph.D. SRI International.
TAIR/Gramene/SGN Workshop I ASPB Meeting July 08, 2007 Chicago, IL Metabolic Databases.
TAIR Workshop Model Organism Databases and Community Annotation Plant and Animal Genome XVI Conference, San Diego January 13, 2008.
The BioCyc Collection of Pathway/Genome Databases Alexander Shearer Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
SRI International Bioinformatics 1 Advanced Editing of Pathway/Genome Databases Ron Caspi.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
MetaCyc and AraCyc: Plant Metabolic Databases Hartmut Foerster Carnegie Institution.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
1 SRI International Bioinformatics And now for our ‘Feature’ presentation: Automatic Loading of Protein Sequence Annotation Data from UniProt to Pathway.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
© 2014 SRI International About OMICS Group OMICS Group International is an amalgamation of Open Access publications and worldwide international science.
Structural Models Lecture 11. Structural Models: Introduction Structural models display relationships among entities and have a variety of uses, such.
NTHU 共 21 頁,第 1 頁 Modeling and Simulating the Biological Pathway - case study - 第六組 Systems Biology Presentation.
Writing Programs that Analyze Pathway/Genome Databases Markus Krummenacker Bioinformatics Research Group SRI International BioCyc.org EcoCyc.org.
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
Building and Refining AraCyc: Data Content, Sources, and Methodologies Kate Dreher TAIR, AraCyc, PMN Carnegie Institution for Science.
Biomax Informatics AG Bioinformatics designed with you in mind. FunCat TM, a controlled vocabulary encompassing the biology of prokaryotes, plants and.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
C u e r n a v a c a C u e r n a v a c a RegulonDB: Curation, Literature Search, Notation and Evidences about Transcriptional Regulation and Transcription.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
Compiling Information and Inferring Useful Knowledge for Systems Biology by Text Mining the Literature Anália Lourenço IBB – Institute for Biotechnology.
Sequence-Structure-Function Sequence Structure Function Threading Ab initio BLAST Folding: impossible but for the smallest structures Function prediction.
Annotating with GO: an overview
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
An Advanced Web Query Interface for Biological Databases
The Pathway Tools FBA Module
The Pathway Tools Schema
The Pathway Tools Software and BioCyc Database Collection
Department of Genetics • Stanford University School of Medicine
A Community Effort to Model the Human Microbiome
Overview of Microbial Pathway and Genome Databases
Overview of the Pathway Tools FBA Module
SRI Bioinformatics Research Group
Overview of the Pathway Tools Software and Pathway/Genome Databases
Presentation transcript:

Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International

SRI International Bioinformatics

SRI International Bioinformatics EcoCyc Organization EcoCyc collects information about multiple types of database objects l Pathway * l Reaction * l Compound * l Protein l Gene * l Transcription Unit * hierarchies Proteins Compounds Genes Pathway Reactions

SRI International Bioinformatics EcoCyc Statistics 176 pathways 992 enzymes 1006 enzymatic reactions 169 transporters 828 transcription units 1929 proteins have a comment (598 > 300 characters)

SRI International Bioinformatics EcoCyc Pathway Information

SRI International Bioinformatics EcoCyc Pathway Information

SRI International Bioinformatics …viewed with “More Detail”

SRI International Bioinformatics EcoCyc Protein Information comment citations reaction

SRI International Bioinformatics EcoCyc Gene Information

SRI International Bioinformatics EcoCyc Metabolic Overview Static or animated views of expression data

SRI International Bioinformatics EcoCyc Curation l names and synonyms l gene classes l subunit composition of protein complexes l location of gene product l protein or complex molecular weight l enzyme activity name l enzyme properties (activators, inhibitors, cofactors) l comment fields l evidence l citations l reactions catalyzed l pathway information

SRI International Bioinformatics Build a new MOD or add a “Pathway Module”! Pathway Tools Software - Takes annotated genome - Generates database, including pathway predictions Freely available (academics/non-profits) Pathway Tools software environment for creation, curation, analysis, and Web publishing of MODs Saccharomyces cerevisiae SGD, Stanford University Arabidopsis thaliana Carnegie Institution of Washington Plasmodium falciparum, Stanford University Mycobacterium tuberculosis Stanford University Synechocystis Carnegie Institution of Washington Methanococcus janaschii EBI Current Pathway Tools Users

SRI International Bioinformatics EcoCyc Strengths Metabolism Transport Transcription regulation

SRI International Bioinformatics EcoCyc into the Future: “EcoCyc is not just metabolism anymore!” …an integrated, review-level information resource on E. coli genomics and biochemistry…

SRI International Bioinformatics What do we need to do?Goals Can we possibly get it done? Quantification Where do we start? Priorities How is it going? Progress The EcoCyc Update Project:

SRI International Bioinformatics EcoCyc Update: Curation Goals Expand database scope beyond metabolism, transporters, and transcription Curate associated reactions and pathways Stay current with the latest papers Curate every gene product:  literature-based descriptions  comprehensive reference lists

SRI International Bioinformatics EcoCyc Update: Quantification 4405 genes -175 transcription factors -168 transporters 4062 genes to curate Full-time curator: 4 days/week on curation + Part-time curator (70%), years 2-4 Year 1: 1600 hours Year 2: 3000 hours Year 3: 3000 hours Year 4: 3000 hours Total: 10,600 hours/4062 genes: 2.6 hours per gene Curation of abstracts

SRI International Bioinformatics EcoCyc Update: Priorities 1. Problems raised by users and advisors 2. Gene products that have new characterizations published in the literature 3. Gene products that have not yet been thoroughly curated 4. Gene products that have been curated, but have not been updated lately

SRI International Bioinformatics Where are we now? 807 gene products curated. 807/4062 = 19.9% of the total (excluding transport and transcription factors) 4-year plan: Curate 615 genes in Year 1 We are meeting our goal!

SRI International Bioinformatics The EcoCyc Collaboration SRI l Peter Karp, PI l Suzanne Paley, Software Engineer l John Pick, Software Engineer l Martha Arnaud, Curator UCD l John Ingraham, Project Leader MBL l Monica Riley, Editor Emerita UNAM l Julio Collado-Vides, Project Leader l Socorro Gama-Castro, Curator l Martin Peralta, Curator TIGR l Ian Paulsen, Project Leader l Mark Hance, Curator UCSD l Milton Saier, Project Leader l Can Tran, Curator Funding: NIH National Center for Research Resources

SRI International Bioinformatics

SRI International Bioinformatics Pathway/Genome DBs Created by External Users Saccharomyces cerevisiae, Stanford University l pathway.yeastgenome.org/biocyc / Plasmodium falciparum, Stanford University l plasmocyc.stanford.edu Mycobacterium tuberculosis, Stanford University l BioCyc.org Arabidopsis thaliana and Synechocystis, Carnegie Institution of Washington l Arabidopsis.org:1555 Methanococcus janaschii, EBI l Maine.ebi.ac.uk:1555 Other PGDBs in progress by 40 other users Software freely available Each PGDB owned by its creator