SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.

Slides:



Advertisements
Similar presentations
Editing Pathway/Genome Databases. SRI International Bioinformatics Pathway Tools Paradigm Separate database from user interface Navigator provides one.
Advertisements

The Pathway/Genome Navigator (These slides are a guide as you experiment with the Navigator)
1 SRI International Bioinformatics The Ocelot Frame Knowledge Representation System Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
SRI International Bioinformatics Data Import / Export Markus Krummenacker Bioinformatics Research Group SRI, International Q
Unbalanced Reactions by Markus Krummenacker Q
SRI International Bioinformatics Comparative Analysis Q
Understanding Correlation In HP LoadRunner >>>>>>>>>>>>>>>>>>>>>>
Chapter 2 Creating a Research Paper with Citations and References
Curation of the EcoCyc Database: The EcoCyc Update Project Martha Arnaud Scientific Database Curator Bioinformatics Research Group SRI International
The Pathway Tools Schema. SRI International Bioinformatics Motivations for Understanding Schema Pathway Tools visualizations and analyses depend upon.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
Reference Manager Making your life easier! Updated September 2007.
SRI International Bioinformatics 1 Gene Ontology in Pathway Tools: Internals.
PathoLogic Pathway Predictor. SRI International Bioinformatics Inference of Metabolic Pathways Pathway/Genome Database Annotated Genomic Sequence Genes/ORFs.
COMPREHENSIVE Excel Tutorial 8 Developing an Excel Application.
SRI International Bioinformatics 1 Searching BioCyc Ron Caspi.
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
Chapter 3 Working with Text and Cascading Style Sheets.
September 5, 2015 Office Setup. Lesson Overview: Office Setup  In this lesson we will cover:  Adding new offices to COM  Individual office setup 
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Working with a Database
Ensure that the Field Day Call Sign is correct.
Gadgets & More…. “Date Range” Gadgets Allows you to choose a specific date, before or after a date or a range of dates using the Workflows calendar.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Chapter 6 Generating Form Letters, Mailing Labels, and a Directory
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
A lesson approach © 2011 The McGraw-Hill Companies, Inc. All rights reserved. a lesson approach Microsoft® Excel 2010 © 2011 The McGraw-Hill Companies,
The Pathway/Genome Navigator (These slides are a guide as you experiment with the Navigator)
← Select Exchange Once logged in. ↓ click Join Course Icon.
SRI International Bioinformatics 1 Advanced Editing of Pathway/Genome Databases Ron Caspi.
Lesson 12: Creating a Manual and Using Mail Merge.
Chapter 17 Creating a Database.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Diagnostic Pathfinder for Instructors. Diagnostic Pathfinder Local File vs. Database Normal operations Expert operations Admin operations.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
The Pathway Tools Schema. SRI International Bioinformatics Motivations for Understanding Schema Pathway Tools visualizations and analyses depend upon.
Copyright 2007, Paradigm Publishing Inc. ACCESS 2007 Chapter 3 BACKNEXTEND 3-1 LINKS TO OBJECTIVES Modify a Table – Add, Delete, Move Fields Modify a Table.
Cellular Overview and Omics Viewer. SRI International Bioinformatics The Cellular Overview Diagram A way to quickly visualize an organism’s metabolism.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
The Pathway/Genome Navigator. SRI International Bioinformatics Overview Data page types General query strategies Web queries Desktop Pathway Tools User.
Chapter 3 Automating Your Work. It is frustrating when you have to type the same passage of text repeatedly. For example your name and address. Word includes.
SRI International Bioinformatics 1 The Structured Advanced Query Page Mario Latendresse Tomer Altman Bioinformatics Research Group SRI International March,
Editing Pathway/Genome Databases Compounds, Reactions and Pathways Ron Caspi.
SRI International Bioinformatics Update your computers! To install a patch: Tools => Instant Patch => Download and Activate All Patches.
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
Microsoft Word 2010 Chapter 2 Creating a Research Paper with Citations and References.
Reconstructing the metabolic network of a bacterium from its genome: the construction of LacplantCyc Christof Francke In silico reconstruction of the metabolic.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
SRI International Bioinformatics 1 The Structured Advanced Query Page Tomer Altman Mario Latendresse Bioinformatics Research Group SRI International April.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
Lesson 7: Using Mail Merge
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
© 2015 Ex Libris | Confidential & Proprietary Yoel Kortick Senior Librarian Cataloging introductory flow.
XP Creating Web Pages with Microsoft Office
Shelly Cashman: Microsoft Word 2016
Excel Tutorial 8 Developing an Excel Application
NOODLETOOLS SIGN-IN Student ID #
Editing Pathway/Genome Databases
Microsoft Word Illustrated
by Markus Krummenacker June 2011
The Pathway Tools Schema
Building Metabolic Models
How to Administer a PGDB
Bioinformatics Research Group
Chapter 2 Creating a Research Paper with References and Sources
Incremental PathoLogic
Propagating Changed Annotation and Pathway Information
Approving Time in Kronos Manager/Supervisor Reference Guide
SRI Bioinformatics Research Group
Unbalanced Reactions by Markus Krummenacker Q
Presentation transcript:

SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi

SRI International Bioinformatics 2 What To Do If Your PGDB Looks Like This?

SRI International Bioinformatics 3 It’s time for an overhaul! Update genome annotation Propagate updates from Reference DB (MetaCyc) Re-run the name matcher Rescore pathways Re-run the transcription unit predictor Run the consistency checker Create protein complexes Re-run the Transport Inference Parser

SRI International Bioinformatics 4 The Consistency Checker Consistency Checking should be performed routinely (every few months), and problems should be addressed

SRI International Bioinformatics 5 Automatic and Manual Tasks I recommend running the automatic tasks first I recommend running individual tasks, one at a time. When you mouse over a task’s name, you will see documentation for that particular task in the bottom window pane

SRI International Bioinformatics 6 Consistency Checker Output The output appears on the right pane, but is also saved into a text file in the reports directory. The name and location of the file are printed at the end of the output.

SRI International Bioinformatics 7 Automatic Tasks: Check all links This tool looks at: Inverse links (compound- reaction, gene-protein, etc) Pathway links Ghost reactions in pathways Pathways included in other pathways

SRI International Bioinformatics 8 Automatic Tasks: Check all links Warnings are not necessarily errors, but should be checked. For example, PWY-21 is completely redundant to P142-PWY and should be deleted.

SRI International Bioinformatics 9 More Automatic Tasks Verify pathways for duplicate reactions Verify replicon components and positions: ensures all genes exist, sorts based on position. Validate GO terms: updates the GO terms, removes obsolete ones. Change compound names to string IDs: mostly applies to legacy data, where enzyme regulators may have been entered as text strings.

SRI International Bioinformatics 10 Yet More Automatic Tasks Run miscellaneous checks: formatting glitches in names, sanity checks for superpathways, clears values of computed slots, deletes temporary frames created by the pathway editor Update proteins: molecular weights Check compound structures for redundant bonds

SRI International Bioinformatics 11 Automatic Tasks: Recompute database statistics Its the only way to change the numbers on the home page

SRI International Bioinformatics 12 Manual Tasks: Run Constraint Checker This tool usually requires the most time and effort for correcting the problems. Flags constraints issues. For example, if a slot is supposed to contain only compound frames, but a different type of frame is listed among its values, the constraint checker identifies and flags the offensive value. The opposite is true as well: the checker will flag that compound as present in a slot of a frame that is not suppose to have such a value. (this means errors are often listed multiple times, under different frames) The checker also flags cardinality violations. For example, cases where more than one value is present in a slot that is only allowed to have a single value.

SRI International Bioinformatics 13 Run Constraint Checker Error Reports: Example 1 Obviously, this frame used to be classified as a protein, but has been converted at some point to a chemical compound. Thus, it should no longer contain a Modified-Protein slot.

SRI International Bioinformatics 14 Fixing The Problem The problematic slot shows up in blue. To solve the problem, highlight the attached value and remove it.

SRI International Bioinformatics 15 Constraint Error Reports: Example 2 The problem here is that CPLX-2, a modified form of CPLX-1, has not been classified as a modified protein. The solution is to open CPLX-2 in the Ontology Editor and add a link to the parent Modified-Proteins.

SRI International Bioinformatics 16 More Manual Tasks Verify all reactions and compounds: finds defective enzymatic reaction frames (missing a protein, a reaction, or both); finds orphan reactions that are not associated with any other objects, looks for duplicate compounds. Generate reaction balance report

SRI International Bioinformatics 17 Frame References Error Report Example Looking at that pathway’s comment, we find that the FRAME construct is missing the last bar.

SRI International Bioinformatics 18 More Manual Tasks Fix references between polypeptide and genes: adds the gene value to modified proteins that miss it, adds a capitalized gene name to the synonyms list, and scans it for duplicates, flags orphan genes and proteins. Check pathway reactions and validate EC numbers: checks the PREDECESSORS slot of pathway frames, flags deleted and transferred EC numbers. Check transcription units: looks for invalid frames, Tus with no genes, with genes in different directions, etc.

SRI International Bioinformatics 19 Even More Manual Tasks Check citations: tries to find formatting problems, reports pubmed citations that have not been imported, provides statistics. Check external database link IDs: flags frames that are linked to the same external DB entry by links that are supposed to be unique.

SRI International Bioinformatics 20 And When You Finish, take pride at your newly renovated PGDB!