Propagating Changed Annotation and Pathway Information

Slides:



Advertisements
Similar presentations
Windows XP System Restore July 22 nd, 2006 CAEUG Meeting.
Advertisements

SRI International Bioinformatics Data Import / Export Markus Krummenacker Bioinformatics Research Group SRI, International Q
Unbalanced Reactions by Markus Krummenacker Q
CPIT 102 CPIT 102 CHAPTER 1 COLLABORATING on DOCUMENTS.
SRI International Bioinformatics 1 Orthology-Based Multi-PGDB Curation Tools Suzanne Paley Pathway Tools Workshop 2010.
Creating Rout Paths Using CAMMaster. Step 1 Import Gerber File. Import Gerber File. User Ctrl+W to window around data. User Ctrl+W to window around data.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Introduction to the Pathway Tools Software David Walsh and Simon Eng bigDATA Workshop—May 29, 2010.
陳虹瑋 國立陽明大學 生物資訊學程 Genome Engineering Lab. Genome Engineering Lab The Newest.
WebCT CE-6 Assignment Tool. Assignment Tool and Assignment Drop Box Use “Assignment” button under Course Tools (your must be in “Build” mode) to: –Modify.
SRI International Bioinformatics 1 Gene Ontology in Pathway Tools: Internals.
PathoLogic Pathway Predictor. SRI International Bioinformatics Inference of Metabolic Pathways Pathway/Genome Database Annotated Genomic Sequence Genes/ORFs.
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
Chapter Seven Advanced Shell Programming. 2 Lesson A Developing a Fully Featured Program.
Transport Inference Parser: Inferring Transport Reactions from Protein Data for PGDBs.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
Diagnostic Pathfinder for Instructors. Diagnostic Pathfinder Local File vs. Database Normal operations Expert operations Admin operations.
May 7, We manage documents and their changes with versioning and check out/check in procedures.
Top Four Essential TAIR Resources Debbie Alexander Metabolic Pathway Databases for Arabidopsis and Other Plants Peifen Zhang.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
Refactoring and Synchronization with the StarTeam Plug-in for Eclipse  Jim Wogulis  Principal Architect, Borland Software Corporation.
FY 2011 Public Libraries Survey WebPLUS Software Training.
Chapter 23. Copyright 2003, Paradigm Publishing Inc. CHAPTER 23 BACKNEXTEND 23-2 LINKS TO OBJECTIVES Record, Run, Pause, and Delete Macros Record, Run,
XP Tutorial 8 New Perspectives on Microsoft Windows XP 1 Microsoft Windows XP Object Linking and Embedding Tutorial 8.
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
1 AraCyc Metabolic Pathway Annotation. 2 AraCyc – An overview  AraCyc is a metabolic pathway database for Arabidopsis thaliana;  Computational prediction.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
Click Author. Log into the system. Click on the title of the manuscript to make changes Your manuscript is being edited.
SRI International Bioinformatics Selected PathoLogic Refining Tasks Creation of Protein Complexes Assignment of Modified Proteins Operon Prediction.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
Lesson 13 PROTECTING AND SHARING DOCUMENTS
PathoLogic Pathway Predictor
Core LIMS Training: Project Management
Networks and Interactions
Do-more Technical Training
Comparative Analysis in BioCyc
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
by Markus Krummenacker June 2011
Single Sample Registration
Java Beans Sagun Dhakhwa.
Bioinformatics Research Group
PathoLogic: More about Matching Enzyme Names to Reactions
Lecture 25 More Synchronized Data and Producer/Consumer Relationship
Building Metabolic Models
LCGAA nightlies infrastructure
How to Administer a PGDB
Lesson 13 PROTECTING AND SHARING DOCUMENTS
FAST Administration Training
How to Import an Excel File
Data Exchange Java API and Perl API : read & modify
Use Cases & User Mocks Customer Call –
Maintaining the EcoCyc
LAB # 2 – MESSAGE AUTHENTICATION STUDENTS MANUAL
NORMA Lab. 7 Generating Reports More Display Options
Running the Transport Inference Parser
PROJECT MANAGEMENT Bill Biddle.
Weka Package Weka package is open source data mining software written in Java. Weka can be applied to your dataset from the GUI, the command line or called.
Comparative Analysis Q
Version Control Version Control: The process of managing changes to your files over time (aka, revision control.
Overview of Microbial Pathway and Genome Databases
BioCyc Update Notifications Suzanne Paley Pathway Tools Workshop 2018
Incremental PathoLogic
Advanced PGDB Editing: Gene Ontology (GO) Terms
Chapter 9: Managing Groups, Folders, Files, and Object Security
Working Knowledge Training
Creating Additional Input Items
Unbalanced Reactions by Markus Krummenacker Q
Selenium IDE Installation and Use.
Macro theory © EIT, Author Gay Robertson, 2019.
Presentation transcript:

Propagating Changed Annotation and Pathway Information

Revised Annotation Data Revised annotations may contain new genes updated gene properties updated functional descriptions Curators don’t want to rebuild PGDB, don’t want to lose manual curation work Curators may have spent a lot of time pruning out false positive pathway predictions – don’t want pathway prediction algorithm to reimport them Curators want manual oversight over many kinds of changes

Incremental Pathologic Utility Build->Update Build for Revised Annotation reads revised annotation file (in .pf or .gbk format) One per genetic element (required if creating new genes), OR One file of updates for whole PGDB compares w/ existing PGDB presents summary of changes, GUI for applying updates Curator can: Apply a set of changes en masse (e.g. create all new genes) Examine each change in a group and decide individually which to apply Save progress and return later Generate report of changes to import into spreadsheet

Updates Applied in Two Phases Create new genes/monomers Apply changes to slot values Revise functional assignments Phase 2 Rerun pathway prediction

Phase 1 Summary Dialog

Assign Selected Reactions Dialog

Phase 2 Rescoring Pathways Rescore Pathways after desired annotation changes have been made Software remembers which pathways were inferred last time If a pathway has since been deleted, the software only considers it if there is now additional evidence for it Summary lists: Previously deleted pathways now w/ more evidence Previously inferred pathways that should now be pruned Newly inferred pathways Pathways not in MetaCyc For each list, curator can quickly check off pathways that should be deleted

Incorporating New MetaCyc Data New version of MetaCyc Revised pathways, reactions, compounds New pathways Tools->Propagate MetaCyc Data Updates Shows differences in compounds, reactions, pathways Curator can update all or selected for each class of difference Does not create or delete pathways PathoLogic->Refine->Rescore Pathways Rescores Pathways and brings up Phase 2 dialog May also want to re-run name matcher first

Notes/Caveats Some updates only make sense in conjunction w/ others, e.g. pathway and its reactions. If you can’t figure out what’s different, default action should be to propagate from MetaCyc. Don’t intersperse edits from update tool w/ edits using standard editors. Can choose to propagate from other reference PGDB instead.