Importing GO terms from UniProt to a PGDB

Slides:



Advertisements
Similar presentations
How to Grade Wikis Ways to look for and grade evidence of collaboration & build strong partnerships.
Advertisements

1 SRI International Bioinformatics The Ocelot Frame Knowledge Representation System Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International.
Before we start Login to the laptop: user: crgcomu Password: crgcomu Login to the network: Wifi: carretwifi Password : Login to galaxy (ldap):
SRI International Bioinformatics Data Import / Export Markus Krummenacker Bioinformatics Research Group SRI, International Q
SRI International Bioinformatics Comparative Analysis Q
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy.
SRI International Bioinformatics 1 The consistency Checker, or Overhauling a PGDB By Ron Caspi.
5 EBI is an Outstation of the European Molecular Biology Laboratory. Master title Molecular Interactions – the IntAct Database Sandra Orchard EMBL-EBI.
Introduction to RefWorks John Anderson WCAS Writing Program Northwestern University November 12, 2002.
EcoliWiki and GONUTS Wiki-based Systems for Community Annotation Jim Hu Dept. of Biochemistry and Biophysics Texas A&M University.
MARS: Microarray analysis, retrieval, and storage system Albert F. Cervantes.
SRI International Bioinformatics 1 Gene Ontology in Pathway Tools: Internals.
Integration of E. Coli Data (E. coli Pathway and Genomic Data from BioCyc) Jesse Walsh.
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
RLIMS-P: A Rule-Based Literature Mining System for Protein Phosphorylation Hu ZZ 1, Yuan X 1, Torii M 2, Vijay-Shanker K 3, and Wu CH 1 1 Protein Information.
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
What's True For E. coli… Enlisting The Community In Ongoing Genome Annotation Jim Hu EcoliHub/EcoliWiki Texas A&M University.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Adding GO for Large Datasets COST Functional Modeling Workshop April, Helsinki.
SRI International Bioinformatics 1 Advanced Editing of Pathway/Genome Databases Ron Caspi.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
Transcriptome Analysis
Welcome to DNA Subway Classroom-friendly Bioinformatics.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik materials by: Katy Wolstencroft University of Manchester.
Pathway Interaction Database (PID) Market Research BioPortals Tiger Team Meeting Mervi Heiskanen January 31, 2013.
The consistency Checker, or Overhauling a PGDB By Ron Caspi.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
Production Priorities. Genome protein sets User Support Production systems change Database changes On-the-fly species gene associations.
1 SRI International Bioinformatics And now for our ‘Feature’ presentation: Automatic Loading of Protein Sequence Annotation Data from UniProt to Pathway.
Data provenance in biomedical discovery Donald Dunbar Queen’s Medical Research Institute University of Edinburgh Workshop on Principles of Provenance in.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
Introduction to the Gene Ontology GO Workshop 3-6 August 2010.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
EBI is an Outstation of the European Molecular Biology Laboratory. UniProtKB Sandra Orchard.
SRI International Bioinformatics Update your computers! To install a patch: Tools => Instant Patch => Download and Activate All Patches.
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
Getting GO: how to get GO for functional modeling Iowa State Workshop 11 June 2009.
An example of GO annotation from a primary paper Rebecca E. Foulger (UniProt Curator) GO Annotation Camp, June 2005 PMID:
Getting GO annotation for your dataset
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
Improving Georeferencing Workflow with Python
Genome Sequence Annotation Server
Bioinformatics Research Group
Bioinformatics Research Group
A Community Effort to Model the Human Microbiome
Bioinformatics Research Group
Reachability Analysis Bioinformatics Research Group
by Markus Krummenacker June 2011
Reachability Analysis Bioinformatics Research Group
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Incremental PathoLogic
Propagating Changed Annotation and Pathway Information
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Advanced PGDB Editing: Gene Ontology (GO) Terms
Yating Liu July 2018 G-OnRamp workshop
Welcome to the MaxTRAQ Tour
Welcome to the MaxTRAQ Tour
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
Overview of the Pathway Tools FBA Module
EXP file structure.
SRI Bioinformatics Research Group
This presentation document has been prepared by Vault Intelligence Limited (“Vault") and is intended for off line demonstration, presentation and educational.
An Introduction to Designing and Executing Workflows with Taverna
Reachability Analysis
Presentation transcript:

Importing GO terms from UniProt to a PGDB Markus Krummenacker Bioinformatics Research Group SRI International kr@ai.sri.com

GO in EcoCyc Introduction GO (http://geneontology.org) is used widely to annotate gene products with functions, processes, and cellular locations Manual curation of GO annotations in EcoCyc:

UniProtKB GO annotations GO consortium hosts UniProtKB annotations file Big, several GB. grep file for E. coli taxon ID Import code maps UniProtKB IDs to EcoCyc gene products, via DBLINKs of the products Most imported GO annots have comp. evidence Comp. ev. annots get timestamps bumped up (because they expire after 1 yr.) Suppress comp. ev. annots if redundant with an existing exp. ev. annot Prune comp. ev. annots if a more specific annot of the same kind exists (several dozens)

EcoliWiki – EcoCyc collaboration Collaboration with Jim Hu / EcoliWiki Workflow: GO UniProtKB  EcoCyc EcoCyc exports GO annots file EcoCyc GO annots  EcoliWiki Merging of EcoCyc and additional EcoliWiki annots EcoliWiki  GO consortium, deposit file for E. coli Annots are absorbed into UniProtKB Repeat in half a year

Open Issues Round-trip problem of deleted annots EcoCyc curator deletes an annot, because wrong EcoliWiki should detect this. Protocol not clear yet. For now: UniProtKB import into EcoCyc checks history logs, to prevent annot addition if that annot was deleted in the past No EcoCyc support yet for some qualifiers: NOT Contributes_to No easy user interface yet for annot import

Do it Yourself Disclaimer: Has never been tried outside of EcoCyc Prepare input file (using grep). DBLINKS need to exist on gene products. (add-go-terms-to-monomers (incorporate-ecocyc-go-terms-from-GOAFF-file :filename “…../gene_association.goa” :db-type ‘UNIPROT) ) (save-kb) (loop for p in (all-frames-that-could-contain-go-annots) do (prune-unnecessary-go-terms p :destructively-prune! t))