Advanced PGDB Editing: Gene Ontology (GO) Terms

Slides:



Advertisements
Similar presentations
The Pathway/Genome Navigator (These slides are a guide as you experiment with the Navigator)
Advertisements

Prokaryotic Gene Regulation:
SRI International Bioinformatics Data Import / Export Markus Krummenacker Bioinformatics Research Group SRI, International Q
SRI International Bioinformatics Comparative Analysis Q
SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of a different aspect of the cellular.
Modeling Functional Genomics Datasets CVM Lesson 3 13 June 2007Fiona McCarthy.
1 Configuring Internet- related services (April 22, 2015) © Abdou Illia, Spring 2015.
BsubCyc – A Model-Organism Database for Bacillus subtilis Ing Ingrid M. Keseler SRI International.
Overview of the Pathway Tools Software and Pathway/Genome Databases.
The IntAct Database Sandra Orchard & Birgit Meldal.
Gene Ontology John Pinney
1 Configuring Web services (Week 15, Monday 4/17/2006) © Abdou Illia, Spring 2006.
Creating a … Community Database Organism-Specific Database Model-Organism Database.
1 SRI International Bioinformatics Advanced PGDB Editing: Regulation GO Terms Ingrid M. Keseler Bioinformatics Research Group SRI International
Overviews, Omics Viewers, and Object Groups. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery.
Erice 2008 Introduction to PDB Workshop From Molecules to Medicine: Integrating Crystallography in Drug Discovery Erice, 29 May - 8 June Peter Rose
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Overviews and Omics Viewers. SRI International Bioinformatics Introduction Each overview is a genome-scale diagram of cellular machinery l Cellular Overview.
Copyright OpenHelix. No use or reproduction without express written consent1.
SRI International Bioinformatics 1 Recent Developments in Pathway Tools GMOD Workshop November ‘07 Suzanne Paley Bioinformatics Research Group SRI International.
NoodleBib is available at the Jericho Middle School Library Website
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
Copyright OpenHelix. No use or reproduction without express written consent1.
1 SRI International Bioinformatics GO Term Integration and Curation in Pathway Tools and EcoCyc Ingrid M. Keseler Bioinformatics Research Group SRI International.
Motifs BCH364C/391L Systems Biology / Bioinformatics – Spring 2015 Edward Marcotte, Univ of Texas at Austin Edward Marcotte/Univ. of Texas/BCH364C-391L/Spring.
SRI International Bioinformatics 1 Submitting pathway to MetaCyc Ron Caspi.
1 SRI International Bioinformatics And now for our ‘Feature’ presentation: Automatic Loading of Protein Sequence Annotation Data from UniProt to Pathway.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
The Pathway/Genome Navigator. SRI International Bioinformatics Overview Data page types General query strategies Web queries Desktop Pathway Tools User.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
SRI International Bioinformatics 1 Editing Pathway/Genome Databases Ron Caspi.
Copyright OpenHelix. No use or reproduction without express written consent1.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
SRI International Bioinformatics 1 Pathway Tools Features Available Only in the Desktop Version PathoLogic.
Copyright OpenHelix. No use or reproduction without express written consent1.
Recent Developments and Future Directions in Pathway Tools Peter D. Karp SRI International.
The Pathway/Genome Navigator
Overviews, Omics Viewers, Pathway Collages
Getting GO annotation for your dataset
Editing Pathway/Genome Databases
Comparative Analysis in BioCyc
Why Create a PGDB? Perform pathway analyses as part of a genome project Analyze omics data Create a central public information resource for the organism,
GSEA-Pro Tutorial Anne de Jong University of Groningen.
Bioinformatics Research Group
The Pathway Tools Schema
PathoLogic: More about Matching Enzyme Names to Reactions
Product Retrieval Statistics Canada / Statistique Canada Title page
The webinar will start at 12:10pm.
Transition from Classic Interface Phoenix Interface to
Department of Genetics • Stanford University School of Medicine
Control of Gene Expression
Bioinformatics Research Group
Data Content of BioCyc Ingrid Keseler 1.
Comparative Analysis Q
Configuring Internet-related services
Data Content of BioCyc Ingrid Keseler 1.
Advanced PGDB Editing: Regulation GO Terms
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Ensembl Genome Repository.
Electronic Field Study Advanced User Training
Bioinformatics Research Group SRI International
Welcome to the Quantitative Trait Loci (QTL) Tutorial
Entering New Employees (Without Vacancy Permit)
The MultiOmics Explainer
SRI Bioinformatics Research Group
Community-Engaged Partnership Database: VCU’s Commitment to Community Engagement
Slide Set I: PARS Overview
NatQuery An End-User Perspective On Using To Extract Data From ADABAS
Presentation transcript:

Advanced PGDB Editing: Gene Ontology (GO) Terms Ingrid M. Keseler Bioinformatics Research Group SRI International keseler@ai.sri.com Presented by Peter E. Midford http://bioinformatics.ai.sri.com/ptools/tutorial/sessions/curation/Curation-of-Regulatory-Information-and-GO-Terms/Editing&ImportingGo.ppt

Gene Ontology (GO) Terms

Motivation: Why GO Terms? For example: Standardization of annotation helps others Data mining across genomes Genome annotation by similarity (e.g. via InterPro, Pfam, TIGRFAM, COG mappings) It helps you: Omics data clustering Cross-checking with different annotation source

A Word (Or Two) About GO Learn what you can about using GO Surf the geneontology.org web site Attend a GO Annotation Camp Ask questions on the GO mailing lists Request new GO terms if appropriate Useful for everybody Have input when it counts A new GO database can be incorporated into Pathway Tools; request help with setting up the process Computational GO term assignments may be available for your genome via UniProt

GO Classification Editor Accessible via the Protein Editor Expand/contract, select/deselect by clicking on +/- and the actual terms Selected items move to “Selections” section Search feature: can search by name/substring or GO id (in the full format only, e.g. “GO:0007165”, not just the number) For example, search for “arginine” yields many options Click an option to highlight in hierarchy Hovering over a term in the hierarchy brings up its definition in the middle panel Must still click on entry in hierarchy to select the term for annotating the protein

Caveats Implementation of GO in Pathway Tools is incomplete, e.g. Only one qualifier, “with”, is implemented No annotation extensions If you are interested in a more complete implementation of the GO ontology and curation capabilities, please let us know

Importing GO Terms via UniProt IDs (1) Download annotation file from geneontology.org: http://geneontology.org/page/download-annotations You probably want the UniProt [multispecies] file Use the ftp: address and either an ftp app (e.g., FileZilla, CyberDuck) or the command line curl tool to download the uniport file e.g., curl ftp://ftp.ebi.ac.uk/pub/databases/GO/goa/UNIPROT/goa_uniport_all.gaf.gz > goa_uniport_all.gaf.gz

Importing Go Terms via UniProt IDs (2) Unzip the file and use grep, or use zgrep directly, to extract annotations for your organism by taxID zgrep ‘taxon:85962’ goa_uniport_all.gaf.gz > goa_uniport_85962 Beware that these are large files and may take a long time to download (e.g. 7 hrs), a long time to unzip, and a long time to grep through (i.e. don’t try this here)

Importing GO Terms May at some point be made into a GUI tool within Pathway Tools; at this time, it is a command line operation At the lisp prompt, with your database opened: (update-uniprot-go-annots ”filepath”) Where filepath is replaced with e.g. /Users/midford/Documents/demo/goa_uniport_85962 Some log files are written into the same directory; do some sanity checking of the files and the database before saving Import citations

Advanced PGDB Editing: Regulation Ingrid M. Keseler Bioinformatics Research Group SRI International keseler@ai.sri.com Presented by Peter E. Midford

Motivation: Why Regulation? For example: Genome and regulatory overview Global perspective Omics data Data sets for promoter prediction etc.

Omics Viewer: Regulatory Overview Data from J Bacteriol. 2010 Feb;192(3):870-82. A comprehensive proteomics and transcriptomics analysis of Bacillus subtilis salt stress adaptation.

Defining a New Transcription Unit Gene > New Operon Key elements – gene names in order PTools will prompt you for a citation for the TU Specify promoter Can use absolute or relative position of transcription start site PTools will calculate the other value for you PTools will prompt you for a citation for the TSS Specify sigma factor (if appropriate) It may be necessary to first classify sigma factors under |Sigma-Factors|

Adding Transcription Factor Binding Sites Click on TU name – Edit > Create Regulatory Interaction Select type of regulatory interaction Can put in a protein name, or select a defined TF Indicate whether it activates, represses or both Define relative distance from transcription start site Draws DNA footprint from feature defined in TF Can edit TF binding sites by clicking on site name Edit > Regulatory Interaction Editor Can add summaries and citations This builds the transcriptional regulatory network

More Regulatory Interactions Attenuation Regulation of translation RNA-mediated Protein-mediated Small molecule-mediated Regulated protein or mRNA degradation (planned) If you have suggestions for types of regulation you would like to represent, or for improvements on what is there, please let us know. Tools for genome-scale datasets?