July 2015 CSHL Data analysis: GO tools and YeastMine, use-case examples.

Slides:



Advertisements
Similar presentations
Bioinformatics growth curves Medline records Computer power DNA sequences 3-D structures.
Advertisements

SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
Integration of Protein Family, Function, Structure Rich Links to >90 Databases Value-Added Reports for UniProtKB Proteins iProClass Protein Knowledgebase.
Global Mapping of the Yeast Genetic Interaction Network Tong et. al, Science, Feb 2004 Presented by Bowen Cui.
Rama Balakrishnan Saccharomyces Genome Database Stanford University
1 Welcome to the Protein Database Tutorial This tutorial will describe how to navigate the section of Gramene that provides collective information on proteins.
Gene Ontology John Pinney
Yeast - why it simply has a lot to say about human disease.
Peter Tsai, Bioinformatics Institute.  University of California, Santa Cruz (UCSC)  A rapid and reliable display of any requested portion of genomes.
Research Methodology of Biotechnology: Protein-Protein Interactions Yao-Te Huang Aug 16, 2011.
Open Day 2006 From Expression, Through Annotation, to Function Ohad Manor & Tali Goren.
Work Process Using Enrich Load biological data Check enrichment of crossed data sets Extract statistically significant results Multiple hypothesis correction.
A Guide to SQL, Seventh Edition. Objectives Understand, create, and drop views Recognize the benefits of using views Grant and revoke user’s database.
DEMO CSE fall. What is GeneMANIA GeneMANIA finds other genes that are related to a set of input genes, using a very large set of functional.
A Study of Cystic Fibrosis Using Web-Based Tools Anuradha Datta Murphy Graduate Student, Dept. of Molecular and Integrative Physiology, University of Illinois.
NGS Analysis Using Galaxy
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
DAY 14: ACCESS CHAPTER 1 Tazin Afrin October 03,
Outline Quick review of GS Current problems with GS Our solutions Future work Discussion …
Basic features for portal users. Agenda - Basic features Overview –features and navigation Browsing data –Files and Samples Gene Summary pages Performing.
GENOME-CENTRIC DATABASES Daniel Svozil. NCBI Gene Search for DUT gene in human.
Fission Yeast Computing Workshop -1- Searching, querying, browsing downloading and analysing data using PomBase Basic PomBase Features Gene Page Overview.
Use cases for Tools at the Bovine Genome Database Apollo and Bovine QTL viewer.
® Microsoft Office 2010 Access Tutorial 3 Maintaining and Querying a Database.
SRI International Bioinformatics 1 Object Groups & Enrichment Analysis Suzanne Paley Pathway Tools Workshop 2010.
Key Applications Module Lesson 21 — Access Essentials
Computer Literacy BASICS: A Comprehensive Guide to IC 3, 5 th Edition Lesson 23 Getting Started with Access Essentials 1 Morrison / Wells / Ruffolo.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
Copyright OpenHelix. No use or reproduction without express written consent1.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Alterations to Mendel Incomplete or partial dominance Codominance
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
PIRSF Classification System PIRSF: Evolutionary relationships of proteins from super- to sub-families Homeomorphic Family: Homologous proteins sharing.
® Microsoft Office 2013 Access Maintaining and Querying a Database.
SRI International Bioinformatics 1 SmartTables & Enrichment Analysis Peter Karp SRI Bioinformatics Research Group September 2015.
Annotator Interface Sharon Diskin GUS 3.0 Workshop June 18-21, 2002.
Copyright OpenHelix. No use or reproduction without express written consent1.
SRI International Bioinformatics 1 Genome Browser Markus Krummenacker Bioinformatics Research Group SRI, International Q
Help: Strain Page Header Yeast ORF deletion: _d suffix : dubious ORF _p suffix : putative (uncharacterized) ORF Gene/Protein: The established name for.
Copyright OpenHelix. No use or reproduction without express written consent1.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
Heredity Chapter 20 part 2. Genetic terms Trait - is a feature of an organism, i.e. hair colour Locus - a fixed position on a chromosome, such as the.
This tutorial will describe how to navigate the section of Gramene that provides descriptions of alleles associated with morphological, developmental,
SRI International Bioinformatics 1 Genome Browser Tomer Altman Bioinformatics Research Group SRI, International August 19th, 2009.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Copyright OpenHelix. No use or reproduction without express written consent1.
Introduction to the GO: a user’s guide NCSU GO Workshop 29 October 2009.
Excel part 5 Working with Excel Tables, PivotTables, and PivotCharts.
Genetics and Genomics Forward genetics Reverse genetics
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
The Genome Genome Browser Training Materials developed by: Warren C. Lathe, Ph.D. and Mary Mangan, Ph.D. Part 2.
Visualizing data from Galaxy
C Copyright © 2009, Oracle. All rights reserved. Using SQL Developer.
Data analysis: GO tools, YeastMine, and use case examples
A hands on tour of the Saccharomyces Genome Database (SGD)
The Human-Mouse: Disease Connection in MGI (BETA)
Unit 3.
Department of Genetics • Stanford University School of Medicine
Pick a Gene Assignment 4 Requirements
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Cancer therapeutics in yeast
Explore Evolution: Instrument for Analysis
Alterations to Mendel Incomplete or partial dominance Codominance
SRI Bioinformatics Research Group
Welcome - webinar instructions
Annotator Interface GUS 3.0 Workshop June 18-21, 2002.
Presentation transcript:

July 2015 CSHL Data analysis: GO tools and YeastMine, use-case examples

July 2015 CSHL How to leverage data rich SGD! About 93,000 GO annotations (manual, HTP and computational) About 135,000 phenotype annotations (manual and HTP) About 338,000 physical (130K) and genetic (208K) interactions

July 2015 CSHL Analysis entry points Home page links GO term pages (e.g. autophagy +/- child terms) Phenotype observable-qualifier pairs (e.g. cell size: increased, or just observable Interaction pages (Physical, Genetic, Intersection, All)

July 2015 CSHL GO Slim Mapper Maps annotations of a group of genes to more general terms; bin them into broad categories You complete a screen looking for mutants with altered sensitivity to a new drug (see: List2 GSM SGD P) and want to know based on the mutants identified what process is being affected.

July 2015 CSHL GO Slim Mapper

July 2015 CSHL GO Slim Mapper: Results

July 2015 CSHL GO Term Finder Searches for significant shared GO terms or parents of these terms, so you can figure out what these genes have in common.

July 2015 CSHL GO Term Finder Optional: Step 3 Specify background set of genes Step 4 Refine the Annotations used for calculation Step 5 Select a p-value cutoff and/or toggle FDR

July 2015 CSHL Batch GO Term Finder Advantages: process multiple gene lists in parallel handles longer gene lists large number of available organisms Example, list of genes with a defect in endomembrane system morphology (List4_GTF_Princeton_P)

July 2015 CSHL Batch GTF Results

July 2015 CSHL YeastMine A multifaceted search and retrieval environment that provides access to diverse data types. Initiate searches, with a gene, list of genes. Results can be combined for further analysis and saved or downloaded in customizable file formats.

July 2015 CSHL Basic features Templates: predefined queries. Filter by category or keyword: –genome –protein –function –interactions –regulation –homology –expression –literature or keyword –“intron” –“sequence”

July 2015 CSHL Template results page (1) Add, re-arrange order, sort-order Edit, add, remove a filter/constraint (e.g. just null vs filter) Create new or add to existing list Download results in various formats

July 2015 CSHL Template results page (2) Navigation aids Column sort Remove column Toggle column visibility Column summary

July 2015 CSHL Lists and list operations List creation –Create/add –Select genes –Name, describe, rename and share (MyMine) display List operations Intersection (DNA replication AND DNA repair or genes on ChrIV, that are inviable when deleted) Union (DNA replication and/or DNA repair, two sets of interactions, etc.) Subtract (DNA replication or DNA repair) Asymmetric diff. (DNA replication minus repair; DNA repair minus replication)

July 2015 CSHL Use case: finding novel mitoribosomal proteins I’m interested in the mitochondrial ribosome. Does it have any as-yet-undiscovered subunits? 1. Find the known mitochondrial ribosomal proteins using YeastMine

July 2015 CSHL 2. Create a list of the results (90 genes) 3. Look for genes/proteins that interact with mt_ribosomal proteins 4. Create a list of 1,051 interacting genes/proteins.

July 2015 CSHL Are any of the interacting genes uncharacterized? create a list of uncharacterized genes find the intersection between the list of uncharacterized genes and the list of mitochondrial ribosome-interacting genes 34 genes are uncharacterized

July 2015 CSHL 34 uncharacterized ORFs interact genetically or physically with known mitochondrial ribosomal proteins. Mutation of a mt ribosomal subunit would block respiratory growth. Do any of these 34 genes exhibit this mutant phenotype? create list of genes that confer a respiratory phenotype find the intersection with the list of 34 uncharacterized ORFs

July 2015 CSHL Three uncharacterized ORFs exhibit genetic or physical interactions with known mt ribosomal proteins AND block respiratory growth when mutated Systematic name Gene nameName Description ORF lengthDescription YBL095WMRX3 Mitochondrial organization of gene expression813 Protein that associates with mitochondrial ribosome; likely functions in cristae junction formation; the authentic, non-tagged protein is detected in highly purified mitochondria in high-throughput studies YDL157C 357 Putative protein of unknown function; the authentic, non-tagged protein is detected in highly purified mitochondria in high- throughput studies YPR109W 885 Predicted membrane protein; diploid deletion strain has high budding index

July 2015 CSHL Explore a gene PRP8 1.Identify PRP8 interactors 2.Use OMIM to identify yeast genes orthologous to human genes involved in retinitis pigmentosa 3.Intersect the two lists to identify PRP8 interactors with orthologs involved in RP

July 2015 CSHL 1. Select template2. Enter gene and show results 3. Remove columns, select manual annotations, and save list of interacting genes/proteins 4. View enrichment

July 2015 CSHL 5. Now go backwards from human disease to human genes to yeast orthologs. Select “OMIM Disease Phenotype -> human gene(s) -> yeast homolog(s)” and enter “retinitis pigmentosa” 6. Perform an inverse selection using column summary to remove “LEBER CONGENITAL …“ 7.Create a second list of yeast orthologs of human genes associated with RP

July 2015 CSHL 8. Now intersect the two lists (PRP8 interactors and RP orthologs) In fact there is evidence that these 3 proteins are associated with Prp8p, as part of the U4/U6-U5 tri-snRNP spliceosome complex!

July 2015 CSHL Prp8p inhibits Brr2p but not RP mutants.

July 2015 CSHL Make your own query Start in QueryBuilder: –select a data type: ORF –constrain to a chromosome: chrI –constrain qualifier: not dubious –show: standard name, systematic name, primary DBID Can run query (show results) Export XML (share with another) Save Query (name it for later run, edit or export in MyMine)