Stephen Taylor, CBRG GMOD Europe 2010

Slides:



Advertisements
Similar presentations
Introductory to database handling Endre Sebestyén.
Advertisements

Functional Genomics with Next-Generation Sequencing
BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010.
Silverlight PivotViewer in Scientific Discovery Jennifer Lin Sr. SDET Lead Developer Division Microsoft Corporation February 2011 Jennifer Lin Sr. SDET.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
UCSC Archaeal genome browser Advanced browsing September 19, 2006 David Bernick, Aaron Cozen and Todd Lowe September 19, 2006 David Bernick, Aaron Cozen.
Genomic Database - Ensembl Ka-Lok Ng Department of Bioinformatics Asia University.
Algorithm Animation for Bioinformatics Algorithms.
Visualization of genomic data Genome browsers. UCSC browser Ensembl browser Others ? Survey.
Technology and Methods Seminar
© 2010 OpenLink Software, All rights reserved. Linked Data Visualization Using HTML5 based PivotViewer By Kingsley IdehenKingsley Idehen Twitter
WebGBrowse A Web Server for GBrowse Configuration Ram Podicheti B.V.Sc. & A.H. (D.V.M.), M.S. Staff Scientist – Bioinformatics Center for Genomics and.
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
Next Generation Sequencing Bioinformatics Stephen Taylor Computational Biology Research Group.
Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
The generic Genome Browser (GBrowse) A combination database and interactive web page for manipulating and displaying annotations on genomes Developed by.
University of Illinois at Urbana-Champaign BeeSpace Navigator v4.0 and Gene Summarizer beespace.uiuc.edu `
Bulk data files // TeraGrid uses for Genome Databases GMOD meet, June 2006 Don Gilbert,
Sackler Medical School
Google Refine for Data Quality / Integrity. Context BioVeL Data Refinement Workflow Synonym Expansion / Occurrence Retrieval Data Selection Data Quality.
Analysis of GEO datasets using GEO2R Parthav Jailwala CCR Collaborative Bioinformatics Resource CCR/NCI/NIH.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
How do we represent the position specific preference ? BID_MOUSE I A R H L A Q I G D E M BAD_MOUSE Y G R E L R R M S D E F BAK_MOUSE V G R Q L A L I G.
A collaborative tool for sequence annotation. Contact:
Cool BaRC Web Tools Prat Thiru. BaRC Web Tools We have.
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
UCSC Genome Browser Zeevik Melamed & Dror Hollander Gil Ast Lab Sackler Medical School.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
Accessing and visualizing genomics data
Visualization of genomic data Genome browsers. How many have used a genome browser ? UCSC browser ? Ensembl browser ? Others ? survey.
HOMER – a one stop shop for ChIP-Seq analysis
Introduction to Exome Analysis in Galaxy Carol Bult, Ph.D. Professor Deputy Director, JAX Cancer Center Short Course Bioinformatics Workshops 2014 Disclaimer…I.
Visualizing data from Galaxy
JBrowse Mitch Skinner Ian Holmes lab UC Berkeley
BLAST: Basic Local Alignment Search Tool Robert (R.J.) Sperazza BLAST is a software used to analyze genetic information It can identify existing genes.
Galaxy for analyzing genome data Hardison October 05, 2010
IRI/LDEO Climate Data Library M.Benno Blumenthal, Michael Bell, and John del Corral International Research Institute for Climate and Society Columbia University.
Konstantin Okonechnikov Qualimap v2: advanced quality control of
Lab 7.2.
Using command line tools to process sequencing data
Nebula : A public web-server for advanced ChIP-seq data analysis
Simon v1.0 Motif Searching Simon v1.0.
Advanced ChIP-seq Identification of consensus binding sites for the LEAFY transcription factor Explain that you can use your own data Explain that data.
Using RNA-seq data to improve gene annotation
Regulatory Genomics Lab
Short Read Sequencing Analysis Workshop
Epigenomics study – ChIP-Seq analysis
Genome Browsers.
Bioinformatics for biologists
EPConDB: Endocrine Pancreas Consortium Database
University of Pittsburgh
TSS Annotation Workflow
GEP Annotation Workflow
Visualization of genomic data
GBrowse-related work at ApiDB
Introduction to G-OnRamp
Ensembl Genome Repository.
Bioinformatics Research Group SRI International
Vector NTI Introduction
Simon V Motif Searching Simon V
Yating Liu July 2018 G-OnRamp workshop
Follow-up from last night: XSEDE credits
Regulatory Genomics Lab
New Round of Regional data collections Deltares
Creating Datasets & Using Data Flows
Introduction to RNA-Seq & Transcriptome Analysis
Regulatory Genomics Lab
Presentation transcript:

Stephen Taylor, CBRG GMOD Europe 2010 BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Introduction Visualization of large numbers of genome regions Querying and filtering properties of genome regions Pivot and BioPivot tools Open discussion of other applications of technology

CBRG

Over 50 different GBrowse databases Many labs started wanting GBrowse Human Mouse Bacterial Time series Arrays ChIP-Seq RNA-Seq

Next Generation Sequencing Histone modification Data ChIP-Seq Interaction cis/trans data PCR amplified regions RNA-Seq Exome Sequencing/SNP detection

ChIP-Seq example Map NGS reads Peak pick Extract sequences from features Motif extract Weblogo

ChIP-Seq NGS reads Map Peakfind

Problems Experimental conditions Antibody Peak finders give false positives Lots of parameters Must choose a suitable cut-off Eyeballing lots of peaks

Further Analysis Which of my peaks overlap with: Genes Exons Promoters CpG Islands Areas of conservation etc

SLOW! BORING! Traditionally Make spreadsheet of data with links to Gbrowse/UCSC regions of interest Click/Filter various parameters Add data to spreadsheet each new analysis SLOW! BORING!

Deep Zoom Tech Blaise Aguera y Arcas (TED 2007) Seadragon/Photosynth http://www.seadragon.com/showcase/ Microsoft Live Labs’ Pivot http://www.getpivot.com/

Wouldn’t it be cool... To use this in bioinformatics...? Take thousands of regions of interest of genome View and Filter seamlessly on metadata

BioPivot Tools GFF3 of ROI Ninth column contains ‘facets’ Choose your GBrowse or UCSC Browser View Run the command: gff32pivot my.gff3 –dzi –generateimages –conf mytypes.cfg -o my.cxml –browser gbrowse2

BioPivot Tools Parsers for peakfinders Annotate a GFF3 file nearest gene exons, introns, intergenic, intragenic TSS/TES up and down stream regions Overlaps of GFF3/BED features

Open Source Zoomable User Interfaces (ZUIs) OpenZoom http://openzoom.org/ SDK for Flash, Flex & AIR APIs

Deep Zoom Image

Deep Zoom Collection Groups of tiled images

CXML File

To Do Installation scripts Deploy in a web browser using Silverlight RNA-Seq parsers e.g. cufflinks, DESeq Get feedback from the community What else can we do with this tech? http://www.cbrg.ox.ac.uk/data/biopivot

Acknowledgements - Code OpenZoom http://openzoom.org/ Cisgenome http://www.biostat.jhsph.edu/~hji/cisgenome/ BEDtools http://code.google.com/p/bedtools/

Acknowledgements - People Jim Hughes CBRG Team GMOD Team