Visualizing data from Galaxy

Slides:



Advertisements
Similar presentations
Welcome to eDMR This PowerPoint presentation is designed to show eDMR users how to login and begin using the eDMR system.
Advertisements

A Toolbox for Blackboard Tim Roberts
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
The Maize Inflorescence Project Website Tutorial Nov 7, 2014.
Windows XP Basics OVERVIEW Next.
Variant Calling Workshop Chris Fields Variant Calling Workshop v2 | Chris Fields1 Powerpoint by Casey Hanson.
Microsoft®Office Outlook Web Access Training: Getting Started
UNESCO ICTLIP Module 4. Lesson 3 Database Design, and Information Storage and Retrieval Lesson 3. Information storage and retrieval using WinISIS.
Genome Browsers Carsten O. Daub Omics Science Center RIKEN, Japan May 2008.
Copyright OpenHelix. No use or reproduction without express written consent1.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
1 of 6 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
RIMS II Online Order and Delivery System Tutorial on Downloading and Viewing Multipliers.
1 of 7 This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS DOCUMENT. © 2007 Microsoft Corporation.
New School Websites Teacher Pages. Visit the SCUSD Website for videos tutorials: For more information.
Bacterial Genome Assembly | Victor Jongeneel Radhika S. Khetani
Lawson System Foundation 9.0
Creating a MagicInfo Pro Screen Template
NGS Analysis Using Galaxy
taskbar Notification area Start To change size of taskbar: RMC- uncheck Lock the taskbar Changing Properties : RMC- Properties.
Introduction to RNA-Seq and Transcriptome Analysis
Customized cloud platform for computing on your terms !
Guideline for ClinLabGeneticist tool Jinlian Wang
Download Dropbox Download should start immediately Save download file:
Network Management Tool Amy Auburger. 2 Product Overview Made by Ipswitch Affordable alternative to expensive & complicated Network Management Systems.
LINGNAN UNIVERSITY LIBRARY MAR 2013 LCE102 Library Workshop 1
Introduction to RNA-Seq & Transcriptome Analysis
Instructors begin using McGraw-Hill’s Homework Manager by creating a unique class Web site in the system. The Class Homepage becomes the entry point for.
Galaxy for Bioinformatics Analysis An Introduction TCD Bioinformatics Support Team Fiona Roche, PhD Date: 31/08/15.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
NGS data analysis CCM Seminar series Michael Liang:
1 OrderPro Point of Sale (POS) Training Prepared by Christina Van Metre Independent Educational Consultant CTO, Business Development Team © Training Version.

Browsing the Genome Using Genome Browsers to Visualize and Mine Data.
For additional assistance, please call the Help Desk Searching 1. If a Search window does not appear after logging into the system, click the Search icon.
ChrGeneticist introduction for reviewer Jinlian Wang 10/8/2014.
GISMO/GEBndPlan Overview Geographic Information System Mapping Object.
IPlant Collaborative Discovery Environment RNA-seq Basic Analysis Log in with your iPlant ID; three orange icons.
Using the AccuGlobe Software with the IndianaMap Using the AccuGlobe Software.
The material contained in this document is proprietary to Triniti Corporation (Triniti). This material may not be disclosed, duplicated or otherwise revealed,
1 EndNote X2 Your Bibliographic Management Tool 29 September 2009 Humanities and Social Sciences Resource Teams.
The UCSC Table Browser & Custom Tracks Advanced searching and discovery using the UCSC Table Browser and Custom Tracks Osvaldo Graña CNIO Bioinformatics.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Copyright OpenHelix. No use or reproduction without express written consent1.
MBAT User Workflows View an Atlas Open Data Upload Data Run a Query –Search Data Further Examination Microarray Data Further Examination of 2D Data –Search.
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
Mapping local community assets online Read this if you want to learn how to: 1)Create online maps of local community assets using Google Maps 2)Allow other.
Chapter 3 Device Monitor Screen Otasuke GP-EX! Chapter 3 Device Monitor Screen Chapter 3 Device Monitor Screen.
A user guide to accessing, reviewing and contributing to the Online Registry System.
Introduction to KE EMu Unit objectives: Introduction to Windows Use the keyboard and mouse Use the desktop Open, move and resize a.
® IBM Software Group © 2007 IBM Corporation Module 3: Creating UML Diagrams Essentials of Modeling with IBM Rational Software Architect, V7.5.
Tools in Bioinformatics Genome Browsers. Retrieving genomic information Previous lesson(s): annotation-based perspective of search/data Today: genomic-based.
LINGNAN UNIVERSITY LIBRARY SEP 2011 LIB170: Using RefWorks 1.
Integrating Components and Dynamic Text Boxes with the Animated Map– Lesson 101 Integrating Components and Dynamic Text Boxes with the Animated Map Lesson.
{ How to Create a Flipchart in ActivInspire Lara Daniel.
TRACKSTER &CIRCSTER DEMO Slides: /g/funcgen/trainings/visualization/Demos/Trackster+Circster.ppt Galaxy: Galaxy Dev:
Introduction to Exome Analysis in Galaxy Carol Bult, Ph.D. Professor Deputy Director, JAX Cancer Center Short Course Bioinformatics Workshops 2014 Disclaimer…I.
© Execview Ltd 2015: all rights reserved Execview Scorecard Training/Reference Guide 2013 Key functions guide for Scorecard administrators.
IGV Demo Slides:/g/funcgen/trainings/visualization/Demos/IGV_demo.ppt Galaxy Dev: 0.
IT Desktop Management GoToMeeting – Online Meeting Service Organizer/Presenter Guide May 2007 Prepared by Angela Mars IT Education and Training.
Canadian Bioinformatics Workshops
AdisInsight User Guide July 2015
Day 5 Mapping and Visualization
Canadian Bioinformatics Workshops
Integrative Genomics Viewer (IGV)
Introduction to Basic Interface of
Data Visualization Web Application
InnovationQ Plus Quick Start Guide
Regulatory Genomics Lab
Presentation transcript:

Visualization in Galaxy Trackster,Circster,IGV

Visualizing data from Galaxy

Visualization options in Galaxy Several datasets in Galaxy can be visualized directly on UCSC, IGB, IGV, Ensembl. You can also click on the fourth icon below the dataset name to ‘Visualize’ your dataset in Trackster, Circster or Scatterplot. why is this good to have visualization in galaxy? Because it allows you to check your data directly and easily right after mapping (bam files visu) or at each step of processing ie "do I have signal in my positive controls".

What is Trackster? Galaxy's visualization and visual analysis environment. Genome browser for high throughput sequencing data. It lets you visualize your SAM/BAM, BED, GFF/GTF, WIG, bigWig, bigBed, bedGraph, and VCF datasets from within Galaxy. Works with large NGS/HTS datasets. More details can be found on the Galaxy Wiki Tutorial

Visualizing various formats in Trackster Shown here are datasets of different formats visualized in Trackster. All of them are available in the data library: Training->visualizationTutorial-RNA-Seq gff3 vcf BedGraph Bed Bam Gff3- coverage Vcf- coverage Bedgraph-Histogram Bed-Auto/Squish Bam- Auto/Squish Wig/Bigwig- Histogram Gtf- auto/squish wig Bigwig gtf Overflow beyond listed maximum/minimum value indicated by red region (Histogram) or red dot (Line)

Visualizing data from your Galaxy History in Trackster Datasets in Galaxy can be easily visualized in Trackster by clicking on the ‘Visualize’ icon under the dataset name. Shown here is the ‘mapped4.vcf’ dataset from Shared Data->Data Libraries->Training->visualizationTutorial-RNA-Seq. History Panel in Galaxy

Viewing data in a New or Saved Visualization

Viewing data in a New Visualization A ‘Browser name’ should be entered and the Reference genome build selected.

Navigating in Trackster Selecting a chromosome Typing in the desired location The Floppy Disc icon in the upper right corner is used to save the visualization.

Adding Tracks to the saved visualization The first icon in the upper-right corner of the Trackster window is used to ‘Add Tracks’. Datasets can be added from any of your Histories or Data Libraries.

Display modes in Trackster Visualization by two track classes: - Line Tracks: display quantitative data along a y-axis - wig, bigWig, bigBed,BedGraph -Drawing modes: Histogram, Line, Filled, Intensity Intensity: show the data point as a gradient of the current y-axis range (darker is more). Filled: fill region from the bottom of the track to the data point. Line: connect the data points with a line. Red dot indicates overflow beyond the listed maximum/minimum value. Histogram: fill region between x-axis and the data point. Red region indicates overflow beyond the listed maximum/minimum value.

Display modes in Trackster(2) FeatureTracks: display feature interval data - BED,BAM/SAM, gff, gtf, vcf - Drawing modes: Automatic, Coverage, Dense, Squish, Pack Display modes for a gff3 track shown here. -Squish- draw exons but not strand/labels -Pack- draw exons, strand, and labels Dense- Everything stacked on top of one another Coverage/Histogram- Show no. of features in binned regions

Changing track settings Max/min values set automatically if not specified by user via ‘Edit settings’. Colour change of tracks also possible via ‘Edit settings’.

Other Features in Trackster Zooming in and out on tracks Dynamic filters based on feature attribute values. Tool parameter settings and Tool Parameter space visualization for Cufflinks datasets. Composite tracks.

Visualizing in Circster Clicking on the fourth (circular) icon in the upper-right corner of the Trackster window will visualize the same datasets in Circster. Adding tracks from the ‘Additional’ folder in the training data library will result in the visualization shown below. The interactions given by the ‘chrint’ format make up the arcs in the centre.

What is Circster? Circos-style viewer for rendering genome-wide data on the Web. Circos - software package for visualizing data and information in a circular layout. Non-genomic data visualized by Circos

How is data visualized with Circster? Position-based data (e.g., binding affinity, gene expression) laid out in concentric circles representing chromosome position. Chromosome interaction data (e.g., three-dimensional interactions, gene fusions) denoted as arcs on the inside of the position data. Datasets can be added easily. Ideal for exploring relationships between objects or positions. Better visualization for whole genome data.

Circster visualization Example Circster visualization with mapped RNA-seq reads and interaction data. A Zoomed–in image of the visualization. Initially, Circster shows the complete genome, and all data is visible for all datasets. Circster uses the D3 framework to render genomic data as SVG. Using SVG paths to display complex objects such as histograms (e.g. coverage data) and line data (e.g. conservation data) limits the number of objects created and makes the visualization scalable to a large number of datasets. Figure 3 shows a Circster visualization for mapped RNA-seq reads from both cancer cell lines and normal tissue, as well as potential chimeric transcripts from ChimerDB 2.0 [18]. Panning and zooming around the view results in more detailed data for visible regions. Goecks et al.

Saving and exiting Save the visualization by clicking on the floppy icon at the upper-right corner. Go back to the Galaxy main window by clicking on ‘Analyze Data’ in the Galaxy menu. Analyze Data Save

Caution with Circster Though Circster is a good way to visualize data in a circular format, we have noted some difficulties while using it. Changing track settings, deleting tracks not possible directly in Circster. Unexpected visualization glitches seen. We would advise you to be cautious with you data while using Circster.

What is IGV? Elaborated as ‘Integrative Genomics Viewer’. Integrated visualization tool for multiple data types and genome annotations. Data from NGS analysis visualized .

Why use IGV? Handles large datasets Interactive and speedy visualization. Run locally on desktop View multiple datasets in separate panels on the same pane. Several track management options (filtering, grouping, sorting) and in-built tools (index, sort, motif finder…) Direct visualization from Galaxy : bed, wig, gff, gff3, Bam, bigWig, bigBed, vcf formats .

Visualizing Galaxy datasets in IGV 2 modes to open Galaxy datasets in IGV: Web current : Download IGV on your system and open dataset in new window. Local : To visualize in the already running IGV

Visualizing Galaxy datasets in IGV(2) 1.Select the ‘web current’ option for IGV 2.Open the downloaded jnlp file with ‘Java Web Start’ 3. You will be asked for authentication. Please enter your embl username and password.

Visualizing Galaxy datasets in IGV(3) This is what you should be able to visualize at first. Select a particular chr or type in the desired location

Whole genome view in IGV The whole genome can be visualized by selecting the ‘All’ option from the Chromosome drop-down menu in the Tool Bar. When you load genomic data, IGV displays the data in horizontal rows called tracks. Typically, each track represents one sample or experiment. For each track, IGV displays the track identifier, one or more attributes, and the data. Attribute names are listed at the top of the attribute panel. Colored blocks represent attribute values, where each unique value is assigned a unique color. Hover over a colored block to see the attribute value. Click an attribute name to sort tracks based on that attribute value.

Save and Exit

Session File A session file (XML) contains information about the genome, locus, data files opened in IGV when the session was saved. It also contains detailed information about the tracks which were saved. Sessions can be easily restored in IGV at a later point.

Other IGV features Changing track settings igvtools- index, sort, count Sorting, filtering, grouping tracks Motif finder Specific search

Comparison between Trackster and IGV Feature Trackster (Circster) IGV Visualize datasets from Galaxy without downloading. Yes Visualize multiple datasets using circular layout No Running location Server Local Work sessions can be saved Yes* Visualize data from Galaxy without installing any software beforehand. Yes/No In-built tools to create index, manage tracks (sort,group,filter) and find motifs. Limited. Running location- Trackster runs on the server and uses the server resources.IGV – rins on your system. No sharing of resources. Work sessions- In Trackster, sometimes the order of tracks is not preserved. But, IGV sessions are stored as XML files on your system and all the settings are preserved. In-built tools: Trackster- Only changing track colours and setting histogram values possible. Also filtering.

Join us for a Galaxy Visualization Training Thu, 17 October – 13h (max 2 hours) Venue: V 207 Register by email : gbcs@embl.de

THANK YOU!