RNA-seq workshop ALIGNMENT

Slides:



Advertisements
Similar presentations
IMGS 2012 Bioinformatics Workshop: RNA Seq using Galaxy
Advertisements

Fast and accurate short read alignment with Burrows–Wheeler transform
Transcriptome Sequencing with Reference
Peter Tsai Bioinformatics Institute, University of Auckland
RNA-seq data analysis Project
TOPHAT Next-Generation Sequencing Workshop RNA-Seq Mapping
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Transcriptomics Jim Noonan GENE 760.
MCB Lecture #21 Nov 20/14 Prokaryote RNAseq.
Lab 3.41 Demo: Exploiting the UCSC Genome Browser Stefanie Butland UBC Bioinformatics Centre
RNA-seq Analysis in Galaxy
mRNA-Seq: methods and applications
Bacterial Genome Assembly | Victor Jongeneel Radhika S. Khetani
Before we start: Align sequence reads to the reference genome
NGS Analysis Using Galaxy
Next generation sequencing Xusheng Wang 4/29/2010.
Introduction to RNA-Seq and Transcriptome Analysis
Li and Dewey BMC Bioinformatics 2011, 12:323
Expression Analysis of RNA-seq Data
Genomics Virtual Lab: analyze your data with a mouse click Igor Makunin School of Agriculture and Food Sciences, UQ, April 8, 2015.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Introduction to RNA-Seq & Transcriptome Analysis
Next Generation DNA Sequencing
TopHat Mi-kyoung Seo. Today’s paper..TopHat Cole Trapnell at the University of Washington's Department of Genome Sciences Steven Salzberg Center.
Transcriptome Analysis
Next Generation Sequencing. Overview of RNA-seq experimental procedures. Wang L et al. Briefings in Functional Genomics 2010;9: © The Author.
RNA-Seq in Galaxy Igor Makunin QAAFI, Internal Workshop, April 17, 2015.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
The iPlant Collaborative
RNA-seq workshop COUNTING & HTSEQ Erin Osborne Nishimura.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
RNA-Seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis is doing the.
Introduction to RNAseq
Bioinformatics for biologists Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
The iPlant Collaborative
RNA-seq: Quantifying the Transcriptome
The iPlant Collaborative
No reference available
RNA-Seq in Galaxy Igor Makunin DI/TRI, March 9, 2015.
Case study: Saccharomyces cerevisiae grown under two different conditions RNAseq data plataform: Illumina Goal: Generate a platform where the user will.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Canadian Bioinformatics Workshops
RNA Seq Analysis Aaron Odell June 17 th Mapping Strategy A few questions you’ll want to ask about your data… - What organism is the data from? -
Introductory RNA-seq Transcriptome Profiling of the hy5 mutation in Arabidopsis thaliana.
Practice:submit the ChIP_Streamline.pbs 1.Replace with your 2.Make sure the.fastq files are in your GMS6014 directory.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on
Short Read Sequencing Analysis Workshop
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Short Read Sequencing Analysis Workshop
RNA-Seq analysis in R (Bioconductor)
S1 Supporting information Bioinformatic workflow and quality of the metrics Number of slides: 10.
High-Throughput Analysis of Genomic Data [S7] ENRIQUE BLANCO
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
From: TopHat: discovering splice junctions with RNA-Seq
Maximize read usage through mapping strategies
Sequence Analysis - RNA-Seq 2
Introduction to RNA-Seq & Transcriptome Analysis
Schematic representation of a transcriptomic evaluation approach.

RNA-Seq Data Analysis UND Genomics Core.
Presentation transcript:

RNA-seq workshop ALIGNMENT Erin Osborne Nishimura http:onish.web.unc/HTSF-workshop-2015

Alignment with Tophat2

There are many alignment tools available Nuno A. Fonseca et al. Bioinformatics 2012;28:3169-3177 © The Author 2012. Published by Oxford University Press.

Which aligner is best? What type of data do you have? What is your research question?

Methods of alignment Splice awareness: What will be matched first? Splice unaware (Bowtie, BWA) Faster Splice aware (Tophat, MapSplice, SpliceMap) Slower Yields more information on splice junctions What will be matched first? Whole genome? Known transcriptome? A short segment of each read first?

Why tophat? Popular Splice aware de novo or sequenced genome modes Transcriptome or whole genome assembly Lots of options for customization Drawbacks Lots of parameters to set & optimize

Tophat2 – how does it work? Kim et al., 2013 Genome Biology http://www.genomebiology.com/2013/14/4/r36

Tophat2 versus Tophat1

The good news… … choice of aligner does not have a major impact on genes identified as differentially expressed, compared to other choices. Fonseca, 2014

Switch to Tophat2 Tutorial https://github.com/erinosb/HTSF_workshop/blob/master/01_RNAseq_alignment.md

Generating genome browser tracks

Ah, those beautiful browser tracks… Brooks and Yang et al., 2011, Genome Research

Today’s simple analysis pipeline .fastq file trimmomatic/bbduk.sh _trim.fastq file TOPHAT2 .bam/.sam file HTseq bedtools genomecov counts.txt file .bg file bedGraphToBigWig DESeq2/R .bw file Differentially Abundant genes IGV/UCSC Pretty browser shots

I have included an example script for Script05_makeBrowserTracks.sh Requires bedGraphToBigWig Requires bedtools Performs Normalization Normalize to read depth One option Scale = (#bps in genome) (#bp per read) x (# mapped reads)

Two most common platforms IGV https://www.broadinstitute.org/igv/ Locally installed UCSC Genome Browser http://genome.ucsc.edu/ Upload required

Visual inspection of each normalized replicate is critical… http://genome.ucsc.edu/cgi-bin/hgTracks?hgS_doOtherUser=submit&hgS_otherUserName=Erin%20Osborne&hgS_otherUserSessionName=hg19_female_v_male_demo http://bit.ly/1SJrQcq