RNA-seq workshop COUNTING & HTSEQ Erin Osborne Nishimura.

Slides:



Advertisements
Similar presentations
Functional Genomics with Next-Generation Sequencing
Advertisements

IMGS 2012 Bioinformatics Workshop: RNA Seq using Galaxy
12/04/2017 RNA seq (I) Edouard Severing.
Exploring the Human Transcriptome
Simon v2.3 RNA-Seq Analysis Simon v2.3.
Peter Tsai Bioinformatics Institute, University of Auckland
DEG Mi-kyoung Seo.
RNA-seq: the future of transcriptomics ……. ?
RNAseq analysis Bioinformatics Analysis Team
RNA-seq data analysis Project
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Transcriptomics Jim Noonan GENE 760.
MCB Lecture #21 Nov 20/14 Prokaryote RNAseq.
RNA-seq Analysis in Galaxy
RNA-Seq Visualization
Introduction to RNA-Seq and Transcriptome Analysis
Li and Dewey BMC Bioinformatics 2011, 12:323
Expression Analysis of RNA-seq Data
Bioinformatics and OMICs Group Meeting REFERENCE GUIDED RNA SEQUENCING.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Introduction to DESeq and edgeR packages Peter A.C. ’t Hoen.
RNAseq analyses -- methods
Introduction to RNA-Seq & Transcriptome Analysis
Next Generation DNA Sequencing
RNA-Seq Analysis Simon V4.1.
Transcriptome Analysis
Next Generation Sequencing. Overview of RNA-seq experimental procedures. Wang L et al. Briefings in Functional Genomics 2010;9: © The Author.
RNA-seq workshop ALIGNMENT
Eran Yanowski, Eran Hornstein’s: Monitor drug impact on the transcriptome of mouse beta cells (primary and cell-line) using Transeq/RNA-Seq Report.
Introduction to RNA-Seq
Erin Osborne Nishimura onish.web.unc.edu University of North Carolina at Chapel Hill.
RNA surveillance and degradation: the Yin Yang of RNA RNA Pol II AAAAAAAAAAA AAA production destruction RNA Ribosome.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Introduction to RNAseq
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop RNA-Seq visualization with cummeRbund.
RNA-Seq data analysis Xuhua Xia University of Ottawa
Analyzing digital gene expression data in Galaxy Supervisors: Peter-Bram A.C. ’t Hoen Kostas Karasavvas Students: Ilya Kurochkin Ivan Rusinov.
The iPlant Collaborative
The iPlant Collaborative
No reference available
Lecture 12 RNA – seq analysis.
RNA-Seq visualization with CummeRbund
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Bioinformatics for biologists (2) Dr. Habil Zare, PhD PI of Oncinfo Lab Assistant Professor, Department of Computer Science Texas State University Presented.
Canadian Bioinformatics Workshops
RNA-Seq with the Tuxedo Suite Monica Britton, Ph.D. Sr. Bioinformatics Analyst September 2015 Workshop.
RNA-Seq Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Statistics Behind Differential Gene Expression
Simon v RNA-Seq Analysis Simon v
RNA Quantitation from RNAseq Data
An Introduction to RNA-Seq Data and Differential Expression Tools in R
Placental Bioinformatics
Gene expression from RNA-Seq
RNA-Seq analysis in R (Bioconductor)
S1 Supporting information Bioinformatic workflow and quality of the metrics Number of slides: 10.
Outline Overview of RNA-Seq Quality control and read trimming Mapping RNA-Seq reads Transcriptome assembly Additional training resources on RNA-Seq.
Gene expression estimation from RNA-Seq data
Learning to count: quantifying signal
Assessing changes in data – Part 2, Differential Expression with DESeq2
Comparison of observed and predicted coverage patterns.
NMDS clustering of sample types and differential expression analysis.
Additional file 2: RNA-Seq data analysis pipeline
Quantitative analyses using RNA-seq data
Sequence Analysis - RNA-Seq 2
Expression of metabolic gene families and ABC transporters in the midgut. Expression of metabolic gene families and ABC transporters in the midgut. Expression.
Fig. 1 A time-course RNA-seq analysis reveals prominent IRF8 expression in the spinal cord during the recovery phase after SCI. A time-course RNA-seq analysis.

RNA-Seq Data Analysis UND Genomics Core.
Presentation transcript:

RNA-seq workshop COUNTING & HTSEQ Erin Osborne Nishimura

_trim.fastq file.bam/.sam file.bw file counts.txt file TOPHAT2 bedGraphToBigWig bedtools genomecov.bg file HTseq DESeq2/R Differentially Abundant genes IGV/UCSC Pretty browser shots Today’s simple analysis pipeline.fastq file trimmomatic/bbduk.sh

Quantification with htseq

The problem

Counting reads What will we count? Genes? Exons? Isoforms? What are some of the issues we need to account for when counting reads? Paralogs? Overlap? Isoforms? Errors? How to count? Raw counts RPKM -- Reads aligned kilobase per million mapped reads FPKM -- Fragments per kilobase per million mapped reads

htseq-count Manual: – huber.embl.de/users/anders/HTSeq/doc/count.htmlhttp://www- huber.embl.de/users/anders/HTSeq/doc/count.html Paper –

The problem

The three htseq-count modes

Switch to hands on tutorial p/blob/master/02_RNAseq_count.mdhttps://github.com/erinosb/HTSF_worksho p/blob/master/02_RNAseq_count.md

Assessing differential abundance

Assessing pairwise differential abundance, relatively simple Anders and Huber, 2010

Identifying genes with shared patterns across multiple samples, complex

For today… Anders and Huber, 2010

Many publications report performance comparisons of the of different packages Seyednasrollah et al., 2013 – 6/1/59.full.pdf+htmlhttp://bib.oxfordjournals.org/content/1 6/1/59.full.pdf+html Soneson et al., /14/91http:// 2105/14/91 Rapaport et al., 2013 – 14/9/r95http:// 14/9/r95

Why is this hard? Why is this different from other types of data? Your question The data –Discretness –Small numbers of replicates –Large dynamic range –Outliers –Data is overdispersed Variance does not scale linearly with mean Breaks the assumptions of some inference tests Anders and Huber, 2010

Why DESeq? Original paper DESeq2 paper Bioconductor mlhttp://bioconductor.org/packages/release/bioc/html/DESeq2.ht ml Vignette DESeq2/inst/doc/DESeq2.pdfhttps:// DESeq2/inst/doc/DESeq2.pdf

A final word about the fate of your data You will need to submit your raw and processed files in a repository PRIOR to submitting your paper for publication. Keep track of what you did! –Module Versions –Conversion & transformation steps –Settings/Options

Switch to hands-on tutorial p/blob/master/02_RNAseq_count.mdhttps://github.com/erinosb/HTSF_worksho p/blob/master/02_RNAseq_count.md

Key Quality Control Metrics 20 Gene coverage –CEAS Over-amplification –FASTQC Complexity –TOPHAT output Reproducibilitybility