NGS Transcriptomic Workflows Hugh Shanahan & Jamie al-Nasir Royal Holloway, University of London.

Slides:



Advertisements
Similar presentations
An Introduction to Studying Expression Data Through RNA-seq
Advertisements

Walk-thru of CAGE exercise Also at /tag_analysis/ /tag_analysis/
Transcriptome Sequencing with Reference
Genome organization Lesk, Ch 2 (Lesk, 2008). Genomes and proteomes Genome of a typical bacterium comes as a single DNA molecule of about 5 million characters.
Peter Tsai Bioinformatics Institute, University of Auckland
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Transcriptomics Jim Noonan GENE 760.
Alignment of mRNAs to genomic DNA Sequence Martin Berglund Khanh Huy Bui Md. Asaduzzaman Jean-Luc Leblond.
Class 02: Whole genome sequencing. The seminal papers ``Is Whole Genome Sequencing Feasible?'' ``Whole-Genome DNA.
mRNA-Seq: methods and applications
TRNA. Transfer RNA (tRNA) is a small molecule, existing as a single- strand that is folded into a clover-leaf shape.
Beyond Laboratory Notebooks: Next Era Biological Data Hurdles for Information Storage, Access and Distribution Richard Slayden, PhD. Microbiology, Immunology.
Before we start: Align sequence reads to the reference genome
Next generation sequencing Xusheng Wang 4/29/2010.
1 of 34 Ensembl use of RNASeq Steve Searle. 2 of 34 Ways we use RNASeq data in Ensembl: Build complete gene set from scratch for individual or pooled.
Whole Exome Sequencing for Variant Discovery and Prioritisation
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
DNA Fingerprinting of Bacterial Communities. Overview Targets gene for ribosomal RNA (16S rDNA) Make many DNA copies of the gene for the entire community.
20.1 – 1 Look at the illustration of “Cloning a Human Gene in a Bacterial Plasmid” (Figure 20.4 in the orange book). If the medium used for plating cells.
Todd J. Treangen, Steven L. Salzberg
Protein Synthesis Chapter 13. Protein Synthesis  How does your DNA eventually lead to your different phenotypes (hair color, eye color, etc)
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Next Generation DNA Sequencing
Practically Genomic A hands-on bioinformatics IAP Course Materials: Instructors: Paola Favaretto, Sebastian Hoersch,
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
1 The Interrupted Gene. Ex Biochem c3-interrupted gene Introduction Figure 3.1.
RNA-Seq Assembly 转录组拼接 唐海宝 基因组与生物技术研究中心 2013 年 11 月 23 日.
De Novo Genome Assembly - Introduction Henrik Lantz - BILS/SciLife/Uppsala University.
RNA Sequence Assembly WEI Xueliang. Overview Sequence Assembly Current Method My Method RNA Assembly To Do.
Introductory RNA-seq Transcriptome Profiling. Before we start: Align sequence reads to the reference genome The most time-consuming part of the analysis.
Transcriptomics Sequencing. over view The transcriptome is the set of all RNA molecules, including mRNA, rRNA, tRNA, and other non coding RNA produced.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
The Tree of Life How do we select a gene sequence for comparison?
Introduction to RNAseq
DNA LIBRARIES Dr. E. What Are DNA Libraries? A DNA library is a collection of DNA fragments that have been cloned into a plasmid and the plasmid is transformed.
The iPlant Collaborative
De Novo Genome Assembly - Introduction
Chap 18 The Genetics of Viruses and Bacteria. Structure of Virus Approximately 20 nm in diameter Their genome can contain DNA or RNA. Enclosed by a.
Manuel Holtgrewe Algorithmic Bioinformatics, Department of Mathematics and Computer Science PMSB Project: RNA-Seq Read Simulation.
Ke Lin 23 rd Feb, 2012 Structural Variation Detection Using NGS technology.
-1- Module 3: RNA-Seq Module 3 BAMView Introduction Recently, the use of new sequencing technologies (pyrosequencing, Illumina-Solexa) have produced large.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
RNA Sequencing and transcriptome reconstruction Manfred G. Grabherr.
Canadian Bioinformatics Workshops
Introductory RNA-seq Transcriptome Profiling of the hy5 mutation in Arabidopsis thaliana.
Canadian Bioinformatics Workshops
Introductory RNA-seq Transcriptome Profiling
Lesson: Sequence processing
Cancer Genomics Core Lab
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Introduction to NGS.
Chapter 4 “DNA Finger Printing”
Transcriptome Assembly
Henrik Lantz - NBIS/SciLife/Uppsala University
Do You Want to Build a Transcriptome?
CHAPTER 12 DNA Technology and the Human Genome
Get out a scratch piece of paper.
Genome organization and Bioinformatics
Protein Synthesis Lecture 5
RNA sequencing (RNA-Seq) and its application in ovarian cancer
COMPOUNDS OF LIVING THINGS
The Structure of the Genome
COMPOUNDS OF LIVING THINGS
COMPOUNDS OF LIVING THINGS
COMPOUNDS OF LIVING THINGS
COMPOUNDS OF LIVING THINGS
Sequence Analysis - RNA-Seq 2
Schematic representation of a transcriptomic evaluation approach.
Presentation transcript:

NGS Transcriptomic Workflows Hugh Shanahan & Jamie al-Nasir Royal Holloway, University of London

 Transcriptome – total sequence and abundance of RNA generated by a cell  RNA is transcribed from DNA  Genome is fixed for a organism  Transcriptome is dynamic  Variation between tissues  Variation over time  RNA transcripts are 1,000’s-10,000 bases in length Setting the scene

 Interested in  How many copies of a particular transcript are there  What is the sequence  - sequence comes from genome but alternative splicing means a transcript may not just be a contiguous block of DNA

Size of transcriptome will vary between species

 Fragment transcripts into shorter pieces (reads)  bases longs  Have many overlapping reads  Amplify (make lots of copies of) the short reads  Can sequence these short reads and then assemble them to reconstruct transcripts.  Size of data set depends on size of transcriptome but also amount of fragmentation (sequencing depth)  Can either assemble with a reference genome or de novo (very hard) Sequencing steps

NGS Workflow

 File formats have been updated to binary – used to use flat text so sizes were huge (Reference Genome – 39 Gbyte -> 0.8 Gybte)  Raw image data is actually discarded  Discussions focusses on assembly and down-stream analysis  Much of this data is deposited in the Sequence Read Archive (SRA)  We’ve papered over everything that happens before sequencing – i.e. the biochemical steps carried out  This is highly variable  These steps are not properly annotated Final points