Transcriptomics Jiri Zavadil, PhD Molecular Mechanisms and Biomarkers

Slides:



Advertisements
Similar presentations
Functional Genomics with Next-Generation Sequencing
Advertisements

The Past, Present, and Future of DNA Sequencing
An Introduction to Studying Expression Data Through RNA-seq
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Peter Tsai Bioinformatics Institute, University of Auckland
Microarray for DNA & RNA Mosa Alzowelei BME 11/12/2014.
MiRNA Platform Overview The Agilent miRNA Microarray System A New Microarray-based Tool for Profiling Human miRNAs.
The 454 and Ion PGM at the Genomics Core Facility Dr. Deborah Grove, Director for Genetic Analysis Genomics Core Facility Huck Institutes of the Life Sciences.
Canadian Bioinformatics Workshops
Bioinformatics Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 1 Introduction Aleppo University Faculty of technical engineering.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Transcriptomics Jim Noonan GENE 760.
Epigenetics of Celiac Disease MEDICEL Malta 2011.
Introduction of Cancer Molecular Epidemiology Zuo-Feng Zhang, MD, PhD University of California Los Angeles.
Master in Advanced Genetics
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2015 Xiaole Shirley Liu Please Fill Out Student Sign In.
Office hours Wednesday 3-4pm 304A Stanley Hall Review session 5pm Thursday, Dec. 11 GPB100.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
High Throughput Sequencing
HL7 Clinical Sequencing Symposium Oncology Use Cases Ellen Beasley, Ph.D.September 14, 2011 VP, Ion Bioinformatics.
mRNA-Seq: methods and applications
Multi-dimensional Genomic Profiling of Acute Leukemias Characterized by MLL gene rearrangements Eunice S. Wang MD (Medicine) and Norma J. Nowak PhD (Cancer.
Department of Bioinformatics and Computational Biology
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
with an emphasis on DNA microarrays
‘Omics’ - Analysis of high dimensional Data
A cell and its population of genes :. DNA forms double strands by a process called hybridization:
The virochip (UCSF) is a spotted microarray. Hybridization of a clinical RNA (cDNA) sample can identify specific viral expression.
Microarray Technology
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Verna Vu & Timothy Abreo
Doug Brutlag 2011 Genomics, Bioinformatics & Medicine Doug Brutlag Professor Emeritus of.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Eukaryotic Genomes  The Organization and Control of Eukaryotic Genomes.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Control of Gene Expression Chapter Proteins interacting w/ DNA turn Prokaryotic genes on or off in response to environmental changes  Gene Regulation:
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
Introduction to RNAseq
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
High-throughput omic datasets and clustering
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Trends Biomedical In silico. “Omics” a variety of new technologies help explain both normal and abnormal cell pathways, networks, and processes simultaneous.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
CBioPortal Web resource for exploring, visualizing, and analyzing multidimentional cancer genomics data.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Gene expression  Introduction to gene expression arrays Microarray Data pre-processing  Introduction to RNA-seq Deep sequencing applications RNA-seq.
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
INTERPRETING GENETIC MUTATIONAL DATA FOR CLINICAL ONCOLOGY Ben Ho Park, M.D., Ph.D. Associate Professor of Oncology Johns Hopkins University May 2014.
Microarray: An Introduction
Agenda  Epigenetics and microRNAs – Update –What’s epigenetics? –Preliminary results.
Different microarray applications Rita Holdhus Introduction to microarrays September 2010 microarray.no Aim of lecture: To get some basic knowledge about.
Canadian Bioinformatics Workshops
STAT115 STAT215 BIO512 BIST298 Introduction to Computational Biology and Bioinformatics Spring 2016 Xiaole Shirley Liu.
Canadian Bioinformatics Workshops
Canadian Bioinformatics Workshops
Molecular Biology of Cancer AND Cancer Informatics (omics) David Boone.
Arrays How do they work ? What are they ?. WT Dwarf Transgenic Other species Arrays are inverted Northerns: Extract target RNA YFG Label probe + hybridise.
RNA Quantitation from RNAseq Data
Genomics A Systematic Study of the Locations, Functions and Interactions of Many Genes at Once.
Gene expression.
Diverse abnormalities manifest in RNA
Peiyong Jiang, K.C. Allen Chan, Y.M. Dennis Lo
Pan-cancer genome and transcriptome analyses of 1,699 paediatric leukaemias and solid tumours By: Anh Pham.
Presentation transcript:

Transcriptomics Jiri Zavadil, PhD Molecular Mechanisms and Biomarkers International Agency for Research on Cancer, Lyon

Transcriptomics - Definitions Transcriptome - the complete set of RNA transcripts produced by the genome at a given time Transcriptome is highly dynamic and complex in comparison to the relatively stable genome Transcriptomics - the global study of gene expression at the RNA level - can include genes for ncRNAs (microRNAs etc)

Biospecimens in I4C Mother-Child and Infant-Child Cohorts Blood spots Cord blood Whole blood Genetic, epigenetic, transcriptomic analyses (nucleic acids) Proteomic analysis, serological and chemical analyses Urine Chemical, proteomic and nucleic acid analysis Tumor cells, tissues

Case for Integrated Omics Analyses The prospective biospecimen collection and retrospective case analysis will yield interconnected results Epigenetics  gene regulation  RNA and protein markers DNA methylation Histone modification Studied by transcriptomics

Transcriptomics – Applications for I4C Specific gene expression Genes and signatures determined by particular genetic, epigenetic regulatory factors, environmental exposures Exploratory approaches Not hypothesis driven, e.g global gene expression in tumors versus healthy tissues, differential responses to distinct environmental exposures Disease etiology and classification Patterns/signatures rather than single markers can improve knowledge about etiology and diagnosis

DNA Microarray Platforms Illumina BeadArray Affymetrix GeneChip Workflow Reverse transcription, IVT with labeled nucleotides, array hybridization, staining, washing scanning Pros/Cons Rapid and streamlined protocols, standardized analysis; biased target collection, levels but limited sequence information

MicroRNA - TaqMan Low Density Array Quantile Normalization Total RNA Sample ABI 7900 SDS HT miRNA TLDA Array 742 total target miRs Quantile Normalization Pros/Cons Quantitative abundance analysis; biased target collection

Integrated Molecular Profiling By MPS Cancer Genome Sequencing Massively Parallel Sequencing (MPS) - powerful nucleic acid analysis tool providing base-pair resolution information at the genome scale Stratton, MR. Science 331, 1553 (2011)

Massively Parallel Sequencing emPCR ABI SOLiD 5500 Accuracy < 99.99% Throughput/Day <10–15 Gb Throughput/Run <90 Gb or >1.4 B reads (paired-end or mate-paired runs) Samples/Run • 1 genome • 12 exomes • 6 transcriptomes

Massively Parallel Sequencing Bridge amplification, clonal expansion Illumina HiSeq2000/2500 6 human genomes at 30x 64 transcriptomes at 20M mapped reads/sample

mRNA Abundance Analysis RPKM (Reads Per Kilobase per Million mapped reads) FPKM (Fragments Per Kilobase per Million mapped reads) Methods of quantifying gene expression levels from RNA-seq data by normalizing for total read length and the number of sequencing reads or fragments (PE reads). Unnormalized data Scaling Normalization Quantile Normalization -4 -2 0 2 4 -4 -2 0 2 4 -4 -2 0 2 4 log2(RPKM) A1 A2 A3 A1 A2 A3 A1 A2 A3 Equivalent distribution Identical distribution (spread, range and median)

Differential mRNA Abundance Analysis ACSL5 – normalized differential abundance ratio = 8.4

Single Nucleotide Variant Analysis DNA RNA Non-syn SNV/mutation identified at both DNA/RNA levels

Acceptor Splice Sites Mutated in UUC mRNA Splicing Aberrations 5’ Exon N GU------A-----AG Exon N+1 3’ Tumor RNA Tumor DNA The answer is yes, it does although the numbers vary. It is important to control these analyses for DNA contents as we can see here. Normal DNA

Stage-Specific RNA Aberrations in ALL Mutant Allele Frequency 6 matched diagnosis/relapse pediatric ALL samples (n=12) RNA-seq to discover novel mutations specific to relapse disease Targeted amplicon resequencing at ultra-deep coverage Patient ID Mutation Mutant Allele Frequency (17,000-50,000x coverage)   Diagnosis Relapse 719515 p.R238W 0.01% 27% 737185 18% 761159 31% 756421 p.R367Q 0.02% 25% 716996 p.K404KD 55% 763368 p.S408R 50% 769886 p. S445F 728610 p. L626F 49% 726584 p. E274Q 19% p. S171I 43% p. M244L 0.38% 47% p. A53V 20%

Solutions for Low Yield Samples Microarray and RNA-seq Transcriptome Profiling Possible with >10 picograms total RNA Degraded samples, RIN scores >2.0 Formalin-fixed, paraffin-embedded (FFPE) samples Whole blood Direct cell lysate from the equivalent of a single or a few cells microRNA Profiling megaplex amplification protocols - 1-350 ng total RNA non- amplification based for 350 – 1000 ng total RNA

The Future of MPS-based OMICS The Economist, 2011 MPS cost goes down, technologies become more advanced and powerful, platforms develop rapidly – a strong case for transcriptomics within integrated omics approaches applied to large cohorts such as I4C.

Considerations for I4C Transcriptomics Low yield samples (blood spots, extracellular microRNAs) might require application of amplification methods Tissue and cell specificity of gene expression (e.g. cord blood vs leukemic clone) – need for carefully matched controls Only genes and RNAs expressed at the time of sampling are detected Depth of coverage needs for RNAseq affect cost-related decisions Specific disease progression stages might mask etiology-associated aberrations Bioinformatics – limited standards for complex data processing and analysis (RNAseq), more benchmarking studies needed using data from consortia-like efforts (FDA’s SEQC). Data storage and access solutions.

Thank you….