A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb. 2006 MPSS Massively Parallel.

Slides:



Advertisements
Similar presentations
RNA-seq library prep introduction
Advertisements

Walk-thru of CAGE exercise Also at /tag_analysis/ /tag_analysis/
Analysis of SAGE Data: An Introduction Kevin R. Coombes Section of Bioinformatics.
Transcriptomics Breakout. Topics Discussed Transcriptomics Applications and Challenges For Each Systems Biology Project –Host and Pathogen Bacteria Viruses.
Transcriptome Sequencing with Reference
Peter Tsai Bioinformatics Institute, University of Auckland
RNA-seq: the future of transcriptomics ……. ?
Canadian Bioinformatics Workshops
Darlene Goldstein A Comparison of Microarray Platforms NUS – IMS Workshop 7 January 2004.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Clustering short time series gene expression data Jason Ernst, Gerard J. Nau and Ziv Bar-Joseph BIOINFORMATICS, vol
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Transcriptomics Jim Noonan GENE 760.
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
mRNA-Seq: methods and applications
Gene Expression Analysis by SAGE and MPSS Amanda Sitterly.
STAT 572: Bootstrap Project Group Members: Cindy Bothwell Erik Barry Erhardt Nina Greenberg Casey Richardson Zachary Taylor.
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Affymetrix GeneChips Oligonucleotide.
(2) Ratio statistics of gene expression levels and applications to microarray data analysis Bioinformatics, Vol. 18, no. 9, 2002 Yidong Chen, Vishnu Kamat,
Li and Dewey BMC Bioinformatics 2011, 12:323
Chapter 14 Genomes and Genomics. Sequencing DNA dideoxy (Sanger) method ddGTP ddATP ddTTP ddCTP 5’TAATGTACG TAATGTAC TAATGTA TAATGT TAATG TAAT TAA TA.
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Sensitivity A Simple Method.
RNAseq analyses -- methods
ModENCODE August 20-21, 2007 Drosophila Transcriptome: Aim 2.2.
Verna Vu & Timothy Abreo
Tag profiling is dead... October 2009 Claudia Voelckel Patrick Biggs...long live mRNA-Seq!
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Antonio (Toni) Reverter Bioinformatics Group CSIRO Livestock Industries Queensland.
Copy Number Variation Eleanor Feingold University of Pittsburgh March 2012.
Genomics.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
A Simple Method for Computationally Inferring Microarray Sensitivity Toni Reverter and Brian Dalrymple Bioinformatics Group CSIRO Livestock Industries.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Co-Expression Reverter et.
While gene expression data is widely available describing mRNA levels in different cancer cells lines, the molecular regulatory mechanisms responsible.
Introduction to RNAseq
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb GP3xCLI GenePix Post-Processing.
Cluster validation Integration ICES Bioinformatics.
Proteome and Gene Expression Analysis Chapter 15 & 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
No reference available
Lecture 12 RNA – seq analysis.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Final Remarks Genetical.
Lecture 26 GWAS Based on chapter 9 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Intelligent DNA Chips: Logical Operation of Gene Expression Profiles on DNA Computers Ji Yoon Park Dept. of Biochem Hanyang University Yasubumi Sakakibara,
Biotechnology and Bioinformatics: Bioinformatics Essential Idea: Bioinformatics is the use of computers to analyze sequence data in biological research.
Canadian Bioinformatics Workshops
RNA Quantitation from RNAseq Data
The Transcriptional Landscape of the Mammalian Genome
Dr. Christoph W. Sensen und Dr. Jung Soh Trieste Course 2017
Gene expression from RNA-Seq
Kallisto: near-optimal RNA seq quantification tool
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
A novel counting algorithm to detect common fetal trisomies
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Sequence Analysis - RNA-Seq 2
KEY CONCEPT Entire genomes are sequenced, studied, and compared.
Presentation transcript:

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Massively Parallel Signature Sequencing Lynx Therapeutics, Inc, Hayward, CA

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS  Invented by Sydney Brenner  Alternative to microarray: Counts all mRNAs in a sample  Designed to capture the complete transcriptome  High sensitivity to detect low abundance transcripts -- typical analysis involves about 1 million transcripts  Digital data that is amenable to developing large relational databases  Excellent dynamic range in excess of 100X up or down regulation  Can be applied to any organism Introduction

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Process Overview

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS  MPSS detects virtually all mRNAs in a sample, while microarrays are limited to gene elements on the array  MPSS has greater sensitivity for routine detection of low level expressed transcripts; microarray sensitivity influenced by many factors that can be difficult to rigorously control  “Digital” data output of MPSS makes it possible to readily import data into complex relational databases; microarray data provides a ratio between an experimental and control fluorescence that is difficult to convert into values for quantitative expression levels  MPSS can be used to conduct quantitative and in-depth expression analysis on any organism, including those with a genome that has not been sequenced or studied in great detail  Microarrays have the advantage of being a high-throughput technology for analyzing large numbers of samples Comparison to Microarray

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Comparison to SAGE Serial Analysis of Gene Expression  Signature sequence of SAGE is 14 nucleotides compared with 17 nucleotides with MPSS:  Less ambiguity with MPSS when mapping to the mammalian genome  Easier to connect MPSS tags with known genes  Typical SAGE data set is 20,000-60,000 tags compared to over a million signatures sequences for MPSS  Lynx cloning and MPSS sequencing done on a miniaturized platform that is amenable to high-through  SAGE conducted with standard cloning and sequencing that are expensive, time consuming and labor intensive  Larger MPSS data sets provide enhanced depth of analysis

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS My First Impressions Technology $$$ cDNA 100 Affymetrix 1,000 MPSS20,000 NB: Conservative Figures

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Mycroarrayvs MPSS N Genes ~10,000 ~25,000 % DE Genes 2 – – 25 N DE Genes 200 – 1,000 3,750 – 6,250 Biochemist  My First Impressions NB: Conservative Figures

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Sample 1Sample2p_value SIGNATURETPMStdevTPMStdev GATCAAATTCATCTCTA GATCAAATTGACCGCTT GATCAAATTGGTGGGGG GATCAAATTGTACTAGT GATCAAATTGTGCAGTA GATCCCGGTGTGAGGTA GATCTGCCGGTGAGGTA My First Impressions Distribution Sensitivity Analysis

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS MPSS Paper PNAS 03, 100:4702 tpmN Tags % > 1(0.0)27, (0.7)15, (1.0)10, (1.7) 3, (2.0) 1, (2.7) ,000(3.0) ,000(3.7) ,000(4.0) MPSS Test Data No Tags = 25,503 S 1 S cDNA Noise Paper PNAS 02, 99: Empirical Distribution of Tags My First Impressions

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS My First Impressions Empirical Distribution of Tags

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Sensitivity Adapted from Reverter et al., tpm tpm My First Impressions

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Annotation of Signatures No Tags: 25,503 Informatives Tags Genes hit by 1 Tag:7,492 Genes hit by 2 Tags:2,232 Genes hit by >= 3 Tags:1,775 Non-Informatives Hitting a Chromosome:1,470 “REPEAT” (>100 hits):1,468 MultiGenome:1,488 No Hits at all:1,439 23% P Values for genes >= 20 tags  2 w/  2 b = 0.92 Annotation Analysis

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Statistical Analysis Categorical Data Normal approximation for Binomial proportions

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Categorical Data Normal approximation for Binomial proportions a la SAGE data: Man et al., 2000 Vencio et al., 2003 Sample 1Sample 2 Gene 1 n11 n12N1. Others n21 n22N2. N.1 N.2N.. No Hypothesis testing Fisher’s exact (?) test: Audic & Claverie’s test: Statistical Analysis

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Differential Gene Expression from MPSS data based on Bootstrap Percentile Confidence Intervals Statistical Analysis ……an alternative Presented at the International Conference on Bioinformatics 2004 – Auckland, NZ

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS MPSS Test Data tan Issues: 1.Equivalence with M-A plots 2.Geometry

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS MPSS Test Data

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Binomial 5,137 DE Genes MPSS Test Data

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Algorithm for Bootstrap 1. Read transcripts for the i-th signature: 2. Sort MA i by a i (x-axis) 3. Define b Bins:(Same width or Same size) 4. Define r BR (Bootstrap Replicates), …enough for CI (eg. r = 200) Define  (Significance) 5. For each B j collect 5.1. Compute: 5.2. Identify: 6. Stop

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Bins of equal width

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Bins of equal size Merits: 1.Accuracy stabilisation 2.Variance stabilisation

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS MPSS Test Data Bootstrap CI 497 DE Genes

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS MPSS Test Data

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS  Compared to microarray, the analysis of MPSS should be trivial  Standard parametric (binomial) methods likely to generate a large number of differentially expressed elements.  Trade-off: Biological vs Statistically significant  The proposed method possesses a number of advantages:  Very easy to implement  Very fast to generate  Operates on total transcripts as opposed to proportions  Accommodates the inherent heteroskedasticity  More research ($) is needed to assess:  The impact of MPSS in expression studies  The (possible) annotation gap (non-sequenced species) Conclusions

A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS References Jongeneel, C.V., C. Iseli, B.J. Stevenson, et al. (2003) Comprehensive sampling of gene expression in human cell lines with massively parallel signature seequencing. PNAS, USA, 100: Brenner, S., M. Johnson, J. Bridgham, et al. (2000) Gene expression analysis by massively parallel signature sequencing (MPSS) on microbead arrays. Nature Biotechnology 18: Man M.Z., X. Wang, and Y. Wang (2000) POWER_SAGE: comparing statistical tests for SAGE experiments. Bioinformatics, 16: Tu. Y., G. Stolovitzky, and U. Klein (2002) Quantitative noise analysis for gene expression microarray experiments. PNAS, USA, 99: Vencio, R.Z.N., H. Brentani, and C.A.B. Pereira (2003) Using credibility intervals instead of hypothesis tests in SAGE analysis. Reverter, A., S. McWilliam, W. Barris, and B. Dalrymple (2004) A rapid method for computationally inferring transcriptome coverage and microarray sensitivity. Bioinformatics (in press).