Image Quantitation in Microarray Analysis More tomorrow...

Slides:



Advertisements
Similar presentations
MicroArray Image Analysis Robin Liechti
Advertisements

Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Introduction to Microarray
Microarray Normalization
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
Normalization of Microarray Data - how to do it! Henrik Bengtsson Terry Speed
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
TIGR Spotfinder: a tool for microarray image processing
Sandrine Dudoit1 Microarray Experimental Design and Analysis Sandrine Dudoit jointly with Yee Hwa Yang Division of Biostatistics, UC Berkeley
Normalization for cDNA Microarray Data Yee Hwa Yang, Sandrine Dudoit, Percy Luu and Terry Speed. SPIE BIOS 2001, San Jose, CA January 22, 2001.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Normalization Class web site: Statistics for Microarrays.
Low-Level Analysis and QC Regional Biases Mark Reimers, NCI.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Gene Expression Data Analyses (2)
Low Level Statistics and Quality Control Javier Cabrera.
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
Normalization of 2 color arrays Alex Sánchez. Dept. Estadística Universitat de Barcelona.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Image Analysis Class web site: Statistics for Microarrays.
Microarray Analysis Jesse Mecham CS 601R. Microarray Analysis It all comes down to Experimental Design Experimental Design Preprocessing Preprocessing.
Introduce to Microarray
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
Analysis of microarray data
B IOINFORMATICS Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 8 Analyzing Microarray Data Aleppo University Faculty of technical.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
1 Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright © 2009 Dan Nettleton.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Introduction to microarray technology and analysis
Affymetrix vs. glass slide based arrays
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
The microarray data analysis Ana Deckmann Carla Judice Jorge Lepikson Jorge Mondego Leandra Scarpari Marcelo Falsarella Carazzolle Michelle Servais Tais.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Microarray: An Introduction
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Microarray Data Analysis Xuming He Department of Statistics University of Illinois at Urbana-Champaign.
2.2 - Microarray Data Modeling Lab Sébastien Lemieux Elitra Canada Ltd.
Microarray - Leukemia vs. normal GeneChip System.
Normalization Methods for Two-Color Microarray Data
Image Processing for cDNA Microarray Data
The Basics of Microarray Image Processing
Normalization for cDNA Microarray Data
Microarray Data Analysis
Presentation transcript:

Image Quantitation in Microarray Analysis More tomorrow...

Microarray analysis n Array construction, hybridisation, scanning n Quantitation of fluorescence signals n Data visualisation n Meta-analysis (clustering) n More visualisation

Technical probe (on chip) sample (labelled) pseudo-colour image [image from Jeremy Buhler]

Experimental design n Track what’s on the chip which spot corresponds to which gene n Duplicate experimental spots reproducibility n Controls –DNAs spotted on glass positive probe (induced or repressed) negative probe (bacterial genes on human chip) –oligos on glass or synthesised on chip (Affymetrix) point mutants (hybridisation plus/minus)

Imagesfrom scanner n Resolution –standard 10  m [currently, max 5  m] –100  m spot on chip = 10 pixels in diameter n Image format –TIFF (tagged image file format) –can be compressed (eg. Lempel-Ziv-Welch: ~ 5x compression) –1cm x 1cm image at 16 bit = 2Mb (uncompressed) –other formats exist eg. SCN (used at Stanford University) n Separate image for each fluorescent sample –channel 1, channel 2, etc.

Images in analysis software n Typical experiment: –“normal” state, Cy3-labelled sample (green) –“perturbed” state, Cy5-labelled sample (red) –hybridisation, then scanning –overlay images  pseudo-colour image –qualitative representation of results Image spot colourSignal strengthGene expression yellow:normal = perturbedunchanged green:normal > perturbedrepressed red:normal < perturbedinduced

Quantitation process (1) Accurate representation of signal for each spot and determine ratio channel1:channel2 n Determine spot boundary –construct grid (dimensions of array / spot size) –iterative process to find spots n Measure signal –fluorescence 8 bit = 256 shades 16 bit = 65’536 shades –absolute output values vary from system to system

Quantitation process (2) n Measure background –local (usually best) –selected region –selected spots / probes from different species n Quality control –eg. fraction of pixels greater than background (ScanAlyze) –flag aberrant spots n Determine ratio of signal strengths for each spot Ch1/Ch2 = (Ch1 I -Ch1 B )/Ch2 I -Ch2 B )

Normalisation n Eliminate systematic variation –correct for dye incorporation print-tip effects hybridisation efficiencies etc. n How? n Use STATISTICS!

Normalisation n Simple example: green vs red Normgreen i = green i TotRed - TotRedB TotGreen - TotGreenB

Normalisation n More complexe: Within slide global (constant over the slide) spot intensity dependent (spot by spot) within print-tip group (group by group) scale (outlayers spread) Between slides paired-slides (dye swap) Multiple slides scale (slide outlayers spread)

Normalisation

n Example: Within slide print-tip group scaling n Before scaling n After scaling

Normalisation references Normalization for cDNA microarray data Yang et al. (2001) In preparation Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments Dudoit et al. (2000) Technical report #578 Berkeley Statistics Dept. Both PDFs are available from the web site of the course [statistics]

Quantitation - problems n Reference signal is close zero –channels ratio (Ch1/Ch2) tends to infinity n Non-uniform background –“mean” background sometimes non-representative bright particles streaks on image –safer to use “median” (middle value) less contribution by extreme values

Background problems

Background removal

Quantitation à la ScanAlyze signal background

ScanAlyze output CH1Ich1 intensity CH2Ich2 intensity SPIXnumber of pixels in spot CH1Bmedian intensity of the local background (recommended) CH2Bmedian intensity of the local background (recommended) CH1BAmean intensity of the local background CH2BAmean intensity of the local background BGPIXnumber of background pixels Ch1 CH1I - CH1B Thus to calculate channel ratios:Ch1 CH1I - CH1B --- = Ch2 CH2I - CH2B Quality control: CH1GTB1fraction of pixels in spot greater than background (CH1B) CH2GTB1fraction of pixels in spot greater than background (CH2B) CH1GTB2fraction of pixels in spot greater than 1.5 X background (CH1B) CH2GTB2fraction of pixels in spot greater than 1.5 X background (CH2B) CH1EDGEAmean magnitude of the horizontal and vertical Sobel edge vectors within spot 1 CH2EDGEAmean magnitude of the horizontal and vertical Sobel edge vectors within spot 2

Input files for Cluster n Minimal table: n Extended table: n Table: tab delimited text, 1 line/gene, 1 column/experiment

Prepare data for Cluster n Exp1 n Exp n ExpN

Software packages - quantitation n ScanAlyze –by Michael Eisen (Stanford University) –quantitation of images –no data visualisation –free from n ImaGene –BioDiscovery Inc. –quantitation and some data visualisation –demo from n plus many others - explore!

Making sense of raw data n Difficult to see results in tabulated data n Represent in graphical form Data visualisation examples from ImaGene and others...

Data visualisation - scatter plot

Data visualisation - M vs A

Data visualisation - pie chart

ScanAlyze quick demo