Image Processing for cDNA Microarray Data

Slides:



Advertisements
Similar presentations
MicroArray Image Analysis Robin Liechti
Advertisements

QR Code Recognition Based On Image Processing
Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Measuring Gene Expression Part 3
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical.
Introduction to Microarray Analysis and Technology Dave Lin - November 5, 2001.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
Normalization of Microarray Data - how to do it! Henrik Bengtsson Terry Speed
TIGR Spotfinder: a tool for microarray image processing
Introduction to DNA Microarray Technology Steen Knudsen, April 2005.
Normalization for cDNA Microarray Data Yee Hwa Yang, Sandrine Dudoit, Percy Luu and Terry Speed. SPIE BIOS 2001, San Jose, CA January 22, 2001.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Gene Expression Data Analyses (2)
Low Level Statistics and Quality Control Javier Cabrera.
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Image Analysis Class web site: Statistics for Microarrays.
HWW Gene Expression Experiments: How? Why? What’s the problem?
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
SPH 247 Statistical Analysis of Laboratory Data. Two-Color Arrays Two-color arrays are designed to account for variability in slides and spots by using.
Image Quantitation in Microarray Analysis More tomorrow...
Scanning and Image Processing -by Steve Clough. GSI Lumonics cDNA microarrays use two dyes with well separated emission spectra such as Cy3 and Cy5 to.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Hybridization and data acquisition –Hybridization –Scanning –Image analysis –Background correction and filtering –Data transformation Methods for normalization.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
1 Two Color Microarrays EPP 245/298 Statistical Analysis of Laboratory Data.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
- 1 - LSM 510 META AIM Carl Zeiss - PMLSM 510 META EF-Guided Tour April 2002 Guided tour through:  Configuration  Defining spectral range  Scan control.
Microarray - Leukemia vs. normal GeneChip System.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
Measuring Gene Expression Part 3 David Wishart Bioinformatics 301
ImArray - An Automated High-Performance Microarray Scanner Software for Microarray Image Analysis, Data Management and Knowledge Mining Wei-Bang Chen and.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
- 1 - LSM 510 META AIM Carl Zeiss - PMLSM 510 META EF-Guided Tour April 2002 Guided tour through:  Configuration  Defining spectral range  Scan control.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
Microarray Data Pre-Processing
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Autonomous Robots Vision © Manfred Huber 2014.
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
The second-simplest cDNA microarray data analysis problem Terry Speed, UC Berkeley Bioinformatic Strategies For Application of Genomic Tools to Environmental.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
©2003/04 Alessandro Bogliolo Analysis of gene expression by means of Microarrays.
Course : T Computer Vision
Microarray - Leukemia vs. normal GeneChip System.
CDNA-Project cDNA project Julia Brettschneider (UCB Statistics)
The Basics of cDNA Microarray Technology
Volume 1, Issue 1, Pages (February 2002)
Normalization Methods for Two-Color Microarray Data
The Basics of Microarray Image Processing
Gene Chips.
Image Processing, Lecture #8
DNA Chip Data Interpretation Tools: Genmapp & Dragon View
Image Processing, Lecture #8
Getting the numbers comparable
Optimal gene expression analysis by microarrays
Normalization for cDNA Microarray Data
Microarray Data Analysis
Presentation transcript:

Image Processing for cDNA Microarray Data Prepared with massive assistance from Yee Hwa Yang (Berkeley, WEHI), and reporting on work done jointly with her, Sandrine Dudoit (Stanford) and Mike Buckley (CSIRO, Sydney). References : M Eisen and P Brown, Methods in Enzymology vol 303, 1999; Chapter 2, DNA Microarrays (ed M Schena, OUP 1999) by Mack J Schermer; Chapter 13, Microarray Biochip Technology (ed M Schena, Eaton 2000) by Basarsky et al.

Scanner Process A/D Convertor Laser PMT Dye Photons Electrons Signal excitation amplification Filtering Time-space averaging

GenePix 4000a Microarray Scanner Protocol 1. Turn on scanner. 2. Slide scanner door open. Insert chip hyp side down and clip chip holder easily around the slide 3 Set PMTs to 600 in both 635nm (Cy3) and 532 (Cy5) channels. 4. Perform low resolution “PREVIEW SCAN” to determine location of spots and initial hyb intensities 5. Once scan location determined, draw a “SCAN AREA” marquis around the array 6. Perform quick visual inspection of hyb and make initial adjustments to PMTs 7. For gene expression hybs, raise or lower the red and green PMTs to achieve color balance 8. Before you perform your data scan, change “LINES TO AVERAGE” to 2. 9. Perform a high-resolution “DATA-SCAN”……(ctd)

GenePix 4000a Microarray Scanner Protocol, ctd 10. Observe the histograms and make adjustments to PMTs. 11. Once the PMT level has been set so that the Intensity Ratio is near 1.00 perform a “DATA SCAN” over “SCAN AREA” and save the results. 12. To save your image, select “SAVE IMAGES”. 13. Save as type=Multi-image TIFF files. 14. Once scanned and saved, you are ready to assign spot identities and calculate results. Note: For us, normalization is performed later during data analysis, see next lecture.

Scanner PMT Laser Pinhole Detector lens Beam-splitter Objective Lens Dye Glass Slide

How to adjust for PMT? Cy3 Cy5 1 600 600 2 650 600 3 650 650 4 700 650 Very weak Cy3 Cy5 1 600 600 2 650 600 3 650 650 4 700 650 5 650 700 6 700 700 7 750 750 saturated

After normalisation In addition, the ranking of the genes stays pretty much the same.

Practical Problems 1 Comet Tails Likely caused by insufficiently rapid immersion of the slides in the succinic anhydride blocking solution.

Practical Problems 2

Practical Problems 3 High Background Weak Signals 2 likely causes: Insufficient blocking. Precipitation of the labeled probe. Weak Signals

Practical Problems 4 Spot overlap: Likely cause: too much rehydration during post - processing.

Practical Problems 5 Dust

Steps in Images Processing 1. Addressing: locate centers 2. Segmentation: classification of pixels either as signal or background. using seeded region growing). 3. Information extraction: for each spot of the array, calculates signal intensity pairs, background and quality measures.

Steps in Image Processing 3. Information Extraction Spot Intensities mean (pixel intensities). median (pixel intensities). Pixel variation (IQR of log (pixel intensities). Background values Local Morphological opening Constant (global) None Quality Information Signal Background

Addressing This is the process of assigning coordinates to each of the spots. Automating this part of the procedure permits high throughput analysis. 4 by 4 grids 19 by 21 spots per grid

Addressing Registration Registration

Problems in automatic addressing Misregistration of the red and green channels Rotation of the array in the image Skew in the array Rotation

Segmentation methods Fixed circles Adaptive Circle Adaptive Shape Edge detection. Seeded Region Growing. (R. Adams and L. Bishof (1994) :Regions grow outwards from the seed points preferentially according to the difference between a pixel’s value and the running mean of values in an adjoining region. Histogram Methods Adaptive threshold.

Examples of algorithms and software implementation

Limitation of fixed circle method SRG Fixed Circle

Limitation of circular segmentation Small spot Not circular Results from SRG

Information Extraction Spot Intensities mean (pixel intensities). median (pixel intensities). Background values Local Morphological opening Constant (global) None Quality Information Take the average

Local Backgrounds

Information Quality Area Circularity Signal to Noise ratio

Quality Measurements Array Correlation between spot intensities. Percentage of spots with no signals. Distribution of spot signal area. Spot Signal / Noise ratio. Variation in pixel intensities. Identification of “bad spot” (spots with no signal). Ratio (2 spots combined) Circularity

Quality of Array Distribution of areas. - Judge by eye - Look at variation. (e.g, SD) Cy5 area mean 59 median 57 SD 24.34 Cy3 area mean 57 median 56 SD 20.67

Does the image analysis matter? Spot.nbg Spot.morph Spot.valley ScanAlyze

Background makes a difference Background method Segmentation method Exp1 Exp2 S.nbg 6 6 Gp.nbg 7 6 SA.nbg 6 6 No background QA.fix.nbg 7 6 QA.hist.nbg 7 6 QA.adp.nbg 14 14 S.valley 17 21 GP 11 11 Local surrounding SA 12 14 QA.fix 18 23 QA.hist 9 8 QA.adp 27 26 Others S.morph 9 9 S.const 14 14 Medians of the SD of log2(R/G) for 8 replicated spots multiplied by 100 and rounded to the nearest integer.