Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.

Slides:



Advertisements
Similar presentations
AFGC Damares C Monte Carnegie Institution of Washington,
Advertisements

MicroArray Image Analysis Robin Liechti
Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Measuring Gene Expression Part 3
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical.
Introduction to Microarray Analysis and Technology Dave Lin - November 5, 2001.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
Normalization of Microarray Data - how to do it! Henrik Bengtsson Terry Speed
TIGR Spotfinder: a tool for microarray image processing
Introduction to DNA Microarray Technology Steen Knudsen, April 2005.
Getting the numbers comparable
The second-simplest cDNA microarray data analysis problem Terry Speed, UC Berkeley Fred Hutchinson Cancer Research Center March 9, 2001.
Normalization for cDNA Microarray Data Yee Hwa Yang, Sandrine Dudoit, Percy Luu and Terry Speed. SPIE BIOS 2001, San Jose, CA January 22, 2001.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Microarray Data Preprocessing and Clustering Analysis
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Gene Expression Data Analyses (2)
Low Level Statistics and Quality Control Javier Cabrera.
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Image Analysis Class web site: Statistics for Microarrays.
HWW Gene Expression Experiments: How? Why? What’s the problem?
Introduce to Microarray
Corrections and Normalization in microarrays data analysis
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Image Quantitation in Microarray Analysis More tomorrow...
Scanning and Image Processing -by Steve Clough. GSI Lumonics cDNA microarrays use two dyes with well separated emission spectra such as Cy3 and Cy5 to.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Statistical Analyses of Microarray Data Rafael A. Irizarry Department of Biostatistics
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
1 Two Color Microarrays EPP 245/298 Statistical Analysis of Laboratory Data.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Microarray - Leukemia vs. normal GeneChip System.
An Automated Segmentation Method for Microarray Image Analysis Wei-Bang Chen 1, Chengcui Zhang 1 and Wen-Lin Liu 2 1 Department of Computer and Information.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
Measuring Gene Expression Part 3 David Wishart Bioinformatics 301
ImArray - An Automated High-Performance Microarray Scanner Software for Microarray Image Analysis, Data Management and Knowledge Mining Wei-Bang Chen and.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
Microarray Data Pre-Processing
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb GP3xCLI GenePix Post-Processing.
The second-simplest cDNA microarray data analysis problem Terry Speed, UC Berkeley Bioinformatic Strategies For Application of Genomic Tools to Environmental.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Other uses of DNA microarrays
©2003/04 Alessandro Bogliolo Analysis of gene expression by means of Microarrays.
Course : T Computer Vision
Microarray - Leukemia vs. normal GeneChip System.
Image Processing for cDNA Microarray Data
The Basics of Microarray Image Processing
Getting the numbers comparable
Normalization for cDNA Microarray Data
Presentation transcript:

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Image Processing for cDNA Microarray Data Prepared with massive assistance from Yee Hwa Yang (Berkeley, WEHI), and reporting on work done jointly with her, Sandrine Dudoit (Stanford) and Mike Buckley (CSIRO, Sydney). References : M Eisen and P Brown, Methods in Enzymology vol 303, 1999; Chapter 2, DNA Microarrays (ed M Schena, OUP 1999) by Mack J Schermer; Chapter 13, Microarray Biochip Technology (ed M Schena, Eaton 2000) by Basarsky et al.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Scanner Process Dye Photons ElectronsSignal LaserPMT A/D Convertor excitation amplification Filtering Time-space averaging

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research GenePix 4000a Microarray Scanner Protocol 1. Turn on scanner. 2. Slide scanner door open. Insert chip hyp side down and clip chip holder easily around the slide 3 Set PMTs to 600 in both 635nm (Cy3) and 532 (Cy5) channels. 4. Perform low resolution “PREVIEW SCAN” to determine location of spots and initial hyb intensities 5. Once scan location determined, draw a “SCAN AREA” marquis around the array 6. Perform quick visual inspection of hyb and make initial adjustments to PMTs 7. For gene expression hybs, raise or lower the red and green PMTs to achieve color balance 8. Before you perform your data scan, change “LINES TO AVERAGE” to Perform a high-resolution “DATA-SCAN”……(ctd)

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research 10. Observe the histograms and make adjustments to PMTs. 11. Once the PMT level has been set so that the Intensity Ratio is near 1.00 perform a “DATA SCAN” over “SCAN AREA” and save the results. 12. To save your image, select “SAVE IMAGES”. 13. Save as type=Multi-image TIFF files. 14. Once scanned and saved, you are ready to assign spot identities and calculate results. Note: For us, normalization is performed later during data analysis, see next lecture. GenePix 4000a Microarray Scanner Protocol, ctd

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Scanner Laser PMT Dye Glass Slide Objective Lens Detector lens Pinhole Beam-splitter

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research How to adjust for PMT? Cy3Cy saturated Very weak

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research After normalisation In addition, the ranking of the genes stays pretty much the same.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Practical Problems 1 Comet Tails Likely caused by insufficiently rapid immersion of the slides in the succinic anhydride blocking solution.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Practical Problems 2

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Practical Problems 3 High Background 2 likely causes: –Insufficient blocking. –Precipitation of the labeled probe. Weak Signals

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Practical Problems 4 Spot overlap: Likely cause: too much rehydration during post - processing.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Practical Problems 5 Dust

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Steps in Images Processing 1. Addressing: locate centers 2. Segmentation: classification of pixels either as signal or background. using seeded region growing). 3. Information extraction: for each spot of the array, calculates signal intensity pairs, background and quality measures.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Steps in Image Processing Spot Intensities –mean (pixel intensities). –median (pixel intensities). –Pixel variation ( IQR of log (pixel intensities ). Background values –Local –Morphological opening –Constant (global) –None Quality Information Signal Background 3. Information Extraction

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Addressing This is the process of assigning coordinates to each of the spots. Automating this part of the procedure permits high throughput analysis. 4 by 4 grids 19 by 21 spots per grid

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Addressing Registration

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Problems in automatic addressing Misregistration of the red and green channels Rotation of the array in the image Skew in the array Rotation

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Segmentation methods Fixed circles Adaptive Circle Adaptive Shape –Edge detection. –Seeded Region Growing. (R. Adams and L. Bishof (1994) :Regions grow outwards from the seed points preferentially according to the difference between a pixel’s value and the running mean of values in an adjoining region. Histogram Methods –Adaptive threshold.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Examples of algorithms and software implementation

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Limitation of fixed circle method SRGFixed Circle

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Limitation of circular segmentation —Small spot —Not circular Results from SRG

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Information Extraction —Spot Intensities —mean (pixel intensities). —median (pixel intensities). —Background values —Local —Morphological opening —Constant (global) —None —Quality Information Take the average

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Local Backgrounds

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Information Quality –Area –Circularity –Signal to Noise ratio

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Quality Measurements Array –Correlation between spot intensities. –Percentage of spots with no signals. –Distribution of spot signal area. Spot –Signal / Noise ratio. –Variation in pixel intensities. –Identification of “bad spot” (spots with no signal). Ratio (2 spots combined) –Circularity

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Quality of Array Distribution of areas. - Judge by eye - Look at variation. (e.g, SD) Cy3 area mean 57 median 56 SD Cy5 area mean 59 median 57 SD 24.34

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Does the image analysis matter? Spot.nbg Spot.morph Spot.valley ScanAlyze

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research Background makes a difference Background methodSegmentation methodExp1 Exp2 S.nbg66 Gp.nbg76 SA.nbg66 No backgroundQA.fix.nbg76 QA.hist.nbg76 QA.adp.nbg1414 S.valley1721 GP1111 Local surroundingSA1214 QA.fix1823 QA.hist98 QA.adp2726 OthersS.morph99 S.const1414 Medians of the SD of log 2 (R/G) for 8 replicated spots multiplied by 100 and rounded to the nearest integer.