Normalization of 2 color arrays Alex Sánchez. Dept. Estadística Universitat de Barcelona.

Slides:



Advertisements
Similar presentations
Improved normalisation of microarray data by optimised iterative local regression Matthias E. Futschik Department of Information Science University of.
Advertisements

Statistical tests for differential expression in cDNA microarray experiments (2): ANOVA Xiangqin Cui and Gary A. Churchill Genome Biology 2003, 4:210 Presented.
Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
Microarray Normalization
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Normalization of microarray data
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Normalization of Microarray Data - how to do it! Henrik Bengtsson Terry Speed
Getting the numbers comparable
Statistics for Microarrays
Normalization for cDNA Microarray Data Yee Hwa Yang, Sandrine Dudoit, Percy Luu and Terry Speed. SPIE BIOS 2001, San Jose, CA January 22, 2001.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Preprocessing Methods for Two-Color Microarray Data
Microarray Data Preprocessing and Clustering Analysis
Normalization Class web site: Statistics for Microarrays.
CDNA Microarray Design and Pre-processing By H. Bjørn Nielsen.
Gene Expression Data Analyses (2)
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
Normalization Review and Cluster Analysis Class web site: Statistics for Microarrays.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
Making Sense of Complicated Microarray Data
A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data A.L. Tarca, J.E.K. Cooke and J. MacKay Presented.
Corrections and Normalization in microarrays data analysis
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
1 Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright © 2009 Dan Nettleton.
(4) Within-Array Normalization PNAS, vol. 101, no. 5, Feb Jianqing Fan, Paul Tam, George Vande Woude, and Yi Ren.
Preprocessing of cDNA microarray data Lecture 19, Statistics 246, April 1, 2004.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
DATA TRANSFORMATION and NORMALIZATION Lecture Topic 4.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
Fishing expeditions in gloomy waters: Detecting differential expression in microarray data Matthias E. Futschik Institute for Theoretical Biology Humboldt-University,
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
1 Pre-processing - Normalization Databases Statistics for Microarray Data Analysis – Lecture 2 The Fields Institute for Research in Mathematical Sciences.
Lecture Topic 5 Pre-processing AFFY data. Probe Level Analysis The Purpose –Calculate an expression value for each probe set (gene) from the PM.
Statistical Principles of Experimental Design Chris Holmes Thanks to Dov Stekel.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Pre-processing in DNA microarray experiments Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Statistics for Differential Expression Naomi Altman Oct. 06.
ABC D EF GH I JKL. Supplementary figure S1: Exemplary overview of the quality assessment plots generated by Robin. All plots have been generated using.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
Pre-processing DNA Microarray Data Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor Short Course Winter 2002 © Copyright.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
The microarray data analysis Ana Deckmann Carla Judice Jorge Lepikson Jorge Mondego Leandra Scarpari Marcelo Falsarella Carazzolle Michelle Servais Tais.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Microarray: An Introduction
Microarray Data Analysis Xuming He Department of Statistics University of Illinois at Urbana-Champaign.
Lecture 2 – Pre-processing and Normalization José Luis Mosquera Computational Lab on Microarrays Data Analysis Special Topics in Computer Science Institute.
DNA Microarray. Microarray Printing 96-well-plate (PCR Products) 384-well print-plate Microarray.
Micro Array Error Analysis Protein Interaction Map Integration & Visualization Dr. Werner Van Belle Department.
Functional Genomics in Evolutionary Research
Normalization Methods for Two-Color Microarray Data
New normalisation methods for microarrays
Getting the numbers comparable
Normalization for cDNA Microarray Data
Presentation transcript:

Normalization of 2 color arrays Alex Sánchez. Dept. Estadística Universitat de Barcelona

2 Microarray studies life cycle Here we are

3 What is normalization? The word normalization describes techniques used to suitably transform the data before they are analysed. Goal is to correct for systematic differences  between samples on the same slide, or  between slides, which do not represent true biological variation between samples.

4 The origin of systematic differences Systematic differences may be due to …  Dye biases which vary with spot intensity,  Location on the array,  Plate origin,  Printing quality which may vary between Pins Time of printing  Scanning parameters,…

5 How to know if it’s necessary? Option 1: to perform self-self normalization  If we hibridize a sample with itself instead of sample vs control intensities should be the same in both channels  All deviations from this equality means there is systematic bias that needs correction Option 2: Look at diagnostic plots for dye, slide or spatial effects

6 False color overlayBoxplots within pin-groupsScatter (MA-)plots Self-self hybridizations

7 Some non self-self hybridizations From the NCI60 data setEarly Ngai lab, UC Berkeley Early PMCRI, Melbourne AustraliaEarly Goodman lab, UC Berkeley

8 Normalization methods & issues Methods  Global adjustment  Intensity dependent normalization  Within print-tip group normalization  And many other… Selection of spots for normalization

9 Global normalization Based on a global adjustment log 2 R/G  log 2 R/G - c = log 2 R/(kg) Choices for k or c = log 2 k are  c = median or mean of log ratios for a particular gene set All genes or control or housekeeping genes.  Total intensity normalization, where K = ∑R i / ∑G i.

10 Example: (Callow et al 2002) Global median normalization.

11 Intensity-dependent normalization Run a line through the middle of the MA plot, shifting the M value of the pair (A,M) by c=c(A), i.e. log 2 R/G  log 2 R/G - c (A) = log 2 R/(k(A)G). One estimate of c(A) is made using the LOWESS function of Cleveland (1979): LOcally WEighted Scatterplot Smoothing.

12 Example: (Callow et al 2002) loess vs median normalization.

13 Example: (Callow et al 2002) Global median normalization. Global normalization performs a global correction but it cannot account for spatial effects   See next slide boxplots for the same situations in only one mouse, showing all sectors

14 Global normalisation does not correct spatial bias (print-tip-sectors)

15 Within print-tip group normalization To correct for spatial bias produced by hybridization artefacts or print-tip or plate effects during the construction of arrays. To correct for both print-tip and intensity- dependent bias perform LOWESS fits to the data within print-tip groups, i.e.  Log 2 R/G  log 2 R/G - c i (A) = log 2 R/(k i (A)G), where c i (A) is the LOWESS fit to the MA-plot for the ith grid only.

16 Local print-tip normalisation corrects spatial bias (print-tip-sectors)

17 Normalization, which spots to use? LOWESS can be run through many different sets of points,  All genes on the array.  Constantly expressed genes (housekeeping).  Controls.  Spiked controls (genes from distant species).  Genomic DNA titration series.  Rank invariant set.

18 Strategies for selecting a set of spots for normalization Use of a global LOWESS approach can be justified by supposing that, when stratified by mRNA abundance, a) Only a minority of genes expected to be differentially expressed, b) Any differential expression is as likely to be up- regulation as down-regulation. Pin-group LOWESS requires stronger assumptions: that one of the above applies within each pin-group.

19 Summary Microarray experiments have many “hot spots” where errors or systematic biases can apper Visual and numerical quality control should be performed Usually intensities will require normalisation  At least global or intensity dependent normalisation should be performed  More sophisticated procedures rely on stronger assumptions  Must look for a balance