The Analysis of Microarray data using Mixed Models David Baird Peter Johnstone & Theresa Wilson AgResearch.

Slides:



Advertisements
Similar presentations
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Advertisements

Improved normalisation of microarray data by optimised iterative local regression Matthias E. Futschik Department of Information Science University of.
Optimal designs for one and two-colour microarrays using mixed models
© Department of Statistics 2012 STATS 330 Lecture 32: Slide 1 Stats 330: Lecture 32.
Limma: Linear Models for Microarray Data R user group 21 June 2005 Judith Boer.
1 Chapter 4 Experiments with Blocking Factors The Randomized Complete Block Design Nuisance factor: a design factor that probably has an effect.
Chapter 4 Randomized Blocks, Latin Squares, and Related Designs
Dahlia Nielsen North Carolina State University Bioinformatics Research Center.
Factorial and Mixed Factor ANOVA and ANCOVA
Microarray Normalization
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Getting the numbers comparable
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Preprocessing Methods for Two-Color Microarray Data
Low-Level Analysis and QC Regional Biases Mark Reimers, NCI.
Gene Expression Data Analyses (2)
Normalization of 2 color arrays Alex Sánchez. Dept. Estadística Universitat de Barcelona.
Making Sense of Complicated Microarray Data
Statistical Analysis of Microarray Data
Business Statistics - QBM117 Statistical inference for regression.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
1 Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright © 2009 Dan Nettleton.
(4) Within-Array Normalization PNAS, vol. 101, no. 5, Feb Jianqing Fan, Paul Tam, George Vande Woude, and Yi Ren.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Proteomics Informatics – Data Analysis and Visualization (Week 13)
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
One-Factor Experiments Andy Wang CIS 5930 Computer Systems Performance Analysis.
DNA microarray technology allows an individual to rapidly and quantitatively measure the expression levels of thousands of genes in a biological sample.
DATA TRANSFORMATION and NORMALIZATION Lecture Topic 4.
1 Two Color Microarrays EPP 245/298 Statistical Analysis of Laboratory Data.
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
Model Comparison for Tree Resin Dose Effect On Termites Lianfen Qian Florida Atlantic University Co-author: Soyoung Ryu, University of Washington.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Constrained ordinations Dependence of multivariate response on one or many predictors.
Lecture Topic 5 Pre-processing AFFY data. Probe Level Analysis The Purpose –Calculate an expression value for each probe set (gene) from the PM.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Statistics for Differential Expression Naomi Altman Oct. 06.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
CPE 619 One Factor Experiments Aleksandar Milenković The LaCASA Laboratory Electrical and Computer Engineering Department The University of Alabama in.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
For a specific gene x ij = i th measurement under condition j, i=1,…,6; j=1,2 Is a Specific Gene Differentially Expressed Differential expression.
Analyzing Expression Data: Clustering and Stats Chapter 16.
Spatial Smoothing and Multiple Comparisons Correction for Dummies Alexa Morcom, Matthew Brett Acknowledgements.
Multi session analysis using FEAT David Field Thanks to…. Tom Johnstone, Jason Gledhill, FMRIB.
Hybridization Design for 2-Channel Microarray Experiments Naomi S. Altman, Pennsylvania State University), NSF_RCN.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
Variability & Statistical Analysis of Microarray Data GCAT – Georgetown July 2004 Jo Hardin Pomona College
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Analysis of (cDNA) Microarray.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
G Lecture 71 Revisiting Hierarchical Mixed Models A General Version of the Model Variance/Covariances of Two Kinds of Random Effects Parameter Estimation.
Microarray Data Analysis Xuming He Department of Statistics University of Illinois at Urbana-Champaign.
Micro array Data Analysis. Differential Gene Expression Analysis The Experiment Micro-array experiment measures gene expression in Rats (>5000 genes).
Design Lecture: week3 HSTS212.
The array comparative genomic hybridization (aCGH/CMA) technology
Normalization Methods for Two-Color Microarray Data
An overview of the design and analysis of experiments
Getting the numbers comparable
Randomized Complete Block and Repeated Measures (Each Subject Receives Each Treatment) Designs KNNL – Chapters 21,
Regression Assumptions
Normalization for cDNA Microarray Data
One-Factor Experiments
Regression Assumptions
Presentation transcript:

The Analysis of Microarray data using Mixed Models David Baird Peter Johnstone & Theresa Wilson AgResearch

David Baird 2002 Raw data – background (log 2 scale)  Truncation of Red/Green at high levels  Limitation of Red & Green at low levels to stabilise ratio

David Baird 2002 Dye Bias Differential binding or scanning levels of the two dyes

David Baird 2002 Visualization of Slide Log 2 (Red/Green) on Blue-Red spectrum

David Baird 2002 Spatial Effects (pins = 8 x 4)  Pins  Rows  Columns  Spatial auto- correlation

David Baird 2002 GenStat Spatial Model Analysis Fixed effects (could be random) Pins Rows and columns within the slide Random effects Cubic Smoothing Spline for Intensity (log 2 (Red*Green)/2) AR1 autocorrelation process across rows and columns within pins (removes carry over effects, local trends)

David Baird 2002 Cubic Smoothing Spline B Spline Basis  Basis vectors fitted as random effects in a REML analysis Alternatives:  Fixed knot cubic spline  Polynomial

David Baird 2002 Dye Bias Fits within Slide  Similar performance of two splines  Poor performance of polynomial (as expected)  Smoothing spline less responsive in left tail

David Baird 2002 Splines per Pin possible  Only 3-5% extra variation explained

David Baird 2002 Pin Effects for First Slide Large trend from one corner to the opposite diagonal

David Baird 2002 Pin Effects for 2 nd Slide

David Baird 2002 Slide Row & Column Effects Effects were highly significant (P<0.001) In addition, within each pin there was a significant autocorrelation for both columns  = , and rows  = (Spots printed within columns)

David Baird 2002 Row/Column Effects Slide 2

David Baird 2002 Residuals from Analysis  Unequal variances  Variance stablises for log 2 Intensity > 9  EST effects calculated from residuals

David Baird 2002 Background too low

David Baird 2002 Background too high

David Baird 2002 Rescaling of Residuals  Possible need for a weighted analysis

David Baird 2002 Dye Swaps for EST Dye Bias Some ESTs preferentially bind to one of the dyes Important to swap dyes between treatments to detect and adjust for this Extreme ratio caused by Red dye always binding to this EST

David Baird 2002 Q-Q Plot of EST Ratios  A large number of under expressed ESTs

David Baird 2002 Differenced Q-Q Plot Blank Removed  Departures from Normal distribution occur after Normal Score of ~ 2.5 (0.6% = 60 ESTs)  No significant departures in positive ratios

David Baird 2002 Volcano Plot  Developed an Index based on combination of T, mean Ratio and Intensity  Plot coloured by Index  Usually Y = –log(p)

David Baird 2002 Design of Experiments Side by side duplicate spots are not useful Repeat printing of EST library with randomisation on same slide is useful Importance of balancing dyes with treatment effects Incomplete Block Designs of size 2 used Replicate slides required due to slide- slide variation (3 - 4 reps per treatment comparison)

David Baird 2002 Time Course Experiment Initial state, and then state 1, 2 & 3 weeks after treatment (8 reps) Time 0Time 1Time 2Time 3 Treatment Treatment Contrast Variances Time Time Time Time 0 Time 1 Time 2

David Baird 2002 Time Course Experiment Standard Treatment (8 reps) Time 0Time 1Time 2Time 3 Treatment Treatment Contrast Variances Time Time Time Time 0 Time 1 Time 2

David Baird 2002 Time Course Experiment Loop Design (6 reps) Time 0Time 1Time 2Time 3 Treatment Treatment Contrast Variances Time Time Time Time 0 Time 1 Time 2

David Baird 2002 Time Course Experiment Full incomplete block design (4 reps) Time 0Time 1Time 2Time 3 Treatment Treatment Contrast Variances Time Time Time Time 0 Time 1 Time 2

David Baird 2002 Analysis of Spot Shape

David Baird 2002 Analysis of pin accuracy