Download presentation
Presentation is loading. Please wait.
1
cDNA Microarray Design and Pre-processing By H. Bjørn Nielsen
2
Why Experimental Design 1.To enable statistical hypothesis verification/falsification 2.To balance the effects from undesired controllable effects 3.To ensure sufficient statistical power
3
1. To enable statistical hypothesis verification/falsification Typically, we want to identify differential expressed genes between a set of conditions using t-test or ANOVA like statistics. This implies that we replicate sampling from a set of fixed conditions. Control vs. Treatment Treatment 1, Treatment 2, Treatment 3 Multi factorial ControlTreatment Mutant,Mutant Treated
4
Replications is essential The length of the series or the sampling density may be most important 1. To enable statistical hypothesis verification/falsification But we may also fit to a trend using alternative statistics (Bayesian fit, Boot strapping, ANOVA etc.) Control vs. Treatment Treatment 1, Treatment 2, Treatment 3 Multi factorial ControlTreatment Mutant,Mutant Treated Series T0, T1, T2,.... Tn
5
2. To balance the effects from undesired controllable effects Typical controllable effects Labeling dye Microarray slide Sampling time Growth conditions Typical uncontrollable effects Random effects Unintended deviations in sample handling, growth conditions, etc. Minimize and Balance
6
2. To ensure sufficient statistical power An appropriate number of replicates are required for distinguishing noise from 'effect' Gene expression studies typically requires +3 replicates t = Make sure to replicate over the most important sources of variance Typical order of noise contributions are: Biological variation Sample preparation batch Hybridization/slide effect Dye effect/Spot effect
7
An example Aim: Identify differentially expressed genes between ill and healthy patients. Samples: 4 ill and 4 healthy patients Using a two channel cDNA array. How should we do? SlideDyeCondition Slide 1Cy3ill Slide 1Cy5...
8
Another example Aim: Identify differentially expressed genes between ill and healthy patients. Samples: 4 ill (2xM +2xF) and 4 healthy (2xM +2F) Using a two channel cDNA array. How should we do? SlideDyeSexCondition Slide 1Cy3Mill Slide 1Cy5...
9
Yet another example Aim Identify genes differentially affected by starving in obese and lean people Samples: 4 obese (2x starving + 2x not starving) and 4 lean (2x starving +2x not starving) Using a one channel GeneChip. How should we do? Chip #BMIFood 1OS 2LN...
10
cDNA pre-processing Background correction Normalization –Within slide –Between slide
11
Background correction Is it meaningful? Methods: –subtraction –movingmin (3x3) –normexp –none Ritchie et al. 2007, Bioinformatics
12
Normalization within array Correct for any bias that follow an undesired uncontrollable effect –Print tip –Microtiter plate –Printing order –Spatial trends (uneven hybridization) As well as intensity dependent biases
13
Normalization between array Correction for intensity dependent biases –Lowess –Qspline –Quantiles –And more M A
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.