Download presentation
Presentation is loading. Please wait.
Published byElwin Riley Modified over 9 years ago
1
Techniques for Analysing Microarrays Which genes are involved in ovarian and prostate cancer?
2
Common Questions (1)Which genes are “up” or “down” in different conditions Cancer patient versus Normal Non-invasive cancer versus invasive cancer (2) Which genes can differentiate between cancer sub-types? (3) Which genes relate to the survival of the patient? (4) Which genes may be in the same pathway as a gene of interest?
3
EOS chips Use Affymetrix GeneChip technology 25mers 8 probes in a probe set 59,000 probe sets ~ 46,000 gene clusters (all human expressed sequences known at time) Normalised distributions of all chips to each other (gamma distribution) Single measure of intensity for each probe set (Tukey’s trimean)
4
Variance (linear scale) Variance (log scale) mean After the “fix”….. (Add constant and log2) Data after “normalisation” Variance increases with mean
5
Which genes are differentially expressed between ovarian cancer and normal ovaries? 6 normal ovaries 38 ovarian cancers o 3 mucinous o 5 endometriod o 30 serous
6
Statistical techniques ranked t-statistics (unequal variance) quantile-quantile plots against normal distribution Westfall and Young permutation test http://stat-www.berkeley.edu/users/terry/zarray/Html/ S. Dudoit, Y.H. Yang, M. J. Callow and T.P.Speed. Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. August 2000Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Ratios of Cancer/Normal.
7
t statistic The tstat gets more extreme as Difference in means The standard deviation of each of the two samples The size of the samples 0+ve -ve tstats ranked
8
Quantile-Quantile Plot R library(sma) or R library(base)
9
Westfall and Young Permutation tpWY program: http://www.cbil.upenn.edu/tpWY / 6 normal ovaries, 38 ovarian cancers Randomise labels (OvCa, N) Compute tstats 100,000 iterations Unadjusted p value: Proportion of iterations where p value adjusted for multiple testing
10
How many genes were “statistically” significant? Ovarian Cancer Normal (Candidates for antibody therapy?) 110 candidates (adjusted p<0.01) 181 candidates (adjusted p <0.05) Ovarian CancerNormal (Candidates for tumor suppressor genes?) 7 candidates (adjusted p<0.01) 15 candidates (adjusted p<0.05)
11
High in cancer Excel
12
Low in cancer Excel How can we deal with (a) Biological variation? (b) More than one cause for cancer?
13
Which genes are differentially expressed between non-invasive and invasive ovarian cancer? No. samples. Non-invasiveInvasive Mucinous54 Endometriod17 Serous233 Future: Model all variables together Now: ranked t-stats, qqplots
14
Assume equal variance for t-stats? S 2 non-invasive (n=5) S 2 invasive (n=4) Theoretical quantiles (F distribution) Ratio variances eg.mucinous cancer
15
What to do when n=2? Assume equal variance? Error model?
16
Limitations of Westfall & Young permutation method No. samples.No. Permut. Non-invasiveInvasive Mucinous54126 Endometriod17--- Serous233595 Not enough power when small sample sizes?
17
Mucinous: non-invasive versus invasive R library(base)
18
Which genes relate to prognosis of patients with prostate cancer? Methods: R survival package & SAS 72 patients with prostate cancer Treatment: Radical prostatectomy 17 relapsed: PSA rise >0.4ng/ml
19
Baseline hazard: (Independent of gene expression or PSA) Exponential: (Involves Gene & PSA Independent of Time) Cox Proportional Hazards Model
20
A B relapsed
21
B Survival Curves: Gene +PSA model High ( >= 25 th percentile) Low (< 25 th percentile). S(t) Time(disease free months)
22
Probe setHazards Ratiounadjusted p value A 0.26 (95% CI: 0.12 to 0.54)0.000351 B0.32 (95% CI : 0.16 to 0.67)0.002151 * False discovery rate for top 50 candidates is 20% (SAM) Hazard Ratio: 75 th /25 th percentile
23
Summary (1)Which genes are “up” or “down” in different conditions? - ranked t-statistics - qq plots (normal distribution) - Westfall & Young permutations (multiple testing) (2) Which genes relate to the survival of the patient? - Cox proportional hazards - SAM multiple testing
24
Acknowledgements Garvan –Sue Henshall, Rob Sutherland, Patricia Vanden Bergh EOS –Jordan Hiller, Daniel Afar, Kurt Gish, David Mack Royal Hospital for Women –Nigel Hacker ANU/John Curtin –John Maindonald –Yvonne Pittelkow Walter and Elisa Hall Institute –Terry Speed, Natalie Thorne University of Queensland –Jessica Marr
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.