Brad Windle, Ph.D Unsupervised Learning and Microarrays Web Site: Link to Courses and then lecture for this class
Gene Expression Profiling Unsupervised Learning Cluster Analysis and Applications Good review of microarray data analysis is Computational analysis of microarray data. Quackenbush J. Nat Rev Genet 2001 Jun;2(6):
Reductionism versus Systems Approach Why generate global analyses? as opposed to picking a gene/protein and hoping you get lucky and it has great significance to the big picture or to mankind’s health.
Normalizing Data Northern blot For normalizing samples, you would divide experimental values by the mean of the values thought to be constant through the samples
Sample values are typically normalized by dividing by the mean of the reference values or mean of all values
What about normalizing gene values across all the samples? Rationale for normalizing samples does not apply to genes One strategy is to subtract the mean (mean centering).
Log transformation //
Gene to Gene Variability
Cluster Analysis Goal - puts items (genes) together in clusters based on similarity of expression across various conditions, either similarity of absolute expression levels or overall similarity in pattern
Pearson
Hierarchical Clustering
Divisive Agglomerative (Aggregative) Clustering Methods
Cluster Linkage Methods Nearest Neighbor or Single Linkage Furthest Neighbor or Complete Linkage Average Neighbors or Average Linkage
X Y Z
1 2 3 K-Means Clustering and it’s relative Self-Organizing Maps (SOM)
Ranking Order Clustering
Cluster Playground 3
Applications of Gene Expression Profiling and Cluster Analysis Tissue or Tumor Classification Gene Classification Drug Classification Drug Target Identification
B-Cell Lymphoma NATURE 403, , 2000 Indistinguishable by histology Yet half responded well to therapy and half did not Where there differences in gene expression that correlate with drug response? Gene expression profiles showed half the lymphomas were of GC B-Cell lineage and the other of Activated B-Cell lineage A subset of genes predicts therapeutic outcome
M1 M2 M3 M4 M5 M6 M7 M8 M9 M10 M11 M12 M13 M14 M15 M16 M17 M18 D1D2D3D4D5D6 D7D8D9D100D11D12 D13D14D15D16D17D18 Gene Expression Profiling of Yeast Mutants and Drugs Cell 102, 109–126, 2000 Mutants Drugs M4 D17 Erg2Dyclonine Human sigma receptor
Validation of cdc28 Kinase Target Inhibition SCIENCE 281, , 1998 cdc28 - D1D2 } Cdc28-regulated genes } Phosphate metabolism genes Nucleotide analogs that block cdc28p D1 and D2 Pho85
Drug Cells A B C D E COMPARE Clustering Drugs Based on Cell Line Sensitivities Nature Genetics 24: , 2000
T1 T2 A7 T2 A7 T1
Profiling
Clustering NCI 60 Cancer Cell Lines Nature Genetics 24: Genes 9 Types of Tissues/Tumors Breast CNS Colon Leukemia Lung Melanoma Ovarian Prostate Renal
Filtering Data Filter out data with the program Cluster, based on SD cuts