Download presentation
Presentation is loading. Please wait.
1
Cancer Genomics and Class Discovery
2/20/17 – Chuck Perou (Department of Genetics) – Introduction to Genomics and Big Data, and Cancer Subtype Class Discovery using gene expression data - Katie Hoadley (Department of Genetics) – Introduction to TCGA Data Portal 2/27/17 – Katie Hoadley (Department of Genetics) – Multi-platform Data Analysis and Across Technology Data Integration 3/6/17 – Steve Marron (Department of Statistics and Operations Research) – Methods for Addressing Data Heterogeneity and Integration 3/13/17 – No Class 3/20/17 – Andrew Nobel (Department of Statistics and Operations Research) – Exploratory Analysis of Genomic Data 3/27/17– Joel Parker (Department of Genetics) – Methods and Challenges in the Analysis of NextGen Sequence Data for DNAseq and RNAseq 4/3/17 – No Class 4/10/17 – In Class Student Presentations (70%) and 2-3 page Written Report (30%) covering a unique analysis performed on TCGA Cancer Genomics Data
2
Cancer Genomics and Class Discovery
2/20/17 – Chuck Perou (Department of Genetics) – Introduction to Genomics and Big Data, and Cancer Subtype Class Discovery using gene expression data - Katie Hoadley (Department of Genetics) – Introduction to TCGA Data Portal Reading list Eisen et al., PNAS 1998 (PMID: ) Perou et al., NATURE 2000 (PMID: ) Parker et al., JCO 2009 (PMID: ) TCGA Breast Cancer Genomic Data Sites
3
http://cancergenome.nih.gov/ TCGA Data is 10,000 individual tumors
33 diverse tumor types Clinical and Pathology data Molecular assays performed: DNA exomes (mutations) mRNA-seq (gene expression) microRNA-seq (microRNAs) DNA methylation arrays AFFY SNP arrays (genotypes and DNA copy number) RPPA protein data on ~60% H&E images of each tumor
4
Homework Reading list Eisen et al., PNAS 1998 (PMID:9843981)
2/20/17 – Chuck Perou (Department of Genetics) – Introduction to Genomics and Big Data, and Cancer Subtype Class Discovery using gene expression data - Katie Hoadley (Department of Genetics) – Introduction to TCGA Data Portal Reading list Eisen et al., PNAS 1998 (PMID: ) Perou et al., NATURE 2000 (PMID: ) Parker et al., JCO 2009 (PMID: ) TCGA Breast Cancer Genomic Data Sites (all open access TCGA Breast Cancer Data) (all 20,000 gene expression values) (~2000 gene “classification list”) Take the “classification list” data, and make a hierarchical cluster. I recommend using “Cluster 3.0” and under “adjust data” select log2 transform and median center the genes. Next select the “hierarchical” tab and select cluster (both genes and arrays), similarity metric “centered” and cluster method “centroid linkage”. The data can then be viewed and explored using “Java Treeview”. Look for and find the “proliferation cluster”, which was showed earlier in this presentation
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.