Brad Windle, Ph.D. 628-1956 Unsupervised Learning and Microarrays Web Site: Link to Courses and.

Slides:



Advertisements
Similar presentations
Yinyin Yuan and Chang-Tsun Li Computer Science Department
Advertisements

Basic Gene Expression Data Analysis--Clustering
Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Microarray Data Analysis (Lecture for CS397-CXZ Algorithms in Bioinformatics) March 19, 2004 ChengXiang Zhai Department of Computer Science University.
Supervised and unsupervised analysis of gene expression data Bing Zhang Department of Biomedical Informatics Vanderbilt University
Instance-based Classification Examine the training samples each time a new query instance is given. The relationship between the new query instance and.
Yan Guo Assistant Professor Department of Cancer Biology Vanderbilt University USA.
Cluster analysis for microarray data Anja von Heydebreck.
Gene 210 Cancer Genomics April 29, Key events in investigating the cancer genome M R Stratton Science 2011;331:
UNSUPERVISED ANALYSIS GOAL A: FIND GROUPS OF GENES THAT HAVE CORRELATED EXPRESSION PROFILES. THESE GENES ARE BELIEVED TO BELONG TO THE SAME BIOLOGICAL.
The Broad Institute of MIT and Harvard Clustering.
GENIE – GEne Network Inference with Ensemble of trees Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling,
Microarrays Dr Peter Smooker,
Microarray Data Preprocessing and Clustering Analysis
Bio277 Lab 2: Clustering and Classification of Microarray Data Jess Mar Department of Biostatistics Quackenbush Lab DFCI
Part II: Discriminative Margin Clustering Joint work with: Rob Tibshirani, Dept of Statistics Patrick O. Brown, School of Medicine Stanford University.
Alizadeh et. al. (2000) Stephen Ayers 12/2/01. Clustering “Clustering is finding a natural grouping in a set of data, so that samples within a cluster.
Introduction to Hierarchical Clustering Analysis Pengyu Hong 09/16/2005.
Introduction to Bioinformatics - Tutorial no. 12
 Goal A: Find groups of genes that have correlated expression profiles. These genes are believed to belong to the same biological process and/or are co-regulated.
Genomic signatures to guide the use of chemotherapeutics Authors: Anil Potti et. al Presenter: Jong Cheol Jeong.
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Computational learning of stem cell fates Martina Koeva 09/10/07.
Analyzing Metabolomic Datasets Jack Liu Statistical Science, RTP, GSK
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Gene expression profiling identifies molecular subtypes of gliomas
Analysis and Management of Microarray Data Dr G. P. S. Raghava.
Functional genomics + Data mining BCH364C/391L Systems Biology / Bioinformatics – Spring 2015 Edward Marcotte, Univ of Texas at Austin.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
BioQUEST / SCALE-IT Module From Omics Data to Knowledge Case 1: Microarrays Namyong Lee Minnesota State University, Mankato Matthew Macauley Clemson University.
Bioinformatics Brad Windle Ph# Web Site:
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
Selection of Patient Samples and Genes for Disease Prognosis Limsoon Wong Institute for Infocomm Research Joint work with Jinyan Li & Huiqing Liu.
Microarrays.
Microarrays and Their Uses Brad Windle, Ph.D
Biology-Driven Clustering of Microarray Data Applications to the NCI60 Data Set K.R. Coombes, K.A. Baggerly, D.N. Stivers, J. Wang, D. Gold, H.G. Sung,
Microarray data analysis David A. McClellan, Ph.D. Introduction to Bioinformatics Brigham Young University Dept. Integrative Biology.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Quantitative analysis of 2D gels Generalities. Applications Mutant / wild type Physiological conditions Tissue specific expression Disease / normal state.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
An Overview of Clustering Methods Michael D. Kane, Ph.D.
Bioinformatics MEDC601 Lecture by Brad Windle Ph# Office: Massey Cancer Center, Goodwin Labs Room 319 Web site for lecture:
Whole Genome Approaches to Cancer 1. What other tumor is a given rare tumor most like? 2. Is tumor X likely to respond to drug Y?
Course Work Project Project title “Data Analysis Methods for Microarray Based Gene Expression Analysis” Sushil Kumar Singh (batch ) IBAB, Bangalore.
Nuria Lopez-Bigas Methods and tools in functional genomics (microarrays) BCO17.
MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia Armstrong et al, Nature Genetics 30, (2002)
CZ5225: Modeling and Simulation in Biology Lecture 3: Clustering Analysis for Microarray Data I Prof. Chen Yu Zong Tel:
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Analyzing Expression Data: Clustering and Stats Chapter 16.
Regulating The Cell Cycle. Warm Up – The Cell Cycle The cell spends 80% of the time in _______________ and 20% of the time in ________________ What are.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring T.R. Golub et al., Science 286, 531 (1999)
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
Gene Expression Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Case Study: Characterizing Diseased States from Expression/Regulation Data Tuck et al., BMC Bioinformatics, 2006.
C LUSTERING José Miguel Caravalho. CLUSTER ANALYSIS OR CLUSTERING IS THE TASK OF ASSIGNING A SET OF OBJECTS INTO GROUPS ( CALLED CLUSTERS ) SO THAT THE.
 Cancer  Compound perturbations  Gene perturbations  Tumor development  Cancer metastasis  Cancer treatments Altered Caspase-8 Expression.
Gene Expression Profiling Brad Windle, Ph.D
CZ5211 Topics in Computational Biology Lecture 3: Clustering Analysis for Microarray Data I Prof. Chen Yu Zong Tel:
Gene expression.
Gene Expression Analysis and Proteins
Multivariate Statistical Methods
Class Prediction Based on Gene Expression Data Issues in the Design and Analysis of Microarray Experiments Michael D. Radmacher, Ph.D. Biometric Research.
Altered Caspase-8 Expression
Drug sensitivity predictions from CCLE data.
Global analysis of the chemical–genetic interaction map.
Presentation transcript:

Brad Windle, Ph.D Unsupervised Learning and Microarrays Web Site: Link to Courses and then lecture for this class

Gene Expression Profiling Unsupervised Learning Cluster Analysis and Applications Good review of microarray data analysis is Computational analysis of microarray data. Quackenbush J. Nat Rev Genet 2001 Jun;2(6):

Reductionism versus Systems Approach Why generate global analyses? as opposed to picking a gene/protein and hoping you get lucky and it has great significance to the big picture or to mankind’s health.

Normalizing Data Northern blot For normalizing samples, you would divide experimental values by the mean of the values thought to be constant through the samples

Sample values are typically normalized by dividing by the mean of the reference values or mean of all values

What about normalizing gene values across all the samples? Rationale for normalizing samples does not apply to genes One strategy is to subtract the mean (mean centering).

Log transformation //

Gene to Gene Variability

Cluster Analysis Goal - puts items (genes) together in clusters based on similarity of expression across various conditions, either similarity of absolute expression levels or overall similarity in pattern

Pearson

Hierarchical Clustering

Divisive Agglomerative (Aggregative) Clustering Methods

Cluster Linkage Methods Nearest Neighbor or Single Linkage Furthest Neighbor or Complete Linkage Average Neighbors or Average Linkage

X Y Z

1 2 3 K-Means Clustering and it’s relative Self-Organizing Maps (SOM)

Ranking Order Clustering

Cluster Playground 3

Applications of Gene Expression Profiling and Cluster Analysis Tissue or Tumor Classification Gene Classification Drug Classification Drug Target Identification

B-Cell Lymphoma NATURE 403, , 2000 Indistinguishable by histology Yet half responded well to therapy and half did not Where there differences in gene expression that correlate with drug response? Gene expression profiles showed half the lymphomas were of GC B-Cell lineage and the other of Activated B-Cell lineage A subset of genes predicts therapeutic outcome

M1 M2 M3 M4 M5 M6 M7 M8 M9 M10 M11 M12 M13 M14 M15 M16 M17 M18 D1D2D3D4D5D6 D7D8D9D100D11D12 D13D14D15D16D17D18 Gene Expression Profiling of Yeast Mutants and Drugs Cell 102, 109–126, 2000 Mutants Drugs M4 D17 Erg2Dyclonine Human sigma receptor

Validation of cdc28 Kinase Target Inhibition SCIENCE 281, , 1998 cdc28 - D1D2 } Cdc28-regulated genes } Phosphate metabolism genes Nucleotide analogs that block cdc28p D1 and D2 Pho85

Drug Cells A B C D E COMPARE Clustering Drugs Based on Cell Line Sensitivities Nature Genetics 24: , 2000

T1 T2 A7 T2 A7 T1

Profiling

Clustering NCI 60 Cancer Cell Lines Nature Genetics 24: Genes 9 Types of Tissues/Tumors Breast CNS Colon Leukemia Lung Melanoma Ovarian Prostate Renal

Filtering Data Filter out data with the program Cluster, based on SD cuts