Alizadeh et. al. (2000) Stephen Ayers 12/2/01. Clustering “Clustering is finding a natural grouping in a set of data, so that samples within a cluster.

Slides:



Advertisements
Similar presentations
Analysis of Microarray Genomic Data of Breast Cancer Patients Hui Liu, MS candidate Department of statistics Prof. Eric Suess, faculty mentor Department.
Advertisements

Basic Gene Expression Data Analysis--Clustering
Information Retrieval Lecture 7 Introduction to Information Retrieval (Manning et al. 2007) Chapter 17 For the MSc Computer Science Programme Dell Zhang.
Wilson WH et al. Proc ASH 2012;Abstract 686.
Cluster analysis for microarray data Anja von Heydebreck.
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
MOLECULAR GENETICS OF B CELL LYMPHOMAS: AN UPDATE Michel Trudel, MD, FRCPC Shaikh Khalifa Medical Center.
Unsupervised learning: Clustering Ata Kaban The University of Birmingham
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E Molecular Portraits of Cancer Microarrays and Cell Biology.
Logical Analysis of Diffuse Large B Cell Lymphoma Gabriela Alexe 1, Sorin Alexe 1, David Axelrod 2, Peter Hammer 1, and David Weissmann 3 of RUTCOR(1)
Image Segmentation Chapter 14, David A. Forsyth and Jean Ponce, “Computer Vision: A Modern Approach”.
Microarray Data Preprocessing and Clustering Analysis
Part II: Discriminative Margin Clustering Joint work with: Rob Tibshirani, Dept of Statistics Patrick O. Brown, School of Medicine Stanford University.
PCluster: Probabilistic Agglomerative Clustering of Gene Expression Profiles Nir Friedman Presenting: Inbar Matarasso 09/05/2005 The School of Computer.
Introduction to Hierarchical Clustering Analysis Pengyu Hong 09/16/2005.
Microarray Data Analysis
Introduction to Bioinformatics - Tutorial no. 12
Cluster Analysis for Gene Expression Data Ka Yee Yeung Center for Expression Arrays Department of Microbiology.
Microarray analysis 2 Golan Yona. 2) Analysis of co-expression Search for similarly expressed genes experiment1 experiment2 experiment3 ……….. Gene i:
DNA Microarrays Examining Gene Expression. Prof. GrossBiology 4 DNA MicroArrays DNA MicroArrays use hybridization technology to examine gene expression.
Supervised gene expression data analysis using SVMs and MLPs Giorgio Valentini
Georg Gerber Lecture #6, 2/6/02
Principal Component Analysis (PCA) for Clustering Gene Expression Data K. Y. Yeung and W. L. Ruzzo.
Chapter 7 Essential Concepts in Molecular Pathology Companion site for Molecular Pathology Author: William B. Coleman and Gregory J. Tsongalis.
Functional genomics + Data mining BCH364C/391L Systems Biology / Bioinformatics – Spring 2015 Edward Marcotte, Univ of Texas at Austin.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
Arthur Edwards Broad Summer Research Program in Genomics Cancer Program 08/06/07 Genome-wide miRNA Expression Analysis in Lymphoma miRNAs Lymphoma.
Background Diffuse large B-cell lymphoma (DLBCL) is the most commonly occurring lymphoma in the Western world. It’s account for about one-third of all.
Diagnostically and Prognostically Significant Genetic Alterations in Diffuse Large B-Cell Lymphoma Friederike Kreisel, MD Department of Pathology and Immunology.
Building and Running caGrid Workflows in Taverna 1 Computation Institute, University of Chicago and Argonne National Laboratory, Chicago, IL, USA 2 Mathematics.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Biology-Driven Clustering of Microarray Data Applications to the NCI60 Data Set K.R. Coombes, K.A. Baggerly, D.N. Stivers, J. Wang, D. Gold, H.G. Sung,
Taylor Rassmann.  Grouping data objects into X tree of clusters and uses distance matrices as clustering criteria  Two Hierarchical Clustering Categories:
Quantitative analysis of 2D gels Generalities. Applications Mutant / wild type Physiological conditions Tissue specific expression Disease / normal state.
Evolutionary Algorithms for Finding Optimal Gene Sets in Micro array Prediction. J. M. Deutsch Presented by: Shruti Sharma.
Statistical Analysis of DNA Microarray. An Example of HDLSS in Genetics.
Whole Genome Approaches to Cancer 1. What other tumor is a given rare tumor most like? 2. Is tumor X likely to respond to drug Y?
1 Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling Author: Ash A. Alizadeh, Michael B. Eisen et al. Source: Nature.
Examples of Classifying Expression Data / 7.90 Computational Functional Genomics Spring 2002.
Hierarchical Clustering Produces a set of nested clusters organized as a hierarchical tree Can be visualized as a dendrogram – A tree like diagram that.
Brad Windle, Ph.D Unsupervised Learning and Microarrays Web Site: Link to Courses and.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
Hierarchical Clustering
Pan-cancer analysis of prognostic genes Jordan Anaya Omnes Res, In this study I have used publicly available clinical and.
Hierarchical clustering approaches for high-throughput data Colin Dewey BMI/CS 576 Fall 2015.
Leukemia Cell Study Strode Note: Meaningless title.
C LUSTERING José Miguel Caravalho. CLUSTER ANALYSIS OR CLUSTERING IS THE TASK OF ASSIGNING A SET OF OBJECTS INTO GROUPS ( CALLED CLUSTERS ) SO THAT THE.
Functional genomics + Data mining
Anatomic sites(lymph node)
Supplemental Figure 1 Supplemental Figure 1: Electron Force Microscopy image of a DBS filter incubated with human blood.
Discussion and Conclusions Acknowledgements and References
Hallett, et al., - Supplementary Figure 1
Image from Gene-Chips (Micorrrays) Statistics for microarray analysis (SMA)
John Nicholas Owen Sarah Smith
Focus on lymphomas Cancer Cell
Gene expression profiles as measured by the NanoString nCounter System in 54 patients with metastatic gastric cancer (GC). Gene expression profiles as.
Cluster Analysis in Bioinformatics
Volume 72, Issue 7, Pages (October 2007)
Class Prediction Based on Gene Expression Data Issues in the Design and Analysis of Microarray Experiments Michael D. Radmacher, Ph.D. Biometric Research.
(A) Hierarchical clustering was performed to identify groups of patients with similar RNASeq expression of 20 genes associated with reduced survivability.
QuantiGene Plex Represents a Promising Diagnostic Tool for Cell-of-Origin Subtyping of Diffuse Large B-Cell Lymphoma  John S. Hall, Suzanne Usher, Richard.
Hierarchical Clustering
Patient organoids respond more diverse to drugs and with lower therapeutic potential than 2D cultured patient cells Patient organoids respond more diverse.
Pharmacology Division, National Cancer Center Research Institute
Signatures of the Immune Response
Gene Expression Profiles of Cutaneous B Cell Lymphoma
M-Wnt and E-Wnt cells cluster tightly with claudin-low and basal-like breast tumors, respectively, by microarray analysis. M-Wnt and E-Wnt cells cluster.
IL1R8 is downmodulated in human lymphoma cell lines.
Coexpression of other immune genes with ImSig core signatures.
Pancreatic adenocarcinoma, chronic pancreatitis, and normal pancreas samples can be distinguished on the basis of gene expression profiling. Pancreatic.
Presentation transcript:

Alizadeh et. al. (2000) Stephen Ayers 12/2/01

Clustering “Clustering is finding a natural grouping in a set of data, so that samples within a cluster will be more similar to each other than they are to samples in other clusters.” Finding groups of correlated genes “signature groups” Genes without well established relationships Extract features of groups

Hierarchical Clustering Tiers of points from a bottom layer of 1 point in each of n clusters to top level of n points, all in one cluster Usually represented in dendrogram

Divisive Top-down Start with all samples and successively split into separate clusters

Agglomerative Bottom-up approach Less computationally intensive Start with n singletons and successively merge clusters –Place all values in separate clusters –Merge most similar clusters into higher clusters –Repeat until all clusters have been merged

Average-Linkage Method Available > 1.Compute similarity matrix 2.Scan matrix to find most highest similarity Uses form of the correlation coefficient 3.A node is created between these values 4.Values are replaced by node

Diffuse Large B-cell Lymphoma Most common subtype of non-Hodgkin’s Lymphoma 25,000 cases/year 40% of patients respond well Possible undetected heterogeneity Found 2 classes using clustering (Eisen 1998): Germinal Center B-like and Activated B-like

Lymphochip 17,856 cDNA clones total 12,069 germinal center B-cell library 2,338 lymphomic cancer genes 3,186 genes important to lymphocyte or cancer biology ¼ of genes = duplicates

Expression Analysis DLBCL, Follicular Lymphoma, Chronic Lympohcytic Leukemia Lymphocyte subpopulations with a range of conditions -normal human tonsils, lymph nodes -lymphoma, leukemia cell lines

Figure 1

Figure 2:

Figure 3:

Figure 4: GC Activated

Figure 5

Conclusions More categories likely Changes in treatment Possible drug targets