Clustering by Passing Messages Between Data Points Brendan J. Frey and Delbert Dueck Science, 2007.

Slides:



Advertisements
Similar presentations
Clustering. How are we doing on the pass sequence? Pretty good! We can now automatically learn the features needed to track both people But, it sucks.
Advertisements

CS6800 Advanced Theory of Computation
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Cluster Analysis Measuring latent groups. Cluster Analysis - Discussion Definition Vocabulary Simple Procedure SPSS example ICPSR and hands on.
Gossip Algorithms and Implementing a Cluster/Grid Information service MsSys Course Amar Lior and Barak Amnon.
Clustering the Temporal Sequences of 3D Protein Structure Mayumi Kamada +*, Sachi Kimura, Mikito Toda ‡, Masami Takata +, Kazuki Joe + + : Graduate School.
Ab initio gene prediction Genome 559, Winter 2011.
Introduction to Bioinformatics
Context-aware Query Suggestion by Mining Click-through and Session Data Authors: H. Cao et.al KDD 08 Presented by Shize Su 1.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Clustered alignments of gene- expression time series data Adam A. Smith, Aaron Vollrath, Cristopher A. Bradfield and Mark Craven Department of Biosatatistics.
Non-metric affinity propagation for unsupervised image categorization Delbert Dueck and Brendan J. Frey ICCV 2007.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
CPSC 668Set 3: Leader Election in Rings1 CPSC 668 Distributed Algorithms and Systems Spring 2008 Prof. Jennifer Welch.
HCS Clustering Algorithm
Segmentation Divide the image into segments. Each segment:
Clustering (Part II) 10/07/09. Outline Affinity propagation Quality evaluation.
Segmentation Graph-Theoretic Clustering.
Clustering (Part II) 11/26/07. Spectral Clustering.
Abstract Extracting a matte by previous approaches require the input image to be pre-segmented into three regions (trimap). This pre-segmentation based.
A Distributed Algorithm for Minimum-Weight Spanning Trees by R. G. Gallager, P.A. Humblet, and P. M. Spira ACM, Transactions on Programming Language and.
Optimized Numerical Mapping Scheme for Filter-Based Exon Location in DNA Using a Quasi-Newton Algorithm P. Ramachandran, W.-S. Lu, and A. Antoniou Department.
Introduction to Data Mining Engineering Group in ACL.
Approximation algorithms for large-scale kernel methods Taher Dameh School of Computing Science Simon Fraser University March 29 th, 2010.
Models of Influence in Online Social Networks
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Social Network Analysis via Factor Graph Model
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 2007 (TPDS 2007)
Reinforcement Learning (II.) Exercise Solutions Ata Kaban School of Computer Science University of Birmingham.
Professor: S. J. Wang Student : Y. S. Wang
Machine Learning Problems Unsupervised Learning – Clustering – Density estimation – Dimensionality Reduction Supervised Learning – Classification – Regression.
Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.
A Clustering Algorithm based on Graph Connectivity Balakrishna Thiagarajan Computer Science and Engineering State University of New York at Buffalo.
© by Yu Hen Hu 1 ECE533 Digital Image Processing Image Segmentation.
Digital Image Processing CCS331 Relationships of Pixel 1.
Image segmentation Prof. Noah Snavely CS1114
COMON NETWORK STRUCTURES BY :Karla Barragán. STAR topology Star networks are one of the most common computer network topologies. In its simplest form,
Scalable Multi-Class Traffic Management in Data Center Backbone Networks Amitabha Ghosh (UtopiaCompression) Sangtae Ha (Princeton) Edward Crabbe (Google)
Data Mining Algorithms for Large-Scale Distributed Systems Presenter: Ran Wolff Joint work with Assaf Schuster 2003.
Unsupervised Learning. Supervised learning vs. unsupervised learning.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Clustering by soft-constraint affinity propagation: applications to gene- expression data Michele Leone, Sumedha and Martin Weight Bioinformatics, 2007.
Computer Science 1 Using Clustering Information for Sensor Network Localization Haowen Chan, Mark Luk, and Adrian Perrig Carnegie Mellon University
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
CSCE 668 DISTRIBUTED ALGORITHMS AND SYSTEMS Spring 2014 Prof. Jennifer Welch CSCE 668 Set 3: Leader Election in Rings 1.
© Tan,Steinbach, Kumar Introduction to Data Mining 4/18/ Data Mining: Cluster Analysis This lecture node is modified based on Lecture Notes for Chapter.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
On Mechanism in Clustering Speaker: Caiming Zhong
Markov Networks: Theory and Applications Ying Wu Electrical Engineering and Computer Science Northwestern University Evanston, IL 60208
Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.
Computational Vision CSCI 363, Fall 2012 Lecture 17 Stereopsis II
Analysis of Massive Data Sets Prof. dr. sc. Siniša Srbljić Doc. dr. sc. Dejan Škvorc Doc. dr. sc. Ante Đerek Faculty of Electrical Engineering and Computing.
Data-Driven 3D Voxel Patterns for Object Category Recognition Andrew Sharp.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
Gaussian Mixture Model classification of Multi-Color Fluorescence In Situ Hybridization (M-FISH) Images Amin Fazel 2006 Department of Computer Science.
Hidden Markov Models BMI/CS 576
Clustering Clustering definition: Partition a given set of objects into M groups (clusters) such that the objects of each group are ‘similar’ and ‘different’
Diversified Trajectory Pattern Ranking in Geo-Tagged Social Media
Semi-supervised Affinity Propagation
Segmentation Graph-Theoretic Clustering.
1 Department of Engineering, 2 Department of Mathematics,
Grouping.
1 Department of Engineering, 2 Department of Mathematics,
Jianping Fan Dept of Computer Science UNC-Charlotte
1 Department of Engineering, 2 Department of Mathematics,
© 2003 University of Wisconsin
“Clustering by Passing Messages Between Data Points”
Reporter: Wenkai Cui Institution: Tsinghua University Date:
Clustering.
Presentation transcript:

Clustering by Passing Messages Between Data Points Brendan J. Frey and Delbert Dueck Science, 2007

Outline Introduction Method Description Experiments Conclusion 2

Introduction Clustering: based on a measure of similarity to cluster data. Exemplar: the centers are selected from actual data points. 3

Introduction A common approach: k-centers clustering. It’s sensitive to the initial selection of exemplars. 4

Introduction In k-means algorithm, the number of exemplars need be specified beforehand. How to apply clustering if we don’t know the number of exemplars? 5

Method Description A new approach: affinity propagation. We view each data point as a node in a network and consider all data points as potential exemplars. 6

Similarity and Preference Affinity propagation needs two information – Similarities between data points: – Preferences: Similarity indicates how well the data point k is suited to be the exemplar for data point i. Preference influences the number of clusters. 7

Messages exchanged Affinity propagation recursively transmits real- valued messages along edges of the network until a good set of exemplars and clusters emerges. The messages include: – responsibility – availability Availabilities and responsibilities can be combined to identify exemplars. 8

Responsibility and availability Responsibility : reflects the accumulated evidence for how well-suited point k is to serve as the exemplar for point i. 9 From data point i to candidate exemplar point k, it takes into account other potential exemplars for point i.

Responsibility and availability Availability : reflects the accumulated evidence for how appropriate it would be for point i to choose point k as its exemplar. 10 From candidate exemplar point k to point i, it takes into account the support from other points that point k should be an exemplar.

How to send messages? The availabilities are initialized to 0,, it means each point doesn’t decide which exemplar it belongs to. The responsibilities are updated by: 11 (For the first iteration.) If r is bigger, it means the point k is more well- suited for point i than other exemplars k’.

How to send messages? Self-responsibility : for i = k, it will be 12 preference The similarities with all other exemplars. How appropriate it would be for data point k as an exemplar itself? If, exemplar is more appropriate to belong to other exemplars.

How to send messages? Availabilities are updated by: 13 It’s the sum of responsibilities for supporting points i’ to exemplar k. If a = 0, it means exemplar point k is more well- suited to point i.

How to send messages? If availability is less than 0, it will increase the other points’ responsibility: 14 Availability < 0 Responsibility from data point i to exemplar k increases!

How to send messages? Self-availability : for i = k, it will be 15 How appropriate it would be for data point k as an exemplar itself? Based on the responsibilities from other data points i.

How to identify the cluster? For point i, we would like to find: If k = i, the data point i is an exemplar itself. Otherwise, the data point k is the exemplar of point i. 16

Method Description Each iteration of affinity propagation consisted of: – Updating all responsibilities given the availabilities. – Updating all availabilities given the responsibilities. – Combining responsibilities and availabilities to monitor the exemplar decisions. When does the algorithm terminate? 17

Method Description The procedure may be terminated: – after a fixed number of iterations. – after changes in the messages fall below a threshold. – after the local decisions stay constant for some number of iterations. 18

Method Description For example: 19

Experiments Clustering images of faces. Clustering putative exons to find genes. Identifying a restricted number of Canadian and American cities, in terms of estimated commercial airline travel time. 20

Clustering images of faces Use affinity propagation and k-centers clustering. 900 grayscale images extracted from the Olivetti face database. 21

Clustering images of faces Experimental results: 22

Clustering putative exons to find genes segments of DNA (60 bases long) corresponding to putative exons were mined from the genome of mouse chromosome 1. The measure of similarity between putative exons was based on their proximity in the genome and the degree of coordination of their transcription levels across the 12 tissues. 23

Clustering putative exons to find genes The similarity matrix consisted of 99.73% similarities with values of -∞, corresponding to distant DNA segments that could not possibly be part of the same gene. 24

Clustering putative exons to find genes Experimental results: 25

Clustering putative exons to find genes Experimental results: 26

Identifying the cities Due to headwinds, the transit time was in many cases different depending on the direction of travel. The 36% of the similarities were asymmetric. Further, for 97% of city pairs i and k, there was a third city j such that the triangle inequality was violated because of a long stopover delay. 27

Identifying the cities Experimental results: 28

Conclusion Affinity propagation is the first method to make use of the idea ‘message passing’ to solve the fundamental problem of clustering data. Because of its simplicity and performance, it will prove to be of board value in science and engineering. 29