K ERNEL - BASED W EIGHTED M ULTI - VIEW C LUSTERING Grigorios Tzortzis and Aristidis Likas Department of Computer Science, University of Ioannina, Greece.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

A Robust Super Resolution Method for Images of 3D Scenes Pablo L. Sala Department of Computer Science University of Toronto.
A Unified Framework for Context Assisted Face Clustering
Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.
C ONVEX M IXTURE M ODELS FOR M ULTI - VIEW C LUSTERING Grigorios Tzortzis and Aristidis Likas Department of Computer Science, University of Ioannina, Greece.
Image Matting and Its Applications Chen-Yu Tseng Advisor: Sheng-Jyh Wang
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Relational Learning with Gaussian Processes By Wei Chu, Vikas Sindhwani, Zoubin Ghahramani, S.Sathiya Keerthi (Columbia, Chicago, Cambridge, Yahoo!) Presented.
Multiple Criteria for Evaluating Land Cover Classification Algorithms Summary of a paper by R.S. DeFries and Jonathan Cheung-Wai Chan April, 2000 Remote.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts Dhillon, Inderjit S., Yuqiang Guan, and Brian Kulis.
ACM Multimedia th Annual Conference, October , 2004
Announcements  Project proposal is due on 03/11  Three seminars this Friday (EB 3105) Dealing with Indefinite Representations in Pattern Recognition.
A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
Agenda The Subspace Clustering Problem Computer Vision Applications
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
Radial Basis Function Networks
Relaxed Transfer of Different Classes via Spectral Partition Xiaoxiao Shi 1 Wei Fan 2 Qiang Yang 3 Jiangtao Ren 4 1 University of Illinois at Chicago 2.
Dimensionality reduction Usman Roshan CS 675. Supervised dim reduction: Linear discriminant analysis Fisher linear discriminant: –Maximize ratio of difference.
Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Identifying Computer Graphics Using HSV Model And Statistical Moments Of Characteristic Functions Xiao Cai, Yuewen Wang.
Presented By Wanchen Lu 2/25/2013
Machine Learning1 Machine Learning: Summary Greg Grudic CSCI-4830.
Mean-shift and its application for object tracking
Mining Discriminative Components With Low-Rank and Sparsity Constraints for Face Recognition Qiang Zhang, Baoxin Li Computer Science and Engineering Arizona.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Boris Babenko Department of Computer Science and Engineering University of California, San Diego Semi-supervised and Unsupervised Feature Scaling.
General Tensor Discriminant Analysis and Gabor Features for Gait Recognition by D. Tao, X. Li, and J. Maybank, TPAMI 2007 Presented by Iulian Pruteanu.
Machine Learning Problems Unsupervised Learning – Clustering – Density estimation – Dimensionality Reduction Supervised Learning – Classification – Regression.
Machine Learning Using Support Vector Machines (Paper Review) Presented to: Prof. Dr. Mohamed Batouche Prepared By: Asma B. Al-Saleh Amani A. Al-Ajlan.
Data Mining and Machine Learning Lab Unsupervised Feature Selection for Linked Social Media Data Jiliang Tang and Huan Liu Computer Science and Engineering.
LexPageRank: Prestige in Multi- Document Text Summarization Gunes Erkan and Dragomir R. Radev Department of EECS, School of Information University of Michigan.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Transductive Regression Piloted by Inter-Manifold Relations.
Low-Rank Kernel Learning with Bregman Matrix Divergences Brian Kulis, Matyas A. Sustik and Inderjit S. Dhillon Journal of Machine Learning Research 10.
Graph-based Text Classification: Learn from Your Neighbors Ralitsa Angelova , Gerhard Weikum : Max Planck Institute for Informatics Stuhlsatzenhausweg.
Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
Chapter 11 Statistical Techniques. Data Warehouse and Data Mining Chapter 11 2 Chapter Objectives  Understand when linear regression is an appropriate.
Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.
Regularization and Feature Selection in Least-Squares Temporal Difference Learning J. Zico Kolter and Andrew Y. Ng Computer Science Department Stanford.
Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised and weakly-supervised discovery of events in video (and audio) Fernando De la Torre.
About Me Swaroop Butala  MSCS – graduating in Dec 09  Specialization: Systems and Databases  Interests:  Learning new technologies  Application of.
Adaptive Multi-view Clustering via Cross Trace Lasso
G REEDY U NSUPERVISED M ULTIPLE K ERNEL L EARNING Grigorios Tzortzis and Aristidis Likas Department of Computer Science, University of Ioannina, Greece.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Optimal Reverse Prediction: Linli Xu, Martha White and Dale Schuurmans ICML 2009, Best Overall Paper Honorable Mention A Unified Perspective on Supervised,
Ultra-high dimensional feature selection Yun Li
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
Iterative K-Means Algorithm Based on Fisher Discriminant UNIVERSITY OF JOENSUU DEPARTMENT OF COMPUTER SCIENCE JOENSUU, FINLAND Mantao Xu to be presented.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
A distributed PSO – SVM hybrid system with feature selection and parameter optimization Cheng-Lung Huang & Jian-Fan Dun Soft Computing 2008.
Motoki Shiga, Ichigaku Takigawa, Hiroshi Mamitsuka
Clustering Clustering definition: Partition a given set of objects into M groups (clusters) such that the objects of each group are ‘similar’ and ‘different’
Bag-of-Visual-Words Based Feature Extraction
Personalized Social Image Recommendation
Graph Based Multi-Modality Learning
Unsupervised Learning II: Soft Clustering with Gaussian Mixture Models
Paper Reading Dalong Du April.08, 2011.
Presentation transcript:

K ERNEL - BASED W EIGHTED M ULTI - VIEW C LUSTERING Grigorios Tzortzis and Aristidis Likas Department of Computer Science, University of Ioannina, Greece

O UTLINE Introduction Feature Space Clustering Kernel-based Weighted Multi-view Clustering Experimental Evaluation Summary 2 I.P.AN Research Group, University of Ioannina

O UTLINE Introduction Feature Space Clustering Kernel-based Weighted Multi-view Clustering Experimental Evaluation Summary 3 I.P.AN Research Group, University of Ioannina

M ULTI - VIEW D ATA Most machine learning approaches assume instances are represented by a single feature space In many real life problems multi-view data arise naturally Different measuring methods – Infrared and visual cameras Different media – Text, video, audio 4 I.P.AN Research Group, University of Ioannina Multi-view data are instances with multiple representations from different feature spaces, e.g. different vector and/or graph spaces

E XAMPLES OF M ULTI - VIEW D ATA Web pages Web page text Anchor text Hyper-links 5 I.P.AN Research Group, University of Ioannina Scientific articles Abstract text Citations graph Such data have raised interest in a novel problem, called multi-view learning Most studies address the semi-supervised setting We will focus on unsupervised clustering of multi-view data Images Color Texture Annotation Text

M ULTI - VIEW C LUSTERING Motivation Views capture different aspects of the data and may contain complementary information A robust partitioning could be derived by simultaneously exploiting all views, that outperforms single view segmentations Simple solution Concatenate the views and apply a classic clustering algorithm Not very effective 6 I.P.AN Research Group, University of Ioannina Given a multiply represented dataset, split this dataset into M disjoint - homogeneous groups, by taking into account every view

M ULTI - VIEW C LUSTERING Most existing multi-view methods rely equally on all views Degenerate views often occur – Noisy, irrelevant views Results will deteriorate if such views are included in the clustering process Views should participate in the solution according to their quality A view ranking mechanism is necessary 7 I.P.AN Research Group, University of Ioannina

C ONTRIBUTION We focus on multi-view clustering and rank the views based on their conveyed information This issue has been overlooked in the literature We represent each view with a kernel matrix and combine the views using a weighted sum of the kernels Weights express the quality of the views and determine the amount of their contribution to the solution We incorporate in our model a parameter that controls the sparsity of the weights This parameter adjusts the sensitivity of the weights to the differences in quality among the views 8 I.P.AN Research Group, University of Ioannina

C ONTRIBUTION We develop two simple iterative procedures to recover the clusters and automatically learn the weights Kernel k-means and its spectral relaxation are utilized The weights are estimated by closed-form expressions We perform experiments with synthetic and real data to evaluate our framework 9 I.P.AN Research Group, University of Ioannina

O UTLINE Introduction Feature Space Clustering Kernel-based Weighted Multi-view Clustering Experimental Evaluation Summary 10 I.P.AN Research Group, University of Ioannina

F EATURE S PACE C LUSTERING 11 I.P.AN Research Group, University of Ioannina

K ERNEL T RICK 12 I.P.AN Research Group, University of Ioannina

K ERNEL K - MEANS 13 I.P.AN Research Group, University of Ioannina Kernel k-means ≡ k-means in feature space

K ERNEL K - MEANS 14 I.P.AN Research Group, University of Ioannina 1 Tzortzis, G., Likas, A., The global kernel k-means algorithm for clustering in feature space, IEEE TNN, 2009

S PECTRAL R ELAXATION OF K ERNEL K - MEANS 15 I.P.AN Research Group, University of Ioannina 1 Dhillon, I.S., Guan, Y., Kulis, B., Weighted graph cuts without eigenvectors: A multilevel approach, IEEE TPAMI, 2007 Spectral methods can substitute kernel k-means and vice versa Constant

O UTLINE Introduction Feature Space Clustering Kernel-based Weighted Multi-view Clustering Experimental Evaluation Summary 16 I.P.AN Research Group, University of Ioannina

K ERNEL - BASED W EIGHTED M ULTI - VIEW C LUSTERING Why? Kernel k-means is a simple, yet effective clustering technique Complementary information in the views can boost clustering accuracy Degenerate views that degrade performance exist in practice Target Split the dataset by simultaneously considering all views Automatically determine the relevance of each view to the clustering task How? Represent views with kernels Associate a weight with each kernel Learn a linear combination of the kernels together with the cluster labels Weights determine the degree that each kernel-view participates in the solution and should reflect its quality 17 I.P.AN Research Group, University of Ioannina We propose an extension of the kernel k-means objective to the multi-view setting that: Ranks the views based on the quality of the conveyed information Differentiates their contribution to the solution according to the ranking We propose an extension of the kernel k-means objective to the multi-view setting that: Ranks the views based on the quality of the conveyed information Differentiates their contribution to the solution according to the ranking

K ERNEL MIXING 18 I.P.AN Research Group, University of Ioannina

M ULTI - VIEW K ERNEL K - MEANS (MVKKM) 19 I.P.AN Research Group, University of Ioannina

M ULTI - VIEW K ERNEL K - MEANS (MVKKM) 20 I.P.AN Research Group, University of Ioannina

MVKKM T RAINING 21 I.P.AN Research Group, University of Ioannina

W EIGHT U PDATE A NALYSIS 22 I.P.AN Research Group, University of Ioannina

M ULTI - VIEW S PECTRAL C LUSTERING (MVS PEC ) 23 I.P.AN Research Group, University of Ioannina Explore the spectral relaxation of kernel k-means and employ spectral clustering to optimize the MVKKM objective

MVS PEC T RAINING 24 I.P.AN Research Group, University of Ioannina

MVKKM VS. MVS PEC 25 I.P.AN Research Group, University of Ioannina MVKKMMVSpec Monotonic convergence to a local minimum Discrete clusters are derived at each iteration

O UTLINE Introduction Feature Space Clustering Kernel-based Weighted Multi-view Clustering Experimental Evaluation Summary 26 I.P.AN Research Group, University of Ioannina

E XPERIMENTAL E VALUATION 27 I.P.AN Research Group, University of Ioannina 1 Blaschko, M. B., Lampert, C. H., Correlational spectral clustering, CVPR, Tzortzis, G., Likas, A., Multiple View Clustering Using a Weighted Combination of Exemplar-based Mixture Models, IEEE TNN, 2010

E XPERIMENTAL S ETUP MVKKM and MVSpec weights are uniformly initialized Global kernel k-means 1 is utilized to deterministically get initial clusters for MVKKM Multiple restarts are avoided Linear kernels are employed for all views For MVCMM, Gaussian convex mixture models are adopted The number of clusters is set equal to the true number of classes in the dataset Performance is measured in terms of NMI Higher NMI values indicate a better match between cluster and class labels 28 I.P.AN Research Group, University of Ioannina 1 Tzortzis, G., Likas, A., The global kernel k-means algorithm for clustering in feature space, IEEE TNN, 2009

S YNTHETIC D ATA We created a two view dataset The second view is a noisy version of the first that mixes the clusters The dataset is not linearly separable Use rbf kernels to represent the views 29 I.P.AN Research Group, University of Ioannina

S YNTHETIC D ATA 30 I.P.AN Research Group, University of Ioannina

R EAL M ULTI - VIEW D ATASETS Multiple Features – Collection of handwritten digits Five views Ten classes 200 instances per class Extracted several four class subsets Corel – Image collection Seven views (color and texture) 34 classes 100 instances per class Extracted several four class subsets 31 I.P.AN Research Group, University of Ioannina

M ULTIPLE F EATURES 32 I.P.AN Research Group, University of Ioannina Digits 0236 Digits 1367

M ULTIPLE F EATURES 33 I.P.AN Research Group, University of Ioannina Digits 0236Digits 1367

C OREL 34 I.P.AN Research Group, University of Ioannina bus, leopard, train, ship owl, wildlife, hawk, rose

C OREL 35 I.P.AN Research Group, University of Ioannina bus, leopard, train, ship owl, wildlife, hawk, rose

E VALUATION C ONCLUSIONS 36 I.P.AN Research Group, University of Ioannina

O UTLINE Introduction Feature Space Clustering Kernel-based Weighted Multi-view Clustering Experimental Evaluation Summary 37 I.P.AN Research Group, University of Ioannina

S UMMARY We studied the multi-view problem under the unsupervised setting and represented views with kernels We proposed two iterative methods that rank the views by learning a weighted combination of the view kernels We introduced a parameter that moderates the sparsity of the weights We derived closed-form expressions for the weights We provided experimental results for the efficacy of our framework 38 I.P.AN Research Group, University of Ioannina

39 I.P.AN Research Group, University of Ioannina Thank you!