Supervised Clustering of Label Ranking Data Mihajlo Grbovic, Nemanja Djuric, Slobodan Vucetic {mihajlo.grbovic, nemanja.djuric,

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

Mustafa Cayci INFS 795 An Evaluation on Feature Selection for Text Clustering.
+ Multi-label Classification using Adaptive Neighborhoods Tanwistha Saha, Huzefa Rangwala and Carlotta Domeniconi Department of Computer Science George.
Feature Grouping-Based Fuzzy-Rough Feature Selection Richard Jensen Neil Mac Parthaláin Chris Cornelis.
1 Semi-supervised learning for protein classification Brian R. King Chittibabu Guda, Ph.D. Department of Computer Science University at Albany, SUNY Gen*NY*sis.
1 Machine Learning: Lecture 10 Unsupervised Learning (Based on Chapter 9 of Nilsson, N., Introduction to Machine Learning, 1996)
Yue Han and Lei Yu Binghamton University.
Learning on Probabilistic Labels Peng Peng, Raymond Chi-wing Wong, Philip S. Yu CSE, HKUST 1.
Explorations in Tag Suggestion and Query Expansion Jian Wang and Brian D. Davison Lehigh University, USA SSM 2008 (Workshop on Search in Social Media)
Jierui Xie, Boleslaw Szymanski, Mohammed J. Zaki Department of Computer Science Rensselaer Polytechnic Institute Troy, NY 12180, USA {xiej2, szymansk,
Week 9 Data Mining System (Knowledge Data Discovery)
Analyzing System Logs: A New View of What's Important Sivan Sabato Elad Yom-Tov Aviad Tsherniak Saharon Rosset IBM Research SysML07 (Second Workshop on.
Iterative Optimization and Simplification of Hierarchical Clusterings Doug Fisher Department of Computer Science, Vanderbilt University Journal of Artificial.
Semi-Supervised Clustering Jieping Ye Department of Computer Science and Engineering Arizona State University
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
Data Mining: A Closer Look Chapter Data Mining Strategies (p35) Moh!
ML ALGORITHMS. Algorithm Types Classification (supervised) Given -> A set of classified examples “instances” Produce -> A way of classifying new examples.
Evaluation of Utility of LSA for Word Sense Discrimination Esther Levin, Mehrbod Sharifi, Jerry Ball
1/16 Final project: Web Page Classification By: Xiaodong Wang Yanhua Wang Haitang Wang University of Cincinnati.
Algorithms for Data Analytics Chapter 3. Plans Introduction to Data-intensive computing (Lecture 1) Statistical Inference: Foundations of statistics (Chapter.
Evaluating Performance for Data Mining Techniques
A Geometric Framework for Unsupervised Anomaly Detection: Detecting Intrusions in Unlabeled Data Authors: Eleazar Eskin, Andrew Arnold, Michael Prerau,
Math 5364 Notes Chapter 8: Cluster Analysis Jesse Crawford Department of Mathematics Tarleton State University.
Data Mining Joyeeta Dutta-Moscato July 10, Wherever we have large amounts of data, we have the need for building systems capable of learning information.
Unsupervised Learning Reading: Chapter 8 from Introduction to Data Mining by Tan, Steinbach, and Kumar, pp , , (
CIKM’09 Date:2010/8/24 Advisor: Dr. Koh, Jia-Ling Speaker: Lin, Yi-Jhen 1.
Classification and Ranking Approaches to Discriminative Language Modeling for ASR Erinç Dikici, Murat Semerci, Murat Saraçlar, Ethem Alpaydın 報告者:郝柏翰 2013/01/28.
Apache Mahout. Mahout Introduction Machine Learning Clustering K-means Canopy Clustering Fuzzy K-Means Conclusion.
Transfer Learning with Applications to Text Classification Jing Peng Computer Science Department.
Data Mining and Machine Learning Lab Unsupervised Feature Selection for Linked Social Media Data Jiliang Tang and Huan Liu Computer Science and Engineering.
Xiangnan Kong,Philip S. Yu Multi-Label Feature Selection for Graph Classification Department of Computer Science University of Illinois at Chicago.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Low-Rank Kernel Learning with Bregman Matrix Divergences Brian Kulis, Matyas A. Sustik and Inderjit S. Dhillon Journal of Machine Learning Research 10.
MACHINE LEARNING 8. Clustering. Motivation Based on E ALPAYDIN 2004 Introduction to Machine Learning © The MIT Press (V1.1) 2  Classification problem:
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
Data Science and Big Data Analytics Chap 4: Advanced Analytical Theory and Methods: Clustering Charles Tappert Seidenberg School of CSIS, Pace University.
Optimal Dimensionality of Metric Space for kNN Classification Wei Zhang, Xiangyang Xue, Zichen Sun Yuefei Guo, and Hong Lu Dept. of Computer Science &
HAITHAM BOU AMMAR MAASTRICHT UNIVERSITY Transfer for Supervised Learning Tasks.
Computational Approaches for Biomarker Discovery SubbaLakshmiswetha Patchamatla.
Support Vector Machines and Gene Function Prediction Brown et al PNAS. CS 466 Saurabh Sinha.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Machine Learning and Data Mining Clustering (adapted from) Prof. Alexander Ihler TexPoint fonts used in EMF. Read the TexPoint manual before you delete.
Iowa State University Department of Computer Science Center for Computational Intelligence, Learning, and Discovery Harris T. Lin, Sanghack Lee, Ngot Bui.
CS378 Final Project The Netflix Data Set Class Project Ideas and Guidelines.
Support Vector Machines and Kernel Methods for Co-Reference Resolution 2007 Summer Workshop on Human Language Technology Center for Language and Speech.
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
CISC 849 : Applications in Fintech Namami Shukla Dept of Computer & Information Sciences University of Delaware iCARE : A Framework for Big Data Based.
A Supervised Machine Learning Algorithm for Research Articles Leonidas Akritidis, Panayiotis Bozanis Dept. of Computer & Communication Engineering, University.
A Statistical Approach to Texture Classification Nicholas Chan Heather Dunlop Project Dec. 14, 2005.
BOOTSTRAPPING INFORMATION EXTRACTION FROM SEMI-STRUCTURED WEB PAGES Andrew Carson and Charles Schafer.
Clustering Algorithms Sunida Ratanothayanon. What is Clustering?
Predicting User Interests from Contextual Information R. W. White, P. Bailey, L. Chen Microsoft (SIGIR 2009) Presenter : Jae-won Lee.
Advanced Gene Selection Algorithms Designed for Microarray Datasets Limitation of current feature selection methods: –Ignores gene/gene interaction: single.
Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.
E-commerce in Your Inbox Product Recommendations at Scale
Support Vector Machine (SVM) Presented by Robert Chen.
SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
Data Mining and Text Mining. The Standard Data Mining process.
Experience Report: System Log Analysis for Anomaly Detection
Machine Learning with Spark MLlib
Semi-Supervised Clustering
Sofus A. Macskassy Fetch Technologies
KDD 2004: Adversarial Classification
Learning with information of features
Prepared by: Mahmoud Rafeek Al-Farra
Prepared by: Mahmoud Rafeek Al-Farra
Clustering.
Fusing Rating-based and Hitting-based Algorithms in Recommender Systems Xin Xin
Low-Rank Sparse Feature Selection for Patient Similarity Learning
Presentation transcript:

Supervised Clustering of Label Ranking Data Mihajlo Grbovic, Nemanja Djuric, Slobodan Vucetic {mihajlo.grbovic, nemanja.djuric, Temple University Department of Computer and Information Sciences Center for Data Analytics and Biomedical Informatics Philadelphia, USA SIAM SDM 2012, Anaheim, California, USA

Introduction Label Ranking Performance Measures Related Work Supervised clustering in context of Label Ranking Motivation Performance Measures Approaches Baseline Approaches Placket-Luce Mixture Model Empirical Evaluation Experiments on Synthetic Data Experiments on Real-world Data Page 2 Outline Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Introduction Label Ranking Setup: L = 5 labels Features (x)y Page 3 Costumer Features Product Costumer Features: age, gender, how often they buy from us, how much on average they spend, etc. Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM

Label Ranking Setup: L = 5 labels π = , pairwise label preferences: 1 > 4 > 2 > 5 > 3 Goal: Learn a model that maps instances x to a total label order π h : x n → π n Features (x)Label Ranking (π) Page 4 D = {(x n, π n ), n = 1…N} Introduction Costumer Features Product Ranking Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM

π = Label Ranking: Missing Information Partial Ranking Features (x)Label Ranking (π) ? ? Page 5 Introduction Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM

Label Ranking: Performance Measure Notation: π(i) the class label at i-th position in the order π -1 (j) the position of the y j class label in the order Distance between two rankings: true ranking (π) and predicted ranking (ρ): Page 6 Given Data set: D = {(x n, π n ), n = 1…N} Kendall tau distance - counts the number of discordant label pairs Label Ranking Loss: Introduction

Label Ranking: Related Work Page 7 1. Map into classification - L(L-1)/2 classifiers - 1 (d x L) dimensional problem 2. kNN based algorithms 3. Utility functions - Learn mappings - Prediction: rank the utility scores f k : x → R, k = 1,…, L

Label Ranking: Supervised Clustering Page 8 Introduction SYNTHETIC DATA 2 features 5 labels Each permutation represented with a color (similar color – similar rank) 5 natural clusters in feature space 3 natural clusters in label space Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Label Ranking: Supervised Clustering Page 9 Introduction GOAL: Cluster data instances (customers) in the feature space by taking into consideration the assigned, potentially incomplete label rankings (product preferences) Such that the rankings of instances within a cluster are more similar to each other than to the rankings of instances in the other clusters Extract cluster centroid-rankings (preferences that represent each cluster uniquely) Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Introduction Page 10 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 Traditional ClusteringSupervised Clustering Label Ranking: Supervised Clustering ρ={4,3,1,5,2}ρ={1,2,3,4,5}

Label Ranking: Supervised Clustering Page 11 Introduction Example: Target marketing A company with several products would like to cluster its costumers (in feature space) Purpose: designing cluster-specific promotional material For each cluster, the company can make a different catalog, by promoting products in different order that best reflects the taste of its target costumers Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Performance Measures  “Tightness” of clusters in label ranking space How similar are the rankings of instances within the clusters How far are cluster central ranking from cluster member rankings  Happiness of new costumer when he receives the catalog by mail How close is the cluster central ranking to true costumer ranking Introduction Page 12 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 Label Ranking: Supervised Clustering

Approaches Heuristic Baselines 1.Cluster in Feature Space → Find Central Cluster Rankings Kmeans → Mallows 2. Cluster in Label Ranking Space → Multi-Class Classification Naïve → SVM EBMS * → SVM 3. Add Label Rankings to Features → Unsupervised Clustering Naïve Kmeans 4. 1-Rank (represent all data using one ranking) * M. Meila and L. Bao, An exponential model for infinite rankings, Journal of Machine Learning Research, 11 (2010) Page 13 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Approaches Page 14 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 Plackett-Luce Mixture Model

Approaches Page 15 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 Plackett-Luce Mixture Model (K clusters) K clusters: Likelihood:

Empirical Evaluation Page 16 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 ρ={1,2,3,4,5,6} ρ={3,1,6,2,5,4} ρ={6,5,4,3,2,1}

Empirical Evaluation Page 17 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Empirical Evaluation Page 18 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 Sushi Data Set (L=10)

Empirical Evaluation Page 19 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Empirical Evaluation Page 20 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Empirical Evaluation Page 21 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Empirical Evaluation Page 22 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012

Conclusion Page 23 Grbovic M., Djuric N., Vucetic S., Supervised Clustering of Label Ranking Data, SIAM SDM 2012 Conclusion This paper presents the first attempt at supervised clustering of complex label rank data We established several baselines for supervised clustering of label ranking data and proposed a Plackett-Luce (PL) mixture model specifically tailored for this application We empirically showed the strength of the PL model by experiments on real-world and synthetic data In addition to the supervised clustering scenario, we compared the PL model to the previously proposed label ranking algorithms in terms of predictive accuracy

THANK YOU