Nir Geffen 021537980 Yotam Margolin039719729 Supervisor Professor Zeev Volkovich 1 ORT BRAUDE COLLEGE – SE DEPT. 16.01.2012.

Slides:

Advertisements

Similar presentations

Partitional Algorithms to Detect Complex Clusters

Advertisements

Multilevel Hypergraph Partitioning Daniel Salce Matthew Zobel.

Ali Husseinzadeh Kashan Spring 2010

PARTITIONAL CLUSTERING

Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.

Normalized Cuts and Image Segmentation

Online Social Networks and Media. Graph partitioning The general problem – Input: a graph G=(V,E) edge (u,v) denotes similarity between u and v weighted.

Clustering II CMPUT 466/551 Nilanjan Ray. Mean-shift Clustering Will show slides from:

Object Detection by Matching Longin Jan Latecki. Contour-based object detection Database shapes: …..

10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

Lecture 6 Image Segmentation

DIMENSIONALITY REDUCTION BY RANDOM PROJECTION AND LATENT SEMANTIC INDEXING Jessica Lin and Dimitrios Gunopulos Ângelo Cardoso IST/UTL December

Spectral Clustering 指導教授 : 王聖智 S. J. Wang 學生 : 羅介暐 Jie-Wei Luo.

A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts Dhillon, Inderjit S., Yuqiang Guan, and Brian Kulis.

Region Segmentation. Find sets of pixels, such that All pixels in region i satisfy some constraint of similarity.

Unsupervised Learning of Categories from Sets of Partially Matching Image Features Dominic Rizzo and Giota Stratou.

A Unified View of Kernel k-means, Spectral Clustering and Graph Cuts

Normalized Cuts Demo Original Implementation from: Jianbo Shi Jitendra Malik Presented by: Joseph Djugash.

Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.

Heterogeneous Consensus Learning via Decision Propagation and Negotiation Jing Gao † Wei Fan ‡ Yizhou Sun † Jiawei Han † †University of Illinois at Urbana-Champaign.

Heterogeneous Consensus Learning via Decision Propagation and Negotiation Jing Gao† Wei Fan‡ Yizhou Sun†Jiawei Han† †University of Illinois at Urbana-Champaign.

Discovery of Aggregate Usage Profiles for Web Personalization

Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.

Introduction to machine learning

Machine Learning Usman Roshan Dept. of Computer Science NJIT.

POTENTIAL RELATIONSHIP DISCOVERY IN TAG-AWARE MUSIC STYLE CLUSTERING AND ARTIST SOCIAL NETWORKS Music style analysis such as music classification and clustering.

A Cumulative Voting Consensus Method for Partitions with a Variable Number of Clusters Hanan G. Ayad, Mohamed S. Kamel, ECE Department University of Waterloo,

Tal Mor  Create an automatic system that given an image of a room and a color, will color the room walls  Maintaining the original texture.

Ensemble Clustering.

Graph-based consensus clustering for class discovery from gene expression data Zhiwen Yum, Hau-San Wong and Hongqiang Wang Bioinformatics, 2007.

Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain

IAstro/IDHA Workshop Strasbourg Observatory November 2002 Vito Di Gesù, Giosuè Lo Bosco DMA – University of Palermo, ITALY THE.

Unsupervised Learning and Clustering k-means clustering Sum-of-Squared Errors Competitive Learning SOM Pre-processing and Post-processing techniques.

Machine Learning Problems Unsupervised Learning – Clustering – Density estimation – Dimensionality Reduction Supervised Learning – Classification – Regression.

Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.

Graph-based Consensus Maximization among Multiple Supervised and Unsupervised Models Jing Gao 1, Feng Liang 2, Wei Fan 3, Yizhou Sun 1, Jiawei Han 1 1.

1 Lecture 10 Clustering. 2 Preview Introduction Partitioning methods Hierarchical methods Model-based methods Density-based methods.

LOGO Ensemble Learning Lecturer: Dr. Bo Yuan

CSE 185 Introduction to Computer Vision Pattern Recognition 2.

Ensembles of Partitions via Data Resampling

Co-clustering Documents and Words Using Bipartite Spectral Graph Partitioning Jinghe Zhang 10/28/2014 CS 6501 Information Retrieval.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Unsupervised learning introduction

Neural Networks - Lecture 81 Unsupervised competitive learning Particularities of unsupervised learning Data clustering Neural networks for clustering.

Andreas Papadopoulos - [DEXA 2015] Clustering Attributed Multi-graphs with Information Ranking 26th International.

Prepared by: Mahmoud Rafeek Al-Farra

Spectral Clustering Jianping Fan Dept of Computer Science UNC, Charlotte.

Learning Spectral Clustering, With Application to Speech Separation F. R. Bach and M. I. Jordan, JMLR 2006.

BOĞAZİÇİ UNIVERSITY DEPARTMENT OF MANAGEMENT INFORMATION SYSTEMS MATLAB AS A DATA MINING ENVIRONMENT.

Clustering Unsupervised learning introduction Machine Learning.

About Me Swaroop Butala  MSCS – graduating in Dec 09  Specialization: Systems and Databases  Interests:  Learning new technologies  Application of.

A Comparison of Resampling Methods for Clustering Ensembles

A Tutorial on Spectral Clustering Ulrike von Luxburg Max Planck Institute for Biological Cybernetics Statistics and Computing, Dec. 2007, Vol. 17, No.

Adaptive Cluster Ensemble Selection Javad Azimi, Xiaoli Fern {azimi, Oregon State University Presenter: Javad Azimi. 1.

哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.

Normalized Cuts and Image Segmentation Patrick Denis COSC 6121 York University Jianbo Shi and Jitendra Malik.

Martina Uray Heinz Mayer Joanneum Research Graz Institute of Digital Image Processing Horst Bischof Graz University of Technology Institute for Computer.

Cluster Analysis What is Cluster Analysis? Types of Data in Cluster Analysis A Categorization of Major Clustering Methods Partitioning Methods.

DATA MINING: CLUSTER ANALYSIS Instructor: Dr. Chun Yu School of Statistics Jiangxi University of Finance and Economics Fall 2015.

Document Clustering with Prior Knowledge Xiang Ji et al. Document Clustering with Prior Knowledge. SIGIR 2006 Presenter: Suhan Yu.

Topic 4: Cluster Analysis Analysis of Customer Behavior and Service Modeling.

Nir Geffen Yotam Margolin SupervisorProfessor Zeev Volkovitch 1 ORT BRAUDE COLLEGE – SE DEPT

A Consensus-Based Clustering Method

Jianping Fan Dept of CS UNC-Charlotte

Santayana; socio-political; reinventing the wheel

Data Mining 資料探勘分群分析 (Cluster Analysis) Min-Yuh Day 戴敏育

Spectral Clustering Eric Xing Lecture 8, August 13, 2010

Presentation transcript:

Nir Geffen Yotam Margolin Supervisor Professor Zeev Volkovich 1 ORT BRAUDE COLLEGE – SE DEPT

Our goal is to study the results of different clustering ensemble techniques and to present the distinction between the cluster ensemble and clustering aggregation approaches via self learning methodology – implemented for image segmentation. /24 2

 Introduction  What Does It Do?  Clustering  Spectral Clustering  Cluster Ensembles.  Consensus  HGPA  MCLA  Volkovich-Yahalom  Main Algorithm  SE Documents 2/24

What does it do? /24 4 Comparison VYCAA MCLA HGPA Preprocess

 Clustering is a method of the unsupervised learning aimed at partitioning a given data set into subsets named clusters, so that items belonging to the same cluster are similar to each other while items belonging to different clusters are not similar. /24 5

 What is wrong with classic clustering?  Spectral Clustering  Eigenvectors and eigenvalues  Noise removed /24 6 Clustered by K-MeansClustered by Spectral

 As no clustering algorithm is agreed to be superior for any data set, a common practice is to construct several cluster solutions and to aggregate them.  We use the Consensus function approach to combine the resulting partitions into a new one, in order t increase the robustness of the clustering process. /24 7

 Algorithms that solve the Cluster Ensemble problem, are also known as Consensus functions, most of which rely on Graph theory. /24 8

 Cluster-based Similarity Partitioning Algorithm (CSPA)  Simple.  Considered the brute-force.  Hyper Graph Partitioning Algorithm (HGPA),  Balanced.  Not always optimal.  Meta-CLustering Algorithm (MCLA)  high-end solution.  yields robust results. /24 9

 A criteria by which to choose any specific consensus function is ANMI. ANMI (or Average Normalized Mutual Information) is defined as the average of the NMI which the final Clustering shares with the solutions.  Mutual Information I(X,Y )≤min(H(X),H(Y )). /24 10

/ Create hyper-graph (hyper-edges are clusters from all clusterings). 2. Repeat K -1 times:  Obtain sub-set (Cluster) by min-cutting the hyper-graph while maintaining a vertex imbalance of at most 5%.

/ Create hyper-graph G. 2. Expand hyper edges. (Create meta-graph from G). 3. Collapse meta-graph (Cluster meta graph to K clusters). 4. Compete for Objects

 Use various partitions of the same data set in order to define a new metric on the data set.  Using the new metric as an enhanced input for a clustering algorithm will produce better and more robust partitions.  This process can be utilized repeatedly, where in each step the metric is updated using the original data as well as the new cluster partition. /24 13

/24 14

 Produce r individual spectral partitions  Use MCLA to obtain Sc MCLA(x);  Use HGPA to obtain Sc HGPA(x);  Use Volkovich-Yahalom to obtain SC VYCAA(x);  By ANMI criterion, get the final decision Sc*(x) from Sc MCLA(x) and Sc HGPA(x) and Sc VYCAA (x). /24 15

GUI – Main window 16

GUI – Results window 17

18

19

20

 Unit test is our first line in the test plan (Test Driven Development)  Coding conventions.  Lint the code for errors such as dead-code or uninitialized pointers.  Usage and code coverage Test. /24 21

References [1]Z. Volkovich, O. Yahalom, "Clustering aggregation via the self-learning approach", work in preparation [2]A. Strehl, J Ghosh, "Cluster Ensembles – a knowledge Reuse Framework for Combining Multiple Partitions", Journal of machine Learning Research 3 (2002) [3]Y. Ng, M, Jordan, Y Weish, "On spectral clustering: analysis and an algorithm", 2002, Advances in neural information processing systems 14: proceedings of the 2002 conference, Sec.2, 849. [4]X. Ma, W. Wan, L. Jiao, “Spectral Clustering Ensemble for Image Segmentation”, 2009 GEC '09 Proceedings of the first ACM/SIGEVO Summit on Genetic and Evolutionary Computation. [5]I.S. Dhilon, Y. Guan, B. Kulis, “Kernel k-means, Spectral Clustering and Normalized Cuts”, [6]E. David, “Spectral Clustering”, 2008 Image Processing Seminar. /24 22

Thank you for listening!