Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Sanghamitra.

Slides:



Advertisements
Similar presentations
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Gianfranco Chicco, Roberto Napoli Federico Piglione, Petru Postolache.
Advertisements

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Presenter : Yu Cheng Chen Author: Hichem.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 On Rival Penalization Controlled Competitive Learning.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A novel document similarity measure based on earth mover’s.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 SCAN: A Structural Clustering Algorithm for Networks Xiaowei.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Fast exact k nearest neighbors search using an orthogonal search tree Presenter : Chun-Ping Wu Authors.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Unsupervised pattern recognition models for mixed feature-type.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Student : Sheng-Hsuan Wang Department.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Human eye sclera detection and tracking using a modified.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Graph self-organizing maps for cyclic and unbounded graphs.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A novel genetic algorithm for automatic clustering Advisor.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Keng-Wei Chang Author : Anthony K.H. Tung Hongjun Lu Jiawei Han Ling Feng 國立雲林科技大學 National.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Adaptive nonlinear manifolds and their applications to pattern.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comparison of SOM Based Document Categorization Systems.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology HE-Tree: a framework for detecting changes in clustering.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 The k-means range algorithm for personalized data clustering.
Intelligent Database Systems Lab 1 Advisor : Dr. Hsu Graduate : Jian-Lin Kuo Author : Silvia Nittel Kelvin T.Leung Amy Braverman 國立雲林科技大學 National Yunlin.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A Comprehensive Comparison Study of Document Clustering.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology On Data Labeling for Clustering Categorical Data Hung-Leng.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Extracting meaningful labels for WEBSOM text archives Advisor.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A self-organizing neural network using ideas from the immune.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Ming Hsiao Author : Bing Liu Yiyuan Xia Philp S. Yu 國立雲林科技大學 National Yunlin University.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Presenter : Keng-Wei Chang Author: Yehuda.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 New Unsupervised Clustering Algorithm for Large Datasets.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A k-mean clustering algorithm for mixed numeric and categorical.
A Fuzzy k-Modes Algorithm for Clustering Categorical Data
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Manoranjan.
國立雲林科技大學 National Yunlin University of Science and Technology Self-organizing map learning nonlinearly embedded manifoldsmanifolds Author :Timo Simila.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 The Evolving Tree — Analysis and Applications Advisor.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 2007.SIGIR.8 New Event Detection Based on Indexing-tree.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Fast accurate fuzzy clustering through data reduction Advisor.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology SEP/COP: An efficient method to find the best partition.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A Novel Density-Based Clustering Framework by Using Level.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Utilizing Marginal Net Utility for Recommendation in E-commerce.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Chung-hung.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A modified version of the K-means algorithm with a distance.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Fuzzy integration of structure adaptive SOMs for web content.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Authors :
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Model-based evaluation of clustering validation measures.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Chien-Shing Chen Author : Juan D.Velasquez Richard Weber Hiroshi Yasuda 國立雲林科技大學 National.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A text mining approach on automatic generation of web.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Rival-Model Penalized Self-Organizing Map Yiu-ming Cheung.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Extending the Growing Hierarchal SOM for Clustering Documents.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Multiclass boosting with repartitioning Graduate : Chen,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 An initialization method to simultaneously find initial.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology O( ㏒ 2 M) Self-Organizing Map Algorithm Without Learning.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Unsupervised Learning with Mixed Numeric and Nominal Data.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology A new data clustering approach- Generalized cellular automata.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A hierarchical clustering algorithm for categorical sequence.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Direct mining of discriminative patterns for classifying.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Growing Mechanisms and Cluster Identification with TurSOM.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Finding a Team of Experts in Social Networks Theodoros.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Wei Xu,
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A survey of kernel and spectral methods for clustering.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology ACM SIGMOD1 Subsequence Matching on Structured Time Series.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Hierarchical model-based clustering of large datasets.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Growing Hierarchical Tree SOM: An unsupervised neural.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author : Yongqiang Cao Jianhong Wu 國立雲林科技大學 National Yunlin University of Science.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Dual clustering : integrating data clustering over optimization.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 2005.ACM GECCO.8.Discriminating and visualizing anomalies.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Visualizing social network concepts Presenter : Chun-Ping Wu Authors :Bin Zhu, Stephanie Watts, Hsinchun.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Yu Cheng Chen Author: Lynette.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Chun Kai Chen Author : Andrew.
Intelligent Database Systems Lab N.Y.U.S.T. I. M. Named Entity Disambiguation by Leveraging Wikipedia Semantic Knowledge Presenter : Jiang-Shan Wang Authors.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Adaptive Clustering for Multiple Evolving Streams Graduate.
Intelligent Database Systems Lab Advisor : Dr. Hsu Graduate : Jian-Lin Kuo Author : Aristidis Likas Nikos Vlassis Jakob J.Verbeek 國立雲林科技大學 National Yunlin.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A New Cluster Validity Index for Data with Merged Clusters.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology IEEE EC1 Generating War Game Strategies Using A Genetic.
Presentation transcript:

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Advisor : Dr. Hsu Graduate : Sheng-Hsuan Wang Author : Sanghamitra Bandyopadhyay Department of Information Management An automatic shape independent clustering technique Pattern Recognition, Vol 37, 2004, pp

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Outline Motivation Objective Introduction Graph theoretical clustering based on relative neighborhood The proposed clustering method Experimental results Conclusions Personal opinion Review

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Motivation The most clustering technique has some problem The number of clusters must be pre-defined. Can’t identify the arbitrary shapes of cluster.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Objective In this paper, a clustering technique that can automatically detect any number of well-separated clusters. Relative neighborhood graph. Iterative partitioning. Coupled with a post-processing step for merging small clusters.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction Clustering. The data set X has n points {x 1, x 2, …, x n } divided to K clusters {C 1, C 2, …, C K }.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Introduction Minimum spanning tree, MST. The concept of inconsistent edges. The extension of MST-based methods. The concept of relative neighborhood of a finite planar set. In this paper, we use the concept of relative neighborhood for designing a clustering algorithm, which called CLUSTER. A B C DE

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Graph theoretical clustering based on relative neighborhood Relative neighborhood graph, RNG. X={x 1, x 2, …, x n }. Two points x i and x j are said to be relative neighbors if

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Graph theoretical clustering based on relative neighborhood Clustering based on limited neighborhood set. The region of influence of two points x i and x j in the RNG, denoted by In Ref. [15], an additional parameter, which is called the relative edge consistency.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Graph theoretical clustering based on relative neighborhood Determine the connected components of the connected graph.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. The proposed clustering method The algorithm called CLUSTER is based on The successive thresholding of the RNG. Until a termination criterion is attained.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. The proposed clustering method RNG=(X,E) X, where the vertices of the graph are the points in X. E is the set of edges in the RNG. Let the weight of an edge e ij, be equal to d(x i, x j ). Let m be the cardinality of E.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. CLUSTER Algoirthm

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Some characteristics of the CLUSTER Algorithm Terminate conditions Inter-cluster relative neighbors are close to each other. (Max < 2 Min in CLUSTER) An appropriate thresh ( >= 2 Min) is not found. |Component| = 1.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Some characteristics of the CLUSTER Algorithm An overfragmented condition will not arise. Hierarchical clusters. The number of clusters is equal to the number of Components formed on termination of the algorithm. Merge threshold,. Check the size of a component. Edge length. Outliers or noise.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Some characteristics of the CLUSTER Algorithm The complexity of CLUSTER is O(m log m). m is the number of edges in the RNG. Reduction of the complexity Discretization of the values between Min and Max. Defined as a fraction of Max – Min,. The complexity of RNG is O(n 2 ). The overall complexity of the clustering algorithm is O(n 2 + m log m) O(n 2 ).

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Experimental results Eight data sets of different characteristics. Parameters Merge condition clusters size below 5% of the size of data set. =0.001 the range [Min, Max] is discretized into 1000 intervals. = 3.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Normal 2 Dimensional, 3 class, Gaussian distributions, N=300.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. RC1

Intelligent Database Systems Lab N.Y.U.S.T. I. M. ADS1

Intelligent Database Systems Lab N.Y.U.S.T. I. M. ADS2

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Encircle

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Concentric & Concentric_noisy

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Conclusions CLUSTER, based on an iterative partitioning of the relative neighborhood graph. The number of clusters not to be predefined. The cluster shape can be convex and non-convex. It is able to identify an appropriate threshold value. A post-processing step of merging small clusters. Outliers of the data are not merged. Be able to provide an hierarchy of clusters.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Personal opinion Adjusting the threshold according to the current state of clusters is a good ides.

Intelligent Database Systems Lab N.Y.U.S.T. I. M. Review Graph theoretical clustering, i.e., MST. Relative neighborhood graph, RNG. CLUSTER. Hierarchical iteration partition. (Top-Down) Based on RNG.