Download presentation
Presentation is loading. Please wait.
Published byShona Gregory Modified over 8 years ago
1
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Dual clustering : integrating data clustering over optimization and constraint domains Advisor : Dr. Hsu Presenter : Wen-Cheng Tsai Author : Cheng-Ru Lin, Ken-Hao Liu Ming-Syan Chen, TKDE,2005
2
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 2 Outline Motivation Objective Method Experience Conclusion Personal Comments
3
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 3 Motivation In the conventional spatial clustering, the input data set is partitioned into several compact regions and data point which are similar to another in their nongeometric attributes may be scattered over different region, thus making the corresponding objective difficult to achieve.
4
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 4 Objective Our goal is to optimize the objective function in the optimization domain while satisfying the constraint specified in the constraint domain. We devise an effective algorithm, named Interlaced Clustering-Classification (ICC), to solve this problem.
5
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 5 Method ICC Algorithm Data Set 1 Step 1 Step 4 Step 6 Result of ICC, k=5, ω=0.7, l=5 Projection on the constraint domain Projection on the optimization domain
6
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 6 Experience Data Set 2 Result of ICC with ω=0.9 Projection on the constraint domain Projection on the optimization domain Result of CR with p=3Result of CR with p=1/3 Result of KNNC with k=5Result of KNNC with k=30 Result of ICC with ω=0.7 Data Set1
7
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 7 Conclusion ICC algorithm combines the information in both domains and iteratively performs a clustering algorithm on the optimization domain and also a classification algorithm on the constraint domain to reach the target clustering effectively.
8
Intelligent Database Systems Lab N.Y.U.S.T. I. M. 8 Personal Comments Advantages ─ reach the target clustering effectively on both domains ─ deal with both domains at the same time Disadvantage …
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.