Computational Intelligence

Slides:



Advertisements
Similar presentations
CLUSTERING.
Advertisements

Computational Intelligence Winter Term 2009/10 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence Winter Term 2009/10 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence Winter Term 2010/11 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Clustering Clustering of data is a method by which large sets of data is grouped into clusters of smaller sets of similar data. The example below demonstrates.
Computational Intelligence Winter Term 2011/12 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence Winter Term 2014/15 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence
Computational Intelligence Winter Term 2010/11 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence Winter Term 2014/15 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Introduction to Bioinformatics
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
Cluster Analysis.  What is Cluster Analysis?  Types of Data in Cluster Analysis  A Categorization of Major Clustering Methods  Partitioning Methods.
Definitions Let i) standard q-ary alphabet. iii) is a set of n elements ii) is the set of all q! permutations of q symbols. n-sequence q-partition.
What is Cluster Analysis?
7-1 Chapter Seven SAMPLING DESIGN. 7-2 Sampling What is it? –Drawing a conclusion about the entire population from selection of limited elements in a.
Chapter 3: Cluster Analysis  3.1 Basic Concepts of Clustering  3.2 Partitioning Methods  3.3 Hierarchical Methods The Principle Agglomerative.
Evaluating Performance for Data Mining Techniques
CPSC 386 Artificial Intelligence Ellen Walker Hiram College
1 Chapter 5 Numerical Integration. 2 A Review of the Definite Integral.
1 Motivation Web query is usually two or three words long. –Prone to ambiguity –Example “keyboard” –Input device of computer –Musical instruments How can.
So Far……  Clustering basics, necessity for clustering, Usage in various fields : engineering and industrial fields  Properties : hierarchical, flat,
CLUSTERING. Overview Definition of Clustering Existing clustering methods Clustering examples.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 29 Nov 11, 2005 Nanjing University of Science & Technology.
Computational Intelligence Winter Term 2015/16 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence Winter Term 2015/16 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
1 Pattern Recognition: Statistical and Neural Lonnie C. Ludeman Lecture 28 Nov 9, 2005 Nanjing University of Science & Technology.
Computational Intelligence Winter Term 2015/16 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
Computational Intelligence Winter Term 2015/16 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund.
The geometric GMST problem with grid clustering Presented by 楊劭文, 游岳齊, 吳郁君, 林信仲, 萬高維 Department of Computer Science and Information Engineering, National.
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Copyright © Cengage Learning. All rights reserved.
Unsupervised Learning
Copyright © Cengage Learning. All rights reserved.
Computational Intelligence
Fuzzy Logic in Pattern Recognition
Computational Intelligence
Computational Intelligence
Chapter 6: Discrete Probability
Computation of the solutions of nonlinear polynomial systems
Computational Intelligence
On statistical models of cluster stability Z. Volkovich a, b, Z
Computational Intelligence
Computational Intelligence
Riemann Sums and the Definite Integral
Clustering (3) Center-based algorithms Fuzzy k-means
Clustering Evaluation The EM Algorithm
Chapter 10 Two-Sample Tests and One-Way ANOVA.
CSE 4705 Artificial Intelligence
Computational Intelligence
Computational Intelligence
Critical Issues with Respect to Clustering
Computational Intelligence
Computational Intelligence
Computational Intelligence
Basic Counting.
Computational Intelligence
Computational Intelligence
Cluster Validity For supervised classification we have a variety of measures to evaluate how good our model is Accuracy, precision, recall For cluster.
Basic Counting Lecture 9: Nov 5, 6.
LECTURE 21: CLUSTERING Objectives: Mixture Densities Maximum Likelihood Estimates Application to Gaussian Mixture Models k-Means Clustering Fuzzy k-Means.
Fuzzy Clustering Algorithms
Computational Intelligence
Clustering.
Computational Intelligence
Computational Intelligence
Data Mining CSCI 307, Spring 2019 Lecture 24
Computational Intelligence
Computational Intelligence
Unsupervised Learning
Presentation transcript:

Computational Intelligence Winter Term 2017/18 Prof. Dr. Günter Rudolph Lehrstuhl für Algorithm Engineering (LS 11) Fakultät für Informatik TU Dortmund

Plan for Today Fuzzy Clustering G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 2 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 2

Cluster Formation and Analysis Introductory Example: Textile Industry → production of T-shirts (for men) best for producer : one size best for consumer: made-to-measure vs.  compromize: S, M, L, XL, 2XL 5 sizes → OK, but which lengths for which size? G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 3 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 3

Cluster Formation and Analysis idea:  select, say, 2000 men at random and measure their “body lengths“ arm‘s length, collar size, chest girth, …  arrange these 2000 men into five disjoint groups such that – deviations from mean of group as small as possible – differences between group means as large as possible in general: arrange objects into groups / clusters such that – elements within a cluster are as homogeneous as possible – elements across clusters are as heterogeneous as possible G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 4 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 4

Cluster Formation and Analysis numerical example: 1000 points uniformly sampled in [0,1] x [0,1]  form 5 cluster G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 5 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 5

Hard / Crisp Clustering given data points x1, x2, …, xN objective: group data points into cluster such that - points within cluster are as homogeneous as possible - points across clusters are as heterogeneous as possible  crisp clustering is just a partitioning of data set { x1, x2, …, xN }, i.e., { x1, x2, …, xN } and where is Cluster and denotes the number of clusters. Constraint: hence G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 6 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 6

Hard / Crisp Clustering Complexity: How many choices to assign N objects into clusters? more precisely: → objects are distinguishable / labelled → clusters are nondistinguishable / unlabelled and nonempty  Stirling number of 2nd kind  enumeration hopeless!  iterative improvement procedure required! G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 7 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 7

Hard / Crisp Clustering idea: define objective function that measures compactness of clusters and quality of partition → elements in cluster Cj should be as homogeneous as possible! → sum of squared distances to unknown center y should be as small as possible → (Euclidean norm) G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 8 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 8

Hard / Crisp Clustering → elements in each cluster Cj should be as homogeneous as possible! → Definition Theorem G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 9 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 9

Crisp K-Means Clustering G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 10 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 10

From Crisp to Fuzzy Clustering objective for crisp clustering: → rewrite objective: expresses membership objective for fuzzy clustering: G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 11 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 11

Fuzzy K-Means Clustering where subject to G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 12 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 12

Fuzzy K-Means Clustering two questions: ad a)  → weighted mean!  G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 13 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 13

Fuzzy K-Means Clustering ad b) apply Lagrange multiplier method:   G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 14 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 14

Fuzzy K-Means Clustering after insertion: problems: - choice of K calculate quality measure for each #cluster; then choose best - choice of m try some values; typical: m=2; use interval → fuzzy type-2 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 15 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 15

Example: Special Case |Ji| > 1 black dot is center of - red cluster - blue cluster - yellow cluster in case of equal weights uij = 1 / |Ji| for j  Ji appears plausible but: different values algorithmically better → cluster centers more likely to separate again (→ tiny randomization?) G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 16 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 16

Measures for Cluster Quality  Partition Coefficient ( “larger is better“ ) → crisp partition → entirely fuzzy  Partition Entropy ( “smaller is better“ ) → entirely fuzzy → crisp partition G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 17 G. Rudolph: Computational Intelligence ▪ Winter Term 2017/18 17