Data Clustering (a very short introduction) Intuition: grouping of data into clusters so that elements from the same cluster are more similar to each other.

Slides:



Advertisements
Similar presentations
Prepared by Dr. Taha MAhdy
Advertisements

SEEM Tutorial 4 – Clustering. 2 What is Cluster Analysis?  Finding groups of objects such that the objects in a group will be similar (or.
Data Mining Cluster Analysis: Basic Concepts and Algorithms
Agglomerative Hierarchical Clustering 1. Compute a distance matrix 2. Merge the two closest clusters 3. Update the distance matrix 4. Repeat Step 2 until.
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
PCA + SVD.
Closest Pair Given a set S = {p1, p2,..., pn} of n points in the plane find the two points of S whose distance is the smallest. Images in this presentation.
Assessment. Schedule graph may be of help for selecting the best solution Best solution corresponds to a plateau before a high jump Solutions with very.
Slides by Olga Sorkine, Tel Aviv University. 2 The plan today Singular Value Decomposition  Basic intuition  Formal definition  Applications.
Jinxiang Chai CSCE441: Computer Graphics Coordinate & Composite Transformations 0.
CS292 Computational Vision and Language Pattern Recognition and Classification.
Lecture 15 Today Transformations between coordinate systems 1.Cartesian to cylindrical transformations 2.Cartesian to spherical transformations.
Polar Differentiation
Chapter 4 Linear Transformations. Outlines Definition and Examples Matrix Representation of linear transformation Similarity.
Chapter 7 Matrix Mathematics Matrix Operations Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Clustering Unsupervised learning Generating “classes”
Vector calculus 1)Differential length, area and volume
Point set alignment Closed-form solution of absolute orientation using unit quaternions Berthold K. P. Horn Department of Electrical Engineering, University.
POLAR COORDINATES (Ch )
Chapter 9 Superposition and Dynamic Programming 1 Chapter 9 Superposition and dynamic programming Most methods for comparing structures use some sorts.
Mathematics. Session Three Dimensional Geometry–1(Straight Line)
1 1.8 © 2016 Pearson Education, Inc. Linear Equations in Linear Algebra INTRODUCTION TO LINEAR TRANSFORMATIONS.
VIPIN VIJAYAN 11/11/03 A Performance Analysis of Two Distributed Computing Abstractions.
Visual Information Systems Recognition and Classification.
Conformational Space.  Conformation of a molecule: specification of the relative positions of all atoms in 3D-space,  Typical parameterizations:  List.
Texture Detection & Texture related clustering C601 Project Jing Qin Fall 2003.
Problem y For the machine element shown, locate the z coordinate
Lessons from Programme Evaluation in Romania First Annual Conference on Evaluation Bucharest 18 February 2008.
Section 1.1 Rectangular Coordinates; Graphing Utilities; Introduction to Graphing Equations.
CS 8751 ML & KDDData Clustering1 Clustering Unsupervised learning Generating “classes” Distance/similarity measures Agglomerative methods Divisive methods.
Ch. Eick: Introduction to Hierarchical Clustering and DBSCAN 1 Remaining Lectures in Advanced Clustering and Outlier Detection 2.Advanced Classification.
Hierarchical Clustering Produces a set of nested clusters organized as a hierarchical tree Can be visualized as a dendrogram – A tree like diagram that.
1 a1a1 A1A1 a2a2 a3a3 A2A Mixed Strategies When there is no saddle point: We’ll think of playing the game repeatedly. We continue to assume that.
Ch. Eick: Some Ideas for Task4 Project2 Ideas on Creating Summaries that Characterize Clustering Results Focus: Primary Focus Cluster Summarization (what.
Lecture 11 CS5661 Structural Bioinformatics – Structure Comparison Motivation Concepts Structure Comparison.
CHAPTER 1 COMPLEX NUMBER. CHAPTER OUTLINE 1.1 INTRODUCTION AND DEFINITIONS 1.2 OPERATIONS OF COMPLEX NUMBERS 1.3 THE COMPLEX PLANE 1.4 THE MODULUS AND.
Review of Complex Numbers. Introduction to Complex Numbers Complex numbers could be represented by the form Where x and y are real numbers Complex numbers.
1.8 Solving Absolute Value Equations and Inequalities Objectives: Write, solve, and graph absolute value equations and inequalities in mathematical and.
1. Randomized Hill Climbing Neighborhood Randomized Hill Climbing: Sample p points randomly in the neighborhood of the currently best solution; determine.
Jinxiang Chai CSCE441: Computer Graphics Coordinate & Composite Transformations 0.
CALCULUS III CHAPTER 5: Orthogonal curvilinear coordinates
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes Introduction to Data Mining by Tan, Steinbach, Kumar © Tan,Steinbach, Kumar Introduction.
Find the optimal alignment ? +. Optimal Alignment Find the highest number of atoms aligned with the lowest RMSD (Root Mean Squared Deviation) Find a balance.
COMPUTER GRAPHICS AND LINEAR ALGEBRA AN INTRODUCTION.
University of Utah Introduction to Electromagnetics Lecture 14: Vectors and Coordinate Systems Dr. Cynthia Furse University of Utah Department of Electrical.
Clustering (2) Center-based algorithms Fuzzy k-means Density-based algorithms ( DBSCAN as an example ) Evaluation of clustering results Figures and equations.
Simplify. Complex Numbers Complex Numbers Intro Definition of Pure Imaginary Numbers: For any positive real number, “b” Where i is the imaginary unit.
CA 302 Computer Graphics and Visual Programming
CHAPTER 1 COMPLEX NUMBER.
Add and subtract complex numbers
Problem y Determine the moments of inertia of the shaded area
LINEAR TRANSFORMATIONS
Cartesian Coordinate System
determinant does not exist
Information Organization: Clustering
[ ] [ ] [ ] [ ] EXAMPLE 3 Scalar multiplication Simplify the product:
ME 142 Engineering Computation I
Algorithms.
CSCE441: Computer Graphics Coordinate & Composite Transformations
Large scale multilingual and multimodal integration
Coordinate Transformation in 3D Final Project Presentation
Definition A Polar Coordinate System is a method of locating a point (r, ) in a plane where r is a distance from a point called the pole directed at.
Objectives Write equations and graph circles in the coordinate plane.
Problem y Determine the moments of inertia of the shaded area
SEEM4630 Tutorial 3 – Clustering.
Rectangular Coordinates; Introduction to Graphing Equations
Solving Equations with Absolute Values
Ordered Pair – (11 - 2) CS-708.
Synthesis Evaluation Analysis Application Comprehension Knowledge
Introduction to Machine Learning
Presentation transcript:

Data Clustering (a very short introduction) Intuition: grouping of data into clusters so that elements from the same cluster are more similar to each other than they are to a element from a different cluster. What is the best clustering? I II III

Major decision steps: Pattern representation Definition of a pattern proximity measure Method for clustering Data abstraction (cluster representation, e.g. centroids …) Assessment of output (evaluation of the output)

Proper pattern representation can simplify clustering Identify circles: Cartesian coordinates? -> use polar coordinates (r,θ)

Definition of a pattern proximity measure Similarity between two clusters C1 ={x i } C2 ={y j } Examples: Single Link min ij dist(x i,y j ) < e Complete Link max ij dist(x i,y j ) < e

Pose Clustering Aims to solve the LCP problem. 1)Compute a set of transformations that align one structure with the other. 2)Cluster transformations. 3)Check large clusters. Idea: a large common point set will produce a large number of similar transformations.

Example: Clustering of 3D transformations Goal: Prevent redundant solutions Representation: 1) 3x3 matrix + 1x3 vector Problem: how to measure distance between two transformations? 2) Image of points, T(S) dist(T 1,T 2 )= dist(T 1 (S),T 2 (S)) (for example RMSD or bottleneck) Problem: time complexity Solution: use less points, 3-4 farthest points are stable enough