5.3. Mapper on 3D Shape Database

Slides:



Advertisements
Similar presentations
Section 2.5: Graphs and Trees
Advertisements

Original Figures for "Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring"
Clustering.
Clustering Categorical Data The Case of Quran Verses
Indexing and Retrieval of Dynamic Brain Images*: Construction of Time-Space Graphs for Cognitive Processes in Human Brain Ulukbek Ibraev, Ph.D. Candidate,
University of CreteCS4831 The use of Minimum Spanning Trees in microarray expression data Gkirtzou Ekaterini.
L16: Micro-array analysis Dimension reduction Unsupervised clustering.
Finding generators for H1.
The Shortest Path Problem
Final Exam Review CS485/685 Computer Vision Prof. Bebis.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
ArrayCluster: an analytic tool for clustering, data visualization and module finder on gene expression profiles 組員:李祥豪 謝紹陽 江建霖.
Computer Vision Lecture 5. Clustering: Why and How.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
CLUSTERING. Overview Definition of Clustering Existing clustering methods Clustering examples.
Sampling distributions rule of thumb…. Some important points about sample distributions… If we obtain a sample that meets the rules of thumb, then…
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
Data Mining Course 2007 Eric Postma Clustering. Overview Three approaches to clustering 1.Minimization of reconstruction error PCA, nlPCA, k-means clustering.
Clustering High-Dimensional Data. Clustering high-dimensional data – Many applications: text documents, DNA micro-array data – Major challenges: Many.
Computational Biology Group. Class prediction of tumor samples Supervised Clustering Detection of Subgroups in a Class.
CSE182 L14 Mass Spec Quantitation MS applications Microarray analysis.
Data Visualization Fall The Data as a Quantity Quantities can be classified in two categories: Intrinsically continuous (scientific visualization,
Welcome to MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Week 1: Introduction to Topological Data.
Source: CSE 214 – Computer Science II Graphs.
Detection of closed sharp edges in point clouds Speaker: Liuyu Time:
Evaluation of Gender Classification Methods with Automatically Detected and Aligned Faces Speaker: Po-Kai Shen Advisor: Tsai-Rong Chang Date: 2010/6/14.
Sept 25, 2013: Applicable Triangulations.
Computational Biology
Unsupervised Learning
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 22, 2013: Topological methods for exploring low-density.
Hierarchical clustering
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Date of download: 10/13/2017 Copyright © ASME. All rights reserved.
Copyright © Cengage Learning. All rights reserved.
Clustering Manpreet S. Katari.
Day 38 – Line of Best Fit (Day 2)
We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological.
Sept 23, 2013: Image data Application.
MATH:7450 (22M:305) Topics in Topology: Scientific and Engineering Applications of Algebraic Topology Nov 22, 2013: Topological methods for exploring low-density.
Constraint Satisfaction Problems vs. Finite State Problems
Unsupervised Learning
Copyright © Cengage Learning. All rights reserved.
Hansheng Xue School of Computer Science and Technology
plosone. org/article/info%3Adoi%2F %2Fjournal. pone
Graph Analysis by Persistent Homology
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Fitting Curve Models to Edges
Clustering.
Guest lecturer: Isabel K. Darcy
Jianping Fan Dept of CS UNC-Charlotte
Hierarchical clustering approaches for high-throughput data
Suppose your data points live in Rn.
(A) Graph depicting interaction between the five regulatory motifs measured by synergy and cooccurrence. (A) Graph depicting interaction between the five.
Clustering.
Multivariate Statistical Methods
Global approach to the diagnosis of leukemia using gene expression profiling by Torsten Haferlach, Alexander Kohlmann, Susanne Schnittger, Martin Dugas,
By Charlie Fractal Mentor: Dr. Vignesh Subbian
6.1: Discrete and Continuous Random Variables
Noah Snavely.
Project HW 6 (Due 3/4) points You are given the following dataset to analyze using TDA Mapper a.) What do you expect the output of TDA mapper to.
Dynamic Processes Shape Spatiotemporal Properties of Retinal Waves
Day 38 – Line of Best Fit (Day 2)
Interpretation of Similar Gene Expression Reordering
Group 9 – Data Mining: Data
8.3 Estimating a Population Mean
Knowledge-Guided Sample Clustering
Clustering.
Introduction to Artificial Intelligence Lecture 22: Computer Vision II
Unsupervised Learning
Presentation transcript:

5.3. Mapper on 3D Shape Database The top row shows the rendering of one model from each of the 7 classes. The bottom row shows the same model colored by the E1 function (setting p = 1 in equation 4–1) computed on the mesh.

In this section we apply Mapper to a collection of 3D shapes from a publicly available database of objects [SP]. These shapes correspond to seven different classes of objects (camels, cats, elephants, faces, heads, horses and lions). For each class there are between 9 and 11 objects, which are different poses of the same shape, see Figure 7. This repository contains many 3D shapes in the form of triangulated meshes. We preprocessed the shapes as follows. Let the set of points be P. From these points, we selected 4000 landmark points using a Euclidean maxmin procedure as described in [dSVG04]. Let this set be Y = { Pi, i ∈ L}, where L is the set of indices of the landmarks.

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 1.) choose point l1 randomly

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 1.) choose point l1 randomly

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 2.) If {l1, …, lk-1} have been chosen, choose lk such that {l1, …, lk-1} is in D - {l1, …, lk-1} and min {d(lk, l1), …, d(lk, lk-1)} ≥ min {d(v, l1), …, d(v, lk-1)}

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 2.) If {l1} chosen, choose l2 such that {l1} is in D - {l1, …, lk-1} and min {d(l2, l1)} ≥ min {d(v, l1)}

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 2.) If {l1, …, lk-1} have been chosen, choose lk such that {l1, …, lk-1} is in D - {l1, …, lk-1} and min {d(lk, l1), …, d(lk, lk-1)} ≥ min {d(v, l1), …, d(v, lk-1)}

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 2.) If {l1, l2} have been chosen, choose l2 such that {l1, …, lk-1} is in D - {l1, …, lk-1} and min {d(l3, l1), d(l3, l2)} ≥ min {d(v, l1), d(v, l2)}

Choosing Landmark points: MaxMin data points. In this very simplified case my data points lie in a two-dimensional plane. Normally data points are high dimensional. For example, I may be comparing the expression or thousands of genes in tumor cells to healthy cells using microarray data. OR I might be comparing politicians voting records. Or I might be comparing the stats of basketball players. These three applications were all, by the way, published by Lum et al this past February in Nature’s Scientific Reports. I have included a link to their paper on my youtube site. http://www.nature.com/srep/2013/130207/srep01236/full/srep01236.html 2.) If {l1, …, lk-1} have been chosen, choose lk such that {l1, …, lk-1} is in D - {l1, …, lk-1} and min {d(lk, l1), …, d(lk, lk-1)} ≥ min {d(v, l1), …, d(v, lk-1)}

In order to use this as the point cloud input to Mapper, we computed distances between the points in Y as follows. First, we computed the adjacency matrix A for the set P by using the mesh information e.g. if Pi and Pj were connected on the given mesh, then A(i, j) = d(Pi,Pj), where d(x, y) is the Euclidean distance between x, y ∈ P. Finally, the matrix of distances D between points of Y was computed using Dijkstra’s algorithm on the graph specified by A.

In order to apply Mapper to this set of shapes we chose to use E1(x) as our filter function (setting p = 1 in equation 4–1), see Figure 7. In order to minimize the effect of bias due to the distribution of local features we used a global threshold for the clustering within all intervals which was determined as follows. We found the threshold for each interval by the histogram heuristic described in Section 3.1, and used the median of these thresholds as the global threshold.

The output of Mapper in this case (single filter function) is a graph. We use GraphViz to produce a visualization of this graph. Mapper results on a few shapes from the database

1. Mapper is able to recover the graph representing the skeleton of the shape with fair accuracy. As an example, consider the horse shapes. In both cases, the three branches at the bottom of the recovered graph represent the font two legs and the neck. The blue colored section of the graph represents the torso and the top three branches represent the hind legs and the tail. In these examples, endowing the nodes of the recovered graph with the mean position of the clusters they represent would recover a skeleton. 2. Different poses of the same shape have qualitatively similar Mapper results however different shapes produce significantly different results. This suggests that certain intrinsic information about the shapes, which