Neighbourhood relation preservation (NRP) A rank-based data visualisation quality assessment criterion Jigang Sun PhD studies finished in July 2011 PhD Supervisor : Colin Fyfe, Malcolm Crowe University of the West of Scotland
outline Multidimensional Scaling (MDS); The need for a common quality measure for data visualisation; Local continuity meta-criterion (LCMC); Definition of neighbourhood relation preservation (NRP); Illustration of LCMC and NRP on mappings of data sets created by different methods;
Multidimensional Scaling (MDS) A group of information visualisation methods that projects data points from high dimensional data space to low, typically two dimensional, latent space in which the structure of the original data set can be identified by eye. For example…
By LeftSammon, using graph distances, k=20 Samples of high dimensional data (each image is 28*28=784 dimensions) 2 dimensional projection
Various methods The classical MDS, the stress function to be minimised is defined to be Sammon Mapping (1969) Each method has its own criterion
Instead of we use base function My insight: 1. The above can be performed very efficiently. 2. The higher order Taylor series terms are better for analysis. to create LeftSammon mapping Various methods
Each method has its own criterion.
Mappings of open box By Sammon’s mapping By LeftSammon mapping Mappings can be assessed by eye
By CMDS By LeftExp By RightExp By Isomap Mappings of open box
by Sammon's mappingby LeftSammon mapping Sammon vs LeftSammon mapping Assessing mapping quality by eye is usually difficult
Local continuity meta-criterion (LCMC) Problem: loose constraints
Rank based quality measures Traditional rank is used in trustworthiness and continuity (T&C ) Problem 1: change of intermediate points is not considered p is mapped perfectly since rank of p does not change Rank is discrete; distance is continuous
Rank based quality measures Problem 2: angle constraint is not considered
Neighbourhood relation preservation (NRP)
Assessment to mappings of open box
Mappings of MNIST digits By CMDS By Isomap By LeftExp By RightExp
Assessment of mappings of digits
Conclusions Multidimensional Scaling (MDS); List of objective function of some MDS methods; The need for a common quality measure for data visualisation; Local continuity meta-criterion (LCMC); Definition of Neighbourhood relation preservation (NRP); Comparison of LCMC and NRP on mappings of data sets created by different methods; Thank you! Any questions?