Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures that are based on distributions of color and texture features summarizing over 1,000 CPU hours of computational experiments. Ground truth is collected via a novel random sampling scheme for color, and via an image partitioning method for texture. Quantitative performance evaluations are given for classification, image retrieval, and segmentation tasks, and for a wide variety of dissimilarity measures. It is demonstrated how the selection of a measure, based on large scale evaluation, substantially improves the quality of classification, retrieval, and unsupervised segmentation of color and texture images.
Computer Vision Group, University of BonnVision Laboratory, Stanford University Goals +compare distribution-based image dissimilarity measures +evaluate dependency on parameter settings +develop generic and statistically sound benchmarking methodology +examine influence in different applications: classification, retrieval, annotation and unsupervised segmentation
Computer Vision Group, University of BonnVision Laboratory, Stanford University Image Representation Distributions: –adaptive binning (multivariate histograms) –marginal histograms –cumulative marginal histograms
Computer Vision Group, University of BonnVision Laboratory, Stanford University Image Representation dissimilarity measures: for multivariate (full) distributions for marginal distributions Color: CIELab color space Texture: Gabor filter responses
Computer Vision Group, University of BonnVision Laboratory, Stanford University Heuristic Dissimilarity Measures Minkowski-distance e.g. p = 1 [see 8] (Histogram Intersection), [see 9] Weighted-Mean-Variance (WMV) [see 4]
Computer Vision Group, University of BonnVision Laboratory, Stanford University Statistical Dissimilarity Measures Kolmogorov-Smirnoff distance (KS) [see 2] Cramer/Von Mises (CvM) -statistic [see 6]
Computer Vision Group, University of BonnVision Laboratory, Stanford University Information-Theoretic Measures Kullback-Leibler divergence (KL) [see 5] Jeffrey divergence (JD) [see 6]
Computer Vision Group, University of BonnVision Laboratory, Stanford University Ground Distance Measures Quadratic Form (QF) via similarity matrix A to incorporate similarities between bins [see 3] Earth Movers Distance (EMD) by solving the transportation problem for the optimal admissible flow g ij between the two distributions. d ij is the dissimilarity between bins. [see 7]
Computer Vision Group, University of BonnVision Laboratory, Stanford University Properties
Computer Vision Group, University of BonnVision Laboratory, Stanford University Methodology quality measure: separating into different tasks (classification, retrieval, segmentation) parameters: select best possible for every measure by exhaustive evaluation evaluate processing steps separately: such as representation, dissimilarity measures, application ground truth: collected by sampling given images
Computer Vision Group, University of BonnVision Laboratory, Stanford University Parameter Settings exhaustive search over parameter values: –K nearest neighbors ( k = 1, 3, 5, 7) –sample size:( color: 4, 8, 16, 32, 64 pixels texture: 8 2, 16 2, 32 2, 64 2, 128 2, pixels) –number of bins: ( 4, 8, 16, 32, 64, 128, 256; for EMD only for 4, 8, 16, 32) –number of Gabor filters: (12, 24, 40) quality measures: –classification: K-NN classifier with leave-one-out –image retrieval: precision vs. number of retrieved images –unsupervised segmentation: pixel-wise error
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Texture Segmentation + Unsupervised Grouping by normalized pairwise clustering [6]
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Color Classification 94 images from Corel Database, 16 Samples from each image Full distributions:
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Color Classification Marginal distributions:
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Texture Classification 94 images from Brodatz Album, 16 samples from each image Full distributions:
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Texture Classification Marginal distributions:
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Color Retrieval
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Texture Retrieval
Computer Vision Group, University of BonnVision Laboratory, Stanford University Conclusion no overall best measure, but different tools for different tasks marginal histograms and aggregate measures good for large feature spaces and small samples multivariate histograms effective on large sample sizes and/or well-adapted binning EMD attractive for moderate similarities
Computer Vision Group, University of BonnVision Laboratory, Stanford University Literature [1]M.Flickner et al. Query by image and video content: The cubic system. IEEE Computer [2]D. Geman et al. Boundary detection by constraint optimization. PAMI [3]J. Hafner et al. Efficient color histogram indexing for quadratic form distance function. PAMI [4]B. Manjunath and W. Ma. Texture features for browsing and retrieval of image data. PAMI [5]T. Ojala et al. A comparative study of texture measures with classification based on feature distributions. Pattern Recognition [6]J. Puzicha et al. Non-parametric similarity measures for unsupervised texture segmentation and image retrieval. CVPR [7]Y. Rubner et al. A metric for distributions with applications to image databases. ICCV [8]M. Swain and D. Ballard. Color indexing. IJCV [9]H. Voorhees and T. Poggio. Computing texture boundaries from images. Nature 1988.
Computer Vision Group, University of BonnVision Laboratory, Stanford University
Computer Vision Group, University of BonnVision Laboratory, Stanford University
Computer Vision Group, University of BonnVision Laboratory, Stanford University
Computer Vision Group, University of BonnVision Laboratory, Stanford University Results: Texture Segmentation +Evaluated over database of 100 Brodatz images