3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Aggregating local image descriptors into compact codes
Principal Component Analysis Based on L1-Norm Maximization Nojun Kwak IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Biomedical Person Identification via Eye Printing Masoud Alipour Ali Farhadi Ali Farhadi Nima Razavi.
Presented by Xinyu Chang
電腦視覺 Computer and Robot Vision I
November 12, 2013Computer Vision Lecture 12: Texture 1Signature Another popular method of representing shape is called the signature. In order to compute.
Extended Gaussian Images
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Similarity Search for Adaptive Ellipsoid Queries Using Spatial Transformation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa (Nara.
Multimedia DBs. Multimedia dbs A multimedia database stores text, strings and images Similarity queries (content based retrieval) Given an image find.
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Xianfeng Gu, Yaling Wang, Tony Chan, Paul Thompson, Shing-Tung Yau
Chapter 5: Linear Discriminant Functions
Lecture Notes for CMPUT 466/551 Nilanjan Ray
Reflective Symmetry Detection in 3 Dimensions
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Basic Concepts and Definitions Vector and Function Space. A finite or an infinite dimensional linear vector/function space described with set of non-unique.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
Scale Invariant Feature Transform (SIFT)
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
Presented by Zeehasham Rasheed
KNN, LVQ, SOM. Instance Based Learning K-Nearest Neighbor Algorithm (LVQ) Learning Vector Quantization (SOM) Self Organizing Maps.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
San Diego, 06/12/03 San Diego, 06/12/03 Martin Pfeifle, Database Group, University of Munich Using Sets of Feature Vectors for Similarity Search on Voxelized.
A Global Geometric Framework for Nonlinear Dimensionality Reduction Joshua B. Tenenbaum, Vin de Silva, John C. Langford Presented by Napat Triroj.
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Atul Singh Junior Undergraduate CSE, IIT Kanpur.  Dimension reduction is a technique which is used to represent a high dimensional data in a more compact.
E.G.M. PetrakisDimensionality Reduction1  Given N vectors in n dims, find the k most important axes to project them  k is user defined (k < n)  Applications:
Dimensionality Reduction
CSci 6971: Image Registration Lecture 5: Feature-Base Regisration January 27, 2004 Prof. Chuck Stewart, RPI Dr. Luis Ibanez, Kitware Prof. Chuck Stewart,
Spatial and Temporal Databases Efficiently Time Series Matching by Wavelets (ICDE 98) Kin-pong Chan and Ada Wai-chee Fu.
Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)
CS 485/685 Computer Vision Face Recognition Using Principal Components Analysis (PCA) M. Turk, A. Pentland, "Eigenfaces for Recognition", Journal of Cognitive.
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Computer vision.
Alignment Introduction Notes courtesy of Funk et al., SIGGRAPH 2004.
Recognition and Matching based on local invariant features Cordelia Schmid INRIA, Grenoble David Lowe Univ. of British Columbia.
Shape Matching for Model Alignment 3D Scan Matching and Registration, Part I ICCV 2005 Short Course Michael Kazhdan Johns Hopkins University.
Alignment and Matching
COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.
Shape Analysis and Retrieval Statistical Shape Descriptors Notes courtesy of Funk et al., SIGGRAPH 2004.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
Axial Flip Invariance and Fast Exhaustive Searching with Wavelets Matthew Bolitho.
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)
Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.
SINGULAR VALUE DECOMPOSITION (SVD)
Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction / 16 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Query Sensitive Embeddings Vassilis Athitsos, Marios Hadjieleftheriou, George Kollios, Stan Sclaroff.
A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,
Chapter 13 (Prototype Methods and Nearest-Neighbors )
Multimedia and Time-Series Data When Is “ Nearest Neighbor ” Meaningful? Group member: Terry Chan, Edward Chu, Dominic Leung, David Mak, Henry Yeung, Jason.
Spatial Range Querying for Gaussian-Based Imprecise Query Objects Yoshiharu Ishikawa, Yuichi Iijima Nagoya University Jeffrey Xu Yu The Chinese University.
A Statistical Approach to Texture Classification Nicholas Chan Heather Dunlop Project Dec. 14, 2005.
CS Machine Learning Instance Based Learning (Adapted from various sources)
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
Pattern Classification All materials in these slides were taken from Pattern Classification (2nd ed) by R. O. Duda, P. E. Hart and D. G. Stork, John Wiley.
SIFT Scale-Invariant Feature Transform David Lowe
Scale Invariant Feature Transform (SIFT)
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
K Nearest Neighbor Classification
REMOTE SENSING Multispectral Image Classification
Recognition and Matching based on local invariant features
Lecture 16. Classification (II): Practical Considerations
Presentation transcript:

3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ of Munich, Germany

Outline Introduction 3D Shape Similarity Model Quadratic Form Distance Functions Extensibility of Histogram Models Query Processing Experimental Results and Conclusion

Outline Introduction 3D Shape Similarity Model Quadratic Form Distance Functions Extensibility of Histogram Models Query Processing Experimental Results and Conclusion

Introduction Classification  the problem of assigning an appropriate class to the query object Applications -molecular biology, medical imaging mechanical engg., astronomy Objects of same class have some characteristic properties in common. These could be geometric properties, thematic properties.

Classification in Molecular Databases Classification schemata is already available We need a fast filter classification algorithm Dali System - a sophisticated classification algorithm for proteins CATH – hierarchical classification of protein domain structures Four levels – class, architecture, topology and homologous super family.

Nearest Neighbor Classification In general classification is done after training Object is assigned if it matches the description of the class Nearest neighbor classifiers –find the nearest neighbor and return its class K- nearest neighbors - #k, Weights of neighbors

Geometry Based Similarity Search Spatial objects transformed into high dimensional vector space In 2D shapes can be represented as ordered set of surface points, approx rectangular coverings etc. Section Coding technique – each polygon’s circumcircle is decomposed into number of sectors, and each of these sectors are normalized. Similarity is defined in terms of Euclidean distance between resulting feature vectors.

Invariance Properties Similarity models need to incorporate invariance against translation, rotation, scaling etc. Most of the methods include a preprocessing step such as rotation of objects to a normalized orientation, translation of center of mass to origin etc. Robustness against errors is not considered in most of these models

Outline Introduction 3D Shape Similarity Model Quadratic Form Distance Functions Extensibility of Histogram Models Query Processing Experimental Results and Conclusion

3D Shape Similarity Model We extend the concept of section coding technique to 3D. Shape Histograms – feature vectors Quadratic Distance Function

Shape Histograms Feature transform maps a complex object onto a feature vector in a multidimensional space. 3D shape histograms are also feature vectors Based on partitioning the space into complete and disjoint cells called the bins of the histogram We can use any space (geometric, thematic etc.)

Shell Model 3D space is decomposed into concentric shells around the center point Independent of rotation around the center Radii of the shells are determined from the extension of the objects Shells of uniform thickness

Sector Model 3D space is decomposed into sectors that emerge from the center point of the model Distribute points uniformly on the surface of the sphere. The Voronoi diagram gives an appropriate decomposition of the space.

Combined Model Combination of shell and sector models Results in a higher dimensionality We can different combinations of shells and sectors for the same dimensionality

Euclidean Distance Euclidean Distance between two N dimensional vectors p and q is given by Individual components of the feature vectors are assumed to be independent No relationships of the components such as substitutability and compensability may be regarded

Euclidean Distance Consider 3 objects a, b and c We can clearly see ‘a and b’ are closely related than ‘a and c’ or ‘b and c’ However due to rotation, the peaks of ‘a’ and ‘b’ are mapped into different bins and hence the Euclidean distance does not reflect similarity in this case

Outline Introduction 3D Shape Similarity Model Quadratic Form Distance Functions Extensibility of Histogram Models Query Processing Experimental Results and Conclusion

Quadratic Form Distance Function Quadratic form distance function is defined in terms of similarity matrix ‘A’ The components a ij of A represent similarity of the components i and j in the underlying space Euclidean distance is a specific case of Quad Form Distance where A= I, the Identity Matrix

Quadratic Form Distance Functions Euclidean distance of two vectors is totally determined Weighted Euclidean distance is a little more flexible, for it controls the effect of individual vector component onto the overall distance On top of this, General Quad form distance function also specifies cross-dependencies of the dimensions

Quadratic Form Distance Functions The neighborhood of the bins can be represented as the similarity weights Let d(i,j) represent the distance of the cells that correspond to bin i and j For shells the bin distance is the difference in the corresponding radii For sectors the bin distance is the difference in the angles of sector centers

Quadratic Form Distance Functions When provided with appropriate distance function, the similarity matrix can be computed as a ij = e -σ.d(i,j) where the parameter σ controls the global shape of the similarity matrix.

Invariance Properties During normalization, we perform translation and rotation of all objects Translation is done such that the COM maps onto the Origin Principal Axes Transform is done This generally leads to unique orientation of the object

Principal Axes Transform Compute the Covariance matrix for a given 3D set of points (x,y,z)

Principal Axes Transform The eigen vectors of this matrix represent the principal axes of the original 3D point set The eigen values indicate the variance of the points in the respective direction As a result of PAT all the covariances of the transformed points vanish

Outline Introduction 3D Shape Similarity Model Quadratic Form Distance Functions Extensibility of Histogram Models Query Processing Experimental Results and Conclusion

Extensibility of Histogram Models Along with spatial properties we can also consider thematic properties General approach to manage both thematic and spatial properties is to use combined histograms Combined histogram is the cartesian product of the individual histograms

Outline Introduction 3D Shape Similarity Model Quadratic Form Distance Functions Extensibility of Histogram Models Query Processing Experimental Results and Conclusion

Query Processing In case of Quad Form Distance Function, the evaluation time of a single database object increases quadratically with dimension

Optimal Multistep k- Nearest Neighbor Search In order to achieve a good performance, the paradigm of mutlistep query processing is used An index-based filter step produces a set of candidates Refinement step performs the expensive exact evaluation of the candidates Filter is responsible for completeness and refinement for correctness

Optimal Multistep k- Nearest Neighbor Search Based on multi-dimensional index structure, the filter step performs an incremental ranking objects ordered by their increasing filter distance to the query are reported In order to guarantee no false dismissals caused by the filter step, d j (p,q) ≤ d o (p,q) Where d j =filter distance and d 0 = object distance

Reduction in Dimensionality of Quadratic Forms Objects in high dimensional spaces are managed by reducing their dimensionality Typically this is done by Principal Component Analysis, Discrete Fourier transform, Similarity Matrix decomposition, Feature Subselection etc. These approaches can also be used in case of Quadratic Form Distance

Reduction in Dimensionality of Quadratic Forms An algorithm to reduce the similarity matrix from a high-dim. space down to a low-dim. space was developed in the context of multimedia databases. The method guarantees three things the reduced distance function is a lower bound of the given high- dimensional distance function. the reduced distance function again is a quadratic form the reduced distance function is the greatest of all lower-bounding distance functions in the reduced space.

Experimental Evaluation Data is taken from Brookhaven Protein Databank. Molecules are represented as surface points for the computation of shape histograms Reduced Feature Vectors for the filter step are managed by a X-tree of dimension 10.

Experimental Evaluation Similarity Matrices are computed by an adapted formula from where the similarity weights a ij of bin i and j are defined as a ij = e -σ.d(i,j) σ = 10

Basic Similarity Search

Classification by Shape Similarity Every class has at least two molecules From Preprocessing, 3422 proteins have been classified into 281 classes 3models pure shell model, pure sector model and combined model have been considered. The accuracy for the combined model is the best

Classification by Shape Similarity