Fast Algorithm for Nearest Neighbor Search Based on a Lower Bound Tree Yong-Sheng Chen Yi-Ping Hung Chiou-Shann Fuh 8 th International Conference on Computer.

Slides:



Advertisements
Similar presentations
Aggregating local image descriptors into compact codes
Advertisements

Spatio-temporal Databases
Presented by Xinyu Chang
Searching on Multi-Dimensional Data
Efficiently searching for similar images (Kristen Grauman)
A NOVEL LOCAL FEATURE DESCRIPTOR FOR IMAGE MATCHING Heng Yang, Qing Wang ICME 2008.
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
2004/05/03 Clustering 1 Clustering (Part One) Ku-Yaw Chang Assistant Professor, Department of Computer Science and Information.
A Robust Pedestrian Detection Approach Based on Shapelet Feature and Haar Detector Ensembles Wentao Yao, Zhidong Deng TSINGHUA SCIENCE AND TECHNOLOGY ISSNl.
Fast High-Dimensional Feature Matching for Object Recognition David Lowe Computer Science Department University of British Columbia.
Spatio-temporal Databases Time Parameterized Queries.
Object retrieval with large vocabularies and fast spatial matching
Scalable Wavelet Video Coding Using Aliasing- Reduced Hierarchical Motion Compensation Xuguang Yang, Member, IEEE, and Kannan Ramchandran, Member, IEEE.
Techniques and Data Structures for Efficient Multimedia Similarity Search.
1 MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING By Kaan Tariman M.S. in Computer Science CSCI 8810 Course Project.
Introduction to Bioinformatics - Tutorial no. 12
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
Spatial and Temporal Databases Efficiently Time Series Matching by Wavelets (ICDE 98) Kin-pong Chan and Ada Wai-chee Fu.
FLANN Fast Library for Approximate Nearest Neighbors
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Fast Pattern Matching. Fast Pattern Matching: Presentation Plan Pattern Matching: Definitions Classic Pattern Matching Algorithms Fast pattern matching.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Multimedia and Time-series Data
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Presented by Tienwei Tsai July, 2005
Nearest Neighbor Searching Under Uncertainty
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
PMLAB Finding Similar Image Quickly Using Object Shapes Heng Tao Shen Dept. of Computer Science National University of Singapore Presented by Chin-Yi Tsai.
Approximate XML Joins Huang-Chun Yu Li Xu. Introduction XML is widely used to integrate data from different sources. Perform join operation for XML documents:
Introduction to The NSP-Tree: A Space-Partitioning Based Indexing Method Gang Qian University of Central Oklahoma November 2006.
M- tree: an efficient access method for similarity search in metric spaces Reporter : Ximeng Liu Supervisor: Rongxing Lu School of EEE, NTU
Fast search methods Pasi Fränti Clustering methods: Part 5 Speech and Image Processing Unit School of Computing University of Eastern Finland
Using Support Vector Machines to Enhance the Performance of Bayesian Face Recognition IEEE Transaction on Information Forensics and Security Zhifeng Li,
An Efficient Search Strategy for Block Motion Estimation Using Image Features Digital Video Processing 1 Term Project Feng Li Michael Su Xiaofeng Fan.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction / 16 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality Piotr Indyk, Rajeev Motwani The 30 th annual ACM symposium on theory of computing.
A NOVEL METHOD FOR COLOR FACE RECOGNITION USING KNN CLASSIFIER
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept. of Electronic.
A Fast LBG Codebook Training Algorithm for Vector Quantization Presented by 蔡進義.
1 Approximate XML Query Answers Presenter: Hongyu Guo Authors: N. polyzotis, M. Garofalakis, Y. Ioannidis.
Fast Query-Optimized Kernel Machine Classification Via Incremental Approximate Nearest Support Vectors by Dennis DeCoste and Dominic Mazzoni International.
Database Management Systems, R. Ramakrishnan 1 Algorithms for clustering large datasets in arbitrary metric spaces.
Content Based Color Image Retrieval vi Wavelet Transformations Information Retrieval Class Presentation May 2, 2012 Author: Mrs. Y.M. Latha Presenter:
Matching of Objects Moving Across Disjoint Cameras Eric D. Cheng and Massimo Piccardi IEEE International Conference on Image Processing
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Example Apply hierarchical clustering with d min to below data where c=3. Nearest neighbor clustering d min d max will form elongated clusters!
1 Queryy Sampling Based High Dimensional Hybrid Index Junqi Zhang, Xiangdong Zhou Fudan University.
Bayesian Hierarchical Clustering Paper by K. Heller and Z. Ghahramani ICML 2005 Presented by David Williams Paper Discussion Group ( )
On the relevance of facial expressions for biometric recognition Marcos Faundez-Zanuy, Joan Fabregas Escola Universitària Politècnica de Mataró (Barcelona.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
High-Dimensional Data. Topics Motivation Similarity Measures Index Structures.
Strategies for Spatial Joins
Parametric calibration of speed–density relationships in mesoscopic traffic simulator with data mining Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2009/10/20.
SIMILARITY SEARCH The Metric Space Approach
An Image Database Retrieval Scheme Based Upon Multivariate Analysis and Data Mining Presented by C.C. Chang Dept. of Computer Science and Information.
Nonparametric Density Estimation – k-nearest neighbor (kNN) 02/20/17
IMAGE PROCESSING RECOGNITION AND CLASSIFICATION
Fast nearest neighbor searches in high dimensions Sami Sieranoja
Probabilistic Data Management
Parametric calibration of speed–density relationships in mesoscopic traffic simulator with data mining Adviser: Yu-Chiang Li Speaker: Gung-Shian Lin Date:2009/10/20.
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
MACHINE LEARNING TECHNIQUES IN IMAGE PROCESSING
Color Image Retrieval based on Primitives of Color Moments
Chair Professor Chin-Chen Chang Feng Chia University
BIRCH: Balanced Iterative Reducing and Clustering using Hierarchies
Color Image Retrieval based on Primitives of Color Moments
Presentation transcript:

Fast Algorithm for Nearest Neighbor Search Based on a Lower Bound Tree Yong-Sheng Chen Yi-Ping Hung Chiou-Shann Fuh 8 th International Conference on Computer Vision

2 Outline  Introduction  Multilevel Structure and LB-Tree  Agglomerative Clustering  Data Transformation  Winner-Update Search  Conclusions

3 Nearest Neighbor Search Problem  Given a fixed set of s points in R d, P={p 1,p 2,…,p s }  For a query point q, find in P the point, p î, closest to q D1D1 D2D2 DdDd

4 Applications of Nearest Neighbor Search  Object (pattern) recognition  Image matching  Stereo matching  Data compression  Block motion estimation  Vector quantization  Information retrieval in database systems  Image and video databases  DNA sequence databases

5  Space partition methods  k-d tree, 1975  R-tree, 1984  SS-tree, 1996  SR-tree, 1997  Pyramid, 1998  Elimination-based methods  Branch and bound, 1975  Projection elimination, 1975, 1987  Threshold rejection, 1997 Literature Review

6 Central Idea of This Work  We skip distance calculation for a point in database by calculating and comparing its distance lower bound.  Distance lower bound was derive from Minkowski ’ s inequality.  Computational cost of the distance lower bound is less than that of the distance itself.  Optimal search is guaranteed.

7 Central Idea of This Work  Reduce the number of distance lower bounds actually calculated by using  an LB-tree constructed in the preprocessing stage  the winner-update search strategy for traversing the constructed LB-tree  Tighten distance lower bounds by  constructing the LB-tree with an agglomerative clustering method  transforming each data point with  Wavelet transform  KL transform (principal component analysis)

8 Proposed Algorithm for NN Search Sample points in database data transformation Query point data transformation search by using winner-update strategy agglomerative clustering multilevel structure Lower-bound tree

9 Multilevel Structure of a Point  For a point p=[p 1, p 2, …, p d ] in R d, d=2 L, we denote its multilevel structure: {p 0, p 1, …, p L }    EX

10 Lower Bound Tree  Multilevel structures of every points in the database, p 1,p 2,…,p s  Idea of node reduction  Select representatives  Hierarchical, agglomerative clustering

11 Hierarchical, Agglomerative Clustering D1D1 D2D2 Ex: r1r1 r2r2

12 Lower Bound Tree  Representative for each cluster  mean point m l j*  distance r l j * of the farthest point in the cluster from m l j*  Each point p *l satisfies

13 Distance Lower Bound Using LB-Tree  For each point p*, we can derive the lower bound of its distance to the query point q : p *l qlql m l j* r l j*

14 Data Transformation  Why Data Transformation?  Make anterior dims more discriminative than posterior dims.  Lower bound can be tightened.  By data content, two transformations used in this work:  Haar wavelet transform(autocorrelated data)  KL transform (principal component analysis) (object recognition)

15 Winner-Update Search for LB-Tree Traversal Given a query point q Calculate d LB for all the nodes in level 0 Choose the node having the minimum d LB as the temporary winner While the winner is not at the bottom level Replace the winner node with its children Calculate d LB for each new child node Choose the node having the minimum d LB as the temporary winner Output the final winner

16 Other Query Types  Winner-update algorithm can be easily extended to support other useful query types:  Progressive search for k -nearest neighbors  Search for k -nearest neighbors within a distance threshold  Search for neighbors close to the nearest neighbor

17 Conclusions  Fast nearest neighbor search techniques, including  Lower bound tree  Winner-update search strategy  Data transformation  Wavelet transform  Principal component analysis  Various useful query types  According to our experiments on Nayar ’ s object recognition database, our algorithm can be more than one thousand faster than the full search algorithm.

18 Thank you.