Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.

Slides:

Advertisements

Similar presentations

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

Advertisements

Presented by Xinyu Chang

Evaluating Color Descriptors for Object and Scene Recognition Koen E.A. van de Sande, Student Member, IEEE, Theo Gevers, Member, IEEE, and Cees G.M. Snoek,

Computer vision: models, learning and inference Chapter 13 Image preprocessing and feature extraction.

Clustering with k-means and mixture of Gaussian densities Jakob Verbeek December 3, 2010 Course website:

TP14 - Indexing local features

Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification Computer Vision, ICCV IEEE 11th International.

Query Specific Fusion for Image Retrieval

Herv´ eJ´ egouMatthijsDouzeCordeliaSchmid INRIA INRIA INRIA

CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.

Image alignment Image from

Bag-of-features models Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

CVPR 2008 James Philbin Ondˇrej Chum Michael Isard Josef Sivic

Packing bag-of-features ICCV 2009 Herv´e J´egou Matthijs Douze Cordelia Schmid INRIA.

Large-scale matching CSE P 576 Larry Zitnick

Bag of Features Approach: recent work, using geometric information.

Robust and large-scale alignment Image from

Object retrieval with large vocabularies and fast spatial matching

Lecture 28: Bag-of-words models

Object-based Image Representation Dr. B.S. Manjunath Sitaram Bhagavathy Shawn Newsam Baris Sumengen Vision Research Lab University of California, Santa.

Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.

3D Hand Pose Estimation by Finding Appearance-Based Matches in a Large Database of Training Views

Distinctive Image Feature from Scale-Invariant KeyPoints

Bag-of-features models

Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.

Scale Invariant Feature Transform (SIFT)

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.

Object Recognition and Augmented Reality

Large-Scale Content-Based Image Retrieval Project Presentation CMPT 880: Large Scale Multimedia Systems and Cloud Computing Under supervision of Dr. Mohamed.

Review: Intro to recognition Recognition tasks Machine learning approach: training, testing, generalization Example classifiers Nearest neighbor Linear.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

CS 766: Computer Vision Computer Sciences Department, University of Wisconsin-Madison Indexing and Retrieval James Hill, Ozcan Ilikhan, Mark Lenz {jshill4,

Indexing Techniques Mei-Chen Yeh.

Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.

CSE 185 Introduction to Computer Vision Pattern Recognition.

Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.

Learning Visual Similarity Measures for Comparing Never Seen Objects By: Eric Nowark, Frederic Juric Presented by: Khoa Tran.

CSE 473/573 Computer Vision and Image Processing (CVIP)

AUTOMATIC ANNOTATION OF GEO-INFORMATION IN PANORAMIC STREET VIEW BY IMAGE RETRIEVAL Ming Chen, Yueting Zhuang, Fei Wu College of Computer Science, Zhejiang.

04/30/13 Last class: summary, goggles, ices Discrete Structures (CS 173) Derek Hoiem, University of Illinois 1 Image: wordpress.com/2011/11/22/lig.

Bag-of-features models. Origin 1: Texture recognition Texture is characterized by the repetition of basic elements or textons For stochastic textures,

A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.

Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.

Computer Vision Lab Seoul National University Keyframe-Based Real-Time Camera Tracking Young Ki BAIK Vision seminar : Mar Computer Vision Lab.

IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.

Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.

Texture Detection & Texture related clustering C601 Project Jing Qin Fall 2003.

Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction / 16 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT.

2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.

Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.

2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )

Yixin Chen and James Z. Wang The Pennsylvania State University

Bundling Features for Large Scale Partial-Duplicate Web Image Search Zhong Wu ∗, Qifa Ke, Michael Isard, and Jian Sun Microsoft Research.

CS654: Digital Image Analysis

Line Matching Jonghee Park GIST CV-Lab..  Lines –Fundamental feature in many computer vision fields 3D reconstruction, SLAM, motion estimation –Useful.

776 Computer Vision Jan-Michael Frahm Spring 2012.

Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.

Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.

Fast nearest neighbor searches in high dimensions Sami Sieranoja

Video Google: Text Retrieval Approach to Object Matching in Videos

Large-scale Instance Retrieval

Mixtures of Gaussians and Advanced Feature Encoding

By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,

CS 1674: Intro to Computer Vision Scene Recognition

Aim of the project Take your image Submit it to the search engine

Large Scale Image Deduplication

Video Google: Text Retrieval Approach to Object Matching in Videos

Presentation transcript:

Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

Outline  Introduction  Methods  LF-clustering  Experiments and Results  Discussion and Conclusion

Introduction  The bag-of-words approach 1. Feature extraction from the database images 2. Building the bag-of-words representation 3. Searching with a query image

Introduction  The Bag-of-word Model

Methods  Feature representation  Clustering  Feature assignment  Image matching

Feature representation  PCA is applied to reduce the dimensionality of the feature vectors  The reduction of the SIFT descriptor is from 128 to between 3 and 12 dimensions  After dimension reduction we add color to our features  the mean RGB value in a 10 × 10 pixels patch around the localization of each feature

Feature representation   is the PCA reduced SIFT feature  is the mean RGB values  is a weighing parameter ( ) 1. normalized to unit length 2. normalized

Clustering  Similar but faster than Mean-shift clustering

Feature assignment  Similarity of images are found by comparing frequency vectors of a query image to images in the database  Give each visual words a weight [16]  [16] D. Nister and H. Stewenius. Scalable recognition with a vocabulary tree. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), volume 2, pages 2161–2168, June 2006.

Image matching  Frequency vectors are compared using the norm  which is found to be superior to the euclidean distance [16]  norm gives equal weight to the overlapping and non-overlapping parts  Inverted files are used for fast image retrieval

Experiments and Results  Data set  first 1400 images form [16]  a series of 4 images of the same scene  Use three of the images from one scene to train the model and the last for testing  The test result is the percentage of the correct images ranked in top 3  data set is relatively small

Experiments and Results  Data set ：

Experiments and Results  Experiments  Color added PCA SIFT  3, 8, and 12 dimensional PCA SIFT features added features are 6, 11, and 15 dimensions  compare with SIFT features reduced with PCA to 6, 11 and 15 dimensions (without color)  Clustering experiments  LF-clustering  from 8,000 to 12,000 clusters  k-means  10 clusters in 4 levels resulting in 10,000 clusters

Experiments and Results  Results

Experiments and Results  Results

Discussion and Conclusion  did not apply LF-clustering to the 128 dimensional SIFT features, because it performed very poorly  for future work the model should be tested on a larger set of data  A problem of the design of the bag-of-words model is it static nature  not designed for adding or removing images from the database