Survey of Algorithms to Query Image Databases COMP 290-72:Computational Geometry Benjamin Lok 11/2/98 Image from Kodak’s PhotoQuilt.

Slides:



Advertisements
Similar presentations
Object Recognition Using Locality-Sensitive Hashing of Shape Contexts Andrea Frome, Jitendra Malik Presented by Ilias Apostolopoulos.
Advertisements

Ranking Multimedia Databases via Relevance Feedback with History and Foresight Support / 12 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT AND EXPLORATION.
A Multi-Resolution Technique for Comparing Images Using the Hausdorff Distance Daniel P. Huttenlocher and William J. Rucklidge Nov Presented by.
Wavelets Fast Multiresolution Image Querying Jacobs et.al. SIGGRAPH95.
Word Spotting DTW.
Presented by Xinyu Chang
Robust Global Registration Natasha Gelfand Niloy Mitra Leonidas Guibas Helmut Pottmann.
3D Shape Histograms for Similarity Search and Classification in Spatial Databases. Mihael Ankerst,Gabi Kastenmuller, Hans-Peter-Kriegel,Thomas Seidl Univ.
Announcements Final Exam May 13th, 8 am (not my idea).
Computer Vision Detecting the existence, pose and position of known objects within an image Michael Horne, Philip Sterne (Supervisor)
Multimedia DBs. Multimedia dbs A multimedia database stores text, strings and images Similarity queries (content based retrieval) Given an image find.
Image Indexing and Retrieval using Moment Invariants Imran Ahmad School of Computer Science University of Windsor – Canada.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Image Similarity and the Earth Mover’s Distance Empirical Evaluation of Dissimilarity Measures for Color and Texture Y. Rubner, J. Puzicha, C. Tomasi and.
Discriminative and generative methods for bags of features
Fast High-Dimensional Feature Matching for Object Recognition David Lowe Computer Science Department University of British Columbia.
Comparing Images Using the Hausdorff Distance Mark Bouts 6th April 2006.
Iterative closest point algorithms
A Study of Approaches for Object Recognition
1 Numerical geometry of non-rigid shapes Spectral Methods Tutorial. Spectral Methods Tutorial 6 © Maks Ovsjanikov tosca.cs.technion.ac.il/book Numerical.
Chamfer Matching & Hausdorff Distance Presented by Ankur Datta Slides Courtesy Mark Bouts Arasanathan Thayananthan.
Object Recognition Using Geometric Hashing
Scale Invariant Feature Transform (SIFT)
Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
1 Numerical geometry of non-rigid shapes Non-Euclidean Embedding Non-Euclidean Embedding Lecture 6 © Alexander & Michael Bronstein tosca.cs.technion.ac.il/book.
Three-Dimensional Face Recognition Using Surface Space Combinations Thomas Heseltine, Nick Pears, Jim Austin Advanced Computer Architecture Group Department.
Fast multiresolution image querying CS474/674 – Prof. Bebis.
Content-Based Image Retrieval using the EMD algorithm Igal Ioffe George Leifman Supervisor: Doron Shaked Winter-Spring 2000 Technion - Israel Institute.
Presenter: Stefan Zickler
Image Processing David Kauchak cs458 Fall 2012 Empirical Evaluation of Dissimilarity Measures for Color and Texture Jan Puzicha, Joachim M. Buhmann, Yossi.
Navigating and Browsing 3D Models in 3DLIB Hesham Anan, Kurt Maly, Mohammad Zubair Computer Science Dept. Old Dominion University, Norfolk, VA, (anan,
Image Based Positioning System Ankit Gupta Rahul Garg Ryan Kaminsky.
Hubert CARDOTJY- RAMELRashid-Jalal QURESHI Université François Rabelais de Tours, Laboratoire d'Informatique 64, Avenue Jean Portalis, TOURS – France.
Computer vision.
Efficient Algorithms for Robust Feature Matching Mount, Netanyahu and Le Moigne November 7, 2000 Presented by Doe-Wan Kim.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
FlowString: Partial Streamline Matching using Shape Invariant Similarity Measure for Exploratory Flow Visualization Jun Tao, Chaoli Wang, Ching-Kuang Shene.
Geometric Operations and Morphing.
Mesh Watermarking based on 2D Transferred Domain Jongyun Jun Tae-Joon Kim CS548 Term Project Presentation 2010/05/13.
Shape Analysis and Retrieval Statistical Shape Descriptors Notes courtesy of Funk et al., SIGGRAPH 2004.
Spin Image Correlation Steven M. Kropac April 26, 2005.
Images Similarity by Relative Dynamic Programming M. Sc. thesis by Ady Ecker Supervisor: prof. Shimon Ullman.
Efficient EMD-based Similarity Search in Multimedia Databases via Flexible Dimensionality Reduction / 16 I9 CHAIR OF COMPUTER SCIENCE 9 DATA MANAGEMENT.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
Bart M. ter Haar Romeny.  Question: can top-points be used for object- retrieval tasks?
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Query Sensitive Embeddings Vassilis Athitsos, Marios Hadjieleftheriou, George Kollios, Stan Sclaroff.
1 Embedding and Similarity Search for Point Sets under Translation Minkyoung Cho and David M. Mount University of Maryland SoCG 2008.
CS654: Digital Image Analysis Lecture 36: Feature Extraction and Analysis.
Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.
Computer Vision Set: Object Recognition Slides by C.F. Olson 1 Object Recognition.
CSE 185 Introduction to Computer Vision Feature Matching.
Local features: detection and description
Application: Multiresolution Curves Jyun-Ming Chen Spring 2001.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching.
Image Representation and Description – Representation Schemes
Visual Information Retrieval
Face Detection EE368 Final Project Group 14 Ping Hsin Lee
Recognition: Face Recognition
Principal Component Analysis (PCA)
The Earth Mover's Distance
Investigating the Hausdorff Distance
Spectral Methods Tutorial 6 1 © Maks Ovsjanikov
K Nearest Neighbor Classification
Geometric Hashing: An Overview
Kiera Henning, Jiwon Kim,
Comparing Images Using Hausdorff Distance
Feature descriptors and matching
Presentation transcript:

Survey of Algorithms to Query Image Databases COMP :Computational Geometry Benjamin Lok 11/2/98 Image from Kodak’s PhotoQuilt

Outline of Talk w Overview of the problem w Three methods Color based Shape based Vision based w Conclusions Image from Microsoft Clip Gallery

Problem w Query an image database w What does a “match” mean? Application dependent Notion of subjectivity Sensitivity to noise

Problem w Semantic similarity is still not possible ex. “All images with cats” w To determine similarity, we need to a new: Metric Space Images from Microsoft Clip Gallery and website

Ambiguity Guy Girl Guy Images from “Shrine to Long Haired Men” and “Videos of Women Getting Their Heads Shaved” websites

Yossi Rubner, Leonidas Guibas, Carlos Tomasi (1997) Stanford Vision Laboratory The Earth Mover’s Distance, Multi-Dimensional Scaling, and Color-Based Image Retrieval Image from Microsoft Clip Gallery

Color Signatures w Utilize the CIE-LAB color space Based on human perception of color w Map each pixel to a point in color space Common color values increase weight of point w Group clusters into points (8-12 per image) Rubner, Guibas, and Tomasi

Earth Mover’s Distance w To compare two images, compute the “work” needed to move the cluster points from one image to the other Rubner, Guibas, and Tomasi

Earth Mover’s Distance (cont) w Solving a linear programming problem: Given two signatures: p = {(p 1,w p1 ),…,(p m,w pm )} and q = {(q 1,w q1 ),…,(q n,w qn )} Find C where C ij is the amount of weight p i matched q j

Applications w Visualize Databases (Queries and Results) w Scale the multiple dimensions into 2D using MDS and minimize STRESS Rubner, Guibas, and Tomasi

Database Visualization Rubner, Guibas, and Tomasi

Algorithm Recap w Map pixels to 3D color space points w Locate and compress “clusters” of points w 8 to 12 points determine the color signature w Calculate the Earth Mover’s Distance to determine “distance” between two images Image from YenPen Stationary Website

Advantages w Based on human perception of color w Some invariance to small change in viewpoint and lighting w Meaningful metric w Relatively fast w Can embed multiple metrics Disadvantages w Depending on application, query format might be not be intuitive w Not much use for non- color images w False positives a real possibility depending on working domain

Shaped-based Image Retrieval Using Geometric Hashing Scott D. Cohen and Leonidas J. Guibas 1997 Stanford Vision Laboratory Image from Microsoft Clip Gallery

Overview w Implementation Search through 500 Chinese characters w Goals Provide invariance to scale, rotation, and translation speed and accuracy Cohen and Guibas

Generating a Illustration w Illustration - set of curves that summarize an image Edgel detection Medial Axis determination Cohen and Guibas

Approximating with Polylines w Convert medial axis representation to polylines w Tradeoff between speed and accuracy Cohen and Guibas

Geometric Hashing w Geometric Hashing - method used to compare two point sets under some transformation group w Take each point and use it as the origin of a coordinate system Cohen and Guibas

Geometric Hashing (cont) w If translating P by q j - p i produces a good match I i (P) and I j (Q) will match. w This property can be generalized to other transformation groups. w Each line segment is a basis of a coordinate system Translation, Rotation, and Orientation defined I(P) = transform all other segments into new CS Cohen and Guibas

Notes on GH w Each segment will be transformed to 2m Coordinate systems w I(P) stores O(m 2 ) segments w Can be done as preprocessing step w Expressing the different possible transformations using each segment as a basis Cohen and Guibas

Querying the Database w Query image undergoes the feature extraction process w For each query feature, a nearest-neighbor query is applied and the k closest or within some j w Similarity score increases if database image has a feature that is “close” to the query feature Cohen and Guibas

“Closeness” w How do you describe the closeness of two lines? w Transform to a 4D space made of (l, ,a,b) w With two (l, ,a,b) descriptions for lines, can compute distance w Divide by standard deviation over sample of database features

Details w Closeness is relative to database contents w Nearest-neighbor algorithm by Arya, Mount, et. al (1994). Query time for k nearest features is O(k log n) Cohen and Guibas

Advantages w Fast Queries database of 500 characters in 1 second on SGI Indy w Queries based on important features Disadvantages w Working domain currently limited w Could get too expensive as complexity in images increases Cohen and Guibas

A Multi-Resolution Technique for Comparing Images Using the Hausdorff Distance Daniel Huttenlocher and William Rucklidge 1992Cornell University Huttenlocher and Rucklidge

Directed Hausdorff Distance w Given A={a 1, …, a m } and B={b 1, …, b m } w Identifies the point in A farthest from any point in B w Measures the degree of mismatch between between two sets.

Properties of Hausdorff Distances w Not symmetric h(A,B) != h(B,A) w Compute k th maximum Notion of rank Reduces sensitivity Fraction of A within h(A,B) of B Obscured portions w h(A,B) = hypothesis h(B,A) = test

Transformations t( ) = w Given A is an image, B is the model w Without Orientation, if A is in B then A undergoes transformation t. f B (t)=H(t(B),A) t=(t x,t y,s x,s y )forward f A (t)=H(A,t(B)) reverse

Bidirectional Hausdorff Distance w Solve for which values of t, the following holds: w Results in searching a four dimensional space

Restricting Search Space w Slope of f(t)=H LK (A,t(B)) is linear w Divide space into cells w Calculate H LK (A,t c (B)) w Determine a maximum delta per cell Based on limit in scale and translation Allows for quick rejection and acceptance w Label cells as interesting or disregard

Restricting Search Space w Create smaller cells from interesting cells w Bounds based on transformations w Quickly narrow down to areas that could possibly be within  of A

Subtleties w Discretization useful if working in computer vision domain (integers) w Can compare partially obscured images w Optimizations Early rejection/acceptance w Pretty slow 200 to 250 seconds Website on submarines

Huttenlocher and Rucklidge

Advantages w Accurate w Geared towards image processing and vision w Partially obscured images w Searches similar to humans Disadvantages w Slow w No Orientation w Database must be specialized w Potential problems in generating queries

Recap w Three Algorithms Color Based Color Signatures Earth Mover’s Distance Shaped Based Polylines Transform Invariant Sets Vision Based Hausdorff Distance Subdivision of Transformation Space

Final Thoughts w Algorithms work well in various domains w Query construction not formalized w Other methods: wavelet-based texture-based object-based w Took 5 minutes to find “Shrine to Men with Long Hair” and “Videos of Women Getting Their Head Shaved” All other images generated by author using Paint Shop Pro