Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.

Slides:

Advertisements

Similar presentations

Introduction to Information Retrieval Introduction to Information Retrieval Lecture 7: Scoring and results assembly.

Advertisements

Image Retrieval with Geometry-Preserving Visual Phrases

Google News Personalization: Scalable Online Collaborative Filtering

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Wheres Waldo: Matching People in Images of Crowds Rahul GargDeva RamananSteven M. Seitz Noah Snavely Problem Definition University of Washington University.

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.

Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.

Three things everyone should know to improve object retrieval

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.

VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.

Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.

Search Engines Information Retrieval in Practice All slides ©Addison Wesley, 2008.

Query Specific Fusion for Image Retrieval

CVPR2013 Poster Representing Videos using Mid-level Discriminative Patches.

Patch to the Future: Unsupervised Visual Prediction

Multi-Local Feature Manifolds for Object Detection Oscar Danielsson Stefan Carlsson Josephine Sullivan

1 Building a Dictionary of Image Fragments Zicheng Liao Ali Farhadi Yang Wang Ian Endres David Forsyth Department of Computer Science, University of Illinois.

CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object

Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.

Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.

Object Detection by Matching Longin Jan Latecki. Contour-based object detection Database shapes: …..

Re-ranking for NP-Chunking: Maximum-Entropy Framework By: Mona Vajihollahi.

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

CVPR 2008 James Philbin Ondˇrej Chum Michael Isard Josef Sivic

WISE: Large Scale Content-Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1.

Personalized Search Result Diversification via Structured Learning

Using Structure Indices for Efficient Approximation of Network Properties Matthew J. Rattigan, Marc Maier, and David Jensen University of Massachusetts.

Object retrieval with large vocabularies and fast spatial matching

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.

MANISHA VERMA, VASUDEVA VARMA PATENT SEARCH USING IPC CLASSIFICATION VECTORS.

Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.

Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.

Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.

Wang, Z., et al. Presented by: Kayla Henneman October 27, 2014 WHO IS HERE: LOCATION AWARE FACE RECOGNITION.

1 Fingerprint Classification sections Fingerprint matching using transformation parameter clustering R. Germain et al, IEEE And Fingerprint Identification.

Tree Kernels for Parsing: (Collins & Duffy, 2001) Advanced Statistical Methods in NLP Ling 572 February 28, 2012.

PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.

Jifeng Dai 2011/09/27.  Introduction  Structural SVM  Kernel Design  Segmentation and parameter learning  Object Feature Descriptors  Experimental.

A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.

Eric H. Huang, Richard Socher, Christopher D. Manning, Andrew Y. Ng Computer Science Department, Stanford University, Stanford, CA 94305, USA ImprovingWord.

Diversified Top-k Graph Pattern Matching 1 Yinghui Wu UC Santa Barbara Wenfei Fan University of Edinburgh Southwest Jiaotong University Xin Wang.

A Statistical Approach to Speed Up Ranking/Re-Ranking Hong-Ming Chen Advisor: Professor Shih-Fu Chang.

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

Features-based Object Recognition P. Moreels, P. Perona California Institute of Technology.

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

Unsupervised Learning of Visual Sense Models for Polysemous Words Kate Saenko Trevor Darrell Deepak.

Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.

Searching Specification Documents R. Agrawal, R. Srikant. WWW-2002.

Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval O. Chum, et al. Presented by Brandon Smith Computer Vision.

A Multiresolution Symbolic Representation of Time Series Vasileios Megalooikonomou Qiang Wang Guo Li Christos Faloutsos Presented by Rui Li.

Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 A self-organizing map for adaptive processing of structured.

Unsupervised Auxiliary Visual Words Discovery for Large-Scale Image Object Retrieval Yin-Hsi Kuo1,2, Hsuan-Tien Lin 1, Wen-Huang Cheng 2, Yi-Hsuan Yang.

Bundling Features for Large Scale Partial-Duplicate Web Image Search Zhong Wu ∗, Qifa Ke, Michael Isard, and Jian Sun Microsoft Research.

Learning to Rank: From Pairwise Approach to Listwise Approach Authors: Zhe Cao, Tao Qin, Tie-Yan Liu, Ming-Feng Tsai, and Hang Li Presenter: Davidson Date:

Contextual Search and Name Disambiguation in Using Graphs Einat Minkov, William W. Cohen, Andrew Y. Ng Carnegie Mellon University and Stanford University.

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Another Example: Circle Detection

Large-Scale Content-Based Audio Retrieval from Text Queries

Nonparametric Semantic Segmentation

Machine Learning Basics

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

“The Truth About Cats And Dogs”

Feature Selection for Ranking

Presentation transcript:

Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features to 3D points (high memory requirement) Retrieval based: retrieve a short list of most similar images and perform image matching We base our approach on retrieval based framework, but using a different representation

Retrieval based approach Idea: retrieve a short list of most similar images and perform image matching Problem: noisy ranking + expensive matching = inefficiency

Location Representations Landmarks with corresponding sets of images (categories) Latitude and longitude coordinates (geo-tags) 3D geometry (3D point clouds) Our approach: locations are implicitly encoded in an image graph nodes correspond to images edges correspond to overlapping, geometrically consistent image pairs

Locations in an image graph Question: How can we learn from the image graph? One approach is to formulate as a similarity measure learning problem

Global similarity learning Idea: similar to learning for image matching, learn a global similarity measure using matching/non-matching image pairs Problem: not specific enough for recognize location within a large dataset

Instance similarity learning Idea: for each individual image in the database, learn a specific similarity function

Instance similarity learning Idea: for each individual image in the database, learn a specific similarity function

Instance similarity learning Idea: for each individual image in the database, learn a specific similarity function

Instance similarity learning Idea: for each individual image in the database, learn a specific similarity function Problem: some images have too few training examples

Our Approach Idea: for each neighborhood (subgraph) in the image graph, learn a specific similarity function

Approach At training time: 1.Obtain image matching graph 2.Compute a covering of the graph with a set of subgraphsLearn and calibrate an SVM-based similarity measure for each subgraph At testing time: 1.Use the models in Step 3 to compute a similarity between the query image to each database image, and generate a ranked shortlist of possible image matchesPerform geometric verification with the top database images in the shortlist (image matching)

Approach At training time: 1.Obtain image matching graph 2.Compute a covering of the graph with a set of subgraphsLearn and calibrate an SVM-based similarity measure for each subgraph At testing time: 1.Use the models in Step 3 to compute a similarity between the query image to each database image, and generate a ranked shortlist of possible image matchesPerform geometric verification with the top database images in the shortlist (image matching)

Computing Subgraphs What makes a good subgraph? contain images that are largely similarhas as many positive examples as possible Minimum dominating set (NP-hard) We use a greedy algorithm: at each iteration select the image that covers the most “uncovered” images “Covered”: is selected or has at least one neighbor that is selected Selected images (center images) define the neighborhoods One image could belong to multiple neighborhoods

An example covering of the Dubrovnik dataset # images: 6044, # centers: 188

Approach At training time: 1.Obtain image matching graph 2.Compute a covering of the graph with a set of subgraphsLearn and calibrate an SVM-based similarity measure for each subgraph At testing time: 1.Use the models in Step 3 to compute a similarity between the query image to each database image, and generate a ranked shortlist of possible image matchesPerform geometric verification with the top database images in the shortlist (image matching)

Similarity Measure Learning Each image is represented as BoW vector For each subgraph, define Positives: subgraph membersNegatives: subsample the rest Learn an SVM classifier and calibrate to output a probability value using Platt’s method

Approach At training time: 1.Obtain image matching graph 2.Compute a covering of the graph with a set of subgraphsLearn and calibrate an SVM-based similarity measure for each subgraph At testing time: 1.Use the models in Step 3 to compute a similarity between the query image to each database image, and generate a ranked shortlist of possible image matchesPerform geometric verification with the top database images in the shortlist (image matching)

Generating a Short List Given a query image, every subgraph has a probability score However, a short list of most similar images from all database images is needed A simple approach: concatenate subgraph members according to scores rank within each subgraph using BoW similarity for image appearing in multiple subgraphs, only count the highest scoring subgraph

Generating a Short List A simple approach: concatenating subgraph members according to scores

Problem: If the first subgraph/neighborhood is wrong, the result is worse than BoW ranking, which has more diversity Idea: to encourage diversity, rank using conditional probabilities conditioned on first failure to obtain the 2nd candidate, and so on Generating a Short List update factor

Generating a Short List Example ranking result with encouraged diversity v.s. without Regularization using BoW ranking also helps increase diversity

Approach At training time: 1.Obtain image matching graph 2.Compute a covering of the graph with a set of subgraphsLearn and calibrate an SVM-based similarity measure for each subgraph At testing time: 1.Use the models in Step 3 to compute a similarity between the query image to each database image, and generate a ranked shortlist of possible image matchesPerform geometric verification with the top database images in the shortlist (image matching)

Experiments Key bottleneck: quality of the shortlist The higher the first match appears, the less computation is needed different from retrieval, where recall is also important Evaluate using accuracies at top k (k = 1, 2, 5, 10) Baselines 1.BoW ranking 2.Co-ocset (BoW ranking considering co-occuring stat.) 3.GPS based approach 4.Two alternative learning formulations: a.single global similarity using image pairs b.instance similarity using each image as center

Experiments Visual Vocabulary size: 1M Two versions of vocabularies specific (trained using each dataset) generic (trained with a common unrelated dataset) Using 3 standard benchmark datasets

Experiments Unsupervised methods perform similarly GPS based model and other learning formulations perform worse Our approach (GBP) improves accuracies consistently Probabilistic reranking (+RR) and BoW regularization (+BoW) improves results further Top k accuracies and mAP are not the same objectives

Conclusions Compared to BoW ranking higher performance with little extra overhead during query time Compared to direct matching less memory usage and better scalability thanks to BoW framework Graph-based recognition could be extended to other recognition tasks (e.g. object recognition) Song Cao, Noah Snavely: Graph-Based Discriminative Learning for Location Recognition. CVPR 2013 (to appear).Graph-Based Discriminative Learning for Location Recognition