VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.

Slides:



Advertisements
Similar presentations
Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Advertisements

Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.
Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Location Recognition Given: A query image A database of images with known locations Two types of approaches: Direct matching: directly match image features.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology VisualRank- Applying PageRank to Large-Scale Image Search.
Aggregating local image descriptors into compact codes
1 Evaluation Rong Jin. 2 Evaluation  Evaluation is key to building effective and efficient search engines usually carried out in controlled experiments.
Three things everyone should know to improve object retrieval
Presented by Xinyu Chang
Content-Based Image Retrieval
Crawling, Ranking and Indexing. Organizing the Web The Web is big. Really big. –Over 3 billion pages, just in the indexable Web The Web is dynamic Problems:
WWW 2014 Seoul, April 8 th SNOW 2014 Data Challenge Two-level message clustering for topic detection in Twitter Georgios Petkos, Symeon Papadopoulos, Yiannis.
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE Shumeet Baluja, Member, IEEE IEEE TRANSACTIONS ON PATTERN ANALYSIS AND.
Discriminative Relevance Feedback With Virtual Textual Representation For Efficient Image Retrieval Suman Karthik and C.V.Jawahar.
Region-based Querying of Solar Data Using Descriptor Signatures Authors: Juan M. Banda 1, Chang Liu 1 and Rafal A. Angryk 2 1 Montana State University,
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
Evaluating Search Engine
Coherency Sensitive Hashing (CSH) Simon Korman and Shai Avidan Dept. of Electrical Engineering Tel Aviv University ICCV2011 | 13th International Conference.
Tour the World: building a web-scale landmark recognition engine ICCV 2009 Yan-Tao Zheng1, Ming Zhao2, Yang Song2, Hartwig Adam2 Ulrich Buddemeier2, Alessandro.
1 Content Based Image Retrieval Using MPEG-7 Dominant Color Descriptor Student: Mr. Ka-Man Wong Supervisor: Dr. Lai-Man Po MPhil Examination Department.
Fast and Compact Retrieval Methods in Computer Vision Part II A. Torralba, R. Fergus and Y. Weiss. Small Codes and Large Image Databases for Recognition.
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
Retrieval Evaluation. Brief Review Evaluation of implementations in computer science often is in terms of time and space complexity. With large document.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Overview of Search Engines
1/16 Final project: Web Page Classification By: Xiaodong Wang Yanhua Wang Haitang Wang University of Cincinnati.
SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,
Improving web image search results using query-relative classifiers Josip Krapacy Moray Allanyy Jakob Verbeeky Fr´ed´eric Jurieyy.
Indexing Techniques Mei-Chen Yeh.
Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
School of Information Technology & Electrical Engineering Multiple Feature Hashing for Real-time Large Scale Near-duplicate Video Retrieval Jingkuan Song*,
PageRank for Product Image Search Yushi Jing, Shumeet Baluja College of Computing, Georgia Institute of Technology Google, Inc. WWW 2008 Referred Track:
PageRank for Product Image Search Kevin Jing (Googlc IncGVU, College of Computing, Georgia Institute of Technology) Shumeet Baluja (Google Inc.) WWW 2008.
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
When Experts Agree: Using Non-Affiliated Experts To Rank Popular Topics Meital Aizen.
Nearest Neighbor Paul Hsiung March 16, Quick Review of NN Set of points P Query point q Distance metric d Find p in P such that d(p,q) < d(p’,q)
Implicit User Feedback Hongning Wang Explicit relevance feedback 2 Updated query Feedback Judgments: d 1 + d 2 - d 3 + … d k -... Query User judgment.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
CANONICAL IMAGE SELECTION FROM THE WEB ACM International Conference on Image and Video Retrieval, 2007 Yushi Jing Shumeet Baluja Henry Rowley.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
The Anatomy of a Large-Scale Hyper textual Web Search Engine S. Brin, L. Page Presenter :- Abhishek Taneja.
In Defense of Nearest-Neighbor Based Image Classification Oren Boiman The Weizmann Institute of Science Rehovot, ISRAEL Eli Shechtman Adobe Systems Inc.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Algorithmic Detection of Semantic Similarity WWW 2005.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Geometric Hashing: A General and Efficient Model-Based Recognition Scheme Yehezkel Lamdan and Haim J. Wolfson ICCV 1988 Presented by Budi Purnomo Nov 23rd.
1 A Compact Feature Representation and Image Indexing in Content- Based Image Retrieval A presentation by Gita Das PhD Candidate 29 Nov 2005 Supervisor:
Gang WangDerek HoiemDavid Forsyth. INTRODUCTION APROACH (implement detail) EXPERIMENTS CONCLUSION.
P ROBING THE L OCAL -F EATURE S PACE OF I NTEREST P OINTS Wei-Ting Lee, Hwann-Tzong Chen Department of Computer Science National Tsing Hua University,
Automatic Video Tagging using Content Redundancy Stefan Siersdorfer 1, Jose San Pedro 2, Mark Sanderson 2 1 L3S Research Center, Germany 2 University of.
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Concept-based P2P Search How to find more relevant documents Ingmar Weber Max-Planck-Institute for Computer Science Joint work with Holger Bast Torino,
26/01/20161Gianluca Demartini Ranking Categories for Faceted Search Gianluca Demartini L3S Research Seminars Hannover, 09 June 2006.
Content Based Color Image Retrieval vi Wavelet Transformations Information Retrieval Class Presentation May 2, 2012 Author: Mrs. Y.M. Latha Presenter:
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Navigation Aided Retrieval Shashank Pandit & Christopher Olston Carnegie Mellon & Yahoo.
Xifeng Yan Philip S. Yu Jiawei Han SIGMOD 2005 Substructure Similarity Search in Graph Databases.
IST 516 Fall 2011 Dongwon Lee, Ph.D.
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
Geometric Hashing: An Overview
Multimedia Information Retrieval
Color Image Retrieval based on Primitives of Color Moments
Information Retrieval and Web Design
Presentation transcript:

VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE

Outline Introduction PageRank VisualRank Similarity graph Web-Scale Visual Rank System Experimental Results

Introduction Image search has become a popular feature in many search engines, including Yahoo!, MSN, Google, etc. The majority of image searches use very little, if any, image information. The success of text-based search of Web pages and, in part, to the difficulty and expense of using image based signals.

Our work belongs to the general category of Content- Based Image Retrieval (CBIR). Many systems proposed in the past [2], [3], [4], [5] are considered as “pure” CBIR systems. On the other hand, “composite” CBIR systems [6], [7] allow flexible query interfaces and a diverse set of signal sources. – For example, Fergus et al. [7] proposed the use of “visual filters” to re-rank Google image search results. [7] R. Fergus, P. Perona, and A. Zisserman, “A Visual Category Filter for Google Images,” Proc. Eighth European Conf. Computer Vision, pp , 2004.

“visual filters” are learned from the top 1,000 search results via Parts-based probabilistic models [8], a form of Probabilistic Graphical Models (PGMs). PGMs have several important limitations for Web image retrieval. A suboptimal model structure can significantly reduce modeling and especially the classification performance [9]. Learning complex structures and their associated parameters is computationally prohibitive for large- scale Web image retrieval.

Although a well-trained object category filter can improve the relevancy of image search results, they offer limited capability to directly control how and why one visual theme, or image, is ranked higher than others. A simpler and more intuitive alternative is to compute the pairwise visual similarity among images from which a global ordering can be derived and potentially combined with other nonvisual signals. Separate ranking. Ranking algorithms :PageRank. Our work extends [18], [19], [6]; we present VisualRank, to improve Google image search results.

PageRank

VisualRank d > 0.8 is often chosen In place of the uniform damping vector p in (2), we can use a nonuniform vector q to bias the computation.

Features Global features like color histograms and shape analysis, when used alone, are often too restrictive for the breadth of image types that need to be handled. In this work, we use the SIFT features.

Similarity Graph Hough Transform

Pairwise Compare

Web-Scale Visual Rank System The similarity matrix S* from (1) can be computed for all unique pairs of images. It is computationally infeasible to generate the similarity graph S for the billions of images. A more efficient approach is to use a hash table to store all the local descriptors such that similar descriptors fall into the same bin.

Hashing We will use a version of the Locality-Sensitive Hashing (LSH) approximate matching approach[39]. Two features are considered as a match if they were hashed into the same bin. Hush function : a is a d-dimensional random vector. b is a real number chosen uniformly from the range [0, W]. W defines the quantization of the features. V is the original feature vector.

Summary of the System

Experimental Results We concentrated on the 2,000 most popular product queries on Google (product search). For each query, we extracted the top 1,000 search results from Google image search. A simple threshold was employed if, in the set of 1,000 images returned, fewer than 5 percent of the images had at least one connection, VisualRank was not used.

Experiment 1: User Study on Retrieval Relevancy We mixed the top 10 VisualRank selected images with the top 10 images from Google, removed the duplicates, and presented them to the user.

Experiment 2: Additional Comparison with Alternative Methods We use a nonuniform damping vector in (3) to increase the effect of images highly ranked in the search engine. We call the resulting algorithm VisualRankbias. Finally, to compare against a pure CBIR system, we formulate an alternative heuristic approach, named HeuristicRank.

Experiment 3: Satisfaction and Click Measurement Collected clicks for the top 40 images presented by the Google search results on 130 common product queries. The top 40 images are reranked using VisualRank. Examine the number of clicks each method received from only the top 20 images. The images selected by VisualRank to be in the top 20 would have received approximately 17.5 percent more clicks than those in the default ranking.

An Alternate Query Set: Landmarks We gathered 80 common landmark related queries. Typical queries included: “Eiffel Tower,” “Big Ben,” “Coliseum,” and “Lincoln Memorial.”