Content-Based Image Retrieval

Slides:



Advertisements
Similar presentations
Wiki-Reality: Augmenting Reality with Community Driven Websites Speaker: Yi Wu Intel Labs/vision and image processing research Collaborators: Douglas Gray,
Advertisements

Improvements and extras Paul Thomas CSIRO. Overview of the lectures 1.Introduction to information retrieval (IR) 2.Ranked retrieval 3.Probabilistic retrieval.
What happens when you create an HTML document and include pictures.
Multimedia Database Systems
Content-Based Image Retrieval
Florian Schroff, Antonio Criminisi & Andrew Zisserman ICCV 2007 Harvesting Image Databases from the Web.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE Shumeet Baluja, Member, IEEE IEEE TRANSACTIONS ON PATTERN ANALYSIS AND.
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
Search Engines. 2 What Are They?  Four Components  A database of references to webpages  An indexing robot that crawls the WWW  An interface  Enables.
Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University
Information Retrieval in Practice
Effective Image Database Search via Dimensionality Reduction Anders Bjorholm Dahl and Henrik Aanæs IEEE Computer Society Conference on Computer Vision.
Visiwords John Tait Chief Scientific Officer. Warning A few half formed ideas from the world of image and video indexing which may be of interest to MT.
WISE: Large Scale Content-Based Web Image Search Michael Isard Joint with: Qifa Ke, Jian Sun, Zhong Wu Microsoft Research Silicon Valley 1.
Information Retrieval Review
Object retrieval with large vocabularies and fast spatial matching
Lecture 28: Bag-of-words models
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.
Beyond bags of features: Adding spatial information Many slides adapted from Fei-Fei Li, Rob Fergus, and Antonio Torralba.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
WebInSight: Making Web Images Accessible Jeff Bigham Richard Ladner Ryan Kaminsky Gordon Hempton Oscar Danielsson.
Search engines fdm 20c introduction to digital media lecture warren sack / film & digital media department / university of california, santa.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Jeffrey P. Bigham Richard Ladner, Ryan Kaminsky, Gordon Hempton, Oscar Danielsson University of Washington Computer Science & Engineering.
Overview of Search Engines
Keypoint-based Recognition and Object Search
Object Recognition and Augmented Reality
CS 766: Computer Vision Computer Sciences Department, University of Wisconsin-Madison Indexing and Retrieval James Hill, Ozcan Ilikhan, Mark Lenz {jshill4,
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Keypoint-based Recognition Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/04/10.
Multimedia Databases (MMDB)
04/30/13 Last class: summary, goggles, ices Discrete Structures (CS 173) Derek Hoiem, University of Illinois 1 Image: wordpress.com/2011/11/22/lig.
Category Discovery from the Web slide credit Fei-Fei et. al.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
10/31/13 Object Recognition and Augmented Reality Computational Photography Derek Hoiem, University of Illinois Dali, Swans Reflecting Elephants.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Object Recognition Part 2 Authors: Kobus Barnard, Pinar Duygulu, Nado de Freitas, and David Forsyth Slides by Rong Zhang CSE 595 – Words and Pictures Presentation.
Lecture 08 27/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
CS798: Information Retrieval Charlie Clarke Information retrieval is concerned with representing, searching, and manipulating.
Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Information Retrieval in Practice
Information Retrieval in Practice
Information Architecture
Information Organization: Overview
Search Engine Architecture
Content-Based Image Retrieval
Proposal for Term Project
CS 2770: Computer Vision Feature Matching and Indexing
Data and Applications Security Developments and Directions
Video Google: Text Retrieval Approach to Object Matching in Videos
Color-Texture Analysis for Content-Based Image Retrieval
By Suren Manvelyan, Crocodile (nile crocodile?) By Suren Manvelyan,
Boolean Retrieval Term Vocabulary and Posting Lists Web Search Basics
Finding Clusters within a Class to Improve Classification Accuracy
CSCE 561 Information Retrieval System Models
Data Mining Chapter 6 Search Engines
Document Clustering Matt Hughes.
HTML Images.
Multimedia Information Retrieval
CS246: Information Retrieval
Video Google: Text Retrieval Approach to Object Matching in Videos
Information Organization: Overview
Information Retrieval and Web Design
Introduction to Search Engines
Information Retrieval and Web Design
Presentation transcript:

Content-Based Image Retrieval Jeff Bigham So far we haven’t really touched upon what you’d think would be one of the fundamental goals of computer vision. And that is, recognizing objects within pictures or videos well enough to allow one to search for that object by name and have pictures of it returned to you. Much like, you type words into Google and it returns documents that contain those words.

Who is Steve Seitz? 1. 7. 4. 10. 2. 8. Top 10 results from Google Images. 11. 5. 3. 9. 6. 12.

Goals Efficient use of Visual Resources Effective Search Intuitive Interfaces Understanding?

Image Search Landscape Inherits problems of text search Ambiguity Spam But vision is hard uses little or no “vision” often reduces to text search Rich UI Text Images Video

Find me an image of a… “Pretty girl doing something active, sporty in a summery setting, beach - not wearing lycra, exercise clothes - more relaxed in tee-shirt. Feature is about deodorant so girl should look active - not sweaty but happy, healthy, carefree - nothing too posed or set up - nice and natural looking.” A typical query issued to a manual image search by a marketing person that illustrates why CB-IR is very hard. http://images.google.com

images.google.com & flickr.com Collect Associated Text alt, title, longdesc attributes of img tag Information from path (e.g., cow.gif) nearby text on page Refinement remove duplicates cluster-based improvement Other PageRank, etc.

Matching Words and Pictures Learn text/image segment association Segment image Generate feature vector for each segment Associate text with the segment features Build generative probabilistic model for text given segmentation features

Matching Words and Pictures Kobus Barnard, Pinar Duygulu, Nando de Freitas, David Forsyth, David Blei, and Michael I. Jordan, "Matching Words and Pictures", Journal of Machine Learning Research, Vol 3, pp 1107-1135, 2003.

Other Ways of Searching Search by Text Search by Example Search by Drawing Search by Model

Other Search Interfaces? Josef Sivic, Andrew Zisserman, "Video Google: A Text Retrieval Approach to Object Matching in Videos," iccv, p. 1470,  Ninth IEEE International Conference on Computer Vision (ICCV'03) - Volume 2,  2003.

Video Google Viewpoint Invariant Descriptors

Video Google Visual Vocabulary

Inverted Index Document 1 Now is the time for all good men Word Document aid 1 all and 2 can come 1, 2 country for good … the 1, 1, 2 Document 1 Now is the time for all good men to come to the aid of their country. Document 2 Summer has come and passed. The innocent can never last.

Video Google (Demo of Video Google)

Computer Vision is Hard But, humans do it well Luis von Ahn (ESP Game Demo)

Vision as Security? True image search is a hard AI problem

The End