Content-Based Image Retrieval

Slides:



Advertisements
Similar presentations
Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University
Content-based Image Retrieval (CBIR) Searching a large database for images that match a query: –What kinds of databases? –What kinds of queries? –What.
Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.
Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University
Introduction to Information Retrieval
Multimedia Database Systems
Three things everyone should know to improve object retrieval
Discriminative Relevance Feedback With Virtual Textual Representation For Efficient Image Retrieval Suman Karthik and C.V.Jawahar.
Image Retrieval Basics Uichin Lee KAIST KSE Slides based on “Relevance Models for Automatic Image and Video Annotation & Retrieval” by R. Manmatha (UMASS)
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
ARNOLD SMEULDERS MARCEL WORRING SIMONE SANTINI AMARNATH GUPTA RAMESH JAIN PRESENTERS FATIH CAKIR MELIHCAN TURK Content-Based Image Retrieval at the End.
Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University
Lecture 12 Content-Based Image Retrieval
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
ISP 433/533 Week 2 IR Models.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Content-based Image Retrieval (CBIR)
Stockman MSU CSE1 Image Database Access  Find images from personal collections  Find images on the web  Find images from medical cases  Find images.
T.Sharon 1 Internet Resources Discovery (IRD) Introduction to MMIR.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Content-based Image Retrieval (CBIR)
SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,
Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.
Content-Based Video Retrieval System Presented by: Edmund Liang CSE 8337: Information Retrieval.
Multimedia and Time-series Data
Multimedia Databases (MMDB)
Image and Video Retrieval INST 734 Doug Oard Module 13.
Content-Based Image Retrieval Readings: Chapter 8: Queries Commercial Systems Retrieval Features Indexing in the FIDS System Lead-in to Object.
Image Retrieval Part I (Introduction). 2 Image Understanding Functions Image indexing similarity matching image retrieval (content-based method)
Content-Based Image Retrieval
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
Xiaoying Gao Computer Science Victoria University of Wellington Intelligent Agents COMP 423.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
IBM QBIC: Query by Image and Video Content Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC 28223
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
Yixin Chen and James Z. Wang The Pennsylvania State University
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
Xiaoying Gao Computer Science Victoria University of Wellington COMP307 NLP 4 Information Retrieval.
VISUAL INFORMATION RETRIEVAL Presented by Dipti Vaidya.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
1 Content-based Image Retrieval (CBIR) Searching a large database for images that match a query: What kinds of databases? What kinds of queries? What constitutes.
Content-Based Image Retrieval Readings: Chapter 8:
Content-Based Image Retrieval
Visual Information Retrieval
Introduction Multimedia initial focus
Multimedia Content-Based Retrieval
Content-Based Image Retrieval Readings: Chapter 8:
Personalized Social Image Recommendation
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Content-based Image Retrieval (CBIR)
Content-based Image Retrieval (CBIR)
Multimedia Information Retrieval
Project Implementation for ITCS4122
بازیابی تصاویر بر اساس محتوا
Content-based Image Retrieval (CBIR)
Content-based Image Retrieval (CBIR)
Multimedia Information Retrieval
Presentation transcript:

Content-Based Image Retrieval Selim Aksoy Department of Computer Engineering Bilkent University saksoy@cs.bilkent.edu.tr

Image retrieval Searching a large database for images that match a query: What kind of databases? What kind of queries? What constitutes a match? How do we make such searches efficient? CS 484, Fall 2015 ©2015, Selim Aksoy

Applications Art Collections Medical Image Databases Fine Arts Museum of San Francisco Medical Image Databases CT, MRI, Ultrasound, The Visible Human Scientific Databases Earth Sciences General Image Collections for Licensing Corbis, Getty Images The World Wide Web Google, Microsoft, Flickr CS 484, Fall 2015 ©2015, Selim Aksoy

Corel data set 60,000 images with annotated keywords CS 484, Fall 2015 ©2015, Selim Aksoy

Fine Arts Museum of San Francisco 80,000 images CS 484, Fall 2015 ©2015, Selim Aksoy

Query formulation Text description (keywords) Query by example Query by sketch Symbolic description (man and woman on a beach) Relevance feedback CS 484, Fall 2015 ©2015, Selim Aksoy

Google query on “rose” CS 484, Fall 2015 ©2015, Selim Aksoy

Corel query on “rose” CS 484, Fall 2015 ©2015, Selim Aksoy

Corbis query on “rose” CS 484, Fall 2015 ©2015, Selim Aksoy

Difficulties with keywords Images may not have keywords. (An image is worth … how many key-words?) Query is not easily satisfied by keywords. “A casually dressed couple gazing into each others eyes lovingly with dramatic clouds in the background.” “Pretty girl doing something active, sporty in a summery setting, beach - not wearing lycra, exercise clothes - more relaxed in tee-shirt. Feature is about deodorant so girl should look active - not sweaty but happy, healthy, carefree - nothing too posed or set up - nice and natural looking.” Content-based image retrieval (CBIR) CS 484, Fall 2015 ©2015, Selim Aksoy

Content-based image retrieval Image Database Query Image Distance Measure Retrieved Images Image Feature Extraction User Feature Space Images CS 484, Fall 2015 ©2015, Selim Aksoy

Image representations and features Iconic Global Region-based Object-based Image features: Color Texture Shape Objects and their relationships (this is the most powerful, but you have to be able to recognize the objects!) CS 484, Fall 2015 ©2015, Selim Aksoy

Image similarity Distance measures: Probabilistic similarity measures: Euclidean distance Other Lp metrics Histogram intersection Cosine distance Earth mover’s distance Probabilistic similarity measures: P( relevance | two images ) P( relevance | two images ) / P( irrelevance | two images) CS 484, Fall 2015 ©2015, Selim Aksoy

Global histograms Searching using global color histograms CS 484, Fall 2015 ©2015, Selim Aksoy

Global histograms “Airplanes” using color histograms (4/12) “Sunsets” using Gabor texture (3/12) CS 484, Fall 2015 ©2015, Selim Aksoy

Query by image content (QBIC) CS 484, Fall 2015 ©2015, Selim Aksoy

Color histograms in QBIC The QBIC color histogram distance is: dhist(I,Q) = (h(I) - h(Q))T A (h(I) - h(Q)). h(I) is a K-bin histogram of a database image. h(Q) is a K-bin histogram of the query image. A is a K x K similarity matrix. R G B Y C V 1 0 0 .5 0 .5 0 1 0 .5 .5 0 0 0 1 1 R G B Y C V ? How similar is blue to cyan? CS 484, Fall 2015 ©2015, Selim Aksoy

Color percentages in QBIC %40 red, %30 yellow, %10 black CS 484, Fall 2015 ©2015, Selim Aksoy

Color layout in QBIC CS 484, Fall 2015 ©2015, Selim Aksoy

Earth mover’s distance CS 484, Fall 2015 ©2015, Selim Aksoy

Earth mover’s distance Visualization using EMD and multidimensional scaling CS 484, Fall 2015 ©2015, Selim Aksoy

Probabilistic similarity measures Two classes: Relevance class A Irrelevance class B Bayes classifier Assign (i,j) to Discriminant function for classification Rank images according to posterior ratio values based on feature differences. CS 484, Fall 2015 ©2015, Selim Aksoy

Probabilistic similarity measures “Residential interiors” (12/12) “Fields” (12/12) CS 484, Fall 2015 ©2015, Selim Aksoy

Shape-based retrieval CS 484, Fall 2015 ©2015, Selim Aksoy

Shape-based retrieval CS 484, Fall 2015 ©2015, Selim Aksoy

Elastic shape matching CS 484, Fall 2015 ©2015, Selim Aksoy

Iconic matching CS 484, Fall 2015 ©2015, Selim Aksoy

Iconic matching Wavelet-based image compression Quantization of coefficients CS 484, Fall 2015 ©2015, Selim Aksoy

Iconic matching CS 484, Fall 2015 ©2015, Selim Aksoy

Region-based retrieval: Blobworld CS 484, Fall 2015 ©2015, Selim Aksoy

Region-based retrieval: Blobworld CS 484, Fall 2015 ©2015, Selim Aksoy

Retrieval using spatial relationships Build graph using regions and their spatial relationships. Similarity is computed using graph matching. sky sand tiger grass above adjacent inside image abstract regions CS 484, Fall 2015 ©2015, Selim Aksoy

Combining multiple features Text query on “rose” CS 484, Fall 2015 ©2015, Selim Aksoy

Combining multiple features Visual query on CS 484, Fall 2015 ©2015, Selim Aksoy

Combining multiple features Text query on “rose” and visual query on CS 484, Fall 2015 ©2015, Selim Aksoy

Video Google: object matching CS 484, Fall 2015 ©2015, Selim Aksoy

Video Google Viewpoint invariant descriptors Visual vocabulary CS 484, Fall 2015 ©2015, Selim Aksoy

Video Google Inverted index Document 1 Now is the time for all good men to come to the aid of their country. Word Document aid 1 all and 2 can come 1, 2 country for good … the 1, 1, 2 Document 2 Summer has come and passed. The innocent can never last. CS 484, Fall 2015 ©2015, Selim Aksoy

Video skimming CS 484, Fall 2015 ©2015, Selim Aksoy

Event detection, indexing, retrieval CS 484, Fall 2015 ©2015, Selim Aksoy

Informedia Digital Video Library IDVL interface returned for "El Nino" query along with different multimedia abstractions from certain documents. CS 484, Fall 2015 ©2015, Selim Aksoy

Informedia Digital Video Library IDVL interface returned for “bin ladin" query. The results can be tuned using many classifiers. CS 484, Fall 2015 ©2015, Selim Aksoy

Relevance feedback In real interactive CBIR systems, the user should be allowed to interact with the system to “refine” the results of a query until he/she is satisfied. CS 484, Fall 2015 ©2015, Selim Aksoy

Relevance feedback Example methods: Query point movement Query point is moved toward positive examples and moved away from negative examples. Weighting features The CBIR system should automatically adjust the weight that were given by the user for the relevance of previously retrieved documents. Weighting similarity measures Feature density estimation Probabilistic relevance feedback CS 484, Fall 2015 ©2015, Selim Aksoy

Relevance feedback Positive feedback Negative feedback CS 484, Fall 2015 ©2015, Selim Aksoy

Relevance feedback “Sunsets” using color histograms (1/12) Using combined features (6/12) After 1st feedback (12/12) CS 484, Fall 2015 ©2015, Selim Aksoy

Relevance feedback “Auto racing” using color histograms (3/12) Using combined features (9/12) After 1st feedback (12/12) CS 484, Fall 2015 ©2015, Selim Aksoy

Indexing for fast retrieval Use of key images and the triangle inequality for efficient retrieval. CS 484, Fall 2015 ©2015, Selim Aksoy

Indexing for fast retrieval Offline Choose a small set of key images. Store distances from database images to keys. Online (given query Q) Compute the distance from Q to each key. Obtain lower bounds on distances to database images. Threshold or return all images in order of lower bounds. CS 484, Fall 2015 ©2015, Selim Aksoy

Indexing for fast retrieval Hierarchical cellular tree CS 484, Fall 2015 ©2015, Selim Aksoy

Indexing for fast retrieval CS 484, Fall 2015 ©2015, Selim Aksoy

Performance evaluation Two traditional measures for retrieval performance in the information retrieval literature are precision and recall. Given a particular number of images retrieved, precision is defined as the percentage of retrieved images that are actually relevant, and recall is defined as the percentage of relevant images that are retrieved. CS 484, Fall 2015 ©2015, Selim Aksoy

Current research objective Query Image Retrieved Images boat User Image Database … Animals Buildings Office Buildings Houses Transportation Boats Vehicles Images Object-oriented Feature Extraction Categories CS 484, Fall 2015 ©2015, Selim Aksoy

Demos Google Image Search (http://images.google.com/) Microsoft Image Search (https://www.bing.com/images) Yahoo Image Search (http://images.search.yahoo.com/) Vitalas (http://vitalas.ercim.eu/) FIDS (http://www.cs.washington.edu/research/ imagedatabase/demo/fids/) Like Visual Shopping (http://www.like.com/) Now Google shopping (http://www.google.com/shopping) Google Similar Images http://googleblog.blogspot.com/2009/10/similar-images-graduates-from-google.html Google Image Swirl http://googleresearch.blogspot.com/2009/11/explore-images-with-google-image-swirl.html CS 484, Fall 2015 ©2015, Selim Aksoy