PicHunter A Bayesian Image Retrieval System

Slides:



Advertisements
Similar presentations
5th Intensive Course on Soil Micromorphology Naples th - 14th September Image Analysis Lecture 3 Image Processing/Analysis Basic Requirements.
Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
CSCE643: Computer Vision Bayesian Tracking & Particle Filtering Jinxiang Chai Some slides from Stephen Roth.
ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Jensen’s Inequality (Special Case) EM Theorem.
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
PHP-based Image Recognition and Retrieval of Late 18th Century Artwork Ben Goodwin Handouts are available for students writing summaries for class assignments.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
Human-Computer Interaction Human-Computer Interaction Segmentation Hanyang University Jong-Il Park.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
The Capacity of Color Histogram Indexing Dong-Woei Lin NTUT CSIE.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
NCKU CSIE Visualization & Layout for Image Libraries Baback Moghaddam, Qi Tian IEEE Int’l Conf. on CVPR 2001 Speaker: 蘇琬婷.
Content-based Image Retrieval CE 264 Xiaoguang Feng March 14, 2002 Based on: J. Huang. Color-Spatial Image Indexing and Applications. Ph.D thesis, Cornell.
1 Adaptive relevance feedback based on Bayesian inference for image retrieval Reporter : Erica Li Date :
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Content-Based Image Retrieval (CBIR) Student: Mihaela David Professor: Michael Eckmann Most of the database images in this presentation are from the Annotated.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
1998/5/21by Chang I-Ning1 ImageRover: A Content-Based Image Browser for the World Wide Web Introduction Approach Image Collection Subsystem Image Query.
Object Detection and Tracking Mike Knowles 11 th January 2005
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Presented by Zeehasham Rasheed
Latent Semantic Analysis (LSA). Introduction to LSA Learning Model Uses Singular Value Decomposition (SVD) to simulate human learning of word and passage.
Dorin Comaniciu Visvanathan Ramesh (Imaging & Visualization Dept., Siemens Corp. Res. Inc.) Peter Meer (Rutgers University) Real-Time Tracking of Non-Rigid.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
The Bayesian Image Retrieval System,PicHunter Theory, Implementation, and Psychophysical Experiments.
Optimizing Learning with SVM Constraint for Content-based Image Retrieval* Steven C.H. Hoi 1th March, 2004 *Note: The copyright of the presentation material.
Presenting by, Prashanth B R 1AR08CS035 Dept.Of CSE. AIeMS-Bidadi. Sketch4Match – Content-based Image Retrieval System Using Sketches Under the Guidance.
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
Multimedia Databases (MMDB)
FRIP: A Region-Based Image Retrieval Tool Using Automatic Image Segmentation and Stepwise Boolean AND Matching IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 7,
1 TEMPLATE MATCHING  The Goal: Given a set of reference patterns known as TEMPLATES, find to which one an unknown pattern matches best. That is, each.
1 Physical Fluctuomatics 5th and 6th Probabilistic information processing by Gaussian graphical model Kazuyuki Tanaka Graduate School of Information Sciences,
Content-Based Image Retrieval
Interactive Graph Cuts for Optimal Boundary & Region Segmentation of Objects in N-D Images (Fri) Young Ki Baik, Computer Vision Lab.
Y. Kotani · F. Ino · K. Hagihara Springer Science + Business Media B.V Reporter: 李長霖.
CSE 185 Introduction to Computer Vision Pattern Recognition 2.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
Context-Sensitive Information Retrieval Using Implicit Feedback Xuehua Shen : department of Computer Science University of Illinois at Urbana-Champaign.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Face Detection Using Large Margin Classifiers Ming-Hsuan Yang Dan Roth Narendra Ahuja Presented by Kiang “Sean” Zhou Beckman Institute University of Illinois.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
1 A Compact Feature Representation and Image Indexing in Content- Based Image Retrieval A presentation by Gita Das PhD Candidate 29 Nov 2005 Supervisor:
Semi-Automatic Image Annotation Liu Wenyin, Susan Dumais, Yanfeng Sun, HongJiang Zhang, Mary Czerwinski and Brent Field Microsoft Research.
Sequential Monte-Carlo Method -Introduction, implementation and application Fan, Xin
Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.
Real-Time Tracking with Mean Shift Presented by: Qiuhua Liu May 6, 2005.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
NTU & MSRA Ming-Feng Tsai
Attila Kiss, Tamás Németh, Szabolcs Sergyán, Zoltán Vámossy, László Csink Budapest Tech Recognition of a Moving Object in a Stereo Environment Using a.
Relevance Feedback in Image Retrieval System: A Survey Tao Huang Lin Luo Chengcui Zhang.
1 Kernel Machines A relatively new learning methodology (1992) derived from statistical learning theory. Became famous when it gave accuracy comparable.
Statistical-Mechanical Approach to Probabilistic Image Processing -- Loopy Belief Propagation and Advanced Mean-Field Method -- Kazuyuki Tanaka and Noriko.
Efficient Image Classification on Vertically Decomposed Data
Multimedia Content-Based Retrieval
Content-based Image Retrieval
Efficient Image Classification on Vertically Decomposed Data
Image Segmentation Techniques
Outline S. C. Zhu, X. Liu, and Y. Wu, “Exploring Texture Ensembles by Efficient Markov Chain Monte Carlo”, IEEE Transactions On Pattern Analysis And Machine.
Authors: Wai Lam and Kon Fan Low Announcer: Kyu-Baek Hwang
Handwritten Characters Recognition Based on an HMM Model
Color Image Retrieval based on Primitives of Color Moments
Random Neural Network Texture Model
Color Image Retrieval based on Primitives of Color Moments
Presentation transcript:

PicHunter A Bayesian Image Retrieval System Ingemar Cox (1,2,3,4) T. Conway (3) Joumana Ghosn (2,3) Matt Miller (1,2,3,4) Thomas Minka (3,4) Steve Omohundro (1) Thomas Papathomas (2,3) Peter N. Yianilos (1,2,3,4)

Project overview Target Testing and the PicHunter Bayesian Multimedia Retrieval System, I.J. Cox, Matt Miller, S.M. Omohundo, P.N. Yianilos, Proceedings of the Forum on Research & Technology Advances in Digital Libraries, pp 66-75, 1996. PicHunter: Bayesian Relevance Feedback for Image Retrieval, I.J. Cox, Matt Miller, Stephen Omohundo, P.N. Yianilos, 13th International Conference on Pattern Recognition, Vol.III, Track C, pp.361-369, August 1996. Introduces PicHunter, the Bayesian framework, and describes a working system including measured user performance. Hidden Annotation in Content Based Image Retrieval, I.J. Cox, Joumana Ghosn, Matt Miller, T. Papathomas, P.N. Yianilos, IEEE Workshop on Content-Based Access of Image & Video Libraries, pp.76-81, June 1997 Introduces the idea of ``hidden annotation'', and reports results demonstrating that it improves performance.

Project overview An Optimized Interaction Strategy for Bayesian Relevance Feedback, I. J. Cox, M. L. Miller, T. Minka, P. N. Yianilos, IEEE International Conference on Computer Vision and Pattern Recognition - CVPR '98, Santa Barbara, CA, pp. 553-558, 1998. Introduces an improved stochastic image display strategy allowing the system to ``ask better questions.'' Psychophysical Studies of the Performance of an Image Database Retrieval System, T. Papathomas, T. Conway, I. Cox, J. Ghosn, M. Miller, T. Minka, P. Yianilos, Proceedings of the Human Vision & Electric Imaging III, San Jose, CA Vol 3299, pp. 591-602, January 1998 Describes Psychophysical studies of the system in a controlled environment.

Project summary The Bayesian Image Retrieval System, PicHunter: Theorgy, Implementation and Psychophysical Experiements, I. J. Cox, M. L. Miller, T. P. Minka, T. V. Papathomas, P. N. Yianilos, IEEE Transactions on Image Processing, 9, 1, 20-37, (2000)

Introduction A search consists of To date, emphasis on query phase Repeated relevance feedback To date, emphasis on query phase better representations, relevance feedback crude or non-existent Lack of quantitative measures for comparing performance of search algorithms

The main ideas Bayesian relevance feedback Quantifiable testing Learn from human interactions Model the user's actions, not his/her query Quantifiable testing Target testing Baseline testing Optimize the image display

User interface

Target testing The user is shown an image from the database. His/her task is to use the system to find it. We measure the number of interactions required. This, then, is easily compared against a simple linear search Not a perfect model for all intended uses --- but something we can measure and use for comparisons

Features Pictorial features Originally 18 global features % of pixels that are one of 11 colors Mean color saturation Median intensity of the image Image width and height A measure of global contrast Two measures of the number of edgels computed at different thresholds

Features Hidden annotation Provides semantic labels 147 attributes Boolean vector, normalized Hamming distance

Bayesian relevance feedback At denotes the current user action, Dt is the current display H the session history including the current images displayed. Thus, Ht = {D1, A1, D2, A2,… Dt, At} T is a target image.

Bayesian relevance feedback We build a predictive model P(A|T,H) Then from Bayes rule

Bayesian relevance feedback Assume time-invariance and same for all users

Absolute-distance model Only one image, Xq, in the display Dt can be selected at each iteration The probability of Ti increases or decreases depending on the distance d(Ti, Xq) P(T=Ti) = P(T=Ti) G(d(Ti, Xq))

Relative-distance model Let Q={Xq1, Xq2,…XqC} denote the set of selected in images in display Dt and Let N={Xn1, Xn2 …XnL} denote the set of unselected images Then we compute the distance difference d(Ti, Xqk) – d(T1,Xnm) for all pairs {Xqk, Xnm} The probabilities of images Tc that are closer to Xqk are increased while those closer to Xnm are decreased.

Display updating algorithm Most probable display Most informative display (Max. mutual information) Sampling Query by example

Experimental setup Database of 4522 images M/N, A/R, P/S/B 1500 annotated M/N, A/R, P/S/B Memory/ no memory (relevance feedback history) Absolute / relative distance Pictorial / semantic/ both features

Experimental notation MRB – memory, relative distance, pictorial and semantic features MAB – memory, absolute distance, pictorial and semantic features NRB – no memory, … NAB MRS – memory, relative, semantic features MRP – memory, relative, pictorial features

Experimental results Memory, metric and features MRB MAB NRB NAB MRS MRP 6 naïve users 25.4 35.8 45.5 33.2 15.6 35.1 2 exp. users 13.1 31.6 28.4 22.2 8.8 18.9

Baseline testing Similarity testing How many images are examined before the user sees a similar image? Compare to number needed when randomly searching the database

Target versus category search MRB/T MRS/T MRB/C RAND/C Naïve users 25.4 15.6 12.2 19.7 Exp. users 13.1 8.8 8.9 20.1

Improved pictorial features HSV 64-element histogram HSV 256-element autocorrelogram RGB 128-element color coherence vector

Experimental results (User learning) Before explanation After explanation Pictorial only 17.1 13.2 Pictorial and semantic 11.7 9.5

Display updating algorithms Most probable display Most informative display (Max. mutual information) Sampling Query by example

Most Probable Display Performs quite well However, greed strategy suffers from “over-learning” PicHunter “gets stuck” in a local maximum Display after display of “lions”, say

Most-Informative Display Try to minimize the total number of iterations required in a search Try to elicit as much information from the user as possible Information theory suggests entropy as an estimate of the number of questions one needs to ask to resolve the ambiguity

Most-informative display Consider the ideal (deterministic) case, in which the display consists of two images

Most-informative display Generalization to the non-deterministic case

Most informative display To perform minimization is non-trivial Perform Monte Carlo simulation Draw random displays {X1, X2… XND} from the distribution P(T=Ti) Sampling is a special case of most informative method where only one Monte Carlo sample is drawn

Simulation results: deterministic

Simulation results: deterministic

Simulation results: non-deterministic

Simulation results: non-deterministic

Experimental results: Display strategies EB’ EP’ ES RB’ MRB’ AB’ NAB’ RS MRS RP MRP Naïve users 11.3 25.8 16.0 12.0 20.4 11.8 29.6 Exp. users 6.8 10.2 8.3 8.65 11.5

Future directions More efficient algorithms Automatic detection of hidden features Explore slightly richer user interfaces Explore increased use of online learning