IIIT Hyderabad Learning in Large Scale Image Retrieval Systems Under the guidance of: Dr. C. V. Jawahar & Dr. Vikram Pudi by Pradhee Tandon Roll No. 200607020.

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.
Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.
Language Models Naama Kraus (Modified by Amit Gross) Slides are based on Introduction to Information Retrieval Book by Manning, Raghavan and Schütze.
Effective Keyword Based Selection of Relational Databases Bei Yu, Guoliang Li, Karen Sollins, Anthony K.H Tung.
IIIT Hyderabad Multimodal Semantic Indexing for Image Retrieval P. L. Chandrika Advisors: Dr. C. V. Jawahar Centre for Visual Information Technology, IIIT-
VisualRank: Applying PageRank to Large-Scale Image Search Yushi Jing, Member, IEEE, and Shumeet Baluja, Member, IEEE.
Low Complexity Keypoint Recognition and Pose Estimation Vincent Lepetit.
Visualization and Cluster
Optimal Design Laboratory | University of Michigan, Ann Arbor 2011 Design Preference Elicitation Using Efficient Global Optimization Yi Ren Panos Y. Papalambros.
PHP-based Image Recognition and Retrieval of Late 18th Century Artwork Ben Goodwin Handouts are available for students writing summaries for class assignments.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
Multimedia Indexing and Retrieval Kowshik Shashank Project Advisor: Dr. C.V. Jawahar.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.
Content-based Image Retrieval CE 264 Xiaoguang Feng March 14, 2002 Based on: J. Huang. Color-Spatial Image Indexing and Applications. Ph.D thesis, Cornell.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
A Study of Approaches for Object Recognition
1 Statistical correlation analysis in image retrieval Reporter : Erica Li 2004/9/30.
ACM Multimedia th Annual Conference, October , 2004
Expectation Maximization Method Effective Image Retrieval Based on Hidden Concept Discovery in Image Database By Sanket Korgaonkar Masters Computer Science.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
February, Content-Based Image Retrieval Saint-Petersburg State University Natalia Vassilieva Il’ya Markov
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
1998/5/21by Chang I-Ning1 ImageRover: A Content-Based Image Browser for the World Wide Web Introduction Approach Image Collection Subsystem Image Query.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Presentation in IJCNN 2004 Biased Support Vector Machine for Relevance Feedback in Image Retrieval Hoi, Chu-Hong Steven Department of Computer Science.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
Presented by Zeehasham Rasheed
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
1 BrainWave Biosolutions Limited Accelerating Life Science Research through Technology.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
SIEVE—Search Images Effectively through Visual Elimination Ying Liu, Dengsheng Zhang and Guojun Lu Gippsland School of Info Tech,
Large-Scale Content-Based Image Retrieval Project Presentation CMPT 880: Large Scale Multimedia Systems and Cloud Computing Under supervision of Dr. Mohamed.
Jinhui Tang †, Shuicheng Yan †, Richang Hong †, Guo-Jun Qi ‡, Tat-Seng Chua † † National University of Singapore ‡ University of Illinois at Urbana-Champaign.
Multimedia Databases (MMDB)
Content-Based Image Retrieval
1 Efficient Search Ranking in Social Network ACM CIKM2007 Monique V. Vieira, Bruno M. Fonseca, Rodrigo Damazio, Paulo B. Golgher, Davi de Castro Reis,
Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.
80 million tiny images: a large dataset for non-parametric object and scene recognition CS 4763 Multimedia Systems Spring 2008.
IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.
INTERACTIVELY BROWSING LARGE IMAGE DATABASES Ronald Richter, Mathias Eitz and Marc Alexa.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
Event retrieval in large video collections with circulant temporal encoding CVPR 2013 Oral.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
1 A Compact Feature Representation and Image Indexing in Content- Based Image Retrieval A presentation by Gita Das PhD Candidate 29 Nov 2005 Supervisor:
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
An Approximate Nearest Neighbor Retrieval Scheme for Computationally Intensive Distance Measures Pratyush Bhatt MS by Research(CVIT)
A Novel Visualization Model for Web Search Results Nguyen T, and Zhang J IEEE Transactions on Visualization and Computer Graphics PAWS Meeting Presented.
Content-Based Image Retrieval (CBIR) By: Victor Makarenkov Michael Marcovich Noam Shemesh.
Supplementary Slides. More experimental results MPHSM already push out many irrelevant images Query image QHDM result, 4 of 36 ground truth found ANMRR=
A Distributed Multimedia Data Management over the Grid Kasturi Chatterjee Advisors for this Project: Dr. Shu-Ching Chen & Dr. Masoud Sadjadi Distributed.
A Genetic Algorithm-Based Approach to Content-Based Image Retrieval Bo-Yen Wang( 王博彥 )
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Visual Information Retrieval
Kyriaki Dimitriadou, Brandeis University
Multimedia Content-Based Retrieval
Content-based Image Retrieval
Self-Organizing Maps for Content-Based Image Database Retrieval
Image Segmentation Techniques
Revision (Part II) Ke Chen
Revision (Part II) Ke Chen
Multimedia Information Retrieval
Presentation transcript:

IIIT Hyderabad Learning in Large Scale Image Retrieval Systems Under the guidance of: Dr. C. V. Jawahar & Dr. Vikram Pudi by Pradhee Tandon Roll No

IIIT Hyderabad Image Retrieval Explosive growth in images Easy access to most of these on the web Contemporary systems used tags The best commercial systems are still tag based Inadequate and unreliable Manual tagging is infeasible Content based retrieval is the best option

IIIT Hyderabad Content Based Image Retrieval Image Feature Database Feature Extraction Comparison Module Query Results QueryResults

IIIT Hyderabad Content Based Image Retrieval Image Feature Database Feature Index Feature Extraction Comparison Module Query Results

IIIT Hyderabad Content Based Image Retrieval Feature Extraction Comparison Module Query Results Feature Index Relevance Learning Memory & Logs Rf

IIIT Hyderabad Content Based Image Retrieval Feature Extraction Comparison Module Query Results Feature Index Relevance Learning Learning Memory Rf

IIIT Hyderabad Scope of work Features Color Histograms Texture Filters Shape Context SIFT GLOH Spatial indexing methods Kd – trees R-tree Distance Metrics Euclidean Mahalanobis KL Divergence Relevance Feedback Short term learning Long term learning Content Free Retrieval Active Learning Diversity Retrieval

IIIT Hyderabad Requirements Efficient Image Retrieval Learning relevant features in images Learning image-image relationships Diversity in retrieval for improved learning

IIIT Hyderabad Requirements Efficient Image Retrieval Learning relevant features in images Learning image-image relationships Diversity in retrieval for improved learning

IIIT Hyderabad FISH – The System

IIIT Hyderabad Implementation of FISH Image Representation in FISH MPEG-7 Colour Structure Descriptor Maximum Response Filters for Textures Developed on the LAMP stack, using C/C++, Perl, PHP, HTML, MySQL and Apache TPIE toolkit from Duke University for B+ tree implementation

IIIT Hyderabad Indexing Scheme Interactive response over large databases (less than a second) Efficient scalable index (dynamic with data) Similarity indexing scheme (r-tree, kd-tree, ss-tree) Support for changing similarity metrics (metric changes with learning) B+ tree based index Nataraj et. al, MMM 2007, Efficient Search with Changing Similarity Measures on Large Multimedia Datasets

IIIT Hyderabad The Retrieval Algorithm Feature Extraction Comparison Module Query Results Feature Index Relevance Learning Learning Memory Rf Retrieval in FISH

IIIT Hyderabad Retrieval Performance Retrieval times with increasing #Dimensions in ( secs ) & DB size fixed at 1 lakh Retrieval times with increasing DB size in ( secs ) & #dimensions fixed at 10

IIIT Hyderabad Requirements Efficient Image Retrieval Learning relevant features in images Learning image-image relationships Diversity in retrieval for improved learning

IIIT Hyderabad Content Based Image Retrieval Feature Extraction Comparison Module Query Results Feature Index Relevance Learning Memory & Logs Rf

IIIT Hyderabad Learning - expectations Effective – capture user intent correctly Efficient – interactive retrieval response Scalable – limited computational overhead for large collections Adaptive – caters to individual user’s subjectivities Intra-query or short term learning (STL) Evolving – incrementally improves across users and queries Inter-query or long term learning (LTL) Dynamic – seamlessly absorbs changes in the collection

IIIT Hyderabad Learning - Method Relative relevance of features using feedback Numerous methods can be used Discriminative variance is as - Weights are incrementally learnt over iterations using – At the end of the session long term learning is updated for the relevant images using – Image to image dissimilarity is computed using – Weighted Mahalanobis

IIIT Hyderabad Improved accuracy Precision across sessions using LTL Rank Convergence of top N relevant samples Sum of ranks of Top 10 relevant images converges close zero (downshifted) over multiple sessions with long term learning

IIIT Hyderabad Improved retrieval System learns the yellow flower in the hedge over sessions System learns the rock and sky pattern over sessions Top 9 results for queries across 3 different sessions (left-most are queries too)

IIIT Hyderabad Optimized Retrieval

IIIT Hyderabad Long term memory allows learning of relevant image features Converges to popular content over sessions For example, Assume, features are associated with individual pixels, colors Consider a gray image, pixels for more relevant features are colored brighter Content from Learning Actual imageContent image

IIIT Hyderabad Visual Content Extraction After a large number of sessions Over sessions

IIIT Hyderabad Requirements Efficient Image Retrieval Learning relevant features in images Learning image-image relationships Diversity in retrieval for improved learning

IIIT Hyderabad Content Based Image Retrieval Feature Extraction Comparison Module Query Results Feature Index Relevance Learning Memory & Logs Rf

IIIT Hyderabad Image-Image Relations Query Given a history of patterns in behavior and a current partial pattern, collaborative filtering predicts the next pattern for the latter Content Free Image Retrieval or CFIR, uses feedback logs to predict the next set of results for the current pattern

IIIT Hyderabad Hybrid Image Retrieval We integrate them in a Bayesian inference like framework, –The a priori relationships from logs –The evidence from visual similarity –Retrieval is an a posteriori estimation problem CBIRCFIR Semantic gapNo semantic gap No cold startCold start Handles unseen queriesCannot handle unseen queries Not affected by sparsenessAffected by sparseness

IIIT Hyderabad Bayesian Image Retrieval System Architecture of the proposed Bayesian Image Retrieval System

IIIT Hyderabad Bayesian Image Retrieval... posterior = prior * evidence Efficient a priori updates –The prior probabilities are not stored, reduces updates –Co-relevance between images are stored in a matrix –The a priori is estimated using the co-relevance values Evidence computation –Weights are learnt using discriminative variance method –Weighted Mahalanobis for (dis)similarity

IIIT Hyderabad Concept Discovery a priori matrix has embedded patterns of similar co-relevances Co-relevance patterns can be summarized into ‘k’ concepts –cluster the patterns into V concepts 1…k. –clustering is repetitive but offline –exhaustive comparisons are avoided

IIIT Hyderabad Accuracy with Bayesian Gain in precision with Bayesian Using real human user feedback logs Using annotation based feedback logs Gain in precision across sessions

IIIT Hyderabad Accuracy with Bayesian CBIR results and Bayesian results

IIIT Hyderabad Requirements Efficient Image Retrieval Learning relevant features in images Learning image-image relationships Diversity in retrieval for improved learning

IIIT Hyderabad Diversity in Image Retrieval Query

IIIT Hyderabad Skylines – the natural solution Results should be similar in a variety of different ways Skylines return non-dominated samples Non-dominated samples are closer to the query than all the others, in at-least one way (attribute)

IIIT Hyderabad Skyline Extraction Architecture of the proposed skyline based similarity retrieval system

IIIT Hyderabad Diversity with Skylines

IIIT Hyderabad Efficient Skylines Real image data with 12 and 9 dimensions with real images Synthetic data with 10 dimensions and and data points

IIIT Hyderabad Preferential Skylines Relevance feedback represents user’s preference Weights learned using feature relevance Skylines are then computed in user space

IIIT Hyderabad Designed and implemented a web- based image retrieval system, called FISH Proposed an efficient feature relevance learning algorithm Integration of complimentary CFIR and CBIR a Bayesian inference framework Skylines to retrieve diversely similar samples for a given query Contributions

IIIT Hyderabad Future directions Videos are richer and the next step Efficient higher level concept discovery is needed Skylines with preference should be explored further

IIIT Hyderabad Publications Pradhee Tandon, Piyush Nigam, Vikram Pudi, C. V. Jawahar, “FISH: A Practical System for Fast Interactive Image Search in Huge Databases”, in Proceedings of the 7th ACM International Conference on Image and Video Retrieval (CIVR ’08), July 6-8, 2008, Niagara Falls, Canada. Pradhee Tandon, C. V. Jawahar, “Long Term Learning for Content Extraction in Image Retrieval”, in Proceedings of the 15th National Conference on Communications (NCC ’09), January 16-18, 2009, Guwahati, India. Pradhee Tandon, C. V. Jawahar, “Bayesian Image Retrieval” submitted to 3rd International Conference on Pattern Recognition and Machine Intelligence (PReMI ’09), December 16-20, 2009, New Delhi, India.

IIIT Hyderabad Thank you

IIIT Hyderabad Addendum

IIIT Hyderabad The Retrieval Algorithm *Learning discussed in detail later

IIIT Hyderabad Bayesian Image Retrieval The a priori probability of retrieving image ‘a’ with query ‘q’ is P(R) = n(q,a)/n(a) –where n(a) denotes relevant retrievals for ‘a’ The evidence from visual similarity is computed as p(S|R) = f(w,q,a) –where weights ‘w’ are refined using relevance feedback The posterior probability of retrieval is computed as p(R|S) = p(S|R) P(R) –the denominator can be ignored –PicHunter is a hybrid but does no feature learning –Zhong et. al, use Bayes inference for a probabilistic decision only

IIIT Hyderabad Skyline Extraction