Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki 2008-11629.

Slides:

Advertisements

Similar presentations

Query Classification Using Asymmetrical Learning Zheng Zhu Birkbeck College, University of London.

Advertisements

ECG Signal processing (2)

Feature Selection as Relevant Information Encoding Naftali Tishby School of Computer Science and Engineering The Hebrew University, Jerusalem, Israel NIPS.

Supervised Learning Techniques over Twitter Data Kleisarchaki Sofia.

Machine learning continued Image source:

Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.

Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov

Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.

Pattern Recognition and Machine Learning

Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.

1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.

Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.

Image Search Presented by: Samantha Mahindrakar Diti Gandhi.

Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.

CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.

Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.

Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.

1 LM Approaches to Filtering Richard Schwartz, BBN LM/IR ARDA 2002 September 11-12, 2002 UMASS.

Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.

Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,

Presented by Zeehasham Rasheed

ICME 2004 Tzvetanka I. Ianeva Arjen P. de Vries Thijs Westerveld A Dynamic Probabilistic Multimedia Retrieval Model.

Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.

Review Rong Jin. Comparison of Different Classification Models  The goal of all classifiers Predicating class label y for an input x Estimate p(y|x)

Multimedia Data Mining Arvind Balasubramanian Multimedia Lab (ECSS 4.416) The University of Texas at Dallas.

Overview of Search Engines

Introduction to machine learning

Information Retrieval in Practice

Slide Image Retrieval: A Preliminary Study Guo Min Liew and Min-Yen Kan National University of Singapore Web IR / NLP Group (WING)

MediaEval Workshop 2011 Pisa, Italy 1-2 September 2011.

Multimedia Databases (MMDB)

Adaptive News Access Daniel Billsus Presented by Chirayu Wongchokprasitti.

Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.

COMMON EVALUATION FINAL PROJECT Vira Oleksyuk ECE 8110: Introduction to machine Learning and Pattern Recognition.

1 CS 430 / INFO 430 Information Retrieval Lecture 23 Non-Textual Materials 2.

Finding Better Answers in Video Using Pseudo Relevance Feedback Informedia Project Carnegie Mellon University Carnegie Mellon Question Answering from Errorful.

Data Mining Chapter 1 Introduction -- Basic Data Mining Tasks -- Related Concepts -- Data Mining Techniques.

Glasgow 02/02/04 NN k networks for content-based image retrieval Daniel Heesch.

CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.

Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.

1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.

Survey of Approaches to Information Retrieval of Speech Message Kenney Ng Spoken Language Systems Group Laboratory for Computer Science Massachusetts Institute.

LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.

PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.

Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.

Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.

1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.

Image Classification for Automatic Annotation

Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,

Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.

KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.

Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.

1 CS 430 / INFO 430 Information Retrieval Lecture 17 Metadata 4.

NTU & MSRA Ming-Feng Tsai

Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:

Learning Kernel Classifiers 1. Introduction Summarized by In-Hee Lee.

Statistical techniques for video analysis and searching chapter Anton Korotygin.

SUPERVISED AND UNSUPERVISED LEARNING Presentation by Ege Saygıner CENG 784.

ECE 8443 – Pattern Recognition ECE 8527 – Introduction to Machine Learning and Pattern Recognition Objectives: Bayes Rule Mutual Information Conditional.

Digital Video Library - Jacky Ma.

Visual Information Retrieval

Large-Scale Content-Based Audio Retrieval from Text Queries

Introduction Multimedia initial focus

Multimedia Information Retrieval

Compact Query Term Selection Using Topically Related Text

Image Segmentation Techniques

Multimedia Information Retrieval

Content Based Image Retrieval

EM Algorithm and its Applications

Presentation transcript:

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Quiz Whats Negative Pseudo-Relevance feedback in multimedia retrieval?

Introduction As a result of high demand of content based access to video information. Content based implies that searching can be done not only through manually indexed terms-directly evaluate if the video content(image and the audio) is similar to the query. Need to allow users to query and retrieve based on the audio information and the imagery of the video, Content-based video retrieval (CBVR) or Content based multimedia retrieval Using pattern recognition technique. NPRF retrieves images/items which are not similar to the query or relevant information.

CBVR rely on pre-defined generic similarity which determines the distance between the two images. Limitations include:- a.visual features representation limited to capturing fairly low-level physical features(color, texture or shape). b.Different query scenarios require different similarity metrics to model the distribution of examples. E.g. Sky and water(sea)

Standard relevance system iteratively asks the user more training examples as relevant or non-relevant for the learning algorithms. After a interactive relevance feedback, the system must then re- build a new classifier. The top-ranked example from generic similarity metric doesnt always make the correct result due to poor performance of the current visual information retrieval in applications. e.g.-cars shape

The Information Digital Video Library System. Focuses on information extraction from video. Involves the integration of speech, image and natural language. After retrieving the metadata, the system enables full content search and retrieval of the spoken language and visual documents.

Informedia interface provides multiple levels of abstractions including:- a.Visual Icons with relevance measure b.Short titles or headlines c.Topic identification of stories d.Filmstrip(storyboard) views e.Transcript f.Dynamic maps g.Active video skims h.Face detections and recognition i.Image retrieval

Relevance and Pseudo-Relevance Feedback in Information Retrieval Main retrieval technique Pseudo-Relevance Feedback is an automatic retrieval approach without any user intervention. Starting with a small no. of positive examples and no negative examples, then extract the strong negative to train the classifier. Transductive learning and co-training are two of paradigms to utilize the information of unlabeled data. Co-training is used to the multimedia retrieval since redundant information is available from different modalities.

Pseudo-Relevance Feedback Define the query-text description plus audio, image or video. Video retrieval algorithm retrieves a set of relevant video shots from given data collections. Taking target(T)and query(Q) the retrieval algorithm should provide permutation of the video shots t(i) in T which is sorted by their similarity to the user queries q(i) in Q. The difference between two video segments is measured through a similarity metric between their feature vectors. Then the video collected are separated into two parts of each query positive examples (T+) and Negative examples T(-). Precision and recall are performance measures for retrieval systems, But we use mean average precision since we want the rank. Precision after every retrieved relevant shot is computed and these precisions are averaged. The average precision of this average precision gives the mean average precision. The main idea in PRF approach is to automatically feed back the data which are identified based on generic similarity metric

Analysis We can define the positive distance d+ as the distance between the positive data T+ and the queries. The negative distance d- is defined also. The distance d+ and d- will converge towards a gaussian distribution when the no. of examples goes to infinity. Therefore the probability density function(pdf) p(x) for both distance are in form of,

Which sometimes is also called the error function er f(x).

Statistical Model for average precision Let p(t) be the probability density of T for the data distribution, p(+)t (positive) an d P(-)t(Negative) distributions.

Probabilistic Output and combination Fusion, combining the base metric and PRF metric. Reduce the prediction variance and offer more stable results. Linearly normalize the scores to a certain interval e.g.[-1,1] As a result all scores(-,+) are gaussian distributed, then we can obtain the probability by applying bayes rule. Parametric sigmoid model to fit the posterior directly

Base Similarity Metric Algorithm used to generate the base retrieval scores. Expressed as follows; Can handle multiple examples in arbitrary metric spaces. Retrieval algorithm is assigned a score for each video frame, while the basic unit is a v ideo shot(Multiple frames)-choose the maximal retrieval score of a frame within a vid eo shots retrieval score.

Sampling strategy No. of feedback training examples will be sampled as the input to a learning algorithm(+ e.g.) Subset of the e.g. that are dissimilar to the queries will be considered as (- e.g.).

Classification Algorithm SVMs are known to yield good generalization performance compared to other classification algorithms. The decision function is of the form;

Query

Results

Conclusion Improved information retrieval, negative pseudo-relevance feedback. Using learning algorithm for classification-very successful. Multimedia query e.g. provide the (+) training e.g. for machine learning theory (-) training e.g. are obtained from the initial simple Euclidian similarity metric. SVM classifier that learns to weight the discriminating features-improves retrieval performance. NPRF shows the ability to separate Gaussian distribution of the (-) and (+) image reducing the variances.

Answer 3 rd slide. Questions? Thanks