Presented by Zeehasham Rasheed

Slides:



Advertisements
Similar presentations
Using Large-Scale Web Data to Facilitate Textual Query Based Retrieval of Consumer Photos.
Advertisements

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
DECISION TREES. Decision trees  One possible representation for hypotheses.
Chapter 5: Introduction to Information Retrieval
Discriminative Relevance Feedback With Virtual Textual Representation For Efficient Image Retrieval Suman Karthik and C.V.Jawahar.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
Li-Jia Li Yongwhan Lim Li Fei-Fei Chong Wang David M. Blei B UILDING AND U SING A S EMANTIVISUAL I MAGE H IERARCHY CVPR, 2010.
The use of unlabeled data to improve supervised learning for text summarization MR Amini, P Gallinari (SIGIR 2002) Slides prepared by Jon Elsas for the.
1 CS 430 / INFO 430 Information Retrieval Lecture 8 Query Refinement: Relevance Feedback Information Filtering.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
1 Statistical correlation analysis in image retrieval Reporter : Erica Li 2004/9/30.
CONTENT BASED FACE RECOGNITION Ankur Jain 01D05007 Pranshu Sharma Prashant Baronia 01D05005 Swapnil Zarekar 01D05001 Under the guidance of Prof.
Prénom Nom Document Analysis: Data Analysis and Clustering Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
SVM Active Learning with Application to Image Retrieval
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
A novel log-based relevance feedback technique in content- based image retrieval Reporter: Francis 2005/6/2.
1998/5/21by Chang I-Ning1 ImageRover: A Content-Based Image Browser for the World Wide Web Introduction Approach Image Collection Subsystem Image Query.
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
Pattern Recognition. Introduction. Definitions.. Recognition process. Recognition process relates input signal to the stored concepts about the object.
Multiple Object Class Detection with a Generative Model K. Mikolajczyk, B. Leibe and B. Schiele Carolina Galleguillos.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
Introduction to Machine Learning Approach Lecture 5.
Transfer Learning From Multiple Source Domains via Consensus Regularization Ping Luo, Fuzhen Zhuang, Hui Xiong, Yuhong Xiong, Qing He.
Methods in Medical Image Analysis Statistics of Pattern Recognition: Classification and Clustering Some content provided by Milos Hauskrecht, University.
Thien Anh Dinh1, Tomi Silander1, Bolan Su1, Tianxia Gong
APPLICATIONS OF DATA MINING IN INFORMATION RETRIEVAL.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Machine Learning in Spoken Language Processing Lecture 21 Spoken Language Processing Prof. Andrew Rosenberg.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
Finding Better Answers in Video Using Pseudo Relevance Feedback Informedia Project Carnegie Mellon University Carnegie Mellon Question Answering from Errorful.
Partially Supervised Classification of Text Documents by Bing Liu, Philip Yu, and Xiaoli Li Presented by: Rick Knowles 7 April 2005.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Web Image Retrieval Re-Ranking with Relevance Model Wei-Hao Lin, Rong Jin, Alexander Hauptmann Language Technologies Institute School of Computer Science.
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
Understanding User’s Query Intent with Wikipedia G 여 승 후.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29th March 2007 Category - based Semantic Search Engine 1 Mihai.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology Mining Logs Files for Data-Driven System Management Advisor.
Probabilistic Latent Query Analysis for Combining Multiple Retrieval Sources Rong Yan Alexander G. Hauptmann School of Computer Science Carnegie Mellon.
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Supervised Learning Resources: AG: Conditional Maximum Likelihood DP:
Image Classification for Automatic Annotation
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Personalization Services in CADAL Zhang yin Zhuang Yuting Wu Jiangqin College of Computer Science, Zhejiang University November 19,2006.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
Enhanced hypertext categorization using hyperlinks Soumen Chakrabarti (IBM Almaden) Byron Dom (IBM Almaden) Piotr Indyk (Stanford)
1 CS 430 / INFO 430 Information Retrieval Lecture 12 Query Refinement and Relevance Feedback.
Computer Vision Lecture 7 Classifiers. Computer Vision, Lecture 6 Oleh Tretiak © 2005Slide 1 This Lecture Bayesian decision theory (22.1, 22.2) –General.
KNN & Naïve Bayes Hongning Wang
Unsupervised Learning Part 2. Topics How to determine the K in K-means? Hierarchical clustering Soft clustering with Gaussian mixture models Expectation-Maximization.
Queensland University of Technology
Visual Information Retrieval
Semantic Video Classification
Multimedia Information Retrieval
Image Segmentation Techniques
iSRD Spam Review Detection with Imbalanced Data Distributions
Unsupervised Learning II: Soft Clustering with Gaussian Mixture Models
Ying Dai Faculty of software and information science,
Region and Shape Extraction
Ying Dai Faculty of software and information science,
LECTURE 23: INFORMATION THEORY REVIEW
Information Retrieval
Ying Dai Faculty of software and information science,
Presentation transcript:

Presented by Zeehasham Rasheed Concept-Oriented Indexing of Video Databases: Towards Semantic Sensitive Retrieval and Browsing J.Fan, H.Luo, K.Ahmed Presented by Zeehasham Rasheed

Outline Introduction Proposed Framework Semantic Video Classification Semantic Video Database Performance Analysis Conclusion and Future work

Introduction Digital Video now plays an important role in Medical education. Several content based video retrieval (CBVR) system have been proposed. Challenging problems are semantic gap, semantic video classification and video database indexing.

Proposed Framework Semantic video content framework using Principal video shots Semantic video concept model Semantic Video Classifier training framework Concept oriented video database

Challenging Issues The performance of semantic video classifiers largely depends on the quality of features and automatic semantic video object extraction is in general very hard.

Existing CBVR systems are unable to support video access at the semantic level because of semantic gap. So to bridge the semantic gap, the rule based approach uses domain knowledge to define rules for extracting semantic video concepts.

Semantic video classification techniques can be classified into Rule based approach and statistical approach. Rule based approach provide ease to insert, delete and modify the existing rules Statistical approach uses machine learning techniques.

Semantic Sensitive Video Content Analysis It is necessary to understand what are the suitable video patterns for interpreting certain domains for medical education. A good semantic sensitive video content framework should be able to enhance the quality of features.

Developed a novel framework by using Principal video shots for video content representation and feature extraction. Based on the knowledge of Medical consultants, a set of multimodal salient objects and semantic medical concepts have been designed.

Multimodal Salient objects include visual, auditory and image textual salient objects. Visual salient objects include human faces, blood-red regions, skin region. Auditory salient objects include single human speech, multiple human speech.

Semantic Medical concepts include lecture presentation, gastrointestinal surgery, dialog, traumatic surgery. So all these are required to select the principal video shots.

Semantic Video Concept and Database Modeling Which database model can be used to support concept oriented video database Proposed a novel framework to organize large scale video collection into domain dependent concept heirarchy.

The deepest level of concept hierarchy is defined as the domain dependent elementary semantic medical concepts. For example five different types of principal video shots such as human face, slides, text lines, slides, human speech, are related to elementary semantic medical concept “Lecture Presentation”

Semantic Video Classification Major step is to classify the principal video shots into most relevant elementary semantic medical concept. Use one against all rule to label training samples Where X are the perceptual features and C is the semantic label for sample

Posterior probability that a principal video shot with feature X can be assigned to elementary semantic medical concept C is determined by Bayesian Framework.

In the last, to achieve better likelihood for higher classification accuracy, they used maximum a posterior probability (MAP) as a classifier.

The MAP estimation can be achieved by using the expectation maximization algorithm They called it adaptive EM algorithm

Testing The principal video shots and their features are extracted from test video clips. Linear Discriminant analysis is used to obtain more representative features. Given an unlabeled principal video shot and its feature values.

It is finally assigned to best elementary semantic medical concept corresponds to maximum posterior probability

Concept Oriented Video Database Organization Uses the following technique to support the statistical video database indexing Each database node (semantic medical concept node) is described by the semantic label (keyword), visual summary and statistical properties of the class distribution.

Representation of database node is done by following parameters

Hierarchical Video Retrieval Intuitive approach for the naive users to specify queries. Query Concept Specification via Browsing: support user to get a good idea of video content quickly by browsing the visual summary for semantic medical concept nodes. They can pick one or multiple video clips as their query.

Query Concept Specification via Keywords: Keywords are most useful for the naive users to specify their queries at semantic level. Query Concept Specification via Pattern combination: user can interpret query by using general combinations of principal video shots.

Query Concept Evaluation for query-by-example After query concepts are interpreted by selected video clips, search is performed. User can then label those retrieved videos as relevant or irrelevant. To improve the search results for the next iteration, some steps have been taken.

Information sample selection: Irrelevant video data samples obtained in the previous query and located in the nearest neighbor sphere is used to shrink the sampling area. So in this way, irrelevant samples are taken out from the sampling area of the current query iteration.

Best Search Direction Prediction: Relevance feedback improves the query results and reduces the size of query iterations, the best search direction for the next query iteration can be predicted by combining such nearest neighbor sphere reduction.

Query Refinement: Only the previous query vector and positive samples are used to determine query vector for the next iteration. It is based on Rocchio’s formula

Performance Analysis Benchmark Matrics: 1- Classification Accuracy (misclassification ratio versus classification accuracy ration) 2- Retrieval Accuracy (Precision versus Recall)

Experiments Our experiments are conducted on two image/video databases: skin database (i.e., marked face database) from Purdue University and medical video database. The skin database consists of 1265 face images. 150 face images are selected as the labeled samples for classifier training. The medical video database includes more than 35000 principal video shots from 45 h of MPEG medical videos, where 1500 principal video shots are selected as the training samples and labeled by our medical consultant.

Conclusion and Future Work Adaptive EM algorithm have improved the classification accuracy. A novel semantic sensitive video content framework via principal video shots have been proposed. Future work is to obtain more accurate estimation using unlabeled data.

Questions ???