CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Multimedia Database Systems
Discriminative Relevance Feedback With Virtual Textual Representation For Efficient Image Retrieval Suman Karthik and C.V.Jawahar.
UCB Computer Vision Animals on the Web Tamara L. Berg CSE 595 Words & Pictures.
- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -
CS4670 / 5670: Computer Vision Bag-of-words models Noah Snavely Object
1 Image Retrieval Hao Jiang Computer Science Department 2009.
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
Computer Vision Group, University of BonnVision Laboratory, Stanford University Abstract This paper empirically compares nine image dissimilarity measures.
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Computer Vision Group University of California Berkeley Shape Matching and Object Recognition using Shape Contexts Jitendra Malik U.C. Berkeley (joint.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
Event prediction CS 590v. Applications Video search Surveillance – Detecting suspicious activities – Illegally parked cars – Abandoned bags Intelligent.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Object retrieval with large vocabularies and fast spatial matching
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
ADVISE: Advanced Digital Video Information Segmentation Engine
Automatic Image Annotation and Retrieval using Cross-Media Relevance Models J. Jeon, V. Lavrenko and R. Manmathat Computer Science Department University.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
1998/5/21by Chang I-Ning1 ImageRover: A Content-Based Image Browser for the World Wide Web Introduction Approach Image Collection Subsystem Image Query.
T.Sharon 1 Internet Resources Discovery (IRD) Introduction to MMIR.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
CS335 Principles of Multimedia Systems Multimedia and Human Computer Interfaces Hao Jiang Computer Science Department Boston College Nov. 20, 2007.
Presented by Zeehasham Rasheed
Project IST_1999_ ARTISTE – An Integrated Art Analysis and Navigation Environment Review Meeting N.1: Paris, C2RMF, November 28, 2000 Workpackage.
The Chinese University of Hong Kong Department of Computer Science and Engineering Lyu0202 Advanced Audio Information Retrieval System.
Agenda Introduction Bag-of-words models Visual words with spatial location Part-based models Discriminative methods Segmentation and recognition Recognition-based.
Information Retrieval in Practice
Unsupervised Learning of Categories from Sets of Partially Matching Image Features Kristen Grauman and Trevor Darrel CVPR 2006 Presented By Sovan Biswas.
Multimedia Databases (MMDB)
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Content-Based Image Retrieval
Improving Web Spam Classification using Rank-time Features September 25, 2008 TaeSeob,Yun KAIST DATABASE & MULTIMEDIA LAB.
Finding Better Answers in Video Using Pseudo Relevance Feedback Informedia Project Carnegie Mellon University Carnegie Mellon Question Answering from Errorful.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Research Projects 6v81 Multimedia Database Yohan Jin, T.A.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
An MPEG-7 Based Semantic Album for Home Entertainment Presented by Chen-hsiu Huang 2003/08/12 Presented by Chen-hsiu Huang 2003/08/12.
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
Computer Vision Group Department of Computer Science University of Illinois at Urbana-Champaign.
Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC Relevance Feedback for Image Retrieval.
BIT 3193 MULTIMEDIA DATABASE CHAPTER 4 : QUERING MULTIMEDIA DATABASES.
SUMMERY 1. VOLUMETRIC FEATURES FOR EVENT DETECTION IN VIDEO correlate spatio-temporal shapes to video clips that have been automatically segmented we.
Data Mining for Surveillance Applications Suspicious Event Detection
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Introduction Multimedia initial focus
Multimedia Content-Based Retrieval
Search Engine Architecture
Color-Texture Analysis for Content-Based Image Retrieval
Paper Presentation: Shape and Matching
Multimedia Information Retrieval
Data Mining for Surveillance Applications Suspicious Event Detection
Rob Fergus Computer Vision
Aim of the project Take your image Submit it to the search engine
Brief Review of Recognition + Context
Multimedia Information Retrieval
Data Mining for Surveillance Applications Suspicious Event Detection
Presentation transcript:

CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007

CS335 Principles of Multimedia Systems Introduction  With the increase of multimedia content on the web, we need methods to find the image, audio or video.  Text only schemes have limitations –A lot of manual work. –Inaccurate description.  Content based media retrieval studies how to –find media data using classification and recognition methods based on “features” in audio, image and videos. –For example, we can use an exemplar (like a sample picture) to find other similar ones.  Applications: –Digital library, personal image album and audio folders, satellite image processing and medical applications.

CS335 Principles of Multimedia Systems An Image Retrieval Example (Viper) The query input.

CS335 Principles of Multimedia Systems An Image Retrieval Example (Viper) The query output.

CS335 Principles of Multimedia Systems User feedback.

CS335 Principles of Multimedia Systems Refined results. Better?

CS335 Principles of Multimedia Systems Another query for paintings.

CS335 Principles of Multimedia Systems Painting Search Result The shortlist returned from the search.

CS335 Principles of Multimedia Systems Content Based Media Retrieval  The input –A text description. –One or more exemplar images, audio or video clips. –A sketch, e.g., a dark background with an orange disk in the the middle used to search for sunset scenes. –Or the combination or them.  The output –A shortlist of images, or audio video clips. –The instances of the event you want to find in videos or audios. –Can be structured into a web page or other documents. –Usually allows user feedback to improve the result.  The basic task in content based media retrieval is comparing and searching multimedia data.

CS335 Principles of Multimedia Systems How Do We Evaluate the Performance?  Precision and Recall –Precision = (# of relevant items) / (# of items retrieved) –Recall = (# of relevant items) / (Total # of related items in the dataset)  The procedure of drawing a Recall-Precision Curve: –Compute the relevance score for each item in the database. –Sort the list. –Assume the sorted list is like r r r n n r r r n n … and we have total 6 relevant items in the database

CS335 Principles of Multimedia Systems The Recall-Precision Curve 1/6 2/6 3/6 4/6 5/6 1 1 Precision Recall Short list is like: r r r n n r r r n n … Q: Why do not we just use a single value instead of a curve?

CS335 Principles of Multimedia Systems The “Best” Recall-Precision Curve 1 Precision Recall 1/(# of relevant items) (# of relevant items)/ (# of total items) 1

CS335 Principles of Multimedia Systems Image Retrieval Methods  To find images in a database, we have to compare images quantitatively based on “features”.  We can compare the images as a whole using features like: –Color, textures and their spatial layouts.  We can also segment images into regions and use similar features in object detection.  In some recent systems, people use salient features such as SIFT (Scale Invariant Transform) like features, learning and pattern recognition methods.

CS335 Principles of Multimedia Systems Color Histogram Methods Color only schemes tend to find many unrelated images.

CS335 Principles of Multimedia Systems Improve Color Histogram Methods  If we can separate the foreground with background the result will be improved. Foreground Background

CS335 Principles of Multimedia Systems Improve Color Histogram Methods  Their spatial relations also help to find the right object. Color Blob 2 Color Blob 1

CS335 Principles of Multimedia Systems Finding Shapes  Finding similar shapes is a very useful tool in managing large number of images.  Chamfer matching is a standard method to compare the similarity of shapes.  General Hough Transform can also be used to find shapes in images.

CS335 Principles of Multimedia Systems Shape Context  Shape context is another widely used feature in shape retrieval. C ij is the distance of shape contexts h i and h j

CS335 Principles of Multimedia Systems Improve Matching Efficiency  Fast pruning in matching –Reprehensive shape contexts –Shapemes Greg Mori, Serge Belongie, and Jitendra Malik, Shape Contexts Enable Efficient Retrieval of Similar Shapes, CVPR, 2001

CS335 Principles of Multimedia Systems Example Results Reprehensive shape contexts in shape matching

CS335 Principles of Multimedia Systems Current Trends and Challenges  We now show a more “recent” work L. Fei-Fei, R. Fergus, and P. Perona. A Bayesian approach to unsupervised One-Shot learning of Object categories. ICCV The goal is to detect whether an object appears in an image.

CS335 Principles of Multimedia Systems SIFT features are used. The good features are In fact learned from Small set of training images.

CS335 Principles of Multimedia Systems Motor bike Results.

CS335 Principles of Multimedia Systems Competitions about Object Recognition  network.org/challenges/VOC/voc2007/

CS335 Principles of Multimedia Systems Retrieve Other Multimedia Data  Audio retrieval –Find a audio clip in a large database.  Video retrieval –Find a specific video clip. –Find a video short that has specific person or action. –Browsing video …

CS335 Principles of Multimedia Systems Data Structures in Media Retrieval  In multimedia data retrieval we often need to find the “nearest Neighbor” in the database from the exemplar.  We can abstract each media object as a feature vector. Our goal is to organize the database so that we can locate the most similar vector as quickly as possible.  Q: Think of some data structures that help to improve the searching.

CS335 Principles of Multimedia Systems K-d Tree  A 2D k-d tree a b c d e f a b c df

CS335 Principles of Multimedia Systems Summary  Content based multimedia retrieval is still not mature. Many problems still need to be solved.  There is no single method that solves all the problems.  We need better object detection and classification schemes.  Other related problems like multimedia data mining are also attracting more and more interest.