Download presentation
Presentation is loading. Please wait.
Published byLindsey Atkinson Modified over 9 years ago
1
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia University Presented by: Maleq Khan November 13, 2002
2
Introduction Rapid growth and wide application of video database leads to fast video data retrieval upon user query
3
Problem Statement Finding video clips in large database quickly Semantic interpretation of visual content “Find video shots where President Bush is stepping off an airplane” Extraction and representation of temporal information “Find video shots where Purdue President Martin Jischeke is handshaking with President Bush after he stepped off an airplane” Representation of spatial information
4
Semantics Annotation Manual Annotation is not feasible for large database Many different semantics interpretation Need automatic annotation
5
Background Video shots: unbroken sequence of frames Key frame: frame that represent the salient feature or content of a shot Video scene: Collection of semantically related and temporally adjacent shots
6
Background (continued) Story unit, U: a collection of interesting objects in a shot Locales, d: background of a shot A ≡ U i d j : U i takes place in locales d j Dialogue: A B A B A B A ….., A B A C A B A B Action: progressive sequence of shots with contrasting visual data content.
7
VIMS (Video Info Management Sys) Video data Segmentation Key frame computation Feature extraction ColorMotionShape … Video query, retrieval and production Video browsing
8
Semantics-Based Query Image matching and content based retrieval are based on visual similarity Unable to answer semantics-based query “A red car is running by a tree” Extracting temporal/spatial information hidden in video is necessary for semantic description
9
Semantic Description Model ColorMotionDirection … Sample database Temporal diagram Object tags Object recognition High level description Object searching High-level retrieval Temporal Comp.
10
Temporal Diagram Link to other video Using bibliographic data scene Link to other scene for browsing scene Objects with position and recording info Links based on similarity in story
11
Object Tracking Position is identified with a boundary rectangle Motion is defined as change of relative positions with a still object If viewing direction changes by angle , multiply all position info by cos .
12
Dynamic Tag Building An array to store semantic description New query: search tag first If not found, run the procedure and new semantic description is added to the tag.
13
Summary Automatic semantic extraction Object tracking Temporal diagram Automatic tag building
14
Comments Identify moving objects if a relative still object is given Cannot distinguish different kind of motions Temporal diagram is not complete Claimed “real-time computation for large digital library” but no theoretical or experimental result is given
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.