Download presentation
Presentation is loading. Please wait.
1
ISP 433/633 Week 5 Multimedia IR
2
Goals –Increase access to media content –Decrease effort in media handling and reuse –Improve usefulness of media content Technology –Feature Extraction –Metadata
3
Type of Multimedia 1D –Audio (speech, music, sound effects, etc.) –MIDI 2D –Photographs –Graphics 3D –Video (2D + Time) –Animation (2D + Time) –Computer graphic models 4D –Computer graphic model animation (3D + Time)
4
Typical Queries Audio –Search for songs by humming Graphics –Search for diagrams by sketching Image –Check if company logo appears in the program as contracted Video –Detect unusual movement in a surveillance video
5
Challenge of Multimedia IR Often Unstructured Content is difficult to analyze and compare –Computers don’t understand the content of multimedia –Need to specify content and semantics Large storage requirement
6
Type of Queries (features) Attribute –Speaker of an audio, size of a video, color distribution of an image etc. –Have an exact match Structural –“all the objects containing one image and one video clip”, “image displays companied by a jingle sound” Semantic –“images with the logo of Ford company”
7
Spatial Query Queries about the spatial relationships (intersection, containment, boundary, adjacency, proximity) of entities geometrically defined and located in space Used in GIS –Georeferenced data
8
Type of Spatial Queries Point-in-polygon –What we have in (x,y) region? Distance and Buffer Zone Queries –What cities lie within 40 miles of the border of Northern and Southern Ireland? Path Queries –What is the shortest route from San Francisco to Los Angeles? Y X
9
More Spatial queries Multimedia Queries : Use non-map georeferenced information. –What are the names of farmers affected by flooding in Monterey and Santa Cruz Counties? p123 p127
10
Spatial Indexing and Access F-dimensional space –Reduce the problem into searching points in a multi-dimensional feature space Feature functions –Map an object into a point in feature space Distance Feature 2 Feature 1 Object A Object B
11
Matches Whole matches –All the objects within a certain distance from query Sub-pattern match –Parts of objects within a certain distance from query Nearest neighbors match All Pairs match
12
R-tree Minimum bounding rectangle (MBR)
13
Feature Exaction How to represent object with numerical feature values? Feature function –Perverse the distance between objects –Capture the characteristics of objects Don’t want too many dimensions; Much is in research –MDS, DSP, machine vision
14
Color Image Color Histogram – 256 dimension RGB average – 3 dimension
15
Example Feature Extraction Product Smart Fire Alert (Fastcom technologies)
16
Problem of Automatic Feature Extraction Mismatch between percepts and concepts –Similar Percepts / Dissimilar Concepts Clown NoseRed Sun
17
Problem of Automatic Feature Extraction Dissimilar Percepts / Similar Concepts A CarAnother Car
18
Metadata Content representation of the media Creation (annotation) –During capture –After capture Use metadata to manipulate media –Storage –Indexing –Search
19
Multimedia Content Description Interface (MPEG-7) Create standardized multimedia description framework Support range of abstraction levels from low-level signal characteristics to high- level semantic information
20
MPEG Moving Picture Experts Group (MPEG) Working group of ISO/IEC in charge of the development of standards for coded representation of digital audio and video Established in 1988, the group has produced –MPEG-1 Standard on which such products as Video CD and MP3 are based –MPEG-2 Standard on which such products as Digital Television set top boxes and DVD are based –MPEG-4 Standard for multimedia for the fixed and mobile web –MPEG-7 Standard for description and search of audio and visual content –MPEG-21 "Multimedia Framework" standard has started in June 2000
21
Application of MPEG-7
22
MPEG-7 Structure
23
MPEG-7 Top Level Hierarchy
24
MPEG-7 Conceptual Description
25
MPEG-7 Still Image Description
26
MPEG-7 Video Segments Example
27
MPEG-7 Segment Relationship Graph
28
Some MPEG-7 Application Types Extraction from MediaSearch / Retrieval Others –Transcoding –Description Filtering
29
Example Application IBM VideoAnn - assists authors in the task of annotating video sequences with MPEG-7 metadata
30
More Example Applications 3D Murale - 3D Measurement & Virtual Reconstruction of Ancient Lost Worlds of Europe ( EU: IST Project ) Real-time video identification - monitors broadcast TV programs and identifies its contents ( NEC ) Virage - a digital asset management system for processing, indexing, storing and publishing video Content providers adopting MPEG-7: emusic.com
31
Challenges Creating metadata –Represent action sequences and higher level narrative structures –Integrate legacy metadata (keywords, natural language) –Gather more and better metadata at the point of capture (develop metadata cameras) –Develop “human-in-the-loop” indexing algorithms and interfaces Using metadata –Integrate linguistic and other query interfaces
32
Multimedia IR demos QBIC –http://www.hermitagemuseum.org/fcgi- bin/db2www/qbicSearch.mac/qbic?selLang =Englishhttp://www.hermitagemuseum.org/fcgi- bin/db2www/qbicSearch.mac/qbic?selLang =English
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.