CH 14 Multimedia IR. Multimedia IR system The architecture of a Multimedia IR system depends on two main factors –The peculiar characteristics of multimedia.

Slides:



Advertisements
Similar presentations
Alexander Gelbukh Special Topics in Computer Science Advanced Topics in Information Retrieval Lecture 6 (book chapter 12): Multimedia.
Advertisements

XML: Extensible Markup Language
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
Lecture-7/ T. Nouf Almujally
Management Information Systems, Sixth Edition
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Multimedia DBs. Multimedia dbs A multimedia database stores text, strings and images Similarity queries (content based retrieval) Given an image find.
Spatial Mining.
Multimedia DBs.
Indexing Time Series. Time Series Databases A time series is a sequence of real numbers, representing the measurements of a real variable at equal time.
Indexing Time Series Based on Slides by C. Faloutsos (CMU) and D. Gunopulos (UCR)
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
Efficient Similarity Search in Sequence Databases Rakesh Agrawal, Christos Faloutsos and Arun Swami Leila Kaghazian.
0 Two-dimensional color images 2-D color image (QBIC) –Compute a k-element color histogram for each image 16×10 6 → 256 A: color-to-color similarity matrix.
Chapter 2Modeling 資工 4B 陳建勳. Introduction.  Traditional information retrieval systems usually adopt index terms to index and retrieve documents.
ISP 433/633 Week 5 Multimedia IR. Goals –Increase access to media content –Decrease effort in media handling and reuse –Improve usefulness of media content.
T.Sharon 1 Internet Resources Discovery (IRD) Introduction to MMIR.
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Spatial and Temporal Data Mining
Recommender systems Ram Akella February 23, 2011 Lecture 6b, i290 & 280I University of California at Berkeley Silicon Valley Center/SC.
Introduction to Databases Transparencies
ICS (072)Database Systems Background Review 1 Database Systems Background Review Dr. Muhammad Shafique.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept.
CH 11 Multimedia IR: Models and Languages
Chapter 14 The Second Component: The Database.
Recommender systems Ram Akella November 26 th 2008.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Spatial and Temporal Databases Efficiently Time Series Matching by Wavelets (ICDE 98) Kin-pong Chan and Ada Wai-chee Fu.
Indexing Time Series.
Advanced Database Applications Database Indexing and Data Mining CS591-G1 -- Fall 2001 George Kollios Boston University.
1 Chapter 2 Database Environment. 2 Chapter 2 - Objectives u Purpose of three-level database architecture. u Contents of external, conceptual, and internal.
Information Retrieval in Practice
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Multimedia and Time-series Data
Multimedia Databases (MMDB)
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
Database System Concepts and Architecture
Querying Structured Text in an XML Database By Xuemei Luo.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
IMAGE DATABASES Prof. Hyoung-Joo Kim OOPSLA Lab. Computer Engineering Seoul National University.
M- tree: an efficient access method for similarity search in metric spaces Reporter : Ximeng Liu Supervisor: Rongxing Lu School of EEE, NTU
5 - 1 Copyright © 2006, The McGraw-Hill Companies, Inc. All rights reserved.
Fast Subsequence Matching in Time-Series Databases Author: Christos Faloutsos etc. Speaker: Weijun He.
MANAGING DATA RESOURCES ~ pertemuan 7 ~ Oleh: Ir. Abdul Hayat, MTI.
E.G.M. PetrakisSearching Signals and Patterns1  Given a query Q and a collection of N objects O 1,O 2,…O N search exactly or approximately  The ideal.
Content-Based Image Retrieval Using Fuzzy Cognition Concepts Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung University.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
1Ellen L. Walker Category Recognition Associating information extracted from images with categories (classes) of objects Requires prior knowledge about.
Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.
V. Clustering 인공지능 연구실 이승희 Text: Text mining Page:82-93.
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
Indexing Time Series. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases Time Series databases Text databases.
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
1 Chapter 2 Database Environment Pearson Education © 2009.
FastMap : Algorithm for Indexing, Data- Mining and Visualization of Traditional and Multimedia Datasets.
BIT 3193 MULTIMEDIA DATABASE CHAPTER 4 : QUERING MULTIMEDIA DATABASES.
Data Models. 2 The Importance of Data Models Data models –Relatively simple representations, usually graphical, of complex real-world data structures.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
Keogh, E. , Chakrabarti, K. , Pazzani, M. & Mehrotra, S. (2001)
Fast Subsequence Matching in Time-Series Databases.
Introduction Multimedia initial focus
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment.
Database Systems Instructor Name: Lecture-3.
Multimedia Information Retrieval
Chapter 2 Database Environment Pearson Education © 2009.
Chapter 2 Database Environment Pearson Education © 2009.
Presentation transcript:

CH 14 Multimedia IR

Multimedia IR system The architecture of a Multimedia IR system depends on two main factors –The peculiar characteristics of multimedia data –The kinds of operations to be performed on such data

Multimedia IR system Support variety of data –Different kinds of media Text, images (both still and moving), graphs, and sound –Mix of structured and unstructured data Metadata Semi-structured data –Data whose structure may not match, or only partially match, the structure prescribed by the data schema The system must typically extract some features from the multimedia objects

Multimedia IR system Data retrieval –Exploiting data attributes and the content of multimedia objects –Basic steps Query specification –Fuzzy predicates, content-based predicates, object attributes, structural predicates Query processing and optimization –Query is parsed and compiled into an internal form Query answer Query iteration

Multimedia IR system Combine DBMS and IR technology –DBMS: data modeling capabilities –IR system: similarity-based query capabilities

Data modeling Main tasks –A data model should be defined by which the user can specify the data to be stored into the system Support conventional and multimedia data types Provide methods to analyze, retrieve, and query such data –Provide a model for the internal representation of multimedia data

Object-oriented DBMS Provide rich data model –More suitable for modeling both multimedia data types and their semantic relationships Class –Attributes +operations –Inheritance Drawback –the performances of storage techniques, query processing, and transaction management is not comparable to that of relational DBMSs –Highly non-standard

Object-relational DBMS Extend the relational model –Represent complex data types –Maintain the performance and the simplicity of relational DBMSs and related query languages –Define abstract data types Allows one to define ad hoc data types for multimedia data

Internal representation Using attributes is not sufficient Feature –Information extracted from objects Multimedia object is represented as a set of features Features can be assigned manually, automatically, or using a hybrid approach

Internal representation Values of some specific features are assigned to a object by comparing the object with some previously classified objects Feature extraction cannot be precise –A weight is usually assigned to each feature value representing the uncertainty of assigning such a value to that feature 80% sure that a shape is a square

SQL3 Support extensible type system –Provide constructs to define user-dependent abstract data types, in an object-oriented like manner Collection data types –Sets, multisets, and lists –The elements of a collection must have compatible types

MULTOS MULTimedia Office Server –Client/server –Support filing and retrieval of multimedia objects Each document is described by a logical structure, a layout structure, and a conceptual structure Documents having similar conceptual structures are grouped into conceptual types

MULTOS Conceptual types are maintained in a hierarchy of generalization Strong type –Completely specifies the structure of its instances Weak type –Partially specifies the structure of its instances –Components of unspecified type (called spring component types) can appear in a document definition

Document PlaceDateReceiver+Sender AddressName CityStreetCountry AddressName CityStreetCountry Letter_body spring component type Conceptual structure of the type Generic_Letter

Complete conceptual structure of the type Business_Product_Letter Document PlaceDateReceiver+Sender AddressName CityStreetCountry AddressName CityStreet Company_Logo Image Country Product_Presentation Text Product_Description Text Product_Cose Text Signature Letter_body

Query languages Relational/object-oriented database system –Exact match of the values of attributes Multimedia IR system –Similarity-based approach (+exact match) Considers the structure and the content of the objects Content-based query –Retrieve multimedia objects depending on their globe content

Query languages In designing a multimedia query language, three main aspects require attention –How the user enters his/her request to the system –Which conditions on multimedia objects can be specified in the user request –How uncertainty, proximity, and weights impact the design of the query language

Request specification Interfaces –Browsing and navigation –Specifying the conditions the objects of interest must satisfy, by means of queries Queries can be specified in two different ways –Using a specific query language –Query by example Using actual data (object example)

Conditions on multimedia data Query predicates –Attribute predicates Concern the attributes for which an exact value is supplied for each object Exact-match retrieval –Structural predicates Concern the structure of multimedia objects Can be answered by metadata and information about the database schema “Find all multimedia objects containing at least one image and a video clip”

Conditions on multimedia data –Semantic predicates Concern the semantic content of the required data, depending on the features that have been extracted and stored for each multimedia object “Find all the red houses” Exact match cannot be applied

Uncertainty, proximity, and weights in query expressions Specify the degree of relevance of the retrieved objects –Using some imprecise terms and predicates Represent a set of possible acceptable values with respect to which the attribute or he features has to be matched Normal, unacceptable, typical –Particular proximity predicates The relationship represented is based on the computation of a semantic distance between the query object and stored ones Nearest object search

Uncertainty, proximity, and weights in query expressions –Assign each condition or term a given weight Specify the degree of precision by which a condition must be verified by an object “Find all the objects containing an image representing a screen (HIGH) and a keyboard (LOW)” The corresponding query is executed by assigning some importance and preference values to each predicate and term

SQL3 query language Major improvements of SQL3 –Functions and stored procedures Allow users to integrate external functionalities with data manipulation –Active database facilities Support active rules –The database is able to react to some system- or user- dependant events by executing specific actions Limitation –No IR techniques are integrated into the SQL3 query processor

MULTOS query language General form FIND DOCUMENTS VERSION version-clause SCOPE scope-clause TYPE type-clause WHERE condition-clause WITH component

MULTOS query language Three main classes of predicates –Data attributes –Textual components –Images The class to which an image should belong Existence and the number of occurrences of an object within an image Support imprecise query –Associating a preference and an importance value with the attributes in the query

MULTOS query language FIND DOCUMENTS VERSION LAST WHERE Document.Date > 1/1/1998 AND (*Sender.Name = “Olivetti” OR *Product_Presentation CONTAINS “Olivetti”) AND *Product_Description CONTAINS “Personal Computer” AND (*Address.Country = “Italy” OR TEXT CONTAINS “Italy”) AND WITH *Company_Logo

MULTOS query language FIND DOCUMENTS VERSION LAST WHERE (Document.Date BETWEEN (12/31/1998,1/31/98) PREFERRED BETWEEN (2/1/1998,2/15/98) ACCEPTABLE) HIGH AND (*Sender.Name = “Olivetti” OR *Product_Presentation CONTAINS “Olivetti”) HIGH AND (*Product_Description CONTAINS “Personal Computer”) HIGH AND (*Product_Description CONTAINS “good ergonomics”) LOW AND (*Address.Country = “Italy” OR TEXT CONTAINS “Italy”) HIGH AND WITH *Company_Logo HIGH (IMAGE MATCHES screen HIGH Keyboard HIGH AT LEAST 2 floppy_drives LOW) HIGH

Indexing and searching Searching similar patterns Distance function –Given two objects, O 1 and O 2, the distance (=dissimilarity) of the two objects is denoted by D(O 1,O 2 ) Similarity queries –Whole match –Sub-pattern match –Nearest neighbors –All pairs

Spatial access methods Map objects into points in f-D space, and to use multiattribute access methods (also referred to as spatial access methods or SAMs) to cluster them and to search for them Methods –R*-trees and the rest of the R-tree family –Linear quadtrees –Grid-files Linear quadtrees and grid files explode exponentially with the dimensionality

R-tree –Represent a spatial object by its minimum bounding rectangle (MBR) –Data rectangles are grouped to form parent nodes (recursively grouped) –The MBR of a parent node completely contains the MBRs of its children –MBRs are allowed to overlap –Nodes of the tree correspond to disk pages

R-tree Range query –Specify a region of interest, requiring all the data regions that intersect it –Retrieve Compute the MBR of the query region Recursively descend the R-tree, excluding the branches whose MBRs do not intersect the query MBR The retrieved data regions will be further examined for intersection with the query region

Generic multimedia indexing approach “Whole match” problem –A collection of N objects: O 1, O 2,…,O N –The distance/dissimilarity between two objects (O i,O j ) is given by the function D(O i,O j ) –User specifies a query object Q, and a tolerance ε –Goal Find the objects in the collection that are within distance εfrom the query object

GEMINI Generic Multimedia object INdexIng Ideas –A ‘quick-and-dirty’ test, to discard quickly the vast majority of non-qualifying objects (possibly, allowing some false alarms) –The use of spatial access methods, to achieve faster-than-sequential searching

GEMINI Example –Database: yearly stock price movements, with one price per day –Distance function Euclidean distance –The idea behind the quick-and-dirty test is to characterize a sequence with a single number (feature), which help us discard many non-qualifying sequences Average stock price over the year, standard deviation, some of the discrete Fourier transform (DFT) coefficients

GEMINI Mapping function –Let F() be the mapping of objects to f-dimensional points, that is, F(O) will be the f-D point that corresponds to object O Organize f-D points into a spatial access method, cluster them in a hierarchical structure, like the R*-trees Upon a query, we can exploit the R*-tree, to prune out large portions of the database that are not promising

GEMINI Search algorithm (for whole match query) –Map the query object Q into a point F(Q) in feature space –Using a spatial access method, retrieve all points within the desired tolerance εfrom F(Q) –Retrieve the corresponding objects, compute their actual distance from Q and discard the false alarms

GEMINI Lower Bounding lemma –To guarantee no false dismissals for whole-match queries, the feature extraction function F() should satisfy the following formula –D feature (): distance of two feature vectors (mapping F() from objects to points should make things look closer)

GEMINI GEMINI algorithm –Determine the distance function D() between two objects –Find one or more numerical feature-extraction functions, to provide a ‘quick-and-dirty’ test –Prove that the distance in feature space lower-bounds the actual distance D(), to guarantee correctness –Use a SAM (e.g., an R-tree), to store and retrieve the f-D feature vectors

GEMINI ‘Feature-extracting’ question –If we are allowed to use only one numerical feature to describe each data object, what should this feature be? The successful answers to the question should meet two goals –They should facilitate step 3 (the distance lower-bounding) –They should capture most of the characteristics of the objects

One-dimensional time series Search a collection of (equal-length) time series, to find the ones that are similar to a desirable series. –‘in a collection of yearly stock price movements, find the ones that are similar to IBM’ Distance function –Euclidean distance

One-dimensional time series Feature extraction –The coefficients of the Discrete Fourier Transform (DFT) Lower-bounding –Parseval’s theorem The DFT preserves the energy of a signal, as well as the distances between two signals

One-dimensional time series –If we keep the first f (f≤n) coefficients of the DEF as the features, we lower-bound the actual distance –There will be no false dismissals

One-dimensional time series DFT concentrates the energy in the first few coefficients, for a large class of signals, the colored noises. These signals have a skewed energy spectrum (O(F -b )) –b=2: random walks (brown noises) Model stock movements and exchange rates –b>2: black noises Model water level of rivers and rainfall patterns –b=1: pink noise ‘interesting signals’: musical scores and other works of art (white noise is unpredictable, brown noise is too predictable)

One-dimensional time series Experiments –Artificially generated random walks Sequence length n=1024 Database size N=50~400 –GEMINI vs. sequential scanning SAM: R*-tree –Response time

One-dimensional time series –GEMINI can be successfully applied to time series, and specifically to the ones that behave like ‘colored noise’ –For signals with skewed spectrum, the minimum in the response time is achieved for a mall number of Fourier coefficients (f=1,2,3). The minimum is rather flat, which implies that a suboptimal choice for f will give search time that is close to the minimum –The success in 1D series suggests that GEMINI is promising for 2D or higher-dimensionality signals, if those signals also have skewed spectrum

Two-dimensional color images QBIC (Query By Image Content) (IBM) –Query large online image databases using the images’ content as the basis of the queries –Content Color, texture, shape, position, and dominant edges of image items and regions Applications –Medical “Give me other images that contain a tumor with a texture like this one” –Photo-journalism “Give me images that have blue at the top and red at the bottom”

Two-dimensional color images Two datatypes –Image (≡scene) –Item A part of a scene –A person, a piece of outlined texture, an apple,… Image feature –Focus on the color features K-element color histogram for each item and scene, where k=256 or 64 colors Each component on the color histogram is the percentage of pixels that are most similar to that color

Two-dimensional color images Distance function A: color-to-color similarity matrix Obstacle of color indexing –Dimensionality curse –Quadratic nature of the distance function Cross-talk among the features Expansive Precludes efficient implementation of commonly used spatial access methods

Two-dimensional color images GEMINI –Feature extraction Average amount of red, green, and blue in a given color image (RGB color space) P is the number of pixels in the item R(P), G(p), B(p) are the red, green, and blue components respectively of the p-th pixel

Two-dimensional color images –Distance function –Lower-bounding Quadratic Distance Bounding Theorem

Two-dimensional color images Experiment –N=924 color images –K=256 colors –CPU time and disk accesses

Automatic feature extraction GEMINI is useful for any setting that we can extract features from Automatic feature extraction methods –Multidimensional Scaling (MDS) –FastMap Extracting features not only facilitates the use of off-the-shelf spatial access methods, but it also allows for visual data mining: we can plot a 2D or 3D projection of the data set, and inspect it for clusters, correlations, and other patterns