Multimedia Databases: Where are we?

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Image Retrieval: Current Techniques, Promising Directions, and Open Issues Yong Rui, Thomas Huang and Shih-Fu Chang Published in the Journal of Visual.
A Stepwise Modeling Approach for Individual Media Semantics Annett Mitschick, Klaus Meißner TU Dresden, Department of Computer Science, Multimedia Technology.
Kien A. Hua Division of Computer Science University of Central Florida.
DL:Lesson 11 Multimedia Search Luca Dini
1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.
Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College.
Integrating Color And Spatial Information for CBIR NTUT CSIE D.W. Lin
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
Image Search Presented by: Samantha Mahindrakar Diti Gandhi.
ADVISE: Advanced Digital Video Information Segmentation Engine
Object-based Image Representation Dr. B.S. Manjunath Sitaram Bhagavathy Shawn Newsam Baris Sumengen Vision Research Lab University of California, Santa.
RETRIEVAL OF MULTIMEDIA OBJECTS USING COLOR SEGMENTATION AND DIMENSION REDUCTION OF FEATURES Mingming Lu, Qiyu Zhang, Wei-Hung Cheng, Cheng-Chang Lu Department.
SWE 423: Multimedia Systems
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
CS292 Computational Vision and Language Visual Features - Colour and Texture.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Content-Based Image Retrieval using the EMD algorithm Igal Ioffe George Leifman Supervisor: Doron Shaked Winter-Spring 2000 Technion - Israel Institute.
Information Retrieval in Practice
Data Mining Techniques
CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.
Multimedia and Time-series Data
Information storage: Introduction of database 10/7/2004 Xiangming Mu.
Wavelet-Based Multiresolution Matching for Content-Based Image Retrieval Presented by Tienwei Tsai Department of Computer Science and Engineering Tatung.
1 Seminar Presentation Multimedia Audio / Video Communication Standards Instructor: Dr. Imran Ahmad By: Ju Wang November 7, 2003.
Multimedia Databases (MMDB)
Università degli Studi di Modena and Reggio Emilia Dipartimento di Ingegneria dell’Informazione Prototypes selection with.
Image Retrieval Part I (Introduction). 2 Image Understanding Functions Image indexing similarity matching image retrieval (content-based method)
Event-Based Fusion of Distributed Multimedia Data Sources Vincent Oria Department of Computer Science New Jersey Institute of Technology Newark, NJ
Computer Vision – Overview Hanyang University Jong-Il Park.
Texture. Texture is an innate property of all surfaces (clouds, trees, bricks, hair etc…). It refers to visual patterns of homogeneity and does not result.
COLOR HISTOGRAM AND DISCRETE COSINE TRANSFORM FOR COLOR IMAGE RETRIEVAL Presented by 2006/8.
10/24/2015 Content-Based Image Retrieval: Feature Extraction Algorithms EE-381K-14: Multi-Dimensional Digital Signal Processing BY:Michele Saad
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
DATABASE MANAGEMENT SYSTEM ARCHITECTURE
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
2005/12/021 Fast Image Retrieval Using Low Frequency DCT Coefficients Dept. of Computer Engineering Tatung University Presenter: Yo-Ping Huang ( 黃有評 )
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Image Classification for Automatic Annotation
MMDB-9 J. Teuhola Standardization: MPEG-7 “Multimedia Content Description Interface” Standard for describing multimedia content (metadata).
1 MPEG-7 Overview - part 2. 2 Review Descriptor (D) - 對內容的特徵作定義。 - 通常用以描述 low-level features 。 Description Scheme (DS) - 通常用以描述 high-level features 。
M4 / September Integrating multimodal descriptions to index large video collections M4 meeting – Munich Nicolas Moënne-Loccoz, Bruno Janvier,
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
1/12/ Multimedia Data Mining. Multimedia data types any type of information medium that can be represented, processed, stored and transmitted over.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
MPEG-7 Audio Overview Ichiro Fujinaga MUMT 611 McGill University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
VISUAL INFORMATION RETRIEVAL Presented by Dipti Vaidya.
MULTIMEDIA DATA MODELS AND AUTHORING
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Content-Based Image Retrieval Using Color Space Transformation and Wavelet Transform Presented by Tienwei Tsai Department of Information Management Chihlee.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Visual Information Retrieval
Automatic Video Shot Detection from MPEG Bit Stream
CS644 Advanced Topics in Networking
Introduction Multimedia initial focus
Multimedia Content-Based Retrieval
Image Segmentation Techniques
Multimedia Content Description Interface
Multimedia Information Retrieval
Ying Dai Faculty of software and information science,
Presentation transcript:

Multimedia Databases: Where are we? Vincent Oria Department of Computer Science New Jersey Institute of Technology 2005 Knowledge Fusion Workshop

Multimedia databases: Challenges Retrieval and Indexing “Give me all video sequences showing Steve laughing” Storage, Communication and performances Standards (MPEG) Streaming and Resource Scheduling Continious data such as audio and video need continious transport (QoS) Authoring Presentation 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop Outline Mono or Multi-media databases (or data repositories) Image Content Analysis and Description Image Querying Multimedia Metadata Standards Conclusion 2005 knowledge Fusion Workshop

Multimedia Database Architecture MM Data Pre- processor <!ELEMENT ..> ..... <!ATTLIST...> Meta-Data Recognized components Additional Information Query Interface <article> ..... </article> MM Data MM Data Instance MM Data Instance Users Multimedia DBMS Multimedia Data Preprocessing System Database Processing 2005 knowledge Fusion Workshop

Document Database Architecture <!ELEMENT ..> ..... <!ATTLIST...> DTD Manager DTD files DTD Parser DTD Type Generator Query Interface <article> ..... </article> <!ELEMENT ..> ..... <!ATTLIST...> Document content SGML/XML Parser DTD XML or SGML Document Instance SGML/XML Documents Parse Tree C++ Types Users Instance Generator Multimedia DBMS C++ Objects Document Processing System Database Processing 2005 knowledge Fusion Workshop

Image Database Architecture Image Analysis and Pattern Recognition Semantic Objects Syntactic Objects Image Content Description Meta-Data Query Interface <article> ..... </article> Image Annotation Image Image Users Multimedia DBMS Image Processing System Database Processing 2005 knowledge Fusion Workshop

Video Database Architecture Video Analysis and Pattern Recognition Key Frames Video Content Description Meta-Data Query Interface <article> ..... </article> Video Video Annotation Video Users Multimedia DBMS Video Processing System Database Processing 2005 knowledge Fusion Workshop

Multimedia at the Confluence of Many Disciplines DBMS Modeling Query processing … Machine Learning Neural Networks Agents Knowledge Representation … Database Systems Artificial Intelligence Computer graphics Human Computer Interaction 3D representation … Information Retrieval Indexing Inverted files … Visualization Networks mathmatics Communication Timing QoS … Coding Statistical and Mathematical Modeling … Other 2005 knowledge Fusion Workshop

Image Content Analysis Image content analysis can be categorized in 2 groups: Low-level features: vectors in a multi-dimensional space Color Texture Shape Mid- to high-level features: Try to infer semantics Semantic Gap 2005 knowledge Fusion Workshop

Image Content Analysis: Color Color space: Multidimensional space A dimension is a color component Examples of color space: RGB, CIELAB, CIEL*u*v*, YCbCr, YIQ, YUV, HSV RGB space: A color is a linear combination of 3 primary colors (Red, Green and Blue) Color Quantization Used to reduce the color resolution of an image Three widely used color features Global color histogram Local color histogram Dominant color 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop Color Histograms Color histograms indicate color distribution without spatial information Color histogram distance metrics 2005 knowledge Fusion Workshop

Image Content Analysis: Texture Refers to visual patterns with properties of homogeneity that do not result from the presence of only a single color Examples of texture: Tree barks, clouds, water, bricks and fabrics Texture features: Contrast, uniformity, coarseness, roughness, frequency, density and directionality Two types of texture descriptors Statistical model-based Explores the gray level spatial dependence of texture and extracts meaningful statistics as texture representation Transform-based DCT transform, Fourier-Mellin transform, Polar Fourier transform, Gabor and wavelet transform 2005 knowledge Fusion Workshop

Image Content Analysis: Shape Object segmentation Approaches: Global threshold-based approach Region growing, Split and merge approach, Edge detection app Still a difficult problem in computer vision. Generally speaking it is difficult to achieve perfect segmentation 2005 knowledge Fusion Workshop

Salient Objects vs. Salient Points Generic low-level description of images into salient objects and salient points Original images Segmented images with region boundaries Extracted salient points 2005 knowledge Fusion Workshop

Modeling Images – Principles Support for multiple representations of an image Support for user-defined categorization of images Well-defined set of operations on images An image can have (semantic, functional, spatial) relationships with other images (or documents) which should be represented in the DBMS An image is composed of salient objects (meaningful image components) 2005 knowledge Fusion Workshop

Salient Object Modeling Multiple representations of a salient object (grid, vector) are allowed A salient object O is of a particular type which belongs to a user defined salient object types hierarchy An image component may have some (semantic, functional, spatial) relationships with other salient objects 2005 knowledge Fusion Workshop

General Approach to Similarity Queries A descriptor is a point in a multidimensional space Querying consists in defining a metric in the space and computing distances between a query point and the points in the space. 2005 knowledge Fusion Workshop

The vector-based Similarity Searches 1) extract from each object N numerical features and map objects into points of a N-dimensional space 2) use a suitable distance (e.g., Euclidean) over such a space, and search for “close” objects using a multi-dimensional (“spatial”) index (low distance = high similarity) 2005 knowledge Fusion Workshop

User-defined similarity Using the same distance function is not always appropriate Example: retrieve (only) black points 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop Query Processing Integration Problem for Complex Image Queries Limits of the Common Approach How Qualitative Preferences Contribute to an Effective and Efficient Solution? BesTop Operator iMPO Algorithm 2005 knowledge Fusion Workshop

Integration of Sub-Query Results Image similarity queries are processed by splitting the overall query into sub-queries. How to obtain an effective and efficient integrating result ? I1 f0,1 f0,3 f0,2 f1,1 f1,3 f1,2 Feature Extraction The common approach is to define a monotonic scoring function (e.g. min and avg) that associates to each object Ii a numeric value (overall score) si An object is better (preferred to) than another iff its s is higher s1,1 s1,3 s1,2 Feature Comparison Scoring Function s1 2005 knowledge Fusion Workshop

Common Approach: Efficiency In order to minimize the number of DB accesses, “middleware” algorithms are applied to return only the Top-k highest scored objects TA [FLN01] MEDRANK [FKS03] Upper [BGM02] sub-query C sub-query T C T I2 S2,T I2 S2,C I1 S1,T I3 S3,C I3 S3,T I4 S4,C I5 S5,T I1 S1,C I4 S4,T I5 S5,C TEXTURE COLOR Top-k STOP! … 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop Skyline Operator The Skyline filters out the set of objects that are not dominated by any other object “An object dominates another object if it is equal or better in all its dimensions and better in at least one dimension” Hotel name Distance to beach (km) Price ($) Opera 0.5 300 Chariot 2.5 100 Star 3 50 Hilton 3.5 350 t1 t4 t2 t3 t3 t2 t4 t1 t3 t4 t2 t1 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop Best Operator Skyline filters out only the “first ” better results …but what happens if the cardinality of Skyline is low and the user want more results? The Best operator has been recently proposed Independent from the selected partial order Given a PO and the level n (n>0), Best computes the best results (level 1) the “second choices” (level 2) … the “n-th choices” (level n) t3 t2 t1 t4 Skyline: computes only the first level Best operator takes any partial order relation as input and returns the result level by level 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop MM Example: Best … … 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop MPEG-7 Objectives MPEG-7, formally called ”Multimedia Content Description Interface”, will standardise: A language to specify description schemes, i.e. a Description Definition Language (DDL). A set of Description Schemes and Descriptors A scheme for coding the description http://www.cselt.it/mpeg/standards/mpeg-7/mpeg- 7.htm 2005 knowledge Fusion Workshop

Content Context and Objectives Specification of a “Multimedia Content Description Interface” Developed by the International Standard Organization and the International Electrotechnical Commission (IEC) Standardized representation of multimedia metadata in XML (XML Schema Language) Describes audio-visual content as a number of levels (features, structures, semantics, models,…) 2005 knowledge Fusion Workshop

MPEG 7 Context and Objectives Content Description format independent may be applied to analogue media different description granularities Supplementary Data Application Types

Content Description in MPEG-7 Metadata Author, producer, copyright Semantics Objects, events, people Structure Region, segments Features Colors, textures, shapes 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop MPEG-7 Framework MM Content User Description MPEG7 Description Generation Definition Language (DDL) Filter Agents MPEG7 Description MPEG7 Description Schemes (DS) & Descriptors (D) Search / Query Engine MPEG7 Coded Description Encoder Decoder 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop MPEG-7 Example 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop MPEG-7 Example (cont) <object_hierarchy type="PHYSICAL"> <!-- Physical hierarchy --> <object_node id="10" object_ref="0"> <!-- Portrait --> <object_node id="11" object_ref="1"> <!-- Father --> <object_node id="12" object_ref="4"/> <!-- Father’s face --> </object_node> <object_node id="13" object_ref="2"> <!-- Mother --> <object_node id="14" object_ref="5"/> <!-- Mother’s face --> </object_hierarchy> <object_hierarchy type="LOGICAL"> <!-- Logical hierarchy: faces in the image --> <object_node id="15" object_ref="3"> <!-- Faces --> <object_node id="16" object_ref="4"/> <!-- Father’s face --> <object_node id="17" object_ref="5"/> <!-- Mother’s face --> 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop MPEG-21 MPEG-7 does not take into account the aspect of the organization of the infrastructure of distributed multimedia systems Initiated in 2000 to provide mechanisms for distributed multimedia systems design and associated services A new distribution entity is proposed and validated: the Digital item 2005 knowledge Fusion Workshop

Conclusion: Towards more Semantics Semantic Representation MPEG-7 Semantic Acquisition Automated approach for all purpose images and videos have failed Manual interpretation is tedious How to minimize human interventions? 2005 knowledge Fusion Workshop

Towards more Semantics: Challenges “A picture is worth a thousand words” “It is not so much that a picture is worth a thousand words, for many fewer words can describe a still picture for most retrieval purpose, the issue has more to do with the fact that those words vary from one person to another” ``In spite of almost fifty years of research, design of a general-purpose machine pattern recognizer remains an elusive goal....The best pattern recognizers in most instances are humans, yet we do not understand how humans recognize patterns". 2005 knowledge Fusion Workshop

2005 knowledge Fusion Workshop Conclusions Gap between research and applications in general Semantic Gap What is Next? Capture more semantics Real multimedia databases and application Indexing and dimensional reduction Integration of Multimedia and Classical Data 2005 knowledge Fusion Workshop