Using Multiple Synchronized Views Presenter: Teklu Urgessa Efficient Video Browsing.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.
Automated Shot Boundary Detection in VIRS DJ Park Computer Science Department The University of Iowa.
Procedure for Developing a Multimedia Presentation 6.02 Apply procedures to develop multimedia presentations used in business.
Using Multiple Synchronized Views Heymo Kou.  What is the two main technologies applied for efficient video browsing? (one for audio, one for visual.
Discussion on Video Analysis and Extraction, MPEG-4 and MPEG-7 Encoding and Decoding in Java, Java 3D, or OpenGL Presented by: Emmanuel Velasco City College.
Chapter 11 Beyond Bag of Words. Question Answering n Providing answers instead of ranked lists of documents n Older QA systems generated answers n Current.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
Multimedia for the Web: Creating Digital Excitement Multimedia Element -- Graphics.
ADVISE: Advanced Digital Video Information Segmentation Engine
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
T.Sharon 1 Internet Resources Discovery (IRD) Video IR.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.
Modern Information Retrieval Chapter 1 Introduction.
Presented by Zeehasham Rasheed
LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
Overview of Search Engines
Application Software.  Topics Covered:  Software Categories  Desktop vs. Mobile Software  Installed vs. Web-Based Software.
Information Retrieval in Practice
Smart Learning Services Based on Smart Cloud Computing
Video Data Topic 4: Multimedia Technology. What is Video? A video is just a collection of bit-mapped images that when played quickly one after another.
Digital Sound and Video Chapter 10, Exploring the Digital Domain.
Video Data Topic 4: Multimedia Technology. What is Video? A video is just a collection of bit-mapped images that when played quickly one after another.
1 Seminar Presentation Multimedia Audio / Video Communication Standards Instructor: Dr. Imran Ahmad By: Ju Wang November 7, 2003.
Multimedia Databases (MMDB)
Multimedia Information Retrieval
Advanced Level Course. Site Extras Site Extras consist of four categories: Stationeries Site Trash Designs Components.
© 2011 The McGraw-Hill Companies, Inc. All rights reserved Chapter 6: Video.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
Glencoe Introduction to Multimedia Chapter 9 Video 1 Chapter Video 9  Section 9.1 Video in Multimedia  Section 9.2 Work with Video Contents.
CHAPTER FOUR COMPUTER SOFTWARE.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Business Software What is database software? p. 145 Allows you to create, access, and manage data Add, change, delete, sort, and retrieve data Next.
MULTIMEDIA DEFINITION OF MULTIMEDIA
CHAPTER TEN AUTHORING.
Understand business uses of presentation software and methods of distribution.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials: Informedia.
Prof. Thomas Sikora Technische Universität Berlin Communication Systems Group Thursday, 2 April 2009 Integration Activities in “Tools for Tag Generation“
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
MULTIMEDIA Multimedia is the field concerned with the computer- controlled integration of text, graphics, drawings, still and moving images (Video), animation,
Procedure for Developing a Multimedia Presentation Apply procedures to develop multimedia presentations used in business.
Oman College of Management and Technology Course – MM Topic 7 Production and Distribution of Multimedia Titles CS/MIS Department.
Video Data Topic 4: Multimedia Technology. What is Video? A video is just a collection of bit-mapped images that when played quickly one after another.
Introduction to MPEG  Moving Pictures Experts Group,  Geneva based working group under the ISO/IEC standards.  In charge of developing standards for.
Ontology-based Automatic Video Annotation Technique in Smart TV Environment Jin-Woo Jeong, Hyun-Ki Hong, and Dong-Ho Lee IEEE Transactions on Consumer.
MPEG 7 &MPEG 21.
Digital Video Library - Jacky Ma.
Visual Information Retrieval
Application Software Chapter 6.
Automatic Video Shot Detection from MPEG Bit Stream
Introduction Multimedia initial focus
Presenter: Ibrahim A. Zedan
Chapter Lessons Understand the Macromedia Flash workspace
Chapter 6: Video.
Paper Reading part Seo Seok Jun.
N7 Graphic Communication
Multimedia Information Retrieval
Multimedia Content Description Interface
Multimedia Information Retrieval
Presentation transcript:

Using Multiple Synchronized Views Presenter: Teklu Urgessa Efficient Video Browsing

Authors Arnon Amir, Savitha Srinivasan and Dulce Ponceleon IBM Almaden Research Center

Key Words for Publication Video Retrieval Multimedia Browsing Video Browsing Synchronized Views Audio Time Scale Modification(TSM) Fast Playback Video Browser Anima-visualization Techniques Slide Show Adaptive Accelerating ……

Table of contents Introduction Traditional Methods Problems with Traditional Methods Advanced Technology/Methods Technology for Visual Technology for Audio Summary Reference

1. Introduction

Text Browsing Vs MMB Browsing Text Documents: Simple and fast Browsing multimedia documents is not as easy text browning It is complex and time consuming Production and application of video contents is increasing from time to me. The need for efficient way of video browsing is very crucial The paper deals with different methods of efficient video browsing.

Growth of Digital Contents

Increasing Demand in Video Content

Factors for the Fast Growth DC Digital Video Becomes common From Our Smart phones Notebooks Webcams Digital camera and camcorders Security and monitoring cameras Advanced Streaming Technology Fast Internet Access MPEG-4 format Diversity in the application areas of Video

Application Areas: Where Videos are Important Entertainment Education and Training Distance Learning: Online Distance Learning Medical and Technical Manuals Advertisements.

Problem/Challenges As the amount of video-rich (multimedia) data grows: Finding and accessing becomes critical problem from large video repositories Given Need of Users: Quick and Efficient Retrieval

Need of Research in the Area of the Efficient Video Retrieval Major research activities/efforts were underway in the last decade to find out best and efficient methods of video indexing, searching and retrieval.

Nature of Video Retrieval Research: Multidisciplinary Areas of research: Computer Vision Pattern Recognition Speech Recognition Information Retrieval …

Basic Concepts: Searching and Browsing Both Activities are tightly Coupled Searching: needs specific entries i.e. you can search for specific company or a person Browsing: A generic approach; Eg. Korean Foods or Houses A combination of both can also happen: First search the broader concept and the browse to reach at the specific concept and vice versa.

2.Traditional Methods Finding a Video Data Search through categories Similar to Internet shopping mall We search for big categories Then smaller categories …and so on… User should choose which to browse Should check whether the selected data matches what user needs Manual categorization and annotation One by one? Time consuming!

Problem with Traditional video search and browsing technologies The Authors stated that Too complicated Lack of efficient algorithm Time consuming Multimedia calculation are complex and demanding Inaccuracy Video data is increasing exponentially manual Cataloging is a big limitation Manual cataloging is error prone: lacks accuracy due subjectivity

3.Advanced Technologies for Image and Video retrieval MPEG-7 Standards Speech indexing Shot Boundary Detection Time Scale Modification of Audio Signals Storyboards, Moving Storyboards and Animation Adaptive Accelerating Fast Playback Streaming Synchronized Views

MPEG-7: Multimedia Description Standard Standardized by : International Standard Organization (ISO) International Electro-technical Commission (IEC) ISO/IEC (Multimedia content description interface) Not a video encoding format of moving pic like MPEG-1-4 MPEG uses XML to store metadata/description The description can be attached to timecode in multimedia in order to tag particular event. By this tag Able to index and search efficiently Yet, improvement is needed

Illustration: Independence between Description and Content Source :

How it works Source :

Speech Indexing Search through speech transcripts Finds familiar metaphor of free text search Automatic speech recognition (ASR) Indexed transcript → semantic information Main advantage : Representation Speech is built of words

Shot Boundary Detection Shot Boundary Detection(SBD) algorithm Completely automatic Key frames are selected and extracted Saved as JPEG files High Accuracy and Efficiency Still, fault detection problem is unsolved

Definitions Basic Concepts Frame: composed of picture elements just like a chess board Key frame: Represents shots Shot: Group of frames which represents similar frames Start key frame End key frame Animation

3 levels of Video Browsing Browsing a large Collection of Videos Browsing a ranked list of videos Browsing a single video to find relevant segments The concern is the second and the third one to extract the most important segment from the video content.

SBD  Key to Efficient Video Visualization is accurate detection of boundaries  A shot is continues Sequence of frames as captured by the camera  Often represented by single key frame in the storyboard  Shot Boundaries: Changes between shots  Created during editing phase (Hard cut, Fade, Dissolves)  Can be gradual or abrupt

SBD Algorithms Four shot boundary detection algorithms 1.Color Histogram Differences: the best and most balanced “older” algorithm: Hard Cut editing 2.Edge Change Ratio: the recently proposed algorithm: used for Hard cut, Fade and Editing 3. Standard Deviation of Pixel Intensities: For fade 4.Contrast: For dissolve

Time Scale Modification of Audio Signals Efficient video browsing needs efficient audio browsing Except images, most digital contents are audible Faster audio browsing is necessary TSM : allow speeding up or slowing down audio w/t noticeable distortion By skip pitch periods to speed up duplicate when you want to slow down Human speech signals are quasi-periodic Changing total play time: deleting or inserting small audio segment

Improvement of TSM Time Scale Modification (TSM) algorithm Waveform Synchronous Overlap(WSOLA) Time-Domain Harmonic Scaling(TDHS) technique Time-Domain, Pitch Synchronous Overlap Add Foundation and general formulation Simple time Domain Modern speech TSM algorithm Pointer Interval Controlled Overlap Add Optional and applicable to all MPEG4 audio coding Scheme Used in the paper

Synchronous Overlap-Add SOLA

Storyboards, Moving Storyboards and Animation Storyboard a set of one or more pages, each consists of a two dimensional array of key-frames, sorted in chronological order. Animation a quick slide show, where each of the key-frames is shown for a fixed short period (e.g., 0.6 seconds) Moving Storyboard (MSB) the animated key frames, fully synchronized with the original audio track. Each key-frame is shown for the entire duration of the associated shot. Example.

Example of Storyboard

Adaptive Accelerating Fast Playback Very fast video playback (without audio) Ordinary fast forward depends only on speed There is a chance to miss important scene Accelerates until new scene is met Requires less computation load

Conclusion Multimedia Browsing not as simple text browsing Studies on efficient video browsing is still underway Active accelerating fast playback Most useful at analyzing surveillance videos SBD: Useful for visual contents TSM: Useful for Audio contents Efficient Video retrieval implements the above technologies

Questions 1. Explain The different Levels of MPEG-7 description method of Visual Content 2. What Method is appropriate for Efficient Audio Retrieval 3. Is MPEG-7 a content compressing tool? If No why?, Who standardized it what is the name of its 4. What Method is efficient way visual content retrieval 5. Explain the difference that exists among Shot, Key frame and shot boundary

References Shot Boundary Detection Key frame Synchronous Overlap-Add Growth of Digital Information Created and Replicated MPEG-7 standard PSOLA (Pitch Synchronous Overlap and Add)