Video Table-of-Contents: Construction and Matching Master of Philosophy 3 rd Term Presentation - Presented by Ng Chung Wing.

Slides:



Advertisements
Similar presentations
AVATAR: Advanced Telematic Search of Audivisual Contents by Semantic Reasoning Yolanda Blanco Fernández Department of Telematic Engineering University.
Advertisements

Fast Algorithms For Hierarchical Range Histogram Constructions
Multi-Model Digital Video Library Professor: Michael Lyu Member: Jacky Ma Joan Chung Multi-Model Digital Video Library LYU9904 Multi-Model Digital Video.
LYU0101 Wireless Digital Library on PDA Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu First semester FYP Presentation 2001~2002.
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns Lu Shi Oct. 4, 2004.
ICIP 2000, Vancouver, Canada IVML, ECE, NTUA Face Detection: Is it only for Face Recognition?  A few years earlier  Face Detection Face Recognition 
Web Categorization Crawler – Part I Mohammed Agabaria Adam Shobash Supervisor: Victor Kulikov Winter 2009/10 Final Presentation Sep Web Categorization.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
1 Content Based Image Retrieval Using MPEG-7 Dominant Color Descriptor Student: Mr. Ka-Man Wong Supervisor: Dr. Lai-Man Po MPhil Examination Department.
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
LYU0101 Wireless Digital Information System Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu Second semester FYP Presentation 2001~2002.
ADVISE: Advanced Digital Video Information Segmentation Engine
Video summarization by video structure analysis and graph optimization M. Phil 2 nd Term Presentation Lu Shi Dec 5, 2003.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Real-time and Retrospective Analysis of Video Streams and Still Image Collections using MPEG-7 Ganesh Gopalan, College of Oceanic and Atmospheric Sciences,
Web Mining Research: A Survey
Video summarization by graph optimization Lu Shi Oct. 7, 2003.
Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang National Central University
1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal Video Conference Archives Indexing System Supervisor: Prof Michael Lyu Presented by: Lewis Ng,
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Overview of Search Engines
Information Retrieval in Practice
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
DHTML. What is DHTML?  DHTML is the combination of several built-in browser features in fourth generation browsers that enable a web page to be more.
TEMPORAL VIDEO BOUNDARIES -PART ONE- SNUEE KIM KYUNGMIN.
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Chapter 7 Web Content Mining Xxxxxx. Introduction Web-content mining techniques are used to discover useful information from content on the web – textual.
A Proposal for a Video Modeling for Composing Multimedia Document Cécile ROISIN - Tien TRAN_THUONG - Lionel VILLARD Presented by: Tien TRAN THUONG Project.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
2004, 9/1 1 Optimal Content-Based Video Decomposition for Interactive Video Navigation Anastasios D. Doulamis, Member, IEEE and Nikolaos D. Doulamis, Member,
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
PMLAB Finding Similar Image Quickly Using Object Shapes Heng Tao Shen Dept. of Computer Science National University of Singapore Presented by Chin-Yi Tsai.
Department of Computer Science and Engineering, CUHK 1 Final Year Project 2003/2004 LYU0302 PVCAIS – Personal VideoConference Archives Indexing System.
Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)
Johannes Kepler University Linz Department of Business Informatics Data & Knowledge Engineering Altenberger Str. 69, 4040 Linz Austria/Europe
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Digital Libraries1 David Rashty. Digital Libraries2 “A library is an arsenal of liberty” Anonymous.
Bachelor of Engineering In Image Processing Techniques For Video Content Extraction Submitted to the faculty of Engineering North Maharashtra University,
Search Engine using Web Mining COMS E Web Enhanced Information Mgmt Prof. Gail Kaiser Presented By: Rupal Shah (UNI: rrs2146)
M4 / September Integrating multimodal descriptions to index large video collections M4 meeting – Munich Nicolas Moënne-Loccoz, Bruno Janvier,
SSQSA present and future Gordana Rakić, Zoran Budimac Department of Mathematics and Informatics Faculty of Sciences University of Novi Sad
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Using Cross-Media Correlation for Scene Detection in Travel Videos.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Effective Anomaly Detection with Scarce Training Data Presenter: 葉倚任 Author: W. Robertson, F. Maggi, C. Kruegel and G. Vigna NDSS
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
MULTIMEDIA DATA MODELS AND AUTHORING
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
General Architecture of Retrieval Systems 1Adrienn Skrop.
1 Copyright © 2008, Oracle. All rights reserved. Repository Basics.
GUILLOU Frederic. Outline Introduction Motivations The basic recommendation system First phase : semantic similarities Second phase : communities Application.
DHTML.
Digital Video Library - Jacky Ma.
Supervisor: Prof Michael Lyu Presented by: Lewis Ng, Philip Chan
Video Summarization by Spatial-Temporal Graph Optimization
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Presentation transcript:

Video Table-of-Contents: Construction and Matching Master of Philosophy 3 rd Term Presentation - Presented by Ng Chung Wing

Outline Overview of Research Previous Work ADVISE Advanced Digital Video Information Segmentation Engine Future Work and Conclusion

Overview of Research Situation A large volume of video contents on the Internet Problems Not enough information to describe the video contents Difficult to search for videos with similar contents

Overview of Research Web-based Video Retrieval System Provides a Video Table-of-Contents to describe the structure of video Applies Tree Matching Algorithms to measure the similarity between videos Allows retrieval of similar videos according to tree matching results

Overview of Research (Cont’d) In the last semester Definition of Video Tree Structure Video Tree Matching Algorithms In this semester ADVISE  Generation of Video Tree Structure  Web-based presentation of the structure as a Video Table-of-Contents In the coming semester Video Retrieval

Review on Previous Work: Video Tree Structure Decompose a video into 5 levels: Video Frames Video Shots Video Groups Video Scenes Whole Video Hierarchical Representation of a Video

Review on Previous Work: Video Tree Structure (Cont’d) Example: Group 1 Group 3 Group 2 Scene 1 Scene 2 Video Shots:

Review on Previous Work: Video Tree Structure (Cont’d) 4 levels tree structure Regarded as: Video Table-of-Contents (V-ToC)

Review on Previous Work: Video Tree Matching Algorithms Measure the similarity of videos Matching on their video tree structures Two approaches: Ordered Tree Matching Algorithm  Constrained by temporal ordering Non-ordered Tree Matching Algorithm  Not constrained by temporal ordering Video feature used Color histograms of video frames

Review on Previous Work: Video Tree Matching Algorithms (Cont’d) Similarities in Algorithm Top-down manner (Scenes, Groups, Shots) Video features extraction at shot level Propagates result from bottom-up Differences in Algorithm Non-ordered Tree Matching Algorithm  Calculates the score of the best match at video, scene and group levels Ordered Tree Matching Algorithm  Calculates the score of the best ordered match using dynamic programming

ADVISE Advanced Digital Video Information Segmentation Engine 3 modules: Generates V-ToC to describe videos Presents V-ToC on the Internet using XML Allows video customization according to the V-ToC with SMIL

ADVISE: Video Structure Construction Video Shots Detection Color histogram based method with weighted regions  5 regional and 1 overall color histograms for each video frame  Catch local color features in video frame  Different weights to regions according to importance 6

ADVISE: Video Structure Construction (Cont’d)  Calculate the frame-to-frame color difference: where Hist i,t (k) denotes the k-th color value in the histogram for region i in frame t. W RDX are weights to regions.

ADVISE: Video Structure Construction (Cont’d)  Find the sudden change in color contents as the video shot boundaries  Need a threshold to determine the shot boundaries:  Not suitable to assign a fixed threshold to different videos Use adaptive threshold

ADVISE: Video Structure Construction (Cont’d)  Employed an entropic thresholding method Divide the frame-to-frame histograms difference into two populations at a threshold point Measure the entropies of the populations Find the maximum sum of two entropies at different threshold point  Distribution of histogram differences: 0Max. Difference Optimal Threshold (with most informative entropies) Shot Breaks Non-Shot Breaks

ADVISE: Video Structure Construction (Cont’d) Video Groups Formation For each shot s, compare its key frame (the first frame) with the key frame from most recent shot in group g.

ADVISE: Video Structure Construction (Cont’d) Shots closer in time are more similar  Consider the temporal factor:  where Temporal ts,tg is the temporal factor. AvgShotLength is the average length of video shots. K is a predefined value which controls the change in the temporal factor

ADVISE: Video Structure Construction (Cont’d) After comparing all groups, we assign the shot to a group if:  Difference is smallest amongst groups  Difference is smaller than the calculated threshold  The shot is temporally not far apart from the group

ADVISE: Video Structure Construction (Cont’d) Video Scenes Formation Construct a continuous video sequence from video groups Video scenes  For each group with the first and the last shots m and n  If m is within a scene, add the group to the scene and extend the scene to n if necessary. Case (i) & (ii)  If m is not within any scene, add it to a new scene Case (iii)

ADVISE: Video Structure Construction (Cont’d) User Interface of implemented system

ADVISE: Video Structure Construction (Cont’d) Experiments: Evaluate 4 different settings of video structure construction  Single color histogram, and fixed threshold  Single color histogram, and adaptive threshold  Weighted regional color histograms, and fixed threshold  Weighted regional color histograms, and adaptive threshold Compare the generated video structure with human judgments

ADVISE: Video Structure Construction (Cont’d) The setting with  Weighted regional color histograms  Adaptive threshold generates the most accurate video structure.

ADVISE: XML Presentation 4 benefits to store the video structure in XML Nested hierarchy  Fit into our video tree structure Plain-text format  Easy to search and modify Extensibility  Available to extend the video structure Application to the Internet

ADVISE: XML Presentation (Cont’d) Defined XML grammar for video structure in DTD DTD for XML Video Tree Structure XML Video Tree Structure

ADVISE: XML Presentation (Cont’d) Web-based presentation of XML using XSL Transformation to HTML Sorting and filtering of XML data at sec

ADVISE: XML Presentation (Cont’d) An example XML presentation V-ToC describes the structure of a video

ADVISE: SMIL Generation Video customization Allow user to pick some video segments that they are interested from the V-ToC SMIL Designed for performing synchronized multimedia presentation on the Internet Use RealPlayer to browse Benefits  Easy to generate because of the XML plain-text property  Dynamically adapt to different network and client condition

ADVISE: SMIL Generation (Cont’d) Defined a SMIL template:... Define the Layout <video src="rtsp:// source video on server" clip-begin="3s" clip-end="15s" region="video" fill="freeze"/> <textstream src="desc.rt" clip-begin="3s" clip-end="15s" region="description" fill="freeze"/> <video src="rtsp:// source video on server" clip-begin="35s" clip-end="50s" region="video" fill="freeze"/> <textstream src="desc.rt" clip-begin="35s" clip-end="50s" region="description" fill="freeze"/>

ADVISE: SMIL Generation (Cont’d) Customized SMIL Video Presentation Script on Web Server 1.Interpret request 2.Select video segments according to XML video structure 3.Generate customized SMIL presentation User Interface for Customization Submit request Return SMIL

Future Work Video retrieval system framework Integrates ADVISE and video tree matching algorithms Explore the capability of using the video tree matching on video retrieval Video clustering Efficient retrieval of video using XML Hierarchy of V-ToC Textual search of video information

Conclusion Overview of the research on video retrieval system Based on the structure of video (V-ToC) Described ADVISE Generates video tree structure (V-ToC) Provides V-ToC in XML as descriptions of videos on the Internet Enables video customization based on V-ToC using SMIL