MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.

Slides:



Advertisements
Similar presentations
A Comparison of Implicit and Explicit Links for Web Page Classification Dou Shen 1 Jian-Tao Sun 2 Qiang Yang 1 Zheng Chen 2 1 Department of Computer Science.
Advertisements

How to Write a Review Article
Processing XML Keyword Search by Constructing Effective Structured Queries Jianxin Li, Chengfei Liu, Rui Zhou and Bo Ning Swinburne University of Technology,
A New Scheme For Robust Blind Digital Video Watermarking Supervised by Prof. LYU, Rung Tsong Michael Presented by Chan Pik Wah, Pat Mar 5, 2002 Department.
Patch to the Future: Unsupervised Visual Prediction
GENERATING AUTOMATIC SEMANTIC ANNOTATIONS FOR RESEARCH DATASETS AYUSH SINGHAL AND JAIDEEP SRIVASTAVA CS DEPT., UNIVERSITY OF MINNESOTA, MN, USA.
Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.
CIKM’2008 Presentation Oct. 27, 2008 Napa, California
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns Lu Shi Oct. 4, 2004.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin Nov
Funding Networks Abdullah Sevincer University of Nevada, Reno Department of Computer Science & Engineering.
Video Table-of-Contents: Construction and Matching Master of Philosophy 3 rd Term Presentation - Presented by Ng Chung Wing.
1 Content Analysis and Summarization for Video Documents Oral Defense for the degree of Master of Philosophy Presented by Lu Shi Committee Members Prof.
ADVISE: Advanced Digital Video Information Segmentation Engine
ACM Multimedia th Annual Conference, October , 2004
Video summarization by video structure analysis and graph optimization M. Phil 2 nd Term Presentation Lu Shi Dec 5, 2003.
Supervised by Prof. LYU, Rung Tsong Michael Department of Computer Science & Engineering The Chinese University of Hong Kong Prepared by: Chan Pik Wah,
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Relevance Feedback based on Parameter Estimation of Target Distribution K. C. Sia and Irwin King Department of Computer Science & Engineering The Chinese.
Feature vs. Model Based Vocal Tract Length Normalization for a Speech Recognition-based Interactive Toy Jacky CHAU Department of Computer Science and Engineering.
1 Integrating User Feedback Log into Relevance Feedback by Coupled SVM for Content-Based Image Retrieval 9-April, 2005 Steven C. H. Hoi *, Michael R. Lyu.
Video summarization by graph optimization Lu Shi Oct. 7, 2003.
Domain Testing Based on Character String Predicate Ruilian Zhao Computer Science Dept. Beijing University of Chemical Technology Michael R. Lyu Computer.
1 PageSim: A Link-based Similarity Measure for the World Wide Web Zhenjiang Lin, Irwin King, and Michael, R., Lyu Computer Science & Engineering, The Chinese.
A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.
Relevance Feedback Content-Based Image Retrieval Using Query Distribution Estimation Based on Maximum Entropy Principle Irwin King and Zhong Jin The Chinese.
6 Systems Analysis and Design in a Changing World, Fourth Edition.
Information Retrieval in Practice
Query session guided multi- document summarization THESIS PRESENTATION BY TAL BAUMEL ADVISOR: PROF. MICHAEL ELHADAD.
Chapter 6: The Traditional Approach to Requirements
System Analysis Overview Document functional requirements by creating models Two concepts help identify functional requirements in the traditional approach.
SIGIR’09 Boston 1 Entropy-biased Models for Query Representation on the Click Graph Hongbo Deng, Irwin King and Michael R. Lyu Department of Computer Science.
Temporal Event Map Construction For Event Search Qing Li Department of Computer Science City University of Hong Kong.
2008/06/06 Y.H.Chang Towards Effective Browsing of Large Scale Social Annotations1 Towards Effective Browsing of Large Scale Social Annotations WWW 2007.
We introduce the use of Confidence c as a weighted vote for the voting machine to avoid low confidence Result r of individual expert from affecting the.
Università degli Studi di Modena and Reggio Emilia Dipartimento di Ingegneria dell’Informazione Prototypes selection with.
Table 3:Yale Result Table 2:ORL Result Introduction System Architecture The Approach and Experimental Results A Face Processing System Based on Committee.
Incident Threading for News Passages (CIKM 09) Speaker: Yi-lin,Hsu Advisor: Dr. Koh, Jia-ling. Date:2010/06/14.
April 14, 2003Hang Cui, Ji-Rong Wen and Tat- Seng Chua 1 Hierarchical Indexing and Flexible Element Retrieval for Structured Document Hang Cui School of.
Dimitrios Skoutas Alkis Simitsis
Understanding The Semantics of Media Chapter 8 Camilo A. Celis.
You Are What You Tag Yi-Ching Huang and Chia-Chuan Hung and Jane Yung-jen Hsu Department of Computer Science and Information Engineering Graduate Institute.
Binxing Jiao et. al (SIGIR ’10) Presenter : Lin, Yi-Jhen Advisor: Dr. Koh. Jia-ling Date: 2011/4/25 VISUAL SUMMARIZATION OF WEB PAGES.
A Scalable Machine Learning Approach for Semi-Structured Named Entity Recognition Utku Irmak(Yahoo! Labs) Reiner Kraft(Yahoo! Inc.) WWW 2010(Information.
C. Lawrence Zitnick Microsoft Research, Redmond Devi Parikh Virginia Tech Bringing Semantics Into Focus Using Visual.
Exploit of Online Social Networks with Community-Based Graph Semi-Supervised Learning Mingzhen Mo and Irwin King Department of Computer Science and Engineering.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
2005/12/021 Content-Based Image Retrieval Using Grey Relational Analysis Dept. of Computer Engineering Tatung University Presenter: Tienwei Tsai ( 蔡殿偉.
1 A Web Search Engine-Based Approach to Measure Semantic Similarity between Words Presenter: Guan-Yu Chen IEEE Trans. on Knowledge & Data Engineering,
Summarizing Encyclopedic Term Descriptions on the Web from Coling 2004 Atsushi Fujii and Tetsuya Ishikawa Graduate School of Library, Information and Media.
2004 謝俊瑋 NTU, CSIE, CMLab 1 A Rule-Based Video Annotation System Andres Dorado, Janko Calic, and Ebroul Izquierdo, Senior Member, IEEE.
Multi-level Bootstrapping for Extracting Parallel Sentence from a Quasi-Comparable Corpus Pascale Fung and Percy Cheung Human Language Technology Center,
Hongbo Deng, Michael R. Lyu and Irwin King
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.
Event-Based Extractive Summarization E. Filatova and V. Hatzivassiloglou Department of Computer Science Columbia University (ACL 2004)
6 Systems Analysis and Design in a Changing World, Fourth Edition.
Analyzing and Predicting Question Quality in Community Question Answering Services Baichuan Li, Tan Jin, Michael R. Lyu, Irwin King, and Barley Mak CQA2012,
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
哈工大信息检索研究室 HITIR ’ s Update Summary at TAC2008 Extractive Content Selection Using Evolutionary Manifold-ranking and Spectral Clustering Reporter: Ph.d.
How to Write a Review Article
Visual Information Retrieval
Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance  Hello everyone,
Semantic Processing with Context Analysis
WSRec: A Collaborative Filtering Based Web Service Recommender System
Video Summarization by Spatial-Temporal Graph Optimization
Multimedia Information Retrieval
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Ying Dai Faculty of software and information science,
Presentation transcript:

MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot Arrangement Patterns Shi Lu, Michael R. Lyu and Irwin King {slu, lyu, Department of computer science and Engineering The Chinese University of Hong Kong Shatin N.T. Hong Kong Jan. 12, 2005

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 2 Outline Introduction Background and motivation Goals The Proposed Method Video structure analysis Video shot arrangement patterns Mutual reinforcement principle Video skim selection Experiment results Conclusion

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 3 Background and Motivation Huge volume of video data are distributed over the Web Browsing and managing the huge video database are time consuming Video summarization helps the user to quickly grasp the content of a video Two kinds of applications: Dynamic video skimming Static video summary We mainly focus on generating dynamic video skimming for movies

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 4 Goals Goals for video summarization Conciseness  Given the target length of the video skim Content coverage  Visual diversity and temporal coverage  Balanced structural coverage Visual coherence

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 5 Workflow

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 6 Video Structure Video narrates a story just like an article does Video (story) Video scenes (paragraph) Video shot groups Video shots (sentence) Video frames Can be built from bottom to up

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 7 Video Scene Formation Loop scenes and progressive scenes Group the visually similar video shots into groups  ToC method by Y. Rui, et al  Spectral graph partitioning by J. B. Shi, et al Intersected groups forms loop scenes Loop scenes depict an event happened at a place Progressive scenes: “ transition ” between events or dynamic events Summarize each video scene respectively

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 8 Video Scene Analysis Scene importance: length and complexity Content entropy for loop scenes Measure the complexity for a loop scene For progressive scenes, we only consider its length Length of a member video shot group Total length of the video scene

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 9 Skim Length Distribution Determine each video scene ’ s target skim length, given Determine each progressive scenes ’ skim length  If, discard it, else Determine each loop scenes ’ skim length  If,discard it  Redistribute to remaining scenes

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 10 Shot Arrangement Patterns The way the director arrange the video shots conveys his intention For each scene, video shot group labels form a string (e.g …… ) K-Non-Repetitive String (k-nrs) Minimal content redundancy and visually coherent — good video skim candidates String coverage {3124} covers {312,124,31,12,24,3,1,2,4}

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 11 Shot Arrangement Patterns Several detected nrs strings

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 12 Video Semantics Low level features and high level concepts: semantic gap Summary based on low level features is not able to ensure the perceived quality Solution: obtain video semantic information by manual/semi-automatic annotation Usage: Retrieval Summary

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 13 Video Semantics Concept representation for a video shot The most popular question: who has done what? The two major contexts: who, what action Concept term and video shot description (user editable and reusable)

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 14 Video Semantics Concept term and video shot description Term (key word): denote an entity, e.g. “ Joe ”, “ talking ”, “ in the bank ” Context: “ who ”, “ what action ”… Shot description: the set comprising all the concept terms that is related to the shot Obtained by semi-automatic or video annotation

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 15 Mutual Reinforcement How to measure the priority for a set of concept terms and a set of descriptions? A more important description should contain more important terms; A more important term should be contained by more important descriptions Mutual reinforcement principle

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 16 Mutual Reinforcement Let W be the weight matrix describes the relationship between the term set and shot description set (elements in W can have various definitions, e.g. the number of occurrence of a term in a description) Let U,V be the vector of the importance value of the concept term set and video shot description set We have Where and are constants. U and V can be calculated by SVD of W

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 17 Mutual Reinforcement For each semantic context: We choose the singular vectors correspond to W ’ s largest singular value as the importance vector for concept terms and sentences Since W is non-negative, the first singular vector V will be non-negative

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 18 Mutual Reinforcement Importance calculation on 76 video shots Based on context “ who ”

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 19 Video Summarization Based on the result of mutual reinforcement, we can determine the relational priority between video shots The generated skim can ensure the semantic contents coverage

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 20 Video Skim Selection Input: the decomposed nrs string set from a scene and the importance scores do Select the most important k-nrs string into the skim shot set Remove those nrs strings from the original set covered by the selected string Until the target skim length is reached

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 21 Video Skim Selection

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 22 Evaluation Subjective experiment:15 people were invited to watch video skims generated from 4 videos with skim rate 0.15 and 0.30 Questions about main actors and key events: Who has done What? (Meaningfulness score) Which skim looks better? (favorite score) Compared with our previous graph based algorithm Achieve better coherency

MMM 2005The Chinese University of Hong Kong MMM 2005 The Chinese University of Hong Kong 23 Summary A novel dynamic video summarization method is proposed Video structure analysis  Determine video scene boundaries  Analyze the shot arrangement patterns  Scene complexity and target skim length Mutual reinforcement  Utilizing the semantic information  An importance measure for video shot patterns Video skim selection