Automatic Generation of Personalized Music Sports Video ACM MM’2005

Slides:



Advertisements
Similar presentations
Generation of Multimedia TV News Contents for WWW Hsin Chia Fu, Yeong Yuh Xu, and Cheng Lung Tseng Department of computer science, National Chiao-Tung.
Advertisements

A Human-Centered Computing Framework to Enable Personalized News Video Recommendation (Oh Jun-hyuk)
Trajectory Analysis of Broadcast Soccer Videos Computer Science and Engineering Department Indian Institute of Technology, Kharagpur by Prof. Jayanta Mukherjee.
MULTIMEDIA DEVELOPMENT 4.3 : AUTHORING TOOLS. At the end of the lesson, students should be able to: 1. Describe different types of authoring tools Learning.
SmartPlayer: User-Centric Video Fast-Forwarding K.-Y. Cheng, S.-J. Luo, B.-Y. Chen, and H.-H. Chu ACM CHI 2009 (international conference on Human factors.
Automatic Soccer Video Analysis and Summarization
1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,
Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.
ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.
ACM Multimedia 2008 Feng Liu 1, Yuhen-Hu 1,2 and Michael Gleicher 1.
ADVISE: Advanced Digital Video Information Segmentation Engine
Segmentation and Event Detection in Soccer Audio Lexing Xie, Prof. Dan Ellis EE6820, Spring 2001 April 24 th, 2001.
 a web site usually maintained by an individual with regular entries of commentary, descriptions of events, or other material such as graphics or video.
John Fennekohl CSC 101. Blogs A blog is a web site, usually maintained by an individual with regular entries of commentary, descriptions of events, or.
Intro Alexei Miagkov: researching GUI networking sound aspects of Java Walter Kammerer: researching networking concepts documenting real-time media concepts.
Support Vector Machine based Logo Detection in Broadcast Soccer Videos Hossam M. Zawbaa Cairo University, Faculty of Computers and Information; ABO Research.
DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.
History of FIFA The modern football was born 1863 when English football team was founded it.The first FIFA cup was 18 July 1930 on. Over the 25 years.
Multimedia and Interactivity. Interactivity Allows users to manipulate information and to contribute to the story Promotes user involvement and understanding.
Multimedia and Interactivity. Interactivity Allows users to manipulate information and to contribute to the story Promotes user involvement and understanding.
Fundamentals of Game Design, 2 nd Edition by Ernest Adams Chapter 16: Sports Games.
Chapter 11-Multimedia Authoring Tools. Overview Introduction to multimedia authoring tools. Types of authoring tools. Cross-platform authoring notes.
Video Classification By: Maryam S. Mirian
WP5.4 - Introduction  Knowledge Extraction from Complementary Sources  This activity is concerned with augmenting the semantic multimedia metadata basis.
Multimedia Databases (MMDB)
A Generic Virtual Content Insertion System Based on Visual Attention Analysis H. Liu 1, 2, S. Jiang 1, Q. Huang 1, 2, C. Xu 2, 3 1 Institute of Computing.
An Overview of MPEG-21 Cory McKay. Introduction Built on top of MPEG-4 and MPEG-7 standards Much more than just an audiovisual standard Meant to be a.
Web Resources Caroline Pierce CSC 101 Spring 2008.
Player Action Recognition in Broadcast Tennis Video with Applications to Semantic Analysis of Sport Game Guangyu Zhu, Changsheng Xu Qingming Huang, Wen.
An Architecture for Mining Resources Complementary to Audio-Visual Streams J. Nemrava, P. Buitelaar, N. Simou, D. Sadlier, V. Svátek, T. Declerck, A. Cobet,
A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 10, NO. 3, APRIL 2008.
Objective Understand digital video production methods, software, and hardware. Course Weight : 15%
1 A Unified Relevance Model for Opinion Retrieval (CIKM 09’) Xuanjing Huang, W. Bruce Croft Date: 2010/02/08 Speaker: Yu-Wen, Hsu.
Tactic Analysis in Football Instructors: Nima Najafzadeh Mahdi Oraei Spring
MULTIMEDIA DEFINITION OF MULTIMEDIA
Day Podcast content is consumed on the user’s personal computers or portable devices 2. Can be used with RSS feeds to be automatically downloaded.
Writing a report for a postgraduate course N. Desypris March 2011.
Research Projects 6v81 Multimedia Database Yohan Jin, T.A.
Levi Smith.  Reading papers  Getting data set together  Clipping videos to form the training and testing data for our classifier  Project separation.
Multimedia By: Marcus Bobian Multimedia period 1.
UFCFS D Technologies for the Web An Introduction to the Module.
Videography: The Basics. Project 2 Timeline Instructional Design Report (January 3rd, Monday) Instructional Design Report (January 3rd, Monday) Final.
Using Webcast Text for Semantic Event Detection in Broadcast Sports Video IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 10, NO. 7, NOVEMBER 2008.
Case Study 1 Semantic Analysis of Soccer Video Using Dynamic Bayesian Network C.-L Huang, et al. IEEE Transactions on Multimedia, vol. 8, no. 4, 2006 Fuzzy.
1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.
Creating Streaming Video Clips for Web-based Instruction Jay Cofield, Ph.D. The university of Montevallo July 9, 2002.
Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.
Improved Video Categorization from Text Metadata and User Comments ACM SIGIR 2011:Research and development in Information Retrieval - Katja Filippova -
Date: 2012/11/29 Author: Chen Wang, Keping Bi, Yunhua Hu, Hang Li, Guihong Cao Source: WSDM’12 Advisor: Jia-ling, Koh Speaker: Shun-Chen, Cheng.
Levi Smith Christian Weigandt.  Getting data set together  Clipping videos to form the training and testing data for our classifier  Looking at code.
Soon Joo Hyun Database Systems Research and Development Lab. US-KOREA Joint Workshop on Digital Library t Introduction ICU Information and Communication.
Web Design, 5 th Edition 3 Planning a Successful Website: Part 1.
MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.
Narration/dialogue: Camera motion: Video effect: Audio effect: Shot duration: Transition to next scene: Storyboard Panel #
Trajectory-Based Ball Detection and Tracking with Aid of Homography in Broadcast Tennis Video Xinguo Yu, Nianjuan Jiang, Ee Luang Ang Present by komod.
Event Tactic Analysis Based on Broadcast Sports Video Guangyu Zhu, Changsheng Xu, Senior Member, IEEE, Qingming Huang, Member, IEEE, Yong Rui, Senior Member,
Digital Video Library - Jacky Ma.
Visual Information Retrieval
  A preliminary study: Perceptions of aviation maintenance students related to the use of Augmented Reality maintenance instructions Amadou Anne, Yu Wang.
ROBUST FACE NAME GRAPH MATCHING FOR MOVIE CHARACTER IDENTIFICATION
CHAPTER 8 Multimedia Authoring Tools
                      Digital Audio 1.
An Overview of MPEG-21 Cory McKay.
A User Attention Based Visible Watermarking Scheme
Football Video Segmentation Based on Video Production Strategy
Multimedia Information Retrieval
Presented by: Cynthia Balderas
A maximum likelihood estimation and training on the fly approach
IEEE bc Use Case Document
Soccer Analyzer Introduction to Computational and Biological Vision
Presentation transcript:

Automatic Generation of Personalized Music Sports Video ACM MM’2005 Jinjun Wang, Changshenf Xu, Engsiong Chng, Lingyu Duan, Kongwah Wan, Qi Tian 2018/12/9 by pj

Outline Introduction System framework Video content selection 3.1 Video/Audio analysis 3.2 Text analysis 3.3 Align the text event with A/V stream Automatic video composition Experiment results 2018/12/9 by pj

Introduction Sports broadcasting is more and more popular. One major advantage of digital broadcast is the possibility of delivering customized and interactive TV programs. 2018/12/9 by pj

Introduction However, current production of Music Video is very labor-intensive and inflexible. 無法自動化產生,要有專人剪接。 不能符合不同使用者的愛好。Ex. 喜歡看特定球員、球隊。 2018/12/9 by pj

Introduction The two challenge for automatic generation of personalized MSV is: Semantic sports video content selection by “event” – goal, injury, card, … “player/team” – Ronaldo, Germany, … ”topic” – the happiness of teams when winning, … Automatic video composition 2018/12/9 by pj

Introduction The contributions of this paper The use of TWB(text web broadcast) improves event detection Enable sports video content selection by player/team Align TWB text event with video event Propose video-centric and music-centric schemes to automatically generate MSV(music sport video). 2018/12/9 by pj

System framework 2018/12/9 by pj

3.1 Video/Audio analysis 3.1 Video/Audio analysis Shot boundary selection(F1) Shot is a basic analysis unit Using M2-Edit Pro software 2018/12/9 by pj

3.1 Video/Audio analysis 3.1 Video/Audio analysis Semantic shot classification(F2) The shots transition reveals the state of the gtame far view, in-field medium view, in-field close-up view, out-field medium view, out-field close-up view Reference “Soccer replay detection using scene transition structure analysis”, ICASSP, March2005 (J. Wang) 2018/12/9 by pj

3.1 Video/Audio analysis 3.1 Video/Audio analysis Replay detection(F3) The director launch a replay for interest event Detect flying logo, slow motion. Nowadays, above 95% broadcast sports video use flying-logo to launch replays. 2018/12/9 by pj

3.1 Video/Audio analysis 3.1 Video/Audio analysis Camera motion(F4) The camera motion provides a useful cue to represent the activity of the game “average motion magnitude”, “motion entropy”, “dominant motion direction”, “camera pan/tilt/zoom factor” Reference “Automatic replay generation for soccer video broadcasting”, ACM MM’04. ( J.Wang) 2018/12/9 by pj

3.1 Video/Audio analysis 3.1 Video/Audio analysis Audio keyword(F5) There are some signi¯cant game-speci¯c sounds that have strong relationships to the action of players, referees, commentators and audience in sports videos. “whistle”, “acclaim”, “noise” Reference “Automatic replay generation for soccer video broadcasting”, ACM MM’04. ( J.Wang) 2018/12/9 by pj

3.2 Text analysis 3.2 Text analysis It can increase the accuracy of video event detection It can detect “red/yellow card”, “player”, … event player time team 2018/12/9 by pj

3.2 Text analysis 3.2 Text analysis Keyword definition 2018/12/9 by pj

3.2 Text analysis 3.2 Text analysis Text event detection Keyword might have different apperance. “goal”, “g-o-a-l”, “gooooaaaaal” The software dtSearch supports fuzzy – 可以漏字, ex gooal v.s goal stemming – 文法變化, ex foul v.s fouling phonic – 聽起來像的單字, ex smith v.s smithe Player/team extraction 一開始要建好 database,用來做 string matching。 2018/12/9 by pj

3.3 Align the text event with A/V stream Q: The inaccuracy of the time-stamp in TWB abouts 2-3 minutes. S: 2018/12/9 by pj

3.3 Align the text event with A/V stream HMM Maximum evaluate function Weight: wn = 0.2, we= 0.8 G(M): shot count for different events. By training 2018/12/9 by pj

Automatic video composition 4.1 Video-Centric MSV In our implementation for this scheme, personalized video contents are first selected from the prepared video content selection pool in chronological order and multiplexed with music clips. For the video-centric case, there is no necessary to align the video shot boundaries with music structures boundaries. 2018/12/9 by pj

Automatic video composition 4.2 Music-Centric MSV Analyzing the semantic music structure Semantic: Intro前奏, Verse主歌, Chrous副歌, Ending, Bridge過門音樂 Reference “Content-based music structure analysis with applications to music semantics understanding” ACM MM’04 2018/12/9 by pj

Automatic video composition 4.2 Music-Centric MSV Content matching Far view for “Intro”, closeup view for “Chorus”, … User define Tempo matching Hence the tempo matching module performs the alignment between shot boundaries and music structure boundaries. 2018/12/9 by pj

Automatic video composition 4.2 Music-Centric MSV Select: 符合 user defined rules Event 和 music 的 duration & motion 相差不多的 Evaluate function T’ = [duration motion] 所以相差越少, v越大 符合rule? 0 or 1? event i 有 k 個shots 2018/12/9 by pj

Experiment results Dataset: Accuracy of A/V and text alignment 7 World-Cup 2002, 4 Euro-Cup 2004 About 16 hours Accuracy of A/V and text alignment Boundary decision accuracy 不好的原因: (1) The error of A/V feature. (2) Inaccuracy of TWB timestamp. 2018/12/9 by pj

Experiment results 主題清不清楚 夠不夠簡潔 能不能代表original video This is mainly because our current system is unable to identify whether every single shot in an event is related to the required player/team or not. video 跟 music配合的好不好 2018/12/9 by pj

Experiment results 分數不高的原因: (1) Our music-centric MSV contains several event types which makes it difficult to understand and thus lowering the Clarity score. (2) Because of the requirement to match the music boundary, shots within an event is sometimes discarded. 2018/12/9 by pj