Football Video Segmentation Based on Video Production Strategy

Slides:

Advertisements

Similar presentations

Generation of Multimedia TV News Contents for WWW Hsin Chia Fu, Yeong Yuh Xu, and Cheng Lung Tseng Department of computer science, National Chiao-Tung.

Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Trajectory Analysis of Broadcast Soccer Videos Computer Science and Engineering Department Indian Institute of Technology, Kharagpur by Prof. Jayanta Mukherjee.

SmartPlayer: User-Centric Video Fast-Forwarding K.-Y. Cheng, S.-J. Luo, B.-Y. Chen, and H.-H. Chu ACM CHI 2009 (international conference on Human factors.

Finding Structure in Home Videos by Probabilistic Hierarchical Clustering Daniel Gatica-Perez, Alexander Loui, and Ming-Ting Sun.

Visual Event Detection & Recognition Filiz Bunyak Ersoy, Ph.D. student Smart Engineering Systems Lab.

1 A scheme for racquet sports video analysis with the combination of audio-visual information Visual Communication and Image Processing 2005 Liyuan Xing,

Personalized Abstraction of Broadcasted American Football Video by Highlight Selection Noboru Babaguchi (Professor at Osaka Univ.) Yoshihiko Kawai and.

Broadcast News Parsing Using Visual Cues: A Robust Face Detection Approach Yannis Avrithis, Nicolas Tsapatsoulis and Stefanos Kollias Image, Video & Multimedia.

ICME 2008 Huiying Liu, Shuqiang Jiang, Qingming Huang, Changsheng Xu.

1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.

LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.

Multimedia Search and Retrieval: New Concepts, System Implementation, and Application Qian Huang, Atul Puri, Zhu Liu IEEE TRANSACTION ON CIRCUITS AND SYSTEMS.

ADVISE: Advanced Digital Video Information Segmentation Engine

Segmentation and Event Detection in Soccer Audio Lexing Xie, Prof. Dan Ellis EE6820, Spring 2001 April 24 th, 2001.

Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,

Presentation Outline  Project Aims  Introduction of Digital Video Library  Introduction of Our Work  Considerations and Approach  Design and Implementation.

LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.

Visual Information Retrieval Chapter 1 Introduction Alberto Del Bimbo Dipartimento di Sistemi e Informatica Universita di Firenze Firenze, Italy.

Presented by Zeehasham Rasheed

LYU0103 Speech Recognition Techniques for Digital Video Library Supervisor : Prof Michael R. Lyu Students: Gao Zheng Hong Lei Mo.

A fuzzy video content representation for video summarization and content-based retrieval Anastasios D. Doulamis, Nikolaos D. Doulamis, Stefanos D. Kollias.

Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques

Low-level Motion Activity Features for Semantic Characterization of Video Kadir A. Peker, A. Aydin Alatan, Ali N. Akansu International Conference on Multimedia.

Real Time Abnormal Motion Detection in Surveillance Video Nahum Kiryati Tammy Riklin Raviv Yan Ivanchenko Shay Rochel Vision and Image Analysis Laboratory.

Digital Forensics Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #12 Intelligent Digital Forensics September 30, 2009.

Using Multimedia on the Web

CONTI’2008, 5-6 June 2008, TIMISOARA 1 Towards a digital content management system Gheorghe Sebestyen-Pal, Tünde Bálint, Bogdan Moscaliuc, Agnes Sebestyen-Pal.

TEMPORAL VIDEO BOUNDARIES -PART ONE- SNUEE KIM KYUNGMIN.

Multimedia Databases (MMDB)

© 2011 The McGraw-Hill Companies, Inc. All rights reserved Chapter 6: Video.

Player Action Recognition in Broadcast Tennis Video with Applications to Semantic Analysis of Sport Game Guangyu Zhu, Changsheng Xu Qingming Huang, Wen.

Università degli Studi di Modena and Reggio Emilia Dipartimento di Ingegneria dell’Informazione Prototypes selection with.

CS654: Digital Image Analysis Lecture 3: Data Structure for Image Analysis.

DIGITAL Video. Video Creation Video captures the real world therefore video cannot be created in the same sense that images can be created video must.

MULTIMEDIA DEFINITION OF MULTIMEDIA

MUMT611: Music Information Acquisition, Preservation, and Retrieval Presentation on Timbre Similarity Alexandre Savard March 2006.

Understanding The Semantics of Media Chapter 8 Camilo A. Celis.

Stylization and Abstraction of Photographs Doug Decarlo and Anthony Santella.

Levi Smith.  Reading papers  Getting data set together  Clipping videos to form the training and testing data for our classifier  Project separation.

PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.

Case Study 1 Semantic Analysis of Soccer Video Using Dynamic Bayesian Network C.-L Huang, et al. IEEE Transactions on Multimedia, vol. 8, no. 4, 2006 Fuzzy.

1 Applications of video-content analysis and retrieval IEEE Multimedia Magazine 2002 JUL-SEP Reporter: 林浩棟.

Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.

Image and Video Retrieval INST 734 Doug Oard Module 13.

Digital Video Library Network Supervisor: Prof. Michael Lyu Student: Ma Chak Kei, Jacky.

MMM2005The Chinese University of Hong Kong MMM2005 The Chinese University of Hong Kong 1 Video Summarization Using Mutual Reinforcement Principle and Shot.

MULTIMEDIA DATA MODELS AND AUTHORING

[1] National Institute of Science & Technology Technical Seminar Presentation 2004 Suresh Chandra Martha National Institute of Science & Technology Audio.

Visual Information Retrieval

Designing Cross-Language Information Retrieval System using various Techniques of Query Expansion and Indexing for Improved Performance Hello everyone,

Automatic Video Shot Detection from MPEG Bit Stream

Introduction Multimedia initial focus

Presenter: Ibrahim A. Zedan

Saliency-guided Video Classification via Adaptively weighted learning

Multimedia Content-Based Retrieval

CHAPTER 8 Multimedia Authoring Tools

Chapter 6: Video.

User-Oriented Approach in Spatial and Temporal Domain Video Coding

An Enhanced Support Vector Machine Model for Intrusion Detection

Overview What is Multimedia? Characteristics of multimedia

Unit 02 – Creative Multimedia - DA202

A User Attention Based Visible Watermarking Scheme

Automatic Generation of Personalized Music Sports Video ACM MM’2005

Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.

Multimedia (CoSc4151) Chapter One : Introduction to Multimedia

Dr. Debaleena Chattopadhyay Department of Computer Science

Digital Video Faraz Khan.

Presentation transcript:

Football Video Segmentation Based on Video Production Strategy Authors: Reede Ren and Joemon M. Jose Conference paper 2005 Noureddine Ghoggali 12/9/2018 Noureddine Ghoggali

Attack structure definition Attack structure detection Data set Outline Introduction Goal of the paper Methodology Attack structure definition Attack structure detection Data set Application Conclusions 12/9/2018 Noureddine Ghoggali

Introduction The amount of audiovisual information available in digital format has grown exponentially in recent years. Gigabytes of new images, audio and video clips are generated and stored everyday. This has led to a huge distributed and mostly unstructured repository of multimedia information. In order to realize the full potential of these databases, tools for automated indexing and intelligent search engines are urgently needed. Indeed, image and video cataloguing, indexing and retrieval are the subject of active research in industry and academia across the world. This reflects the commercial importance of such technology. The aim of this presentation is to give an overview of Football video segmentation. 12/9/2018 Noureddine Ghoggali

Basic concepts Cognitively, the predominant feature in video is its higher-level temporal structure. People are unable to perceive millions of individual frames, but they can perceive episodes, scenes, and moving objects. A scene in a video is a sequence of frames that are considered to be semantically consistent. Scene changes therefore demarcate changes in semantic context. Segmenting a video into its constituent scenes permits it to be accessed in terms of meaningful units. A video is physically formed by shots and semantically described by scenes. shot is a sequence of frames representing continuous action in time and space. scene is a story unit and consists of a sequence of connected or unconnected shots. 12/9/2018 Noureddine Ghoggali

Basic concepts Most of the current research efforts are devoted to shot-based video segmentation. Algorithms for scene change detection can be classified according to the features used for processing into uncompressed and compressed domain algorithms. Temporal segmentation is the process of decomposing video streams into these syntactic elements. Shots are a sequence of frames recorded continuously by one camera and scenes are composed of a small number of interrelated shots that are unified by a given event. 12/9/2018 Noureddine Ghoggali

Video production strategy Video Production Strategy in Football Broadcasting A football game is made up by a series of team movements called attack in sports jargon. They are mostly independent and sorted by time throughout the game. In some sense, attack decides broadcasting strategy. During broad-casting, video reporters focus on two issues, how to record the game or attack how to avoid missing interesting issues in an attack. They employed view to describe team tactics and middle view or close-up view to catch players detailed movement. When an important event or highlight takes place, such as goal, it will be replayed. 12/9/2018 Noureddine Ghoggali

Video production strategy The strategy can be stated as following, When an attack begins, a global view will be used until the ball passes the centre circle. When the ball comes into front field, a middle view is going to be employed to show how groups of players attack and defend. When the ball come into or close to the penalty area, a close-up view is here to catch possible highlights and players action in detail. When there is a highlight, such as shoot and foul, a close-up slow motion replay will come to state the event. 12/9/2018 Noureddine Ghoggali

Video production strategy With these observations, we conjecture: Video making methods in football game dictate the structure of video and compose semantics. As a time sequence, a football game can be modeled by Hidden Markov Model with attack video structure. Attack is an independent semantic video unit, which can be treated as a scene in football video domain. 12/9/2018 Noureddine Ghoggali

Goal of paper Parse continuous video stream into a sequence of ‘attack’. Set up a hierarchical video content index to summarize the game and allow a non linear navigation of video content. 12/9/2018 Noureddine Ghoggali

Methodology 12/9/2018 Noureddine Ghoggali

Attack definition Attack takes the role of scene in the framework. To detect it, the authors define a new video structure layer between shot and attack It includes four mutually exclusive video structures in broadcasting video data (play, focus, replay and break). Play: video makers convey global status of the game and employ long and medium shots or field view . Focus: short stop of game, in which the video maker traces a player to show his or her detailed actions. Replay: is for slow-replays. Break : includes non-game video clips. 12/9/2018 Noureddine Ghoggali

Attack structure detection A two pass classification is proposed to identifies video structures and labels video frames with their video structure type: play, focus, replay and break by a GMM classifier and its output is smoothed by dynamic programming process Second pass detects replay. Replay is a slow motion video clip sandwiched by editing effects. 12/9/2018 Noureddine Ghoggali

Attack structure detection Three salient features are computed for the first pass GMM classification to parse video in attack structure Field ratio: play field area ratio over image. Zoom size: to detect football uniform. Edit effect detection: detecting logos. 12/9/2018 Noureddine Ghoggali

Attack structure detection 12/9/2018 Noureddine Ghoggali

Attack structure detection Field ratio:  its reasonable to expect grass color to be the dominent color. 12/9/2018 Noureddine Ghoggali

Attack structure detection Zoom size: Football uniform is an oblivious domain feature compared with human face. It has the following merits: its with bright color and special pattern it associates with the appearance of player only. It is rotation robust. 12/9/2018 Noureddine Ghoggali

Attack structure detection Zoom size: use the Foley-Sammon Transform algorithm to build the training set Edit effect detection: 12/9/2018 Noureddine Ghoggali

Attack scene construction After attack structure classification, we get the video structure label sequence "...BPFPFPRP...", where B is the abridgement for `Break', P for `Play', F for `Focus' and R for `Replay'. The string records the process of video making and keeps the information of àttack'. So the job of àttack' construction is to divide it into a serials of substrings, which contain only one àttack' sequence each. But the string is too long for the attack. 12/9/2018 Noureddine Ghoggali

Attack scene construction 12/9/2018 Noureddine Ghoggali

Experiments Data Set: The data set includes two MPEG-1 broadcasting videos in World Cup 2002, the final game and the one Japan vs Turkey. It is about 320 minutes, containing interview, celebration and commercial clips. Both games are divided into halves, Final I, Final II, Japan-Turkey I and Japan-Turkey II. The first half of Japan-Turkey and final game are labelled manually to set up ground truth. 4535 play frames (33.7%), 4253 focus frames(31.6%) and 4674 `break' frames (34.6%). There are 33 replay in the final game and 34 in the Japan-Turkey game. Training set: includes 2000 frames 400 from play 1000 from focus, 600 from break, which are randomly selected from marked samples. Remaining frames are kept for test. The grass hue model is automatically calculated for every game. 12/9/2018 Noureddine Ghoggali

Experiments 12/9/2018 Noureddine Ghoggali

Application Related video browser (a) retrieves replay' and its `play' and `focus' segments. It includes two regions in the panel, replay segment list and related segments panel. The related segments panel displays top n (n=3) closest `play' and `focus‘ to the selected `replay'. User chooses `replay' from the `replay' segment list and decides whether to contain it and its related video segments in summary or not. A double-click on icons will play the video clip by a stand alone window. Summary browser (b) shows all video segments in the proposed summary, and grantees user the ability to insert and remove shots. 12/9/2018 Noureddine Ghoggali

Conclusions football videos. It is based on video production conventions and helps in video summarization and indexing. attack is a semantic unit of football game and is an equivalent of scene in other video domains. The result shows those high-level video structures can be computed with high accuracy. We focus on video structure identification and how to merge these structures into attack scene. future work: measure accuracy of attack boundary. The algorithm leaves much space for improvements: Audio event detectors, such as goal and whistle detection, can be integrated; Improve GST algorithm to search more embedded video structure; 12/9/2018 Noureddine Ghoggali

Comments The major difficulty with the statistical approaches in that a large number of hand labeled training samples are required to learn correctly. This problem becoming crucial. And thus the expert are required to derive the semantic labels for a large number of training samples. Given this cost problem its very attractive to design more effective classifier. Training techniques that can take advantage of the unlabeled samples There are three primary genetic operators that govern the producing of the new offspring in the next generation 12/9/2018 Noureddine Ghoggali

THANK’S 12/9/2018 Noureddine Ghoggali There are three primary genetic operators that govern the producing of the new offspring in the next generation 12/9/2018 Noureddine Ghoggali