Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION.

Slides:



Advertisements
Similar presentations
Autonomic Scaling of Cloud Computing Resources
Advertisements

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
Decoding of Convolutional Codes  Let C m be the set of allowable code sequences of length m.  Not all sequences in {0,1}m are allowable code sequences!
TRACKING THE INVISIBLE: LEARNING WHERE THE OBJECT MIGHT BE Helmut Grabner1, Jiri Matas2, Luc Van Gool1,3, Philippe Cattin4 1ETH-Zurich, 2Czech Technical.
Bayesian Network and Influence Diagram A Guide to Construction And Analysis.
BIOINFORMATICS GENE DISCOVERY BIOINFORMATICS AND GENE DISCOVERY Iosif Vaisman 1998 UNIVERSITY OF NORTH CAROLINA AT CHAPEL HILL Bioinformatics Tutorials.
Unsupervised Detection of Regions of Interest Using Iterative Link Analysis Gunhee Kim 1 Antonio Torralba 2 1: SCS, CMU 2: CSAIL, MIT Neural Information.
Computer Science Engineering Lee Sang Seon.  Introduction  Basic notions for temporal video boundaries  Micro-Boundaries  Macro-Boundaries  Mega-Boundaries.
Computer Vision Lecture 16: Region Representation
Face Recognition & Biometric Systems Support Vector Machines (part 2)
Introduction of Probabilistic Reasoning and Bayesian Networks
Qualifying Exam: Contour Grouping Vida Movahedi Supervisor: James Elder Supervisory Committee: Minas Spetsakis, Jeff Edmonds York University Summer 2009.
Introduction to probability theory and graphical models Translational Neuroimaging Seminar on Bayesian Inference Spring 2013 Jakob Heinzle Translational.
Learning Causality Some slides are from Judea Pearl’s class lecture
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Locally Constraint Support Vector Clustering
HMM-BASED PATTERN DETECTION. Outline  Markov Process  Hidden Markov Models Elements Basic Problems Evaluation Optimization Training Implementation 2-D.
1 CS 430: Information Discovery Lecture 22 Non-Textual Materials 2.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
1. Elements of the Genetic Algorithm  Genome: A finite dynamical system model as a set of d polynomials over  2 (finite field of 2 elements)  Fitness.
Multimedia Search and Retrieval Presented by: Reza Aghaee For Multimedia Course(CMPT820) Simon Fraser University March.2005 Shih-Fu Chang, Qian Huang,
Abstract We present a model of curvilinear grouping using piecewise linear representations of contours and a conditional random field to capture continuity.
Part 2 of 3: Bayesian Network and Dynamic Bayesian Network.
Graphical Models Lei Tang. Review of Graphical Models Directed Graph (DAG, Bayesian Network, Belief Network) Typically used to represent causal relationship.
Simulation and Application on learning gene causal relationships Xin Zhang.
Chinese Character Recognition for Video Presented by: Vincent Cheung Date: 25 October 1999.
Face Processing System Presented by: Harvest Jang Group meeting Fall 2002.
Jacinto C. Nascimento, Member, IEEE, and Jorge S. Marques
1 Bayesian Networks Chapter ; 14.4 CS 63 Adapted from slides by Tim Finin and Marie desJardins. Some material borrowed from Lise Getoor.
Learning the Common Structure of Data Kristina Lerman and Steven Minton Presentation by Jeff Roth.
Machine Learning CUNY Graduate Center Lecture 21: Graphical Models.
EE513 Audio Signals and Systems Statistical Pattern Classification Kevin D. Donohue Electrical and Computer Engineering University of Kentucky.
Made by: Maor Levy, Temple University  Probability expresses uncertainty.  Pervasive in all of Artificial Intelligence  Machine learning 
Soft Computing Lecture 17 Introduction to probabilistic reasoning. Bayesian nets. Markov models.
Slides are based on Negnevitsky, Pearson Education, Lecture 12 Hybrid intelligent systems: Evolutionary neural networks and fuzzy evolutionary systems.
Hierarchical Dirichlet Process (HDP) A Dirichlet process (DP) is a discrete distribution that is composed of a weighted sum of impulse functions. Weights.
© 2005 Ritsumeikan Univ. All Rights Reserved. Context Aware Operation Reproduction for Safety Driving Satoshi Kaede Ritsumeikan University Graduate School.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
The Science of Prediction Location Intelligence Conference April 4, 2006 How Next Generation Traffic Services Will Impact Business Dr. Oliver Downs, Chief.
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Multimodal Information Analysis for Emotion Recognition
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Chapter 4: Pattern Recognition. Classification is a process that assigns a label to an object according to some representation of the object’s properties.
Ch 8. Graphical Models Pattern Recognition and Machine Learning, C. M. Bishop, Revised by M.-O. Heo Summarized by J.W. Nam Biointelligence Laboratory,
Learning the Structure of Related Tasks Presented by Lihan He Machine Learning Reading Group Duke University 02/03/2006 A. Niculescu-Mizil, R. Caruana.
Stable Multi-Target Tracking in Real-Time Surveillance Video
Using Webcast Text for Semantic Event Detection in Broadcast Sports Video IEEE TRANSACTIONS ON MULTIMEDIA, VOL. 10, NO. 7, NOVEMBER 2008.
We are learning to write expressions using variables. (8-1)
October Andrew C. Gallagher, Jiebo Luo, Wei Hao Improved Blue Sky Detection Using Polynomial Model Fit Andrew C. Gallagher, Jiebo Luo, Wei Hao Presented.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
Video Tracking G. Medioni, Q. Yu Edwin Lei Maria Pavlovskaia.
Maximum Entropy Model, Bayesian Networks, HMM, Markov Random Fields, (Hidden/Segmental) Conditional Random Fields.
ECE 471/571 - Lecture 19 Review 11/12/15. A Roadmap 2 Pattern Classification Statistical ApproachNon-Statistical Approach SupervisedUnsupervised Basic.
1 CMSC 671 Fall 2001 Class #20 – Thursday, November 8.
A Cooperative Coevolutionary Genetic Algorithm for Learning Bayesian Network Structures Arthur Carvalho
Semantic Alignment Spring 2009 Ben-Gurion University of the Negev.
Computational methods for inferring cellular networks II Stat 877 Apr 17 th, 2014 Sushmita Roy.
CSC400W Honors Project Proposal Understanding ocean surface features from satellite images Jared Tilanus Nemanja Spasic.
Topological Sort. Sorting technique over DAGs (Directed Acyclic Graphs) It creates a linear sequence (ordering) for the nodes such that: –If u has an.
Visual Information Retrieval
Automatic Video Shot Detection from MPEG Bit Stream
Presenter: Ibrahim A. Zedan
Mean Shift Segmentation
Features & Decision regions
Topological Sort.
Topological Sort.
Presented by: Yang Yu Spatiotemporal GMM for Background Subtraction with Superpixel Hierarchy Mingliang Chen, Xing Wei, Qingxiong.
Pattern Recognition and Image Analysis
Class #16 – Tuesday, October 26
Presentation transcript:

Mega-boundaries TEMPORAL VIDEO BOUNDARY DETECTION

Mega-boundaries  Mega boundaries are defined between macro segments that exhibit different structural and feature consistency.  A good example of mega boundaries application is commercial detection

Commercial detection  Common method is detection of high activity rate and black frame detection coupled with silence detection  Lienhart 1997  Use of monochrome images, scene breaks, and action.  Blum 1992  Use of black frames and activity detector.  Iggulden 1997  Distance between black frame sequences.  Dimitrova 2002  Automatically spots repetitive patterns. Must be identified before recognizing

Commercial detection  Nafeh 1994  Learning and discerning of broadcast using Neural Network  Bonner 1982  McGee 1999  Novak 1988  Y. Li 2000  Agnihotri 2003

Features for Commercial detection  Mega boundaries detection method’s are based on what features we have on the test video  Unicolor Frame for commercial break  High visual activity  Letterbox format  Dataset of 8 hours of video from TV programs  Feature data consists of frames

Triggers and Verifiers  Trigger - Features that can aid in determining the location of the commercial break  Verifier – Features that can determine boundaries of the commercial break  We use the time interval between detected unicolor frame as triggers  Presence of a letterbox change or high cut rate expressed in terms of low cut used as verifiers.  Constrains on commercial breaks are longer than 1 minute and shorter than 6 minute.

Bayesian Belief Network Model  Directed acyclical graph (DAG)  The nodes correspond to variables  The arcs describe direct casual relationship between linked variables  The strength of these links is given by conditional probability distributions  P(x 1,..,x n )=P(X n |X n-1,..,x 1 )*... *P(x 2 |x 1 )P(x 1 )  Ω(x 1,..x n ) - Variables define as DAG  P(x i |∏ i )=P(x i |x 1,..,x n )  P(.|.) is a cpd (conditional probability density)  Using probability density function and chain rule

Bayesian Belief Network Model Probability for the verification node using pdf and chain rule Probability for potential commercial Probability for separator

Bayesian Belief Network Model Probability for sequence of black frames Probability for key frame distance

Evolved Algorithm  Challenge to create algorithm for all countries broadcast difference  Genetic algorithms implement a form Darwinian evolution.  Uses chromosome etc..  Eshelman’s CHC algorithm  CHC is general algorithm with 3 features  Monotonic  CHC prevents parents from mating if their genetic is too similar.  CHC uses soft restart

CHC for our Experiment  Default Parameters.  50 chromosomes, divergence rate of 35%.  Each parameter was coded as a binary string.  Each chromosome was decoded into set of parameters for the commercial detector and this detector was given a test video stream.  Correct label for the video frames were detected by human  Highest precision and recall was achieved with precision + recall

Results  First data set  8 hours of TV broad cast consisting of 13 TV programs  1.5 hours of 28 different commercials  Second dataset  4 hours of TV broad cast consisting of 11 TV programs  1 hour of 35 different commercials  FN,FP,TP,TN  Recall=TP/(TP+FN)  Precision=TP/(TP+FP)

Results  Using First data set  Results from first 4 experiment (recall and precision)  80.8% and 92.6%, 80.8% and 92.6%, 79.7% and 87.4%, 81.3% and 94.3%  Experiment 5 used experiment 4 and result was 88% and 90 %  Using second data set  This dataset was acquired after the algorithm  Test to the Genetic Algorithm  Results are shown in Figure.

Conclusion  Boundary segmentation in video  Visual scene segmentation  Multimodal story segmentation  Commercial detection  Questions ?