Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya.

Slides:

Advertisements

Similar presentations

Image Retrieval With Relevant Feedback Hayati Cam & Ozge Cavus IMAGE RETRIEVAL WITH RELEVANCE FEEDBACK Hayati CAM Ozge CAVUS.

Advertisements

Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki

Relevance Feedback and User Interaction for CBIR Hai Le Supervisor: Dr. Sid Ray.

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance Dhruv Batra, Carnegie Mellon University Adarsh Kowdle, Cornell.

CIS 581 Course Project Heshan Lin

Automatic Video Shot Detection from MPEG Bit Stream Jianping Fan Department of Computer Science University of North Carolina at Charlotte Charlotte, NC.

Automated Shot Boundary Detection in VIRS DJ Park Computer Science Department The University of Iowa.

Taxonomic classification for web- based videos Author: Yang Song et al. (Google) Presenters: Phuc Bui & Rahul Dhamecha.

Patch to the Future: Unsupervised Visual Prediction

Detecting Cartoons a Case Study in Automatic Video-Genre Classification Tzvetanka Ianeva Arjen de Vries Hein Röhrig.

COLORCOLOR A SET OF CODES GENERATED BY THE BRAİN How do you quantify? How do you use?

Video Shot Boundary Detection at RMIT University Timo Volkmer, Saied Tahaghoghi, and Hugh E. Williams School of Computer Science & IT, RMIT University.

10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.

A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.

1 Content-Based Retrieval (CBR) -in multimedia systems Presented by: Chao Cai Date: March 28, 2006 C SC 561.

Lecture 6 Image Segmentation

HCI Final Project Robust Real Time Face Detection Paul Viola, Michael Jones, Robust Real-Time Face Detetion, International Journal of Computer Vision,

Morris LeBlanc.  Why Image Retrieval is Hard?  Problems with Image Retrieval  Support Vector Machines  Active Learning  Image Processing ◦ Texture.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

1 Color Segmentation: Color Spaces and Illumination Mohan Sridharan University of Birmingham

Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson

Robust Object Segmentation Using Adaptive Thresholding Xiaxi Huang and Nikolaos V. Boulgouris International Conference on Image Processing 2007.

Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.

Spatial Semi- supervised Image Classification Stuart Ness G07 - Csci 8701 Final Project 1.

LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.

Adaboost and its application

CSE 291 Final Project: Adaptive Multi-Spectral Differencing Andrew Cosand UCSD CVRR.

A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.

Smart Traveller with Visual Translator for OCR and Face Recognition LYU0203 FYP.

Video Trails: Representing and Visualizing Structure in Video Sequences Vikrant Kobla David Doermann Christos Faloutsos.

Foundations of Computer Vision Rapid object / face detection using a Boosted Cascade of Simple features Presented by Christos Stoilas Rapid object / face.

Face Detection and Neural Networks Todd Wittman Math 8600: Image Analysis Prof. Jackie Shen December 2001.

FACE DETECTION AND RECOGNITION By: Paranjith Singh Lohiya Ravi Babu Lavu.

Exploring the Parameter Space of Image Segmentation Algorithms Talk at NCHU p 1 TexPoint fonts used in EMF. Read the TexPoint manual before you.

Introduction to Computer Vision Olac Fuentes Computer Science Department University of Texas at El Paso El Paso, TX, U.S.A.

Visual Tracking with Online Multiple Instance Learning

Chapter 14: SEGMENTATION BY CLUSTERING 1. 2 Outline Introduction Human Vision & Gestalt Properties Applications – Background Subtraction – Shot Boundary.

Implementing Codesign in Xilinx Virtex II Pro Betim Çiço, Hergys Rexha Department of Informatics Engineering Faculty of Information Technologies Polytechnic.

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

DIEGO AGUIRRE COMPUTER VISION INTRODUCTION 1. QUESTION What is Computer Vision? 2.

IEEE Int'l Symposium on Signal Processing and its Applications 1 An Unsupervised Learning Approach to Content-Based Image Retrieval Yixin Chen & James.

PSEUDO-RELEVANCE FEEDBACK FOR MULTIMEDIA RETRIEVAL Seo Seok Jun.

Tijana Janjusevic Multimedia and Vision Group, Queen Mary, University of London Clustering of Visual Data using Ant-inspired Methods Supervisor: Prof.

CS654: Digital Image Analysis

Advances in digital image compression techniques Guojun Lu, Computer Communications, Vol. 16, No. 4, Apr, 1993, pp

1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.

Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.

Project by: Cirill Aizenberg, Dima Altshuler Supervisor: Erez Berkovich.

Boosted Particle Filter: Multitarget Detection and Tracking Fayin Li.

Chittampally Vasanth Raja vasanthexperiments.wordpress.com.

Bachelor of Engineering In Image Processing Techniques For Video Content Extraction Submitted to the faculty of Engineering North Maharashtra University,

Content-Based Image Retrieval QBIC Homepage The State Hermitage Museum db2www/qbicSearch.mac/qbic?selLang=English.

Image and Video Retrieval INST 734 Doug Oard Module 13.

Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.

Learning video saliency from human gaze using candidate selection CVPR2013 Poster.

Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.

Implementing the By: Matthew Marsh Supervisors: Prof Shaun Bangay Mrs Adele Lobb segmentation technique as a plugin for the GIMP.

Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.

Color Image Segmentation Mentor : Dr. Rajeev Srivastava Students: Achit Kumar Ojha Aseem Kumar Akshay Tyagi.

Introduction To Computational and Biological Vision Max Binshtok Ohad Greenshpan March 2006 Shot Detection in video.

Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.

Adaboost (Adaptive boosting) Jo Yeong-Jun Schapire, Robert E., and Yoram Singer. "Improved boosting algorithms using confidence- rated predictions."

Machine learning & object recognition Cordelia Schmid Jakob Verbeek.

Student Gesture Recognition System in Classroom 2.0 Chiung-Yao Fang, Min-Han Kuo, Greg-C Lee, and Sei-Wang Chen Department of Computer Science and Information.

Automatic Video Shot Detection from MPEG Bit Stream

Presenter: Ibrahim A. Zedan

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Color Image Retrieval based on Primitives of Color Moments

Presentation transcript:

Learning Techniques for Video Shot Detection Under the guidance of Prof. Sharat Chandran by M. Nithya

Outline Introduction Types of Shot-break Previous approaches to Shot Detection  General Approach - pixel comparison, histogram comparison…  Recent Work – Temporal slice analysis, Cue Video Our Proposed approaches  Supervised Learning using AdaBoost algorithm  Unsupervised Learning using clustering  Semi-supervised Learning combining AdaBoost & clustering Conclusion

Introduction  9,000 hours of motion pictures are produced around the world every year.  3,000 television stations broadcasting for twenty-four hours a day produce eight million hours of video per year. Problems: Searching the video Retrieving the relevant information Solution: Break down the video into smaller manageable parts called “Shots”

What is Shot?  Shot is the result of uninterrupted camera work  Shot-break is the transition from one shot to the next

Types of Shot-Break

Shot-Break Hard Cut Fade DissolveWipe

Hard Cut

Fade

Dissolve

Wipe

Shot Detection Methods

Goal: To segment video into shots Two ways: Cluster the similar frames to identify shots Find the shots that differ and declare it as shot-break

Pervious Approaches to Shot Detection General Approaches –Pixel Comparison –Block-based approach –Histogram Comparison –Edge Change Ratio Recent Work –Temporal Slice Analysis –Cue Video

Pixel Comparison Frame N Frame N + 1  x=1  y=1 | P i (x,y) – P i+1 (x,y) | D(i,i+1)= X Y XY

Block – Based Approach Frame N Frame N + 1 Compares statistics of the corresponding blocks Counts the number of significantly different blocks

Histogram Comparison

Edge Change Ratio

Comparison… MethodAdvantagesDisadvantages Pixel-Comparison Simple, easy to implement Computationally heavy, Very sensitive to moving object or camera motion Block basedPerforms better than pixel Can’t identify dissolve, fade, fast moving objects Histogram comparisonPerformance is better Detects hard-cut, fade, wipe and dissolve Fails if the two successive shots have same histogram. Can’t distinguish fast object or camera motion Edge Change RatiosDetects hard-cut, fade, wipe and dissolve Computationally heavy Fails when there is large amount of motion

Problems with previous approaches  Can’t distinguish shot-breaks with Fast object motion or Camera motion Fast Illumination changes Reflections from glass, water Flash photography  Fails to detect long and short gradual transitions

Temporal – Slice Analysis

Cue Video

Temporal – Slice Analysis

Cue Video Graph based approach Each frame maps to a node Connected upto 1, 3 or 7 frames apart Each node is associated with – color Histogram – Edge Histogram Weights of the edges represent similarity measure between the two frames Graph partitioning will segment the video into shots

Proposed Approaches

Use learning techniques to distinguish between shot-break and Fast object motion or Camera motion Fast Illumination changes Reflections from glass, water Flash photography

Supervised Learning

Feature Extraction 25 Primitive features like edge, color are extracted directly from the image These 25 features are used as input to next round of feature extraction yielding 25 x 25 = 625 features This 625 features can be used as input to compute 625 x 625 = 15, 625 features

How these features can be used to classify images?

Solution : Use AdaBoost to select these features. Oops!! There are 15, 625 features! Applying them to red, green and blue separately will result in 46, 875 features! Can we find few important features that will help to distinguish the images?

Input: (x 1,y 1 ) (x 2,y 2 ) …(x m,y m ) where x 1,x 2,…x m are the images y i = 0,1 for negative and positive examples Let n and p be the number of positive and negative examples Initial weight w 1,i = 1/2n if y i = 0 and w 1,I = 1/2p if y i = 1 For t= 1,…T: Train one hypothesis h i (x) for each feature and find the error Choose the hypothesis with low error value update the weight: w t+1,i = w t,i *  t 1-e t where e i =0,1for x i classified incorrectly or correctly  t =e t /(1-e t ) Normalize w t+1,I so that it is a distribution Final hypothesis is calculated as AdaBoost Algorithm

Supervised Learning Extract Highly selective features AdaBoost algorithm to select few important features Train the method to detect different shot-breaks

Unsupervised techniques Clustering

Unsupervised technique - clustering

Hard Cut Dissolve

Unsupervised technique Clustering method to cluster into shots Relevance Feedback

Semi-supervised Learning

 Combination of Supervised and Unsupervised  Few labeled data are available, using which it works on large unlabeled video Steps: AdaBoost algorithm to select features Clustering method to cluster into shots Relevance Feedback

Conclusion…

Problems with previous approaches: Can’t distinguish shot-breaks with –Fast object motion or Camera motion –Fast Illumination changes –Reflections from glass, water –Flash photography  Fails to detect long and short gradual transitions Planning to use AdaBoost learning based clustering scheme for shot-detection

Thank you…