Video Summarization via Determinantal Point Processes (DPP)

Slides:



Advertisements
Similar presentations
Université du Québec École de technologie supérieure Face Recognition in Video Using What- and-Where Fusion Neural Network Mamoudou Barry and Eric Granger.
Advertisements

Context-based object-class recognition and retrieval by generalized correlograms by J. Amores, N. Sebe and P. Radeva Discussion led by Qi An Duke University.
Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.
Structured SVM Chen-Tse Tsai and Siddharth Gupta.
Machine learning continued Image source:
Designing Facial Animation For Speaking Persian Language Hadi Rahimzadeh June 2005.
An Overview of Machine Learning
Supervised Learning Recap
Patch to the Future: Unsupervised Visual Prediction
Multiple People Detection and Tracking with Occlusion Presenter: Feifei Huo Supervisor: Dr. Emile A. Hendriks Dr. A. H. J. Stijn Oomes Information and.
Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.
Personalizing Web Page Recommendation via Collaborative Filtering and Topic-Aware Markov Model Qingyan Yang, Ju Fan, Jianyong Wang, Lizhu Zhou Database.
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
Personalized Search Result Diversification via Structured Learning
Machine Vision and Dig. Image Analysis 1 Prof. Heikki Kälviäinen C50A6100 Lectures 12: Object Recognition Professor Heikki Kälviäinen Machine Vision and.
Discriminative Naïve Bayesian Classifiers Kaizhu Huang Supervisors: Prof. Irwin King, Prof. Michael R. Lyu Markers: Prof. Lai Wan Chan, Prof. Kin Hong.
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
Presented by Zeehasham Rasheed
J Cheng et al,. CVPR14 Hyunchul Yang( 양현철 )
DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.
Machine Learning Usman Roshan Dept. of Computer Science NJIT.
Dimensionality reduction Usman Roshan CS 675. Supervised dim reduction: Linear discriminant analysis Fisher linear discriminant: –Maximize ratio of difference.
This week: overview on pattern recognition (related to machine learning)
Overcoming Dataset Bias: An Unsupervised Domain Adaptation Approach Boqing Gong University of Southern California Joint work with Fei Sha and Kristen Grauman.
Bayesian Sets Zoubin Ghahramani and Kathertine A. Heller NIPS 2005 Presented by Qi An Mar. 17 th, 2006.
Jifeng Dai 2011/09/27.  Introduction  Structural SVM  Kernel Design  Segmentation and parameter learning  Object Feature Descriptors  Experimental.
Summary  The task of extractive speech summarization is to select a set of salient sentences from an original spoken document and concatenate them to.
Data Mining: Classification & Predication Hosam Al-Samarraie, PhD. Centre for Instructional Technology & Multimedia Universiti Sains Malaysia.
Particle Filters for Shape Correspondence Presenter: Jingting Zeng.
A Sparse Modeling Approach to Speech Recognition Based on Relevance Vector Machines Jon Hamaker and Joseph Picone Institute for.
Competence Centre on Information Extraction and Image Understanding for Earth Observation 29/03/07 Blind city classification using aggregation of clusterings.
Chengjie Sun,Lei Lin, Yuan Chen, Bingquan Liu Harbin Institute of Technology School of Computer Science and Technology 1 19/11/ :09 PM.
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Greedy is not Enough: An Efficient Batch Mode Active Learning Algorithm Chen, Yi-wen( 陳憶文 ) Graduate Institute of Computer Science & Information Engineering.
Sharing Features Between Objects and Their Attributes Sung Ju Hwang 1, Fei Sha 2 and Kristen Grauman 1 1 University of Texas at Austin, 2 University of.
Geodesic Flow Kernel for Unsupervised Domain Adaptation Boqing Gong University of Southern California Joint work with Yuan Shi, Fei Sha, and Kristen Grauman.
Interactive Learning of the Acoustic Properties of Objects by a Robot
CHAPTER 8 DISCRIMINATIVE CLASSIFIERS HIDDEN MARKOV MODELS.
Topic Models Presented by Iulian Pruteanu Friday, July 28 th, 2006.
Intelligent Database Systems Lab Advisor : Dr.Hsu Graduate : Keng-Wei Chang Author : Lian Yan and David J. Miller 國立雲林科技大學 National Yunlin University of.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
Presented by: Fang-Hui Chu Discriminative Models for Speech Recognition M.J.F. Gales Cambridge University Engineering Department 2007.
Dimensionality Reduction in Unsupervised Learning of Conditional Gaussian Networks Authors: Pegna, J.M., Lozano, J.A., Larragnaga, P., and Inza, I. In.
Supervised Random Walks: Predicting and Recommending Links in Social Networks Lars Backstrom (Facebook) & Jure Leskovec (Stanford) Proc. of WSDM 2011 Present.
Identifying “Best Bet” Web Search Results by Mining Past User Behavior Author: Eugene Agichtein, Zijian Zheng (Microsoft Research) Source: KDD2006 Reporter:
6.S093 Visual Recognition through Machine Learning Competition Image by kirkh.deviantart.com Joseph Lim and Aditya Khosla Acknowledgment: Many slides from.
Multi-label Prediction via Sparse Infinite CCA Piyush Rai and Hal Daume III NIPS 2009 Presented by Lingbo Li ECE, Duke University July 16th, 2010 Note:
NN k Networks for browsing and clustering image collections Daniel Heesch Communications and Signal Processing Group Electrical and Electronic Engineering.
Dan Roth University of Illinois, Urbana-Champaign 7 Sequential Models Tutorial on Machine Learning in Natural.
Collaborative Deep Learning for Recommender Systems
1 Dongheng Sun 04/26/2011 Learning with Matrix Factorizations By Nathan Srebro.
Experience Report: System Log Analysis for Anomaly Detection
Recent Trends in Text Mining
Saliency-guided Video Classification via Adaptively weighted learning
Query-Focused Video Summarization – Week 1
CFA: A Practical Prediction System for Video Quality Optimization
Multimodal Learning with Deep Boltzmann Machines
Tracking parameter optimization
J. Zhu, A. Ahmed and E.P. Xing Carnegie Mellon University ICML 2009
An Introduction to Support Vector Machines
Probabilistic Models with Latent Variables
Learning Markov Networks
Overview of Machine Learning
Oral presentation for ACM International Conference on Multimedia, 2014
Ying Dai Faculty of software and information science,
View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions 1,2 1.
Machine Learning Support Vector Machine Supervised Learning
Christoph F. Eick: A Gentle Introduction to Machine Learning
Semi-Supervised Learning
Presentation transcript:

Video Summarization via Determinantal Point Processes (DPP) Boqing Gong University of Southern California Joint work with Wei-Lun Chao, Kristen Grauman, and Fei Sha

Background Basic idea of DPP Sequential DPP (NIPS 2014) Large-margin training of DPP Conclusion

Background Motivation: Representation: Subset Selection problem Indispensable for fast video browsing and retrieval Representation: Key frames / segments extraction Subset Selection problem

Background Video summarization is hard: Naïve solution: Clustering Individual selected frame: Representativeness Selected frames as a whole: Diversity Naïve solution: Clustering Competing !

Background Clustering works?

Video summarization: an overview Video summarization is hard: What criteria lead to user perspective? What kind of models: Supervised learning ! Diverse subset with representative items

Background How to model subset selection problem? Structured prediction, submodular functions Determinantal Point Processes (DPPs) [Alex Kulesza and Ben Taskar, 2012]

Background Basic idea of DPP Sequential DPP (NIPS 2014) Large-margin training of DPP Conclusion

Basic idea of DPP Idea: A point process based on matrix determinant. Formulation: M discrete items (binary decision)

Basic idea of DPP Why diverse? Extreme cases:

Basic idea of DPP Learning in DPP: 11

Background Basic idea of DPP Sequential DPP (NIPS 2014) Large-margin training of DPP Conclusion

Sequential DPP Motivation: Proposed Idea: The temporal structure of video is missing Proposed Idea: Sequential DPP via Markov properties

… …

Sequential DPP Modeling the sequential structure: Conditional DPP: still a DPP !

Sequential DPP Parameterization:

Inference and Learning Allow brute-force search in small chunks Optimization:

Sequential DPP Experimental setting: 3 datasets: OVP (50), Youtube (39), Kodak (18) Fisher vectors + Saliency + Contextual features Evaluation: Recall, Precision, and F1 score Comparison: unsupervised methods & vanilla DPP

Sequential DPP Experimental Results:

Sequential DPP Experimental Results:

Sequential DPP Experimental Results:

Background Basic idea of DPP Sequential DPP (NIPS 2014) Large-margin training of DPP Conclusion

Learning parameters in DPP Maximum likehood estimation Focuses on observed data only Large-margin training Maximizes margin between observed and undesired data Discriminative learning More flexible: incorporating evaluation metrics

Large-margin training of DPP More discriminative and flexible

Conclusion Supervised learning for video summarization DPPs: modeling diversity subset selection Video structure: Sequential DPP Parameterization: Neural networks Future work Better inference algorithms Models beyond DPP (submodular)