ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised and weakly-supervised discovery of events in video (and audio) Fernando De la Torre.

Slides:



Advertisements
Similar presentations
Pseudo-Relevance Feedback For Multimedia Retrieval By Rong Yan, Alexander G. and Rong Jin Mwangi S. Kariuki
Advertisements

The Extended Cohn-Kanade Dataset(CK+):A complete dataset for action unit and emotion-specified expression Author:Patrick Lucey, Jeffrey F. Cohn, Takeo.
DONG XU, MEMBER, IEEE, AND SHIH-FU CHANG, FELLOW, IEEE Video Event Recognition Using Kernel Methods with Multilevel Temporal Alignment.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.
INTRODUCTION TO MACHINE LEARNING David Kauchak CS 451 – Fall 2013.
Machine learning continued Image source:
Image Congealing (batch/multiple) image (alignment/registration) Advanced Topics in Computer Vision (048921) Boris Kimelman.
Patch to the Future: Unsupervised Visual Prediction
1 Part 1: Classical Image Classification Methods Kai Yu Dept. of Media Analytics NEC Laboratories America Andrew Ng Computer Science Dept. Stanford University.
Lecture 31: Modern object recognition
Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,
Transferable Dictionary Pair based Cross-view Action Recognition Lin Hong.
Object-centric spatial pooling for image classification Olga Russakovsky, Yuanqing Lin, Kai Yu, Li Fei-Fei ECCV 2012.
Data Visualization STAT 890, STAT 442, CM 462
Discriminative and generative methods for bags of features
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Retrieving Actions in Group Contexts Tian Lan, Yang Wang, Greg Mori, Stephen Robinovitch Simon Fraser University Sept. 11, 2010.
Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?
Locally Constraint Support Vector Clustering
A Study of Approaches for Object Recognition
Support Vector Machines Pattern Recognition Sergios Theodoridis Konstantinos Koutroumbas Second Edition A Tutorial on Support Vector Machines for Pattern.
Image Categorization by Learning and Reasoning with Regions Yixin Chen, University of New Orleans James Z. Wang, The Pennsylvania State University Published.
Dept. of Computer Science & Engineering, CUHK Pseudo Relevance Feedback with Biased Support Vector Machine in Multimedia Retrieval Steven C.H. Hoi 14-Oct,
Selective Transfer Machine for Personalized Facial Action Unit Detection Wen-Sheng Chu, Fernando De la Torre and Jeffery F. Cohn Robotics Institute, Carnegie.
Bag of Video-Words Video Representation
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
Player Action Recognition in Broadcast Tennis Video with Applications to Semantic Analysis of Sport Game Guangyu Zhu, Changsheng Xu Qingming Huang, Wen.
Graphite 2004 Statistical Synthesis of Facial Expressions for the Portrayal of Emotion Lisa Gralewski Bristol University United Kingdom
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Building local part models for category-level recognition C. Schmid, INRIA Grenoble Joint work with G. Dorko, S. Lazebnik, J. Ponce.
IEEE TRANSSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Pedestrian Detection and Localization
Representations for object class recognition David Lowe Department of Computer Science University of British Columbia Vancouver, Canada Sept. 21, 2006.
Classifiers Given a feature representation for images, how do we learn a model for distinguishing features from different classes? Zebra Non-zebra Decision.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Transfer Learning for Image Classification Group No.: 15 Group member : Feng Cai Sauptik Dhar Sauptik.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
A Novel Local Patch Framework for Fixing Supervised Learning Models Yilei Wang 1, Bingzheng Wei 2, Jun Yan 2, Yang Hu 2, Zhi-Hong Deng 1, Zheng Chen 2.
Visual Categorization With Bags of Keypoints Original Authors: G. Csurka, C.R. Dance, L. Fan, J. Willamowski, C. Bray ECCV Workshop on Statistical Learning.
K ERNEL - BASED W EIGHTED M ULTI - VIEW C LUSTERING Grigorios Tzortzis and Aristidis Likas Department of Computer Science, University of Ioannina, Greece.
Data Mining, ICDM '08. Eighth IEEE International Conference on Duy-Dinh Le National Institute of Informatics Hitotsubashi, Chiyoda-ku Tokyo,
Discriminative Sub-categorization Minh Hoai Nguyen, Andrew Zisserman University of Oxford 1.
Image Classification over Visual Tree Jianping Fan Dept of Computer Science UNC-Charlotte, NC
GENDER AND AGE RECOGNITION FOR VIDEO ANALYTICS SOLUTION PRESENTED BY: SUBHASH REDDY JOLAPURAM.
Skeleton Based Action Recognition with Convolutional Neural Network
Discovering Objects and their Location in Images Josef Sivic 1, Bryan C. Russell 2, Alexei A. Efros 3, Andrew Zisserman 1 and William T. Freeman 2 Goal:
Locally Linear Support Vector Machines Ľubor Ladický Philip H.S. Torr.
9.913 Pattern Recognition for Vision Class9 - Object Detection and Recognition Bernd Heisele.
Finding Clusters within a Class to Improve Classification Accuracy Literature Survey Yong Jae Lee 3/6/08.
Hierarchical Motion Evolution for Action Recognition Authors: Hongsong Wang, Wei Wang, Liang Wang Center for Research on Intelligent Perception and Computing,
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Nonparametric Bayesian Learning of Switching Dynamical Processes
The topic discovery models
Supervised Time Series Pattern Discovery through Local Importance
The topic discovery models
Using Transductive SVMs for Object Classification in Images
Thesis Advisor : Prof C.V. Jawahar
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
The Open World of Micro-Videos
Domingo Mery Department of Computer Science
The topic discovery models
AHED Automatic Human Emotion Detection
Outline Background Motivation Proposed Model Experimental Results
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Data Mining, Machine Learning, Data Analysis, etc. scikit-learn
Paper Reading Dalong Du April.08, 2011.
Presentation transcript:

ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised and weakly-supervised discovery of events in video (and audio) Fernando De la Torre

ACADS-SVMConclusions Introduction CMU-MMAC A dream

ACADS-SVMConclusions Introduction CMU-MMAC Outline Introduction CMU-Multimodal Activity database Unsupervised discovery of video events Aligned Cluster Analysis (ACA) Weakly-supervised discovery of video events Detection-Segmentation SVMs Conclusions

ACADS-SVMConclusions Introduction CMU-MMAC Quality of life technologies (QLoT)

ACADS-SVMConclusions Introduction CMU-MMAC Multimodal data collection 40 subjects, 5 recipes

ACADS-SVMConclusions Introduction CMU-MMAC Multimodal data collection 40 subjects, 5 recipes

ACADS-SVMConclusions Introduction CMU-MMAC Anomalous dataset

ACADS-SVMConclusions Introduction CMU-MMAC Time series analysis Anomalous detection formulated as detecting outliers in multimodal time series. – Supervised – Unsupervised – Semi-supervised or weakly supervised

ACADS-SVMConclusions Introduction CMU-MMAC Time series analysis Anomalous detection formulated as detecting outliers in multimodal time series. – Supervised – Unsupervised – Semi-supervised or weakly supervised

ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised discovery of events in video

ACADS-SVMConclusions Introduction CMU-MMAC Motivation Mining facial expression for one subject

ACADS-SVMConclusions Introduction CMU-MMAC Mining facial expression for one subject Motivation Mining facial expression for one subject Summarization Visualization Indexing

ACADS-SVMConclusions Introduction CMU-MMAC Mining facial expression for one subject Looking up SleepingSmiling Looking forward Waking up Motivation Summarization Visualization Indexing

ACADS-SVMConclusions Introduction CMU-MMAC Mining facial expression of one subject Motivation Summarization Embedding Indexing

ACADS-SVMConclusions Introduction CMU-MMAC Mining facial expression for one subject Motivation Summarization Embedding Indexing

ACADS-SVMConclusions Introduction CMU-MMAC Related work in time series Change point detection (e.g. Page ‘54, Stephens 94’, Lai ‘95, Ge and Smyth ‘00, Steyvers & Brown ’05, Murphy et al. ‘07, Harchaoui et al. ‘08) Segmental HMMs (e.g. Ge and Smith ‘00, Kohlmoren et al. ’01, Ding & Fan ‘07) Mixtures of HMMs (e.g. Fine et al. ‘98, Murphy & Paskin ‘01, Oliver et al. ’02, Alon et al. ‘03) Switching LDS (e.g. Pavolvic et al. ‘00, Oh et al. ‘08, Turaga et al. ‘09) Hierarchical Dirichelet Process (e.g. Beal et al. ‘02, Fox et al. ‘08) Aligned Cluster Analysis (ACA)

ACADS-SVMConclusions Introduction CMU-MMAC Summarization with ACA

ACADS-SVMConclusions Introduction CMU-MMAC x y Kernel k-means and spectral clustering (Ding et al. ‘02, Dhillon et al. ‘04, Zass and Shashua ‘05, De la Torre ‘06) x y x y x y

ACADS-SVMConclusions Introduction CMU-MMAC Problem formulation for ACA Labels (G) Start and end of the segments (h) Dynamic Time Alignment Kernel (Shimodaira et al. 01)

ACADS-SVMConclusions Introduction CMU-MMAC Dynamic Time Alignment Kernel (Shimodaira et al. 01) X [S i, S i+1 ) mcmc X mcmc Problem formulation for ACA

ACADS-SVMConclusions Introduction CMU-MMAC Matrix formulation for ACA samples segments clusters segments Dynamic Time Alignment Kernel (Shimodaira et al. 01) 23 frames, 3 clusters

ACADS-SVMConclusions Introduction CMU-MMAC Facial image features Appearance Active Appearance Models (Baker and Matthews ‘04) Upper face Lower face Shape Image features

ACADS-SVMConclusions Introduction CMU-MMAC Unsupervised facial event discovery

ACADS-SVMConclusions Introduction CMU-MMAC Cohn-Kanade: 30 people and five different expressions (surprise, joy, sadness, fear, anger) Facial event discovery across subjects

ACADS-SVMConclusions Introduction CMU-MMAC ACASpectral Clustering (SC) 0.87(.05)0.56(.04) Cohn-Kanade: 30 people and five different expressions (surprise, joy, sadness, fear, anger) Facial event discovery across subjects 10 sets of 30 people

ACADS-SVMConclusions Introduction CMU-MMAC Honey bee dance (Oh et al. ‘08) Seq 1Seq 2Seq 3Seq 4Seq 5Seq 6 ACA PS- SLDS (Oh et al. ‘08) HDP- VAR(1)-HMM (Fox et al. ‘08) Spectral Clustering Three behaviors: 1-waggling 2-turning left 3-turning right

ACADS-SVMConclusions Introduction CMU-MMAC Clustering human motion

ACADS-SVMConclusions Introduction CMU-MMAC Weakly supervised discovery of events in images and video

ACADS-SVMConclusions Introduction CMU-MMAC Spot the differences!

ACADS-SVMConclusions Introduction CMU-MMAC What distinguish these images?

ACADS-SVMConclusions Introduction CMU-MMAC Classification of time series

ACADS-SVMConclusions Introduction CMU-MMAC Similarity of these problems? Global statistics are not distinctive enough! Better understanding of the discriminative regions or events

ACADS-SVMConclusions Introduction CMU-MMAC Image Bag of ‘regions’ At least one positive All negative

ACADS-SVMConclusions Introduction CMU-MMAC Support vector machines (SVMs)

ACADS-SVMConclusions Introduction CMU-MMAC Learning formulation Standard SVM (Andrews et. al. ’03, Felzenszwalb et al. ‘08)

ACADS-SVMConclusions Introduction CMU-MMAC Optimization all possible subwindows 100ms/image (480*640 pixels) (Lampert et al. CVPR08) 1) 2) ) SVM with QP

ACADS-SVMConclusions Introduction CMU-MMAC Discriminative patterns in time series At most k disjoint intervals We name it: k-segmentation Efficient search: Global optimum guaranteed! 10ms/sequence (15000 frames)

ACADS-SVMConclusions Introduction CMU-MMAC Representation of signals Training data Compute frame-level feature vectors IDs of visual words Visual dictionary clustering

ACADS-SVMConclusions Introduction CMU-MMAC K-segmentation Original signal IDs of visual words Histogram of visual words We need:

ACADS-SVMConclusions Introduction CMU-MMAC What is ? SVM parameters Original signal (x) IDs of visual words m-segmentation  (m+1)-segmentation Consider m-segmentation: Situation 1: Situation 2:

ACADS-SVMConclusions Introduction CMU-MMAC Experiment 1 – glasses vs. no-glasses 624 images, 20 people under different expression/pose 8 people training (126 sunglasses, 128 no glasses), 12 testing (185 sunglasses and 185 no glasses)

ACADS-SVMConclusions Introduction CMU-MMAC Localization result

ACADS-SVMConclusions Introduction CMU-MMAC Experiment 2 – car vs. no car 400 images, half contains cars and other half no cars. Each image 10,000 SIFT descriptors and a vocabulary of 1,000 visual words.

ACADS-SVMConclusions Introduction CMU-MMAC Localization result

ACADS-SVMConclusions Introduction CMU-MMAC Bad localization cases

ACADS-SVMConclusions Introduction CMU-MMAC Classification performance Human labels Our method outperforms SVM with human labels!!! whole image discriminative regions

ACADS-SVMConclusions Introduction CMU-MMAC Experiment 3 – synthetic data Positive class Negative class Result k: maximum number of disjoint intervals. Accuracy

ACADS-SVMConclusions Introduction CMU-MMAC Experiment 4 – mouse activity Mouse activities: – Drinking, eating, exploring, grooming, sleeping

ACADS-SVMConclusions Introduction CMU-MMAC Result – F1 scores

ACADS-SVMConclusions Introduction CMU-MMAC Conclusions CMU Multimodal Activity database Unsupervised discovery of events in time-series – Aligned Cluster Analysis for summarization, indexing and visualization of time-series – Code online ( – Open problems: automatic selection of number of clusters Weakly-supervised discovery of events in time-series – DS-SVM – Novel & efficient algorithm for time series – Outperform methods with human labeled data Kernel methods a fundamental framework for multimodal data fusion.

ACADS-SVMConclusions Introduction CMU-MMAC Thanks Questions?