Chao-Yeh Chen and Kristen Grauman University of Texas at Austin Efficient Activity Detection with Max- Subgraph Search.

Slides:



Advertisements
Similar presentations
Coherent Laplacian 3D protrusion segmentation Oxford Brookes Vision Group Queen Mary, University of London, 11/12/2009 Fabio Cuzzolin.
Advertisements

Robust spectral 3D-bodypart segmentation along time Fabio Cuzzolin, Diana Mateus, Edmond Boyer, Radu Horaud Perception project meeting 24/4/2007 Submitted.
A Discriminative Key Pose Sequence Model for Recognizing Human Interactions Arash Vahdat, Bo Gao, Mani Ranjbar, and Greg Mori ICCV2011.
Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.
Zhimin CaoThe Chinese University of Hong Kong Qi YinITCS, Tsinghua University Xiaoou TangShenzhen Institutes of Advanced Technology Chinese Academy of.
O(N 1.5 ) divide-and-conquer technique for Minimum Spanning Tree problem Step 1: Divide the graph into  N sub-graph by clustering. Step 2: Solve each.
Presented by Xinyu Chang
TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets Chun Chen 1, Feng Li 2, Beng Chin Ooi 2, and Sai Wu 2 1 Zhejiang University, 2 National.
Human Identity Recognition in Aerial Images Omar Oreifej Ramin Mehran Mubarak Shah CVPR 2010, June Computer Vision Lab of UCF.
1 Challenge the future HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences Omar Oreifej Zicheng Liu CVPR 2013.
Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Face Alignment with Part-Based Modeling
CVPR2013 Poster Representing Videos using Mid-level Discriminative Patches.
Histograms of Oriented Gradients for Human Detection Navneet Dalal and Bill Triggs CVPR 2005 Another Descriptor.
Intelligent Systems Lab. Recognizing Human actions from Still Images with Latent Poses Authors: Weilong Yang, Yang Wang, and Greg Mori Simon Fraser University,
Robust Object Tracking via Sparsity-based Collaborative Model
1 Efficient Subgraph Search over Large Uncertain Graphs Ye Yuan 1, Guoren Wang 1, Haixun Wang 2, Lei Chen 3 1. Northeastern University, China 2. Microsoft.
Discriminative Segment Annotation in Weakly Labeled Video Kevin Tang, Rahul Sukthankar Appeared in CVPR 2013 (Oral)
Watching Unlabeled Video Helps Learn New Human Actions from Very Few Labeled Snapshots Chao-Yeh Chen and Kristen Grauman University of Texas at Austin.
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
High-level Component Filtering for Robust Scene Text Detection
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Detecting Pedestrians by Learning Shapelet Features
Fast intersection kernel SVMs for Realtime Object Detection
Robust and large-scale alignment Image from
Beyond Actions: Discriminative Models for Contextual Group Activities Tian Lan School of Computing Science Simon Fraser University August 12, 2010 M.Sc.
ADVISE: Advanced Digital Video Information Segmentation Engine
Generic Object Detection using Feature Maps Oscar Danielsson Stefan Carlsson
1 Integration of Background Modeling and Object Tracking Yu-Ting Chen, Chu-Song Chen, Yi-Ping Hung IEEE ICME, 2006.
5/30/2006EE 148, Spring Visual Categorization with Bags of Keypoints Gabriella Csurka Christopher R. Dance Lixin Fan Jutta Willamowski Cedric Bray.
A Study of the Relationship between SVM and Gabriel Graph ZHANG Wan and Irwin King, Multimedia Information Processing Laboratory, Department of Computer.
School of Electronic Information Engineering, Tianjin University Human Action Recognition by Learning Bases of Action Attributes and Parts Jia pingping.
Bag of Video-Words Video Representation
Action recognition with improved trajectories
CS55 Tianfan Xue Adviser: Bo Zhang, Jianmin Li.
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Professor: S. J. Wang Student : Y. S. Wang
Jifeng Dai 2011/09/27.  Introduction  Structural SVM  Kernel Design  Segmentation and parameter learning  Object Feature Descriptors  Experimental.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Week 9 Presented by Christina Peterson. Recognition Accuracies on UCF Sports data set Method Accuracy (%)DivingGolfingKickingLiftingRidingRunningSkating.
Keyword Searching and Browsing in Databases using BANKS Seoyoung Ahn Mar 3, 2005 The University of Texas at Arlington.
Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.
Object Detection with Discriminatively Trained Part Based Models
1 Learning Sub-structures of Document Semantic Graphs for Document Summarization 1 Jure Leskovec, 1 Marko Grobelnik, 2 Natasa Milic-Frayling 1 Jozef Stefan.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
CVPR 2006 New York City Spatial Random Partition for Common Visual Pattern Discovery Junsong Yuan and Ying Wu EECS Dept. Northwestern Univ.
Histograms of Oriented Gradients for Human Detection(HOG)
Recognition Using Visual Phrases
Poselets: Body Part Detectors Trained Using 3D Human Pose Annotations ZUO ZHEN 27 SEP 2011.
Unsupervised Salience Learning for Person Re-identification
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Ning Jin, Wei Wang ICDE 2011 LTS: Discriminative Subgraph Mining by Learning from Search History.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Scale Invariant Feature Transform (SIFT)
Data Driven Attributes for Action Detection
Learning Mid-Level Features For Recognition
PRESENTED BY Yang Jiao Timo Ahonen, Matti Pietikainen
Yun-FuLiu Jing-MingGuo Che-HaoChang
Paper Presentation: Shape and Matching
Object detection as supervised classification
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
A Tutorial on HOG Human Detection
Approximate Correspondences in High Dimensions
Object-Graphs for Context-Aware Category Discovery
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Presentation transcript:

Chao-Yeh Chen and Kristen Grauman University of Texas at Austin Efficient Activity Detection with Max- Subgraph Search

Outline Introduction Approach Define weighted nodes Link nodes Search for the maximum-weight graph Experimental Result Conclusion

Introduction Existing methods tend to separate activity detection into two distinct stages: 1. generates space-time candidate regions of interest from the test video 2. scores each candidate according to how well it matches a given activity model (often a classifier).

How to detect human activity in continuous video? Status quo approaches:

Introduction We pose activity detection as a maximum-weight connected subgraph problem over a learned space-time graph constructed on the test sequence.

Approach

Classifier training for feature weights Learn a linear SVM from training data, the scoring function would have the form: let denote j-th bin count for histogram h(S), the j-th word is associated with a weight for j = 1,…,K, where K is the dimension of histogram h.

Classifier training for feature weights Thus the classifier response for subvolume S is: Candidate subvolume SVM weight for j-th word Num occurrences of j-th word SVM weight for i-th feature's word

Bag-of-feature(Bof)

Localized SpaceTime Features Low-level descriptors we use HoG and HoF computed in local space-time cubes [14, 10]. These descriptors capture the appearance and motion in the video. High-level descriptors

Define weighted nodes Divide space-time volume into frame-level or space-time nodes. Compute the weight of nodes from the features inside them.

Link nodes Two different link strategies: 1. Neighbors only for frame-level nodes(T-Subgraph) or space- time nodes(ST-Subgraph). 2. First two neighbors for frame-level nodes(T-Jump-Subgraph).

Search for the maximum-weight graph Transform max-weight subgraph problem into a prize- collecting Steiner tree problem. Solve efficiently with branch and cut method from [15]. [15]An algorithmic framework for the exact solution of the prize-collecting Steiner tree problem. Math. Prog., 2006.

Experimental Result Datasets

Baselines T-Sliding ST-Cube-Sliding ST-Cube-Subvolume[29] J. Yuan, Z. Liu, and Y. Wu. Discriminative subvolume search for efficient action detection. In CVPR, 2009.

UCF Sports data

Hollywood data

MSR dataset

Example of ST-Subgraph

Overview of all methods on the three datasets

High-level vs Low-level descriptors

Conclusion Compare to sliding window search,it significantly reduces computation time. Flexible node structure offers more robust detection in noisy backgrounds. High-level descriptor shows promise for complex activities by incorporating semantic relationships between humans and objects in video.