Download presentation
Presentation is loading. Please wait.
Published byAlannah Gardner Modified over 9 years ago
1
Chao-Yeh Chen and Kristen Grauman University of Texas at Austin Efficient Activity Detection with Max- Subgraph Search
2
Outline Introduction Approach Define weighted nodes Link nodes Search for the maximum-weight graph Experimental Result Conclusion
3
Introduction Existing methods tend to separate activity detection into two distinct stages: 1. generates space-time candidate regions of interest from the test video 2. scores each candidate according to how well it matches a given activity model (often a classifier).
4
How to detect human activity in continuous video? Status quo approaches:
5
Introduction We pose activity detection as a maximum-weight connected subgraph problem over a learned space-time graph constructed on the test sequence.
6
Approach
7
Classifier training for feature weights Learn a linear SVM from training data, the scoring function would have the form: let denote j-th bin count for histogram h(S), the j-th word is associated with a weight for j = 1,…,K, where K is the dimension of histogram h.
8
Classifier training for feature weights Thus the classifier response for subvolume S is: Candidate subvolume SVM weight for j-th word Num occurrences of j-th word SVM weight for i-th feature's word
9
Bag-of-feature(Bof)
12
Localized SpaceTime Features Low-level descriptors we use HoG and HoF computed in local space-time cubes [14, 10]. These descriptors capture the appearance and motion in the video. High-level descriptors
13
Define weighted nodes Divide space-time volume into frame-level or space-time nodes. Compute the weight of nodes from the features inside them.
14
Link nodes Two different link strategies: 1. Neighbors only for frame-level nodes(T-Subgraph) or space- time nodes(ST-Subgraph). 2. First two neighbors for frame-level nodes(T-Jump-Subgraph).
15
Search for the maximum-weight graph Transform max-weight subgraph problem into a prize- collecting Steiner tree problem. Solve efficiently with branch and cut method from [15]. [15]An algorithmic framework for the exact solution of the prize-collecting Steiner tree problem. Math. Prog., 2006.
16
Experimental Result Datasets
17
Baselines T-Sliding ST-Cube-Sliding ST-Cube-Subvolume[29] J. Yuan, Z. Liu, and Y. Wu. Discriminative subvolume search for efficient action detection. In CVPR, 2009.
18
UCF Sports data
19
Hollywood data
20
MSR dataset
21
Example of ST-Subgraph
22
Overview of all methods on the three datasets
23
High-level vs Low-level descriptors
24
Conclusion Compare to sliding window search,it significantly reduces computation time. Flexible node structure offers more robust detection in noisy backgrounds. High-level descriptor shows promise for complex activities by incorporating semantic relationships between humans and objects in video.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.