Download presentation
Presentation is loading. Please wait.
Published bySharleen Phillips Modified over 9 years ago
1
Bangpeng Yao Li Fei-Fei Computer Science Department, Stanford University, USA
2
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
3
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
4
Human pose estimation & Object detection Right-arm Left-arm Torso Right-leg Left-leg Tennis racket
5
Challenging :
6
Mutual context : Human pose estimation & Object detection - facilitate the recognition of each other
7
Mutual context V.S no mutual context
8
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
10
A : Activity class, ex : tennis server, volleyball smash O : Object, ex : tennis racket, volleyball H : Human pose P : Body parts f : visual feature Each A have more than one type of H
11
: edge of the model : potential function : weight : Freguencies of co- occurrence between A, O, and H,, : Spatial relationship among object and body parts, compute by : (position, orientation, scale)
12
: model the dependence of the object and a body part with their corresponding image evidence
13
Co-occurrence context for the activity class, object, and human pose Multiple types of human pose for each activity Spatial context between object and body parts
14
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
15
Learning step needs to achieve two goals : structure learning & parameter estimation Structure learning : discover the hidden human pose and the connectivity among the object, human pose, and body parts Parameter estimation : for the potential weight to maximize the discrimination between different activities
16
Objective : Connectivity pattern between the object, the human pose, and the body parts Method : hill-climbing approach with tabu list
17
Hill-climbing approach adds or removes edges one at a time until maximum is reached Human pose
18
Objective : obtain a set of potential weight that maximize the discrimination between different classes of activities Training sample : : is potential function value, disconnected edge set 0 : is the human pose H : is the class label A If, then : is a weight vector for the r-th sub-class
19
: is L2 norm : normalization constant
20
Using only one human pose for each HOI class is not enough to characterize well all the image in this class
21
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
22
Given a new testing image, our objective is : - estimate the pose of the human - detect the object that is interacting with the human
23
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
24
Cricket - defensive shot (player and cricket bat) Cricket - bowling (player and cricket ball) Croquet - shot (player and croquet mallet) Tennis - forehand (player and tennis racket) Tennis – serve (player and tennis racket) Volleyball - smash (player and volleyball) 30 images for training, 20 for testing
26
Sliding window Pedestrian as context Our method detector
27
Pose estimation still difficult Multiple pose is better than only one pose
28
Upper : our method Lower left : object detection by a scanning window Lower right : pose estimation by the state-of-art pictorial structure method
29
Note Gupta et.al. uses predominantly the background scene context
30
Introduction Modeling mutual context of object and pose Model learning Model inference, object detection, and human pose estimation Experiments Conclusion
31
Treat object and human pose as the context of each other in different HOI activity classes Structure learning method - connectivity important patterns between objects and human pose Further improve : - incorporate useful background scene context to facilitate the recognition of foreground object and activity - deal with more than one object
Similar presentations
© 2024 SlidePlayer.com. Inc.
All rights reserved.