Data Driven Attributes for Action Detection

Slides:



Advertisements
Similar presentations
Max-Margin Additive Classifiers for Detection
Advertisements

Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.
Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.
Adding Unlabeled Samples to Categories by Learned Attributes Jonghyun Choi Mohammad Rastegari Ali Farhadi Larry S. Davis PPT Modified By Elliot Crowley.
Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.
Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.
Limin Wang, Yu Qiao, and Xiaoou Tang
Ľubor Ladický1 Phil Torr2 Andrew Zisserman1
Shape Sharing for Object Segmentation
Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.
Lecture 31: Modern object recognition
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
Good morning, everyone, thank you for coming to my presentation.
COS 429 PS5: Finding Nemo. Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar.
Student: Kylie Gorman Mentor: Yang Zhang COLOR-ATTRIBUTES- RELATED IMAGE RETRIEVAL.
Bag of Video-Words Video Representation
Object Bank Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 4 th, 2013.
Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.
An Example of Course Project Face Identification.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Week 9 Presented by Christina Peterson. Recognition Accuracies on UCF Sports data set Method Accuracy (%)DivingGolfingKickingLiftingRidingRunningSkating.
Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Mentor: Salman Khokhar Action Recognition in Crowds Week 7.
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR
E XEMPLAR -SVM FOR A CTION R ECOGNITION Week 11 Presented by Christina Peterson.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Recognition Using Visual Phrases
Object Recognizing. Object Classes Individual Recognition.
D ATA D RIVEN A TTRIBUTES FOR A CTION R ECOGNITION Week 7 Presented by Christina Peterson.
REU Week 1 Presented by Christina Peterson. Edge Detection Sobel ◦ Convolve image with derivative masks:  x:  y: ◦ Calculate gradient magnitude ◦ Apply.
Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.
1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.
Course Project Lists for ITCS6157 Jianping Fan. Project Implementation Lists Automatic Image Clustering You can download 1,000,000 images from You can.
Cascade for Fast Detection
Object detection with deformable part-based models
Data Driven Attributes for Action Detection
Finding Things: Image Parsing with Regions and Per-Exemplar Detectors
Lit part of blue dress and shadowed part of white dress are the same color
Action Recognition ECE6504 Xiao Lin.
Exemplar-SVM for Action Recognition
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
Learning to Detect a Salient Object
Object detection as supervised classification
Enhanced-alignment Measure for Binary Foreground Map Evaluation
A Tutorial on HOG Human Detection
HOGgles Visualizing Object Detection Features
A Convolutional Neural Network Cascade For Face Detection
Action Recognition in Temporally Untrimmed Videos
Rob Fergus Computer Vision
Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.
Video understanding using part based object detection models
Mentor: Salman Khokhar
Week 6 Fatemeh Yazdiananari.
AHED Automatic Human Emotion Detection
Human Action Recognition Week 8
Weakly Supervised Action Recognition
Exemplar-SVM for Action Recognition
University of Central Florida
Visual Attributes in Video
Week 3 Volodymyr Bobyr.
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Presentation transcript:

Data Driven Attributes for Action Detection Week 5 Presented by Christina Peterson

Bag of Words Implemented the Bag of Words in Matlab For each video, collect low level features within the ground truth bounding box At each frame, the box is divided into 3 x 3 cells For each cell, create a histogram for each feature (STIP, color, texture) For each bounding box, create a histogram for each feature Feature Vector is the concatenation of the histograms One Feature Vector for every bounding box

Exemplar-SVM Obtained source code from Tomasz Malisiewicz’s website for ‘Ensemble of Exemplar SVMS for Object Detection and Beyond’ Do implementation based on this code, but designed for the UCF Sports dataset For each action class, create Exemplar-SVM for each bounding box of every video Optimize by reducing bounding box to one cycle of the action

Calibration Run each Exemplar-SVM on a validation set Use non-maximum suppression to remove redundant detections Compute the overlap score between resulting detections and ground-truth bounding-boxes Detections that overlap by more than 0.5 with ground truth are positive Detections with overlap by less than 0.2 with ground truth are negative Fit a logistic function to these scores

Goals Implement the Exemplar-SVMs Implement the calibration step as detailed in Malisiewicz et al. [1] Research other calibration methods to improve this step

References [1] T. Malisiewicz, A. Gupta, and A. A. Efros. Ensemble of Exemplar SVMS for Object Detection and Beyond. In Proc. ICCV, 2011. [2] Y. Tian, R. Sukthankar, and M. Shah. Spatiotemporal Deformable Part Models for Action Detection. In CVPR, 2013.