Data Driven Attributes for Action Detection

Slides:

Advertisements

Similar presentations

Max-Margin Additive Classifiers for Detection

Advertisements

Human Detection Phanindra Varma. Detection -- Overview  Human detection in static images is based on the HOG (Histogram of Oriented Gradients) encoding.

Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.

Adding Unlabeled Samples to Categories by Learned Attributes Jonghyun Choi Mohammad Rastegari Ali Farhadi Larry S. Davis PPT Modified By Elliot Crowley.

Pose Estimation and Segmentation of People in 3D Movies Karteek Alahari, Guillaume Seguin, Josef Sivic, Ivan Laptev Inria, Ecole Normale Superieure ICCV.

Recognizing Human Actions by Attributes CVPR2011 Jingen Liu, Benjamin Kuipers, Silvio Savarese Dept. of Electrical Engineering and Computer Science University.

Limin Wang, Yu Qiao, and Xiaoou Tang

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Shape Sharing for Object Segmentation

Human Action Recognition across Datasets by Foreground-weighted Histogram Decomposition Waqas Sultani, Imran Saleemi CVPR 2014.

Lecture 31: Modern object recognition

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Global spatial layout: spatial pyramid matching Spatial weighting the features Beyond bags of features: Adding spatial information.

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Good morning, everyone, thank you for coming to my presentation.

COS 429 PS5: Finding Nemo. Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar.

Student: Kylie Gorman Mentor: Yang Zhang COLOR-ATTRIBUTES- RELATED IMAGE RETRIEVAL.

Bag of Video-Words Video Representation

Object Bank Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 4 th, 2013.

Watch, Listen and Learn Sonal Gupta, Joohyun Kim, Kristen Grauman and Raymond Mooney -Pratiksha Shah.

An Example of Course Project Face Identification.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Week 9 Presented by Christina Peterson. Recognition Accuracies on UCF Sports data set Method Accuracy (%)DivingGolfingKickingLiftingRidingRunningSkating.

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Mentor: Salman Khokhar Action Recognition in Crowds Week 7.

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.

VIP: Finding Important People in Images Clint Solomon Mathialagan Andrew C. Gallagher Dhruv Batra CVPR

E XEMPLAR -SVM FOR A CTION R ECOGNITION Week 11 Presented by Christina Peterson.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Recognition Using Visual Phrases

Object Recognizing. Object Classes Individual Recognition.

D ATA D RIVEN A TTRIBUTES FOR A CTION R ECOGNITION Week 7 Presented by Christina Peterson.

REU Week 1 Presented by Christina Peterson. Edge Detection Sobel ◦ Convolve image with derivative masks:  x:  y: ◦ Calculate gradient magnitude ◦ Apply.

Week 3 Emily Hand UNR. Online Multiple Instance Learning The goal of MIL is to classify unseen bags, instances, by using the labeled bags as training.

1 Bilinear Classifiers for Visual Recognition Computational Vision Lab. University of California Irvine To be presented in NIPS 2009 Hamed Pirsiavash Deva.

Course Project Lists for ITCS6157 Jianping Fan. Project Implementation Lists Automatic Image Clustering You can download 1,000,000 images from You can.

Cascade for Fast Detection

Object detection with deformable part-based models

Data Driven Attributes for Action Detection

Finding Things: Image Parsing with Regions and Per-Exemplar Detectors

Lit part of blue dress and shadowed part of white dress are the same color

Action Recognition ECE6504 Xiao Lin.

Exemplar-SVM for Action Recognition

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Learning to Detect a Salient Object

Object detection as supervised classification

Enhanced-alignment Measure for Binary Foreground Map Evaluation

A Tutorial on HOG Human Detection

HOGgles Visualizing Object Detection Features

A Convolutional Neural Network Cascade For Face Detection

Action Recognition in Temporally Untrimmed Videos

Rob Fergus Computer Vision

Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.

Video understanding using part based object detection models

Mentor: Salman Khokhar

Week 6 Fatemeh Yazdiananari.

AHED Automatic Human Emotion Detection

Human Action Recognition Week 8

Weakly Supervised Action Recognition

Exemplar-SVM for Action Recognition

University of Central Florida

Visual Attributes in Video

Week 3 Volodymyr Bobyr.

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Presentation transcript:

Data Driven Attributes for Action Detection Week 5 Presented by Christina Peterson

Bag of Words Implemented the Bag of Words in Matlab For each video, collect low level features within the ground truth bounding box At each frame, the box is divided into 3 x 3 cells For each cell, create a histogram for each feature (STIP, color, texture) For each bounding box, create a histogram for each feature Feature Vector is the concatenation of the histograms One Feature Vector for every bounding box

Exemplar-SVM Obtained source code from Tomasz Malisiewicz’s website for ‘Ensemble of Exemplar SVMS for Object Detection and Beyond’ Do implementation based on this code, but designed for the UCF Sports dataset For each action class, create Exemplar-SVM for each bounding box of every video Optimize by reducing bounding box to one cycle of the action

Calibration Run each Exemplar-SVM on a validation set Use non-maximum suppression to remove redundant detections Compute the overlap score between resulting detections and ground-truth bounding-boxes Detections that overlap by more than 0.5 with ground truth are positive Detections with overlap by less than 0.2 with ground truth are negative Fit a logistic function to these scores

Goals Implement the Exemplar-SVMs Implement the calibration step as detailed in Malisiewicz et al. [1] Research other calibration methods to improve this step

References [1] T. Malisiewicz, A. Gupta, and A. A. Efros. Ensemble of Exemplar SVMS for Object Detection and Beyond. In Proc. ICCV, 2011. [2] Y. Tian, R. Sukthankar, and M. Shah. Spatiotemporal Deformable Part Models for Action Detection. In CVPR, 2013.