Object Classification through Deconvolutional Neural Networks

Slides:



Advertisements
Similar presentations
Limin Wang, Yu Qiao, and Xiaoou Tang
Advertisements

Motion Coherent Tracking with Multi-Label MRF Optimization David Tsai Matthew Flagg James M. Rehg Computational Perception Lab School of Interactive Computing.
1 Week 6 Fatemeh Yazdiananari. 2 Feature Extraction 75 Validation Videos Were sent to the cluster for DTF feature extraction The extracted features are.
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Personal Driving Diary: Constructing a Video Archive of Everyday Driving Events IEEE workshop on Motion and Video Computing ( WMVC) 2011 IEEE Workshop.
Image Subtraction for Real Time Moving Object Extraction Shahbe Mat Desa, Qussay A. Salih, CGIV’04.
Marcin Marszałek, Ivan Laptev, Cordelia Schmid Computer Vision and Pattern Recognition, CVPR Actions in Context.
Mentor: Salman Khokhar Action Recognition in Crowds Week 7.
Human Action Recognition from RGB-D Videos Oliver MacNeely YSP 2015.
Levi Smith.  Reading papers  Getting data set together  Clipping videos to form the training and testing data for our classifier  Project separation.
Human pose recognition from depth image MS Research Cambridge.
Students: Meera & Si Mentor: Afshin Dehghan WEEK 4: DEEP TRACKING.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
Skeleton Based Action Recognition with Convolutional Neural Network
Semantic Extraction and Semantics-Based Annotation and Retrieval for Video Databases Authors: Yan Liu & Fei Li Department of Computer Science Columbia.
Motion Features for Action Recognition YeHao 3/11/2014.
Logan Lebanoff Mentor: Haroon Idrees
Introduction to Convolutional Neural Networks
Lecture 4b Data augmentation for CNN training
Learning color and locality cues for moving object detection and segmentation Yuan-Hao Lai Feng Liu and Michael Gleicher University of Wisconsin-Madison.
A Hierarchical Deep Temporal Model for Group Activity Recognition
Cancer Metastases Classification in Histological Whole Slide Images
Unsupervised Learning of Video Representations using LSTMs
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
Automatic Lung Cancer Diagnosis from CT Scans (Week 1)
Object Classification through Deconvolutional Neural Networks
Week 3 (June 6 – June10 , 2016) Summary :
Presenter: Ibrahim A. Zedan
Object Detection with Bootstrapping Carlos Rubiano Mentor: Oliver Nina
Ajita Rattani and Reza Derakhshani,
Training Techniques for Deep Neural Networks
Project Implementation for ITCS4122
Deep Learning and Newtonian Physics
Textual Video Prediction
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Disguised Face Identification (DFI) with Facial KeyPoints using Spatial Fusion Convolutional Network Nathan Sun CIS601.
A Convolutional Neural Network Cascade For Face Detection
Multiple Organ Detection in CT Volumes using CNN Week 4
Tremor Detection Using Motion Filtering and SVM Bilge Soran, Jenq-Neng Hwang, Linda Shapiro, ICPR, /16/2018.
Computer Vision James Hays
Figure 4. Testing minimal configurations with existing models for spatiotemporal recognition. (A-B) A binary classifier is trained to separate a positive.
Two-Stream Convolutional Networks for Action Recognition in Videos
Video understanding using part based object detection models
Counting in Dense Crowds using Deep Learning
RGB-D Image for Scene Recognition by Jiaqi Guo
The Open World of Micro-Videos
INF 5860 Machine learning for image classification
Applications of Cellular Neural Networks to Image Understanding
Multiple Feature Learning for Action Classification
Object Classification through Deconvolutional Neural Networks
Object Classification through Deconvolutional Neural Networks
Oral presentation for ACM International Conference on Multimedia, 2014
Lecture: Deep Convolutional Neural Networks
Recurrent Encoder-Decoder Networks for Time-Varying Dense Predictions
CNN-based Action Recognition Using Adaptive Multiscale Depth Motion Maps And Stable Joint Distance Maps Junyou He, Hailun Xia, Chunyan Feng, Yunfei Chu.
Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS
Sequence to Sequence Video to Text
Chuan Wang1, Haibin Huang1, Xiaoguang Han2, Jue Wang1
Compare Against Analytic KB
Multi-UAV to UAV Tracking
Report 7 Brandon Silva.
CRCV REU 2019 Kara Schatz.
Week 3 Volodymyr Bobyr.
Self-Supervised Cross-View Action Synthesis
Week 7 Presentation Ngoc Ta Aidean Sharghi
Multi-Target Detection and Tracking of UAVs from a UAV
Truman Action Recognition Status update
Truman Action Recognition Status update
Nguyen Ngoc Hoang, Guee-Sang Lee, Soo-Hyung Kim, Hyung-Jeong Yang
Presentation transcript:

Object Classification through Deconvolutional Neural Networks Student: Carlos Rubiano Mentor: Oliver Nina

ChaLearn Looking at People Action/interaction spotting on RGB data Recognize actions using 235 performances of 11 action classes recorded and manually labeled in continuous RGB sequences of people performing natural isolated and collaborative behaviors

Video Classification with CNN Use CNN structure to classify actions in videos Using similar network that classifies CIFAR and takes inputs of 32 x 32 x 3 Also do this using ImageNet network and takes inputs of 256 x 256 x 3 Extract the frames from the videos Applied optical flow on the frames of the videos Resized the data for faster performance, and match inputs of the network Optical flow gives temporal information across two frames

ChaLearn Looking at People Dataset Optical Flow Optical Flow

Model

Results from ChaLearn Looking at People 2014 With automatic segmentation

Results on Validation Set Spatial DCNN Stacks Randomly Initialized Pretrained with Fine tuning 1 frame 0.53 0.57 3 frames 0.55 5 frames 0.6 Temporal DCNN Stacks Randomly Initialized Pretrained with Fine tuning 1 frame 0.58 0.5 3 frames 0.54 - 5 frames With manual segmentation

Also… Add: - motion boundary histograms - Dense SIFT - HOG