Mentor Prof. Amitabha Mukerjee Deepak Pathak Kaustubh Tapi 10346

Slides:



Advertisements
Similar presentations
NEURAL NETWORKS Backpropagation Algorithm
Advertisements

Neural networks Introduction Fitting neural networks
Learning Functions and Neural Networks II Lecture 9 Luoting Fu Spring 2012.
What is the Best Multi-Stage Architecture for Object Recognition? Ruiwen Wu [1] Jarrett, Kevin, et al. "What is the best multi-stage architecture for object.
Machine Learning Neural Networks
Lecture 14 – Neural Networks
Neural Networks Chapter Feed-Forward Neural Networks.
LOGO Classification III Lecturer: Dr. Bo Yuan
K-means Based Unsupervised Feature Learning for Image Recognition Ling Zheng.
Artificial Neural Networks KONG DA, XUEYU LEI & PAUL MCKAY.
Overview of Back Propagation Algorithm
Face Recognition Using Neural Networks Presented By: Hadis Mohseni Leila Taghavi Atefeh Mirsafian.
SE367 Project Final Presentation By: Sujith Thomas Parimi Krishna Chaitanya In charge:- Prof Amitabha Mukerjee.
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
CSSE463: Image Recognition Day 21 Upcoming schedule: Upcoming schedule: Exam covers material through SVMs Exam covers material through SVMs.
Artificial Neural Networks (ANN). Output Y is 1 if at least two of the three inputs are equal to 1.
Multi Layer NN and Bit-True Modeling of These Networks SILab presentation Ali Ahmadi September 2007.
Neural Networks Chapter 6 Joost N. Kok Universiteit Leiden.
Chapter 9 Neural Network.
Appendix B: An Example of Back-propagation algorithm
Backpropagation An efficient way to compute the gradient Hung-yi Lee.
COMPARISON OF IMAGE ANALYSIS FOR THAI HANDWRITTEN CHARACTER RECOGNITION Olarik Surinta, chatklaw Jareanpon Department of Management Information System.
Yang, Luyu.  Postal service for sorting mails by the postal code written on the envelop  Bank system for processing checks by reading the amount of.
Handwritten Recognition with Neural Network Chatklaw Jareanpon, Olarik Surinta Mahasarakham University.
Handwritten Hindi Numerals Recognition Kritika Singh Akarshan Sarkar Mentor- Prof. Amitabha Mukerjee.
Analysis of Classification Algorithms In Handwritten Digit Recognition Logan Helms Jon Daniele.
Procedure for Training a Child to Identify a Cat using 10,000 Example Cats For Cat_index  1 to Show cat and describe catlike features (Cat_index)
Neural Networks Teacher: Elena Marchiori R4.47 Assistant: Kees Jong S2.22
C - IT Acumens. COMIT Acumens. COM. To demonstrate the use of Neural Networks in the field of Character and Pattern Recognition by simulating a neural.
Neural Networks 2nd Edition Simon Haykin
Object Recognizing. Deep Learning Success in 2012 DeepNet and speech processing.
Joe Bradish Parallel Neural Networks. Background  Deep Neural Networks (DNNs) have become one of the leading technologies in artificial intelligence.
Logan Lebanoff Mentor: Haroon Idrees
Neural Networks Lecture 11: Learning in recurrent networks Geoffrey Hinton.
Digit Recognition Using SIS Testbed Mengjie Mao. Overview Cycle 1: sequential component AAM training Cycle 2: sequential components Identifier 0 Ten perfect.
Neural networks (2) Reminder Avoiding overfitting Deep neural network Brief summary of supervised learning methods.
1 Convolutional neural networks Abin - Roozgard. 2  Introduction  Drawbacks of previous neural networks  Convolutional neural networks  LeNet 5 
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Feature selection using Deep Neural Networks March 18, 2016 CSI 991 Kevin Ham.
CSE343/543 Machine Learning Mayank Vatsa Lecture slides are prepared using several teaching resources and no authorship is claimed for any slides.
Tofik AliPartha Pratim Roy Department of Computer Science and Engineering Indian Institute of Technology Roorkee CVIP-WM 2017 Paper ID 172 Word Spotting.
Big data classification using neural network
Deep Learning Amin Sobhani.
Data Mining, Neural Network and Genetic Programming
Computer Science and Engineering, Seoul National University
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Combining CNN with RNN for scene labeling (segmentation)
Neural Networks 2 CS446 Machine Learning.
Prof. Carolina Ruiz Department of Computer Science
Multiple Organ Detection in CT Volumes using CNN Week 1
Dog/Cat Classifier Christina Stiff.
Deep learning Introduction Classes of Deep Learning Networks
Smart Robots, Drones, IoT
network of simple neuron-like computing elements
Convolutional neural networks Abin - Roozgard.
Age and Gender Classification using Convolutional Neural Networks
Forward and Backward Max Pooling
Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824
Convolutional Neural Networks
Face Recognition: A Convolutional Neural Network Approach
Neural Networks II Chen Gao Virginia Tech ECE-5424G / CS-5824
CSC321: Neural Networks Lecture 11: Learning in recurrent networks
Automatic Handwriting Generation
Debasis Bhattacharya, JD, DBA University of Hawaii Maui College
Example of training and deployment of deep convolutional neural networks. Example of training and deployment of deep convolutional neural networks. During.
Pattern Recognition: Statistical and Neural
Prof. Carolina Ruiz Department of Computer Science
Presented By: Firas Gerges (fg92)
Presentation transcript:

Mentor Prof. Amitabha Mukerjee Deepak Pathak Kaustubh Tapi

Objective is to classify human actions from the video dataset. Motivation: Current methods are highly image processing based and highly problem dependent. We’ll use 3-D Convolutional Neural networks which extracts and learns the features to classify different set of actions. Implemented on Weizmann Dataset of human actions which is classified into 10 actions. [CREDIT: WEIZMANN DATASET]

Firstly we break each video into its constituent frames and apply bounding box on each frame to reduce input dimension size. Dataset of 226 videos (10 classifications) was divided into training(181 videos) and testing part(45 videos). A subsequence of 13(64X48X13) consecutive frames with 12 frames overlap is given as input to 3-D Convolutional Neural Networks. Till now we have tested on silhouette frames of videos.

CREDIT: “Sequential Deep Learning for Human Action Recognition” Paper by: Baccouche, M., Mamalet, F., Wolf, C., Garcia, C., Baskurt, A. [2011] For training the neural network, inputs from training set are forward propagated till the last layer[FORWARD PROPAGATION]. Our 3-D Convolutional Neural Network undergoes supervised training.

Error is computed in last layer and then propagated backwards to all previous layers.(BACKPROPAGATION). Weight updation in layers depends on eta(learning rate). Weights will converge after a number of epochs. (Hessian Back-propagation used to reduce number of epochs) Learned feature maps seem to capture visually relevant information (person/background segmentation, limbs involved during the action, edge information... ) Same learning algorithm used for entire 3-D Convolutional Neural Networks

Input Video Silhouette Frames Convolved feature maps Sub-sampled feature maps (with bias) Recurrent Neural Network Output layer (10 classifications)

 We obtained code for 2-D Convolutional Neural Network for MNIST digit recognition(C++ Implementation) ) by Mike O’ Neill [3]  We modified the code to construct 3-D Convolutional Neural Network for Human action recognition on WEIZMANN DATASET.  Our code can be implemented from command line and number of nodes, layers and kernels can be modified easily.

 Accuracy of 88%-90% was obtained on WEIZMANN DATASET(181 videos for training and 45 videos for testing) after a training of 8 epochs.

 [1] Baccouche M., Mamalet F., Wolf C., Garcia C., Baskurt A. : “Sequential Deep Learning for Human Action Recognition”. In: Salah, A.A., Lepri, B. (eds.) HBU LNCS, vol. 7065, pp. 29–39. Springer, Heidelberg [2011].  [2] Weizmann Dataset (std.).  [3] Code for 2-D Convolutional Neural Network for MNIST digit recognition(C++ Implementation) by Mike O’ Neill presented in paper by- Patrice Y. Simard, Dave Steinkraus, John Platt, "Best Practices for Convolutional Neural Networks Applied to Visual Document Analysis," International Conference on Document Analysis and Recognition (ICDAR), IEEE Computer Society, Los Alamitos, pp [2003].