Faster R-CNN – Concepts

Slides:



Advertisements
Similar presentations
Rich feature Hierarchies for Accurate object detection and semantic segmentation Ross Girshick, Jeff Donahue, Trevor Darrell, Jitandra Malik (UC Berkeley)
Advertisements

A brief review of non-neural-network approaches to deep learning
R-CNN By Zhang Liliang.
Spatial Pyramid Pooling in Deep Convolutional
From R-CNN to Fast R-CNN
Convolutional Neural Networks for Image Processing with Applications in Mobile Robotics By, Sruthi Moola.
Fully Convolutional Networks for Semantic Segmentation
Cascade Region Regression for Robust Object Detection
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Spatial Localization and Detection
Deep Residual Learning for Image Recognition
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.
A Sentence Interaction Network for Modeling Dependence between Sentences Biao Liu, Minlie Huang Tsinghua University.
Cancer Metastases Classification in Histological Whole Slide Images
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Pining for Data II: The Empirical Results Strike Back
Recent developments in object detection
Deep Residual Learning for Image Recognition
Convolutional Sequence to Sequence Learning
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
CNN-RNN: A Unified Framework for Multi-label Image Classification
Demo.
CS 4501: Introduction to Computer Vision Computer Vision + Natural Language Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy / Justin Johnson.
Object Detection based on Segment Masks
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Matt Gormley Lecture 16 October 24, 2016
Combining CNN with RNN for scene labeling (segmentation)
Ajita Rattani and Reza Derakhshani,
Synthesis of X-ray Projections via Deep Learning
Lecture 5 Smaller Network: CNN
Neural Networks 2 CS446 Machine Learning.
Training Techniques for Deep Neural Networks
Convolutional Networks
CS6890 Deep Learning Weizhen Cai
Deep Residual Learning for Image Recognition
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Object detection.
Layer-wise Performance Bottleneck Analysis of Deep Neural Networks
Bird-species Recognition Using Convolutional Neural Network
Introduction to Neural Networks
A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,
Counting in Dense Crowds using Deep Learning
Deep learning Introduction Classes of Deep Learning Networks
March 2017 Project: IEEE P Working Group for Wireless Personal Area Networks (WPANs) Submission Title: Deep Leaning Method for OWC Date Submitted:
Object Detection + Deep Learning
Object Classification through Deconvolutional Neural Networks
Hairong Qi, Gonzalez Family Professor
Faster R-CNN By Anthony Martinez.
On Convolutional Neural Network
Lecture: Deep Convolutional Neural Networks
Papers 15/08.
Outline Background Motivation Proposed Model Experimental Results
Recurrent Encoder-Decoder Networks for Time-Varying Dense Predictions
Object Tracking: Comparison of
RCNN, Fast-RCNN, Faster-RCNN
Neural Network Pipeline CONTACT & ACKNOWLEDGEMENTS
Heterogeneous convolutional neural networks for visual recognition
Course Recap and What’s Next?
Attention for translation
CIS 519 Recitation 11/15/18.
Department of Computer Science Ben-Gurion University of the Negev
Automatic Handwriting Generation
Human-object interaction
Deep Object Co-Segmentation
Natalie Lang Tomer Malach
CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.
Neural Machine Translation using CNN
Object Detection Implementations
End-to-End Facial Alignment and Recognition
Jiahe Li
Presentation transcript:

Faster R-CNN – Concepts Student Presentation by: Assaf Livne Based on the work of: Ross Girdhick, Shaoqing Ren, Kaiming He אוניברסיטת בן-גוריון בנגב Ben-Gurion University of the Negev Faculty of Engineering Sciences Department of Electrical Engineering

Introduction R-CNN Concepts Fast R-CNN Concepts Faster R-CNN Concepts 5/12/2016 Introduction R-CNN Concepts Fast R-CNN Concepts Faster R-CNN Concepts Conclusion

17/11/2016 Introduction

Introduction ImageNet 17/11/2016 https://www.youtube.com/watch?v=n5uP_LP9SmM

17/11/2016 Introduction Kitti https://www.youtube.com/watch?v=QtgIyJlIn44 https://www.youtube.com/watch?v=Vj17JK3J1JU https://www.youtube.com/watch?v=s4gY6pw_mPQ

17/11/2016 R-CNN

17/11/2016 R-CNN concepts Lets combine Localization NN and classification NN in the simplest way.

17/11/2016 http://www.robots.ox.ac.uk/~tvg/publications/talks/fast-rcnn-slides.pdf

17/11/2016 https://webcourse.cs.technion.ac.il/236815/Spring2016/ho/WCFiles/RCNN_X3_6pp.pdf

Training Process Take a pre-trained classification network. 17/11/2016 Training Process Take a pre-trained classification network. Re-train the last fully connected layer with the objects that need to be detected + "no-object" class. Get all proposals(=~2000 p/image), resize them to match the cnn input, then save to disk. Train SVM to classify between object and background. BB Regression: Train a linear regression classifier that will output some correction factor. https://leonardoaraujosantos.gitbooks.io/artificial-inteligence/content/object_localization_and_detection.html

17/11/2016 R-CNN drawbacks Numerous Candidate object locations must be processed – slow training time and test time Ad hoc training objective. correlation between image space location and detected class is developing.

17/11/2016 Fast R-CNN

17/11/2016 Fast R-CNN concepts Instead of running the CNN on every proposal lets try to save some resources. Train all the layers in a single stage. No memory consumption. Inspired from the VGG16 concept - Very Deep CNN.

17/11/2016 http://www.robots.ox.ac.uk/~tvg/publications/talks/fast-rcnn-slides.pdf, 33

17/11/2016 http://www.robots.ox.ac.uk/~tvg/publications/talks/fast-rcnn-slides.pdf

17/11/2016 ROI pooling Type of max-pooling with a pool size dependent on the input, so that the output always has the same size. This is done because fully connected layer always expected the same input size. https://leonardoaraujosantos.gitbooks.io/artificial-inteligence/content/object_localization_and_detection.html

17/11/2016

17/11/2016

17/11/2016

R-CNN vs Fast R-CNN Testing time: mAP (VOC 2007): 49s 2.32s 66% 66.9% 17/11/2016 R-CNN vs Fast R-CNN Testing time: 49s 2.32s mAP (VOC 2007): 66% 66.9%

17/11/2016 Fast R-CNN drawbacks Still depends on an external object proposal system . Which is the major bottleneck from computing resources point of view.

17/11/2016 Faster R-CNN

17/11/2016 Faster R-CNN concepts Using the already running CNN to infer region proposals.

Faster R-CNN Pipe-Lines 17/11/2016 Faster R-CNN Pipe-Lines Get feature maps from the deep convolution layers. Train a Region Proposal Network (RPN). Give proposals to the ROI pooling layer. Send proposals to a fully connected layer to finish the classification.

17/11/2016 https://leonardoaraujosantos.gitbooks.io/artificial-inteligence/content/object_localization_and_detection.html

Region proposal Network (RPN) 17/11/2016 Region proposal Network (RPN) Basically the RPN is a sliding window which slides on the feature map. Sends as an output the locations of the proposals windows.

Fast R-CNN vs Faster R-CNN 17/11/2016 Fast R-CNN vs Faster R-CNN Testing time: 2.32s 0.2s mAP (VOC 2007): 66.9% 66.9%

17/11/2016 Conclusion

17/11/2016 Conclusion “Using the recently popular terminology of neural networks with ’attention’ mechanisms, the RPN module tells the Fast R-CNN module where to look.”

’Attention’ Mechanisms 17/11/2016 ’Attention’ Mechanisms Rather than using all available information, we need to select the most pertinent piece of information. https://www.youtube.com/watch?v=IGQmdoK_ZfY

RNN – Encoder Decoder Model 17/11/2016 RNN – Encoder Decoder Model https://talbaumel.github.io/attention/

RNN - Attention Model 17/11/2016 https://talbaumel.github.io/attention/

Natural Language Processing (NLP) 17/11/2016 Natural Language Processing (NLP) Show, Attend and Tell – Kelvin Xu et al 2015

Natural Language Processing (NLP) 17/11/2016 Natural Language Processing (NLP) Show, Attend and Tell – Kelvin Xu et al 2015

Thank you for your attention! 17/11/2016 Thank you for your attention!