Efficient Deep Model for Monocular Road Segmentation

Slides:



Advertisements
Similar presentations
ImageNet Classification with Deep Convolutional Neural Networks
Advertisements

From R-CNN to Fast R-CNN
Fully Convolutional Networks for Semantic Segmentation
Feedforward semantic segmentation with zoom-out features
Deep Residual Learning for Image Recognition
Deep Learning Overview Sources: workshop-tutorial-final.pdf
Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.
Lecture 4b Data augmentation for CNN training
Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.
Lecture 3a Analysis of training of NN
Convolutional Neural Networks at Constrained Time Cost (CVPR 2015) Authors : Kaiming He, Jian Sun (MSR) Presenter : Hyunjun Ju 1.
Cancer Metastases Classification in Histological Whole Slide Images
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Deeply-Recursive Convolutional Network for Image Super-Resolution
Wenchi MA CV Group EECS,KU 03/20/2017
Recent developments in object detection
Learning to Compare Image Patches via Convolutional Neural Networks
Analysis of Sparse Convolutional Neural Networks
Convolutional Neural Network
Summary of “Efficient Deep Learning for Stereo Matching”
Object Detection based on Segment Masks
Compact Bilinear Pooling
[Ran Manor and Amir B.Geva] Yehu Sapir Outlines Review
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
The Problem: Classification
CAMELYON16 Challenge Matt Berseth, NLP Logix Jacksonville FL
Training Techniques for Deep Neural Networks
CS6890 Deep Learning Weizhen Cai
Deep Residual Learning for Image Recognition
Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules
Adversarially Tuned Scene Generation
Image Question Answering
Deep Learning Convoluted Neural Networks Part 2 11/13/
Fully Convolutional Networks for Semantic Segmentation
Bird-species Recognition Using Convolutional Neural Network
Computer Vision James Hays
CNNs and compressive sensing Theoretical analysis
Introduction to Neural Networks
Face Recognition with Deep Learning Method
Image Classification.
A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,
Two-Stream Convolutional Networks for Action Recognition in Videos
Counting in Dense Crowds using Deep Learning
CS 4501: Introduction to Computer Vision Training Neural Networks II
Deep Learning Hierarchical Representations for Image Steganalysis
Very Deep Convolutional Networks for Large-Scale Image Recognition
Smart Robots, Drones, IoT
Object Detection Creation from Scratch Samsung R&D Institute Ukraine
A Proposal Defense On Deep Residual Network For Face Recognition Presented By SAGAR MISHRA MECE
LECTURE 35: Introduction to EEG Processing
Convolutional networks
Neural Networks Geoff Hulten.
Lecture: Deep Convolutional Neural Networks
LECTURE 33: Alternative OPTIMIZERS
Visualizing and Understanding Convolutional Networks
Object Tracking: Comparison of
Analysis of Trained CNN (Receptive Field & Weights of Network)
边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
Department of Computer Science Ben-Gurion University of the Negev
Deep Object Co-Segmentation
CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.
Motivation State-of-the-art two-stage instance segmentation methods depend heavily on feature localization to produce masks.
Learning and Memorization
Object Detection Implementations
Learning Deconvolution Network for Semantic Segmentation
Presented By: Firas Gerges (fg92)
Shengcong Chen, Changxing Ding, Minfeng Liu 2018
Presentation transcript:

Efficient Deep Model for Monocular Road Segmentation 10/12/2018 Paper by: Gabriel L. Oliveira, Wolfram Burgard, Thomas Brox (University of Freiburg, Germany) Presented by: Samin Khan

Intro convolution layers: high-res image => low-res output up-convolution layers (de-convolutions): low-res, abstract features => high-res image prediction state-of-the-art semantic segmentation networks have inference times that are >= 150 ms (upwards of a second in some cases; not real-time) proposed parameter optimization achieves 83 ms inference time on 500x500 images Efficient Model for Road Segmentation

Related Work Fully Convolutional Network (FCN): - take feature extractor layers of a classification network such as VGG - attach up-convolution layers that create coarse score maps for pixel-level classification - successive up-convolution “refinements” increase resolution of these score maps Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Parameter Optimization parameter reduction: - base FC-conv layers have 4096 filters of 7x7 size - this paper proposes 1024 filters of 3x3 size new parameters in the up-conv layers: - substantial reduction in FC-conv layers means parameters needed elsewhere to preserve accuracy - an empirical scalar C is introduced in all intermediate up-conv layers Efficient Model for Road Segmentation

Methodology: Data Augmentation scaling: - training images scaled by a factor between 0.7 and 1.4 color: - value between -0.1 and 0.1 added to the hue channel of the input images’ HSV representation rotation and cropping: - NOT used because these potentially reduce network’s learning of spatial priors Efficient Model for Road Segmentation

Methodology: Hyper-parameters weights initialization: Xavier (mean = 0; variance = 1/nin) momentum: 0.95 (effect of previous state of weights) C: 5 (empirical parameter; problem dependent) Ncl: 3 (output classes: road, fake road, non-road) learning rate: 1e-9 (size of jump towards gradient descent) learning rate decay policy: poly (reduction in jump size as learning progresses) power: 0.9 (decay factor of learning rate) Efficient Model for Road Segmentation

Methodology: Hyper-parameters https://github.com/BVLC/caffe/blob/master/src/caffe/proto/caffe.proto Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Training Refinement 1: conv layers fc-CONV1 + relu dropout fc-CONV2 + relu dropout SCORE DECONV_big_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training Refinement 2: conv layers sum crop  conv_score_pool4   DEconv_score fc-CONV1 + relu dropout fc-CONV2 + relu dropout SCORE DECONV_big_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training Refinement 3: conv layers sum crop  conv_score_pool4   DEconv_score fc-CONV1 + relu dropout fc-CONV2 + relu sum crop  conv_score_pool3   DEconv_score dropout SCORE DECONV_big_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training Refinement 4: conv layers sum crop  conv_score_pool4   DEconv_score fc-CONV1 + relu dropout fc-CONV2 + relu sum crop  conv_score_pool3   DEconv_score dropout SCORE DECONV_big_score sum crop  conv_score_pool2   DEconv_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training sum crop  conv_score_pool4   DEconv_score Refinement 5: conv layers fc-CONV1 + relu sum crop  conv_score_pool3   DEconv_score dropout fc-CONV2 + relu dropout sum crop  conv_score_pool2   DEconv_score SCORE DECONV_big_score sum crop  conv_score_pool1   DEconv_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training sum crop  conv_score_pool4   DEconv_score + ReLU Refinement 6: DROPOUT conv layers fc-CONV1 + relu sum crop  conv_score_pool3   DEconv_score + ReLU dropout fc-CONV2 + relu dropout sum crop  conv_score_pool2   DEconv_score + ReLU SCORE DECONV_big_score sum crop  conv_score_pool1   DEconv_score + ReLU crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training sum crop  conv_score_pool4   DEconv_score + ReLU Refinement 7: DROPOUT conv layers fc-CONV1 + relu sum crop  conv_score_pool3   DEconv_score + ReLU dropout fc-CONV2 + relu dropout sum crop  conv_score_pool2   DEconv_score + ReLU SCORE FINAL_score DECONV_big_score sum crop  conv_score_pool1   DEconv_score + ReLU crop SOFTMAX loss Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Efficient Model for Road Segmentation

Experiment: Performance (300x300 image) (Titan X GPU) Efficient Model for Road Segmentation

Experiment: Impact of Parameter Reduction and New Refinement - performed on PASCAL Parts dataset (due to restriction on number of submissions to KITTI) Efficient Model for Road Segmentation

Conclusion changing filter size of fully convolutional layers dramatically improves performance, as long as new parameters are introduced elsewhere (scalar “C” increased filter space of intermediate up-convolutional layers) to preserve network accuracy training up-convolutional networks can be painful Efficient Model for Road Segmentation