Efficient Deep Model for Monocular Road Segmentation

Slides:

Advertisements

Similar presentations

ImageNet Classification with Deep Convolutional Neural Networks

Advertisements

From R-CNN to Fast R-CNN

Fully Convolutional Networks for Semantic Segmentation

Feedforward semantic segmentation with zoom-out features

Deep Residual Learning for Image Recognition

Deep Learning Overview Sources: workshop-tutorial-final.pdf

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.

Lecture 4b Data augmentation for CNN training

Parsing Natural Scenes and Natural Language with Recursive Neural Networks INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2011) RICHARD SOCHER CLIFF.

Lecture 3a Analysis of training of NN

Convolutional Neural Networks at Constrained Time Cost (CVPR 2015) Authors : Kaiming He, Jian Sun (MSR) Presenter : Hyunjun Ju 1.

Cancer Metastases Classification in Histological Whole Slide Images

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Deeply-Recursive Convolutional Network for Image Super-Resolution

Wenchi MA CV Group EECS,KU 03/20/2017

Recent developments in object detection

Learning to Compare Image Patches via Convolutional Neural Networks

Analysis of Sparse Convolutional Neural Networks

Convolutional Neural Network

Summary of “Efficient Deep Learning for Stereo Matching”

Object Detection based on Segment Masks

Compact Bilinear Pooling

[Ran Manor and Amir B.Geva] Yehu Sapir Outlines Review

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

The Problem: Classification

CAMELYON16 Challenge Matt Berseth, NLP Logix Jacksonville FL

Training Techniques for Deep Neural Networks

CS6890 Deep Learning Weizhen Cai

Deep Residual Learning for Image Recognition

Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules

Adversarially Tuned Scene Generation

Image Question Answering

Deep Learning Convoluted Neural Networks Part 2 11/13/

Fully Convolutional Networks for Semantic Segmentation

Bird-species Recognition Using Convolutional Neural Network

Computer Vision James Hays

CNNs and compressive sensing Theoretical analysis

Introduction to Neural Networks

Face Recognition with Deep Learning Method

Image Classification.

A Comparative Study of Convolutional Neural Network Models with Rosenblatt’s Brain Model Abu Kamruzzaman, Atik Khatri , Milind Ikke, Damiano Mastrandrea,

Two-Stream Convolutional Networks for Action Recognition in Videos

Counting in Dense Crowds using Deep Learning

CS 4501: Introduction to Computer Vision Training Neural Networks II

Deep Learning Hierarchical Representations for Image Steganalysis

Very Deep Convolutional Networks for Large-Scale Image Recognition

Smart Robots, Drones, IoT

Object Detection Creation from Scratch Samsung R&D Institute Ukraine

A Proposal Defense On Deep Residual Network For Face Recognition Presented By SAGAR MISHRA MECE

LECTURE 35: Introduction to EEG Processing

Convolutional networks

Neural Networks Geoff Hulten.

Lecture: Deep Convolutional Neural Networks

LECTURE 33: Alternative OPTIMIZERS

Visualizing and Understanding Convolutional Networks

Object Tracking: Comparison of

Analysis of Trained CNN (Receptive Field & Weights of Network)

边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University

Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton

Department of Computer Science Ben-Gurion University of the Negev

Deep Object Co-Segmentation

CS295: Modern Systems: Application Case Study Neural Network Accelerator Sang-Woo Jun Spring 2019 Many slides adapted from Hyoukjun Kwon‘s Gatech “Designing.

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Bug Localization with Combination of Deep Learning and Information Retrieval A. N. Lam et al. International Conference on Program Comprehension 2017.

Motivation State-of-the-art two-stage instance segmentation methods depend heavily on feature localization to produce masks.

Learning and Memorization

Object Detection Implementations

Learning Deconvolution Network for Semantic Segmentation

Presented By: Firas Gerges (fg92)

Shengcong Chen, Changxing Ding, Minfeng Liu 2018

Presentation transcript:

Efficient Deep Model for Monocular Road Segmentation 10/12/2018 Paper by: Gabriel L. Oliveira, Wolfram Burgard, Thomas Brox (University of Freiburg, Germany) Presented by: Samin Khan

Intro convolution layers: high-res image => low-res output up-convolution layers (de-convolutions): low-res, abstract features => high-res image prediction state-of-the-art semantic segmentation networks have inference times that are >= 150 ms (upwards of a second in some cases; not real-time) proposed parameter optimization achieves 83 ms inference time on 500x500 images Efficient Model for Road Segmentation

Related Work Fully Convolutional Network (FCN): - take feature extractor layers of a classification network such as VGG - attach up-convolution layers that create coarse score maps for pixel-level classification - successive up-convolution “refinements” increase resolution of these score maps Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Parameter Optimization parameter reduction: - base FC-conv layers have 4096 filters of 7x7 size - this paper proposes 1024 filters of 3x3 size new parameters in the up-conv layers: - substantial reduction in FC-conv layers means parameters needed elsewhere to preserve accuracy - an empirical scalar C is introduced in all intermediate up-conv layers Efficient Model for Road Segmentation

Methodology: Data Augmentation scaling: - training images scaled by a factor between 0.7 and 1.4 color: - value between -0.1 and 0.1 added to the hue channel of the input images’ HSV representation rotation and cropping: - NOT used because these potentially reduce network’s learning of spatial priors Efficient Model for Road Segmentation

Methodology: Hyper-parameters weights initialization: Xavier (mean = 0; variance = 1/nin) momentum: 0.95 (effect of previous state of weights) C: 5 (empirical parameter; problem dependent) Ncl: 3 (output classes: road, fake road, non-road) learning rate: 1e-9 (size of jump towards gradient descent) learning rate decay policy: poly (reduction in jump size as learning progresses) power: 0.9 (decay factor of learning rate) Efficient Model for Road Segmentation

Methodology: Hyper-parameters https://github.com/BVLC/caffe/blob/master/src/caffe/proto/caffe.proto Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Architecture Efficient Model for Road Segmentation

Methodology: Training Refinement 1: conv layers fc-CONV1 + relu dropout fc-CONV2 + relu dropout SCORE DECONV_big_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training Refinement 2: conv layers sum crop conv_score_pool4 DEconv_score fc-CONV1 + relu dropout fc-CONV2 + relu dropout SCORE DECONV_big_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training Refinement 3: conv layers sum crop conv_score_pool4 DEconv_score fc-CONV1 + relu dropout fc-CONV2 + relu sum crop conv_score_pool3 DEconv_score dropout SCORE DECONV_big_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training Refinement 4: conv layers sum crop conv_score_pool4 DEconv_score fc-CONV1 + relu dropout fc-CONV2 + relu sum crop conv_score_pool3 DEconv_score dropout SCORE DECONV_big_score sum crop conv_score_pool2 DEconv_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training sum crop conv_score_pool4 DEconv_score Refinement 5: conv layers fc-CONV1 + relu sum crop conv_score_pool3 DEconv_score dropout fc-CONV2 + relu dropout sum crop conv_score_pool2 DEconv_score SCORE DECONV_big_score sum crop conv_score_pool1 DEconv_score crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training sum crop conv_score_pool4 DEconv_score + ReLU Refinement 6: DROPOUT conv layers fc-CONV1 + relu sum crop conv_score_pool3 DEconv_score + ReLU dropout fc-CONV2 + relu dropout sum crop conv_score_pool2 DEconv_score + ReLU SCORE DECONV_big_score sum crop conv_score_pool1 DEconv_score + ReLU crop SOFTMAX loss Efficient Model for Road Segmentation

Methodology: Training sum crop conv_score_pool4 DEconv_score + ReLU Refinement 7: DROPOUT conv layers fc-CONV1 + relu sum crop conv_score_pool3 DEconv_score + ReLU dropout fc-CONV2 + relu dropout sum crop conv_score_pool2 DEconv_score + ReLU SCORE FINAL_score DECONV_big_score sum crop conv_score_pool1 DEconv_score + ReLU crop SOFTMAX loss Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Training: 95 Testing: 96 Training: 96 Testing: 94 Training: 98 Testing: 100 Efficient Model for Road Segmentation

Experiment: KITTI Road/Lane Dataset Efficient Model for Road Segmentation

Experiment: Performance (300x300 image) (Titan X GPU) Efficient Model for Road Segmentation

Experiment: Impact of Parameter Reduction and New Refinement - performed on PASCAL Parts dataset (due to restriction on number of submissions to KITTI) Efficient Model for Road Segmentation

Conclusion changing filter size of fully convolutional layers dramatically improves performance, as long as new parameters are introduced elsewhere (scalar “C” increased filter space of intermediate up-convolutional layers) to preserve network accuracy training up-convolutional networks can be painful Efficient Model for Road Segmentation