边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University

Slides:

Advertisements

Similar presentations

Patch to the Future: Unsupervised Visual Prediction

Advertisements

1 P. Arbelaez, M. Maire, C. Fowlkes, J. Malik. Contour Detection and Hierarchical image Segmentation. IEEE Trans. on PAMI, Student: Hsin-Min Cheng.

Large-Scale Object Recognition with Weak Supervision

Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA

Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain

Geodesic Minimal Paths Vida Movahedi Elder Lab, January 2010.

Detection, Segmentation and Fine-grained Localization

Fully Convolutional Networks for Semantic Segmentation

Spatial Localization and Detection

Gaussian Conditional Random Field Network for Semantic Segmentation

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Deeply-Recursive Convolutional Network for Image Super-Resolution

Recent developments in object detection

Deep Learning for Dual-Energy X-Ray

Summary of “Efficient Deep Learning for Stereo Matching”

Object Detection based on Segment Masks

HFS: Hierarchical Feature Selection for Efficient Image Segmentation

Deep Learning Amin Sobhani.

ISBI Camelyon16 Challenge Prague, April 13, 2016

Data Mining, Neural Network and Genetic Programming

Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek

Data Driven Attributes for Action Detection

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Understanding and Predicting Image Memorability at a Large Scale

Announcements Project proposal due tomorrow

Regularizing Face Verification Nets To Discrete-Valued Pain Regression

Combining CNN with RNN for scene labeling (segmentation)

Compositional Human Pose Regression

Intelligent Information System Lab

Synthesis of X-ray Projections via Deep Learning

Supervised Training of Deep Networks

Efficient Deep Model for Monocular Road Segmentation

CS6890 Deep Learning Weizhen Cai

Deeply Supervised Salient Object Detection with Short Connections Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr Observations:

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules

Adversarially Tuned Scene Generation

Enhanced-alignment Measure for Binary Foreground Map Evaluation

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

A Convolutional Neural Network Cascade For Face Detection

By: Kevin Yu Ph.D. in Computer Engineering

Zan Gao, Deyu Wang, Xiangnan He, Hua Zhang

Bird-species Recognition Using Convolutional Neural Network

Computer Vision James Hays

Image Classification.

Convolutional Neural Networks for Visual Tracking

Counting in Dense Crowds using Deep Learning

Object Detection + Deep Learning

Very Deep Convolutional Networks for Large-Scale Image Recognition

CornerNet: Detecting Objects as Paired Keypoints

Semantic segmentation

Lecture: Deep Convolutional Neural Networks

Outline Background Motivation Proposed Model Experimental Results

RCNN, Fast-RCNN, Faster-RCNN

Iterative Crowd Counting

Heterogeneous convolutional neural networks for visual recognition

Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton

Attention for translation

Department of Computer Science Ben-Gurion University of the Negev

Human-object interaction

Deep Object Co-Segmentation

Word representations David Kauchak CS158 – Fall 2016.

Winter in Kraków photographed by Marcin Ryczek

VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION

Semantic Segmentation

Week 3 Volodymyr Bobyr.

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Directional Occlusion with Neural Network

Shengcong Chen, Changxing Ding, Minfeng Liu 2018

Presentation transcript:

边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University Email: cmm@nankai.edu.cn URL: http://mmcheng.net/

Early pioneering methods Locate sharp changes in the intensity function Sobel, Canny, Laplacian, …

Data driven methods Learning the probability distributions of features Konishi et al. [1] proposed the first data-driven method Low-level cues such as color, intensity, gradient, texture, etc. Employ various classification paradigm Popular method: Pb, gPb, StructuredEdge Berkeley segmentation Dataset [1] Konishi S, Yuille A L, Coughlan J M, et al. Statistical edge detection: Learning and evaluating edge cues[J]. IEEE TPAMI, 2003, 25(1): 57-74.

45 years of boundary detection Source: Arbelaez

Limitations of previous methods Semantically meaningful edge detection Requires object-level (high level) information

Recent CNN based methods N4-Fields, ACCV 2014 DeepEdge, CVPR 2015 DeepContour, CVPR 2015 HFL, ICCV 2015 HED, ICCV 2015 RCF, CVPR 2017

N4-Fields 0.75 Ganin Y, Lempitsky V. N4-fields: Neural network nearest neighbor fields for image transforms[C]//ACCV. Springer International Publishing, 2014: 536-551.

DeepEdge Candidate contour points  Canny edge Patches at 4 scales run through the CNN Two branches for: classification & regressor. They run the Canny edge detector to extract candidate contour points. Then, around each candidate point, they extract patches at four different scales and simultaneously run them through the KNet. They connect the convolutional layers to two separately-trained network branches. The first branch is trained for classification, while the second branch is trained as a regressor. At testing time, the scalar outputs from these two sub-networks are averaged to produce the final score. Bertasius G, Shi J, Torresani L. Deepedge: A multi-scale bifurcated deep network for top-down contour detection[C]//CVPR. 2015: 4380-4389.

DeepContour Multi-class shape classification New CNN loss for positive vs. negative 0.76 Shen W, Wang X, Wang Y, et al. Deepcontour: A deep convolutional feature learned by positive-sharing loss for contour detection[C]//CVPR. 2015: 3982-3991.

HFL Candidate contour points Unsample image to feed through VGG net, which was pre-trained for high level task 0.77 Bertasius G, Shi J, Torresani L. High-for-low and low-for-high: Efficient boundary detection from deep object features and its applications to high-level vision[C]//ICCV. 2015: 504-512.

Results of making local decisions

HED Holistically-Nested Edge Detection Holistic: image to image fashion Xie S, Tu Z. Holistically-nested edge detection[C]//ICCV. 2015: 1395-1403.

HED Holistically-Nested Edge Detection Holistic: image to image fashion Multi-scale and multi-level feature learning Side-output layers are inserted after convolutional layers. Deep supervision is imposed at each side-output layer, guiding the side-outputs towards edge predictions with the characteristics. One weighted-fusion layer is added to automatically learn how to combine outputs from multiple scales. The entire network is trained with multiple error propagation paths (dashed lines).

HED Two examples illustrating how deep supervision helps side-output layers to produce multi-scale dense predictions. Note that in the left column, the side outputs become progressively coarser and more “global”, while critical object boundaries are preserved. Illustration: how deep supervision helps side-output layers to produce multi-scale dense predictions.

Results

RCF: Richer Convolutional Features We build a simple network based on VGG16 to produce side outputs of conv3_1, conv3_2, conv3_3, conv4_1, conv4_2 and conv4_3. One can clearly see that convolutional features become coarser gradually, and the intermediate layers conv3_1, conv3_2, conv4_1, and conv4_2 contain lots of useful fine details that do not appear in other layers. Motivation: intermediate layers contain lots of useful fine details Key idea: combining ALL the meaningful convolutional features Liu Y, Cheng M M, Hu X, et al. Richer Convolutional Features for Edge Detection[C]//CVPR. 2017.

RCF: Richer Convolutional Features Use ALL convolutional layer, instead of last layer of each stage. Utilize the CNN features from all the conv layers can be potentially generalized to other vision tasks. Liu Y, Cheng M M, Hu X, et al. Richer Convolutional Features for Edge Detection[C]//CVPR. 2017.

Explicit multi-scale processing. Multiscale Fusion Explicit multi-scale processing. Liu Y, Cheng M M, Hu X, et al. Richer Convolutional Features for Edge Detection[C]//CVPR. 2017.

Samples image G-Truth results Some examples of RCF. From top to bottom: BSDS500, NYUD, Multicue-Boundary, and Multicue-Edge. From left to right: origin image, ground truth, RCF edge map, origin image, ground truth, and RCF edge map. image G-Truth results

Evaluation on BSDS500 Open Source Source code: https://github.com/yun-liu/rcf