边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University

Slides:



Advertisements
Similar presentations
Patch to the Future: Unsupervised Visual Prediction
Advertisements

1 P. Arbelaez, M. Maire, C. Fowlkes, J. Malik. Contour Detection and Hierarchical image Segmentation. IEEE Trans. on PAMI, Student: Hsin-Min Cheng.
Large-Scale Object Recognition with Weak Supervision
Ghunhui Gu, Joseph J. Lim, Pablo Arbeláez, Jitendra Malik University of California at Berkeley Berkeley, CA
Presented by: Kamakhaya Argulewar Guided by: Prof. Shweta V. Jain
Geodesic Minimal Paths Vida Movahedi Elder Lab, January 2010.
Detection, Segmentation and Fine-grained Localization
Fully Convolutional Networks for Semantic Segmentation
Spatial Localization and Detection
Gaussian Conditional Random Field Network for Semantic Segmentation
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Deeply-Recursive Convolutional Network for Image Super-Resolution
Recent developments in object detection
Deep Learning for Dual-Energy X-Ray
Summary of “Efficient Deep Learning for Stereo Matching”
Object Detection based on Segment Masks
HFS: Hierarchical Feature Selection for Efficient Image Segmentation
Deep Learning Amin Sobhani.
ISBI Camelyon16 Challenge Prague, April 13, 2016
Data Mining, Neural Network and Genetic Programming
Convolutional Neural Fabrics by Shreyas Saxena, Jakob Verbeek
Data Driven Attributes for Action Detection
Krishna Kumar Singh, Yong Jae Lee University of California, Davis
Understanding and Predicting Image Memorability at a Large Scale
Announcements Project proposal due tomorrow
Regularizing Face Verification Nets To Discrete-Valued Pain Regression
Combining CNN with RNN for scene labeling (segmentation)
Compositional Human Pose Regression
Intelligent Information System Lab
Synthesis of X-ray Projections via Deep Learning
Supervised Training of Deep Networks
Efficient Deep Model for Monocular Road Segmentation
CS6890 Deep Learning Weizhen Cai
Deeply Supervised Salient Object Detection with Short Connections Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr Observations:
Object detection as supervised classification
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules
Adversarially Tuned Scene Generation
Enhanced-alignment Measure for Binary Foreground Map Evaluation
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
A Convolutional Neural Network Cascade For Face Detection
By: Kevin Yu Ph.D. in Computer Engineering
Zan Gao, Deyu Wang, Xiangnan He, Hua Zhang
Bird-species Recognition Using Convolutional Neural Network
Computer Vision James Hays
Image Classification.
Convolutional Neural Networks for Visual Tracking
Counting in Dense Crowds using Deep Learning
Object Detection + Deep Learning
Very Deep Convolutional Networks for Large-Scale Image Recognition
CornerNet: Detecting Objects as Paired Keypoints
Semantic segmentation
Lecture: Deep Convolutional Neural Networks
Outline Background Motivation Proposed Model Experimental Results
RCNN, Fast-RCNN, Faster-RCNN
Iterative Crowd Counting
Heterogeneous convolutional neural networks for visual recognition
Deep Learning Authors: Yann LeCun, Yoshua Bengio, Geoffrey Hinton
Attention for translation
Department of Computer Science Ben-Gurion University of the Negev
Human-object interaction
Deep Object Co-Segmentation
Word representations David Kauchak CS158 – Fall 2016.
Winter in Kraków photographed by Marcin Ryczek
VERY DEEP CONVOLUTIONAL NETWORKS FOR LARGE-SCALE IMAGE RECOGNITION
Semantic Segmentation
Week 3 Volodymyr Bobyr.
Volodymyr Bobyr Supervised by Aayushjungbahadur Rana
Directional Occlusion with Neural Network
Shengcong Chen, Changxing Ding, Minfeng Liu 2018
Presentation transcript:

边缘检测年度进展概述 Ming-Ming Cheng Media Computing Lab, Nankai University Email: cmm@nankai.edu.cn URL: http://mmcheng.net/

Early pioneering methods Locate sharp changes in the intensity function Sobel, Canny, Laplacian, …

Data driven methods Learning the probability distributions of features Konishi et al. [1] proposed the first data-driven method Low-level cues such as color, intensity, gradient, texture, etc. Employ various classification paradigm Popular method: Pb, gPb, StructuredEdge Berkeley segmentation Dataset [1] Konishi S, Yuille A L, Coughlan J M, et al. Statistical edge detection: Learning and evaluating edge cues[J]. IEEE TPAMI, 2003, 25(1): 57-74.

45 years of boundary detection Source: Arbelaez

Limitations of previous methods Semantically meaningful edge detection Requires object-level (high level) information

Recent CNN based methods N4-Fields, ACCV 2014 DeepEdge, CVPR 2015 DeepContour, CVPR 2015 HFL, ICCV 2015 HED, ICCV 2015 RCF, CVPR 2017

N4-Fields 0.75 Ganin Y, Lempitsky V. N4-fields: Neural network nearest neighbor fields for image transforms[C]//ACCV. Springer International Publishing, 2014: 536-551.

DeepEdge Candidate contour points  Canny edge Patches at 4 scales run through the CNN Two branches for: classification & regressor. They run the Canny edge detector to extract candidate contour points. Then, around each candidate point, they extract patches at four different scales and simultaneously run them through the KNet. They connect the convolutional layers to two separately-trained network branches. The first branch is trained for classification, while the second branch is trained as a regressor. At testing time, the scalar outputs from these two sub-networks are averaged to produce the final score. Bertasius G, Shi J, Torresani L. Deepedge: A multi-scale bifurcated deep network for top-down contour detection[C]//CVPR. 2015: 4380-4389.

DeepContour Multi-class shape classification New CNN loss for positive vs. negative 0.76 Shen W, Wang X, Wang Y, et al. Deepcontour: A deep convolutional feature learned by positive-sharing loss for contour detection[C]//CVPR. 2015: 3982-3991.

HFL Candidate contour points Unsample image to feed through VGG net, which was pre-trained for high level task 0.77 Bertasius G, Shi J, Torresani L. High-for-low and low-for-high: Efficient boundary detection from deep object features and its applications to high-level vision[C]//ICCV. 2015: 504-512.

Results of making local decisions

HED Holistically-Nested Edge Detection Holistic: image to image fashion Xie S, Tu Z. Holistically-nested edge detection[C]//ICCV. 2015: 1395-1403.

HED Holistically-Nested Edge Detection Holistic: image to image fashion Multi-scale and multi-level feature learning Side-output layers are inserted after convolutional layers. Deep supervision is imposed at each side-output layer, guiding the side-outputs towards edge predictions with the characteristics. One weighted-fusion layer is added to automatically learn how to combine outputs from multiple scales. The entire network is trained with multiple error propagation paths (dashed lines).

HED Two examples illustrating how deep supervision helps side-output layers to produce multi-scale dense predictions. Note that in the left column, the side outputs become progressively coarser and more “global”, while critical object boundaries are preserved. Illustration: how deep supervision helps side-output layers to produce multi-scale dense predictions.

Results

RCF: Richer Convolutional Features We build a simple network based on VGG16 to produce side outputs of conv3_1, conv3_2, conv3_3, conv4_1, conv4_2 and conv4_3. One can clearly see that convolutional features become coarser gradually, and the intermediate layers conv3_1, conv3_2, conv4_1, and conv4_2 contain lots of useful fine details that do not appear in other layers. Motivation: intermediate layers contain lots of useful fine details Key idea: combining ALL the meaningful convolutional features Liu Y, Cheng M M, Hu X, et al. Richer Convolutional Features for Edge Detection[C]//CVPR. 2017.

RCF: Richer Convolutional Features Use ALL convolutional layer, instead of last layer of each stage. Utilize the CNN features from all the conv layers can be potentially generalized to other vision tasks. Liu Y, Cheng M M, Hu X, et al. Richer Convolutional Features for Edge Detection[C]//CVPR. 2017.

Explicit multi-scale processing. Multiscale Fusion Explicit multi-scale processing. Liu Y, Cheng M M, Hu X, et al. Richer Convolutional Features for Edge Detection[C]//CVPR. 2017.

Samples image G-Truth results Some examples of RCF. From top to bottom: BSDS500, NYUD, Multicue-Boundary, and Multicue-Edge. From left to right: origin image, ground truth, RCF edge map, origin image, ground truth, and RCF edge map. image G-Truth results

Evaluation on BSDS500 Open Source Source code: https://github.com/yun-liu/rcf