Edge boxes: Locating object proposals from edges

Slides:



Advertisements
Similar presentations
BRISK (Presented by Josh Gleason)
Advertisements

Ming-Ming Cheng 1 Ziming Zhang 2 Wen-Yan Lin 3 Philip H. S. Torr 1 1 Oxford University, 2 Boston University 3 Brookes Vision Group Training a generic objectness.
MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…
MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.
Interest points CSE P 576 Larry Zitnick Many slides courtesy of Steve Seitz.
Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.
1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.
1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.
R-CNN By Zhang Liliang.
On the Object Proposal Presented by Yao Lu
LOCUS Demo Stefan Zickler. Two “different” classes Class “Car Side Views” Class “Car Rears”
Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.
Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.
Multiple Organ detection in CT Volumes - Week 2 Daniel Donenfeld.
From Edges to Objects Piotr Dollár and Larry Zitnick.
A Scale and Rotation Invariant Approach to Tracking Human Body Part Regions in Videos Yihang BoHao Jiang Institute of Automation, CAS Boston College.
New Segmentation Methods Advisor : 丁建均 Jian-Jiun Ding Presenter : 蔡佳豪 Chia-Hao Tsai Date: Digital Image and Signal Processing Lab Graduate Institute.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.
Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.
Object Detection with Discriminatively Trained Part Based Models
Lecture 6: Edge Detection CAP 5415: Computer Vision Fall 2008.
Structured Forests for Fast Edge Detection
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.
CS654: Digital Image Analysis Lecture 30: Clustering based Segmentation Slides are adapted from:
Modern Boundary Detection II Computer Vision CS 143, Brown James Hays Many slides Michael Maire, Jitendra Malek Szeliski 4.2.
Expectation-Maximization (EM) Case Studies
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Lecture 08 27/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Recognition Using Visual Phrases
Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.
Regionlets for Generic Object Detection IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 37, NO. 10, OCTOBER 2015 Xiaoyu Wang, Ming.
Lecture 30: Segmentation CS4670 / 5670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.
Instructor: Mircea Nicolescu Lecture 10 CS 485 / 685 Computer Vision.
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
1 Shape Descriptors for Maximally Stable Extremal Regions Per-Erik Forss´en and David G. Lowe Department of Computer Science University of British Columbia.
Spatial Localization and Detection
April 21, 2016Introduction to Artificial Intelligence Lecture 22: Computer Vision II 1 Canny Edge Detector The Canny edge detector is a good approximation.
Winter in Kraków photographed by Marcin Ryczek
When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.
Recent developments in object detection
CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.
A. M. R. R. Bandara & L. Ranathunga
Object Detection based on Segment Masks
HFS: Hierarchical Feature Selection for Efficient Image Segmentation
CS 4501: Introduction to Computer Vision Sparse Feature Detectors: Harris Corner, Difference of Gaussian Connelly Barnes Slides from Jason Lawrence, Fei.
TP12 - Local features: detection and description
A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology
Mean Shift Segmentation
Huazhong University of Science and Technology
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Introduction of Pedestrian Detection
Vehicle Segmentation and Tracking in the Presence of Occlusions
Text Detection in Images and Video
Object Detection + Deep Learning
On-going research on Object Detection *Some modification after seminar
(Hopefully) Real-time Multi Object Tracking
Progress report 2019/1/14 PHHung.
Local features and image matching
Related Work in Camera Network Tracking
Lecture 29: Face Detection Revisited
Saliency Optimization from Robust Background Detection
Computer and Robot Vision I
Winter in Kraków photographed by Marcin Ryczek
Initial Progress Report
Presentation transcript:

Edge boxes: Locating object proposals from edges 2014. 9. 23. Mooyeol Baek Zitnick, C. Lawrence, and Piotr Dollár. "Edge boxes: Locating object proposals from edges." Computer Vision–ECCV 2014. Springer International Publishing, 2014. 391-405.

Object proposals Finds general candidates of object efficiently 2014-09-23 CV lab. seminar

Low level components for object proposal Segmentation for detection Superpixel Edge map Hard to find Whole-video processing은 영상이 길면 메모리를 너무 많이 필요로 한다. 그래서 긴 비디오에서는 차라리 frame-by-frame processing을 이용하기도 한다. (Lee, Y.J., Kim, J., Grauman, K.: Key-segments for video object segmentation. In: ICCV (2011)) Dense Not robust High order information Fast to calculate Sparse 2014-09-23 CV lab. seminar

Key idea Propose objectness score that counts the number of contours wholly enclosed by a bounding box 2014-09-23 CV lab. seminar

Model outline Extract edge map Edge set clustering Find object proposals 2014-09-23 CV lab. seminar

Structured edge prediction[ICCV13Dollar] [ICCV13Dollar] Dollár, Piotr, and C. Lawrence Zitnick. "Structured forests for fast edge detection." Computer Vision (ICCV), 2013 IEEE International Conference on. IEEE, 2013. 2014-09-23 CV lab. seminar

Edge groups and affinities Clustering edge into edge groups Affinity between two edge groups 𝑎 𝑠 𝑖, , 𝑠 𝑗 =0 if two groups are separated by more than two pixels. Otherwise, Markov assumption 직전 time slice의 subsequence와 segmentation result만이 영향을 미친다. 이 식의 계산에 dynamic programming을 사용할 수 없다. Hierarchical video segmentation에서는 보통 explicit energy function이 없고, S_i의 space가 너무 크기 때문이다. Strong Markov approximation 각 시점의 segmentation result는 과거의 segmentation에 independent하다. 2014-09-23 CV lab. seminar

Bounding box scoring Enclosedness coefficient 𝑤 𝑏 𝑠 𝑖 =0 𝑖𝑓 𝑠 𝑖 𝑖𝑠 𝑜𝑢𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑏𝑜𝑥 𝑤 𝑏 𝑠 𝑖 =0 𝑖𝑓 𝑠 𝑖 𝑖𝑠 𝑜𝑛 𝑡ℎ𝑒 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒, CV lab. seminar 2014-09-23

Bounding box scoring Bounding box scoring Reducing effects of edges near the center Using an integral image to speed computation 2014-09-23 CV lab. seminar

Finding intersecting edge groups Data structures for finding 𝑆 𝑏 efficiently 𝐿 𝑟 =[0, 4, 0, 2, 0, 3, 0, 7, 1, 0, 1, 0] 𝐾 𝑟 =[1, 1, 1, 2, 2, 3, 3,4, 4, 5, 5, …10, 11, 11, 12, 12] 𝑠 4 𝑠 2 𝑠 3 𝑠 7 𝑠 1 𝑦=𝑟 2014-09-23 CV lab. seminar

Search strategy Explicit control over diversity versus accuracy Desired Intersection over Union (IoU): 𝛿 Initial step size: 𝛼 (𝛿∝𝛼) Scale (==box area): 𝜎 ~ 𝑓𝑢𝑙𝑙 𝑖𝑚𝑎𝑔𝑒 (𝜎=1000𝑝𝑥) Aspect ratio: 1/𝜏 ~ 𝜏 (𝜏=3) Calculate score -> ½ step size -> Thresholding -> Calculate score -> ½ step size -> Thresholding -> … Non-maximal suppression 2014-09-23 CV lab. seminar

Benchmark[arXiv2014Hosang] Quality over Pascal VOC 2007 validation set Recall at IoU above 0.5 versus # of proposed windows Recall versus IoU threshold (for 1000 proposals per image) Bing CPMC EdgeBoxes Endres MCG Objectness Rahtu Rand.Prim Ranta.2014 Sel.Search Gaussian Sliding window Superpixels Uniform area under the curve (avg # of windows per image) [arXiv2014Hosang] Hosang, Jan, Rodrigo Benenson, and Bernt Schiele. "How good are detection proposals, really?." arXiv preprint arXiv:1406.6962 (2014). 2014-09-23 CV lab. seminar

Benchmark[arXiv2014Hosang] Quality over ImageNet 2013 validation set Recall at IoU above 0.5 versus # of proposed windows Recall versus IoU threshold (for 1000 proposals per image) Bing EdgeBoxes Endres MCG Rand.Prim Sel.Search Gaussian Sliding window Superpixels [arXiv2014Hosang] Hosang, Jan, Rodrigo Benenson, and Bernt Schiele. "How good are detection proposals, really?." arXiv preprint arXiv:1406.6962 (2014). 2014-09-23 CV lab. seminar

Comparison table[arXiv2014Hosang] 2014-09-23 CV lab. seminar

Thank you! 2014-09-23 CV lab. seminar