Edge boxes: Locating object proposals from edges

Slides:

Advertisements

Similar presentations

BRISK (Presented by Josh Gleason)

Advertisements

Ming-Ming Cheng 1 Ziming Zhang 2 Wen-Yan Lin 3 Philip H. S. Torr 1 1 Oxford University, 2 Boston University 3 Brookes Vision Group Training a generic objectness.

MIT CSAIL Vision interfaces Approximate Correspondences in High Dimensions Kristen Grauman* Trevor Darrell MIT CSAIL (*) UT Austin…

MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.

Interest points CSE P 576 Larry Zitnick Many slides courtesy of Steve Seitz.

Real-time Embedded Face Recognition for Smart Home Fei Zuo, Student Member, IEEE, Peter H. N. de With, Senior Member, IEEE.

1 Image Recognition - I. Global appearance patterns Slides by K. Grauman, B. Leibe.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.

1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.

R-CNN By Zhang Liliang.

On the Object Proposal Presented by Yao Lu

LOCUS Demo Stefan Zickler. Two “different” classes Class “Car Side Views” Class “Car Rears”

Learning and Recognizing Activities in Streams of Video Dinesh Govindaraju.

Computer Vision - A Modern Approach Set: Segmentation Slides by D.A. Forsyth Segmentation and Grouping Motivation: not information is evidence Obtain a.

Multiple Organ detection in CT Volumes - Week 2 Daniel Donenfeld.

From Edges to Objects Piotr Dollár and Larry Zitnick.

A Scale and Rotation Invariant Approach to Tracking Human Body Part Regions in Videos Yihang BoHao Jiang Institute of Automation, CAS Boston College.

New Segmentation Methods Advisor : 丁建均 Jian-Jiun Ding Presenter : 蔡佳豪 Chia-Hao Tsai Date: Digital Image and Signal Processing Lab Graduate Institute.

“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)

Window-based models for generic object detection Mei-Chen Yeh 04/24/2012.

Lecture 29: Face Detection Revisited CS4670 / 5670: Computer Vision Noah Snavely.

Object Detection with Discriminatively Trained Part Based Models

Lecture 6: Edge Detection CAP 5415: Computer Vision Fall 2008.

Structured Forests for Fast Edge Detection

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Data Extraction using Image Similarity CIS 601 Image Processing Ajay Kumar Yadav.

CS654: Digital Image Analysis Lecture 30: Clustering based Segmentation Slides are adapted from:

Modern Boundary Detection II Computer Vision CS 143, Brown James Hays Many slides Michael Maire, Jitendra Malek Szeliski 4.2.

Expectation-Maximization (EM) Case Studies

CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.

Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.

Lecture 08 27/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Recognition Using Visual Phrases

Text From Corners: A Novel Approach to Detect Text and Caption in Videos Xu Zhao, Kai-Hsiang Lin, Yun Fu, Member, IEEE, Yuxiao Hu, Member, IEEE, Yuncai.

Regionlets for Generic Object Detection IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 37, NO. 10, OCTOBER 2015 Xiaoyu Wang, Ming.

Lecture 30: Segmentation CS4670 / 5670: Computer Vision Noah Snavely From Sandlot ScienceSandlot Science.

Instructor: Mircea Nicolescu Lecture 10 CS 485 / 685 Computer Vision.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

1 Shape Descriptors for Maximally Stable Extremal Regions Per-Erik Forss´en and David G. Lowe Department of Computer Science University of British Columbia.

Spatial Localization and Detection

April 21, 2016Introduction to Artificial Intelligence Lecture 22: Computer Vision II 1 Canny Edge Detector The Canny edge detector is a good approximation.

Winter in Kraków photographed by Marcin Ryczek

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Recent developments in object detection

CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.

A. M. R. R. Bandara & L. Ranathunga

Object Detection based on Segment Masks

HFS: Hierarchical Feature Selection for Efficient Image Segmentation

CS 4501: Introduction to Computer Vision Sparse Feature Detectors: Harris Corner, Difference of Gaussian Connelly Barnes Slides from Jason Lawrence, Fei.

TP12 - Local features: detection and description

A Forest of Sensors: Using adaptive tracking to classify and monitor activities in a site Eric Grimson AI Lab, Massachusetts Institute of Technology

Mean Shift Segmentation

Huazhong University of Science and Technology

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Introduction of Pedestrian Detection

Vehicle Segmentation and Tracking in the Presence of Occlusions

Text Detection in Images and Video

Object Detection + Deep Learning

On-going research on Object Detection *Some modification after seminar

(Hopefully) Real-time Multi Object Tracking

Progress report 2019/1/14 PHHung.

Local features and image matching

Related Work in Camera Network Tracking

Lecture 29: Face Detection Revisited

Saliency Optimization from Robust Background Detection

Computer and Robot Vision I

Winter in Kraków photographed by Marcin Ryczek

Initial Progress Report

Presentation transcript:

Edge boxes: Locating object proposals from edges 2014. 9. 23. Mooyeol Baek Zitnick, C. Lawrence, and Piotr Dollár. "Edge boxes: Locating object proposals from edges." Computer Vision–ECCV 2014. Springer International Publishing, 2014. 391-405.

Object proposals Finds general candidates of object efficiently 2014-09-23 CV lab. seminar

Low level components for object proposal Segmentation for detection Superpixel Edge map Hard to find Whole-video processing은 영상이 길면 메모리를 너무 많이 필요로 한다. 그래서 긴 비디오에서는 차라리 frame-by-frame processing을 이용하기도 한다. (Lee, Y.J., Kim, J., Grauman, K.: Key-segments for video object segmentation. In: ICCV (2011)) Dense Not robust High order information Fast to calculate Sparse 2014-09-23 CV lab. seminar

Key idea Propose objectness score that counts the number of contours wholly enclosed by a bounding box 2014-09-23 CV lab. seminar

Model outline Extract edge map Edge set clustering Find object proposals 2014-09-23 CV lab. seminar

Structured edge prediction[ICCV13Dollar] [ICCV13Dollar] Dollár, Piotr, and C. Lawrence Zitnick. "Structured forests for fast edge detection." Computer Vision (ICCV), 2013 IEEE International Conference on. IEEE, 2013. 2014-09-23 CV lab. seminar

Edge groups and affinities Clustering edge into edge groups Affinity between two edge groups 𝑎 𝑠 𝑖, , 𝑠 𝑗 =0 if two groups are separated by more than two pixels. Otherwise, Markov assumption 직전 time slice의 subsequence와 segmentation result만이 영향을 미친다. 이 식의 계산에 dynamic programming을 사용할 수 없다. Hierarchical video segmentation에서는 보통 explicit energy function이 없고, S_i의 space가 너무 크기 때문이다. Strong Markov approximation 각 시점의 segmentation result는 과거의 segmentation에 independent하다. 2014-09-23 CV lab. seminar

Bounding box scoring Enclosedness coefficient 𝑤 𝑏 𝑠 𝑖 =0 𝑖𝑓 𝑠 𝑖 𝑖𝑠 𝑜𝑢𝑡 𝑜𝑓 𝑡ℎ𝑒 𝑏𝑜𝑥 𝑤 𝑏 𝑠 𝑖 =0 𝑖𝑓 𝑠 𝑖 𝑖𝑠 𝑜𝑛 𝑡ℎ𝑒 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒, CV lab. seminar 2014-09-23

Bounding box scoring Bounding box scoring Reducing effects of edges near the center Using an integral image to speed computation 2014-09-23 CV lab. seminar

Finding intersecting edge groups Data structures for finding 𝑆 𝑏 efficiently 𝐿 𝑟 =[0, 4, 0, 2, 0, 3, 0, 7, 1, 0, 1, 0] 𝐾 𝑟 =[1, 1, 1, 2, 2, 3, 3,4, 4, 5, 5, …10, 11, 11, 12, 12] 𝑠 4 𝑠 2 𝑠 3 𝑠 7 𝑠 1 𝑦=𝑟 2014-09-23 CV lab. seminar

Search strategy Explicit control over diversity versus accuracy Desired Intersection over Union (IoU): 𝛿 Initial step size: 𝛼 (𝛿∝𝛼) Scale (==box area): 𝜎 ~ 𝑓𝑢𝑙𝑙 𝑖𝑚𝑎𝑔𝑒 (𝜎=1000𝑝𝑥) Aspect ratio: 1/𝜏 ~ 𝜏 (𝜏=3) Calculate score -> ½ step size -> Thresholding -> Calculate score -> ½ step size -> Thresholding -> … Non-maximal suppression 2014-09-23 CV lab. seminar

Benchmark[arXiv2014Hosang] Quality over Pascal VOC 2007 validation set Recall at IoU above 0.5 versus # of proposed windows Recall versus IoU threshold (for 1000 proposals per image) Bing CPMC EdgeBoxes Endres MCG Objectness Rahtu Rand.Prim Ranta.2014 Sel.Search Gaussian Sliding window Superpixels Uniform area under the curve (avg # of windows per image) [arXiv2014Hosang] Hosang, Jan, Rodrigo Benenson, and Bernt Schiele. "How good are detection proposals, really?." arXiv preprint arXiv:1406.6962 (2014). 2014-09-23 CV lab. seminar

Benchmark[arXiv2014Hosang] Quality over ImageNet 2013 validation set Recall at IoU above 0.5 versus # of proposed windows Recall versus IoU threshold (for 1000 proposals per image) Bing EdgeBoxes Endres MCG Rand.Prim Sel.Search Gaussian Sliding window Superpixels [arXiv2014Hosang] Hosang, Jan, Rodrigo Benenson, and Bernt Schiele. "How good are detection proposals, really?." arXiv preprint arXiv:1406.6962 (2014). 2014-09-23 CV lab. seminar

Comparison table[arXiv2014Hosang] 2014-09-23 CV lab. seminar

Thank you! 2014-09-23 CV lab. seminar