Huazhong University of Science and Technology

Slides:

Advertisements

Similar presentations

Foreground Focus: Finding Meaningful Features in Unlabeled Images Yong Jae Lee and Kristen Grauman University of Texas at Austin.

Advertisements

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Large-Scale Object Recognition with Weak Supervision

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Quantifying and Transferring Contextual Information in Object Detection Professor: S. J. Wang Student : Y. S. Wang 1.

R-CNN By Zhang Liliang.

Spatial Pyramid Pooling in Deep Convolutional

From R-CNN to Fast R-CNN

Generic object detection with deformable part-based models

Hands segmentation Pat Jangyodsuk. Motivation Alternative approach of finding hands Instead of finding bounding box, classify each pixel whether they’re.

Detection, Segmentation and Fine-grained Localization

Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.

Object Detection with Discriminatively Trained Part Based Models

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.

Tell Me What You See and I will Show You Where It Is Jia Xu 1 Alexander G. Schwing 2 Raquel Urtasun 2,3 1 University of Wisconsin-Madison 2 University.

Recognition Using Visual Phrases

Feedforward semantic segmentation with zoom-out features

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

BOOTSTRAPPING INFORMATION EXTRACTION FROM SEMI-STRUCTURED WEB PAGES Andrew Carson and Charles Schafer.

Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,

Strong Supervision from Weak Annotation: Interactive Training of Deformable Part Models S. Branson, P. Perona, S. Belongie.

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation Dr.G.M.Nasira R. Vidya R. P. Jaia Priyankka.

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

Recent developments in object detection

Scalable Person Re-identification on Supervised Smoothed Manifold

Hybrid Deep Learning for Reflectance Confocal Microscopy Skin Images

CNN-RNN: A Uniﬁed Framework for Multi-label Image Classiﬁcation

Learning to Compare Image Patches via Convolutional Neural Networks

Object Detection based on Segment Masks

Compact Bilinear Pooling

Object detection with deformable part-based models

Sofus A. Macskassy Fetch Technologies

Krishna Kumar Singh, Yong Jae Lee University of California, Davis

Web News Sentence Searching Using Linguistic Graph Similarity

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

ICCV Hierarchical Part Matching for Fine-Grained Image Classification

CS6890 Deep Learning Weizhen Cai

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Image Question Answering

Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science

Object detection.

Accounting for the relative importance of objects in image retrieval

Normalized Cut Loss for Weakly-supervised CNN Segmentation

Object-Graphs for Context-Aware Category Discovery

Computer Vision James Hays

Graph Based Multi-Modality Learning

Convolutional Neural Networks for Visual Tracking

Multiple Instance Learning: applications to computer vision

Vessel Extraction in X-Ray Angiograms Using Deep Learning

Find API Usage Patterns

Object Detection + Deep Learning

On-going research on Object Detection *Some modification after seminar

Fine-Grained Visual Categorization

CornerNet: Detecting Objects as Paired Keypoints

Lecture: Deep Convolutional Neural Networks

Outline Background Motivation Proposed Model Experimental Results

Deep Robust Unsupervised Multi-Modal Network

Actively Learning Ontology Matching via User Interaction

Department of Computer Science Ben-Gurion University of the Negev

Task Fashion Landmark Detection. Task Fashion Landmark Detection.

Semantic Segmentation

Object Detection Implementations

Weak-supervision based Multi-Object Tracking

Multi-Target Detection and Tracking of UAVs from a UAV

Volodymyr Bobyr Supervised by Aayushjungbahadur Rana

Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,

Van-Thanh Hoang May 11, 2019 Improving Object Localization with Fitness NMS and Bounded IoU Loss Lachlan Tychsen-Smith, Lars.

Presentation transcript:

Huazhong University of Science and Technology Multiple Instance Detection Network with Online Instance Classifier Refinement Peng Tang pengtang@hust.edu.cn Huazhong University of Science and Technology

Weakly-supervised visual learning (WSVL) Weakly-supervised visual learning is a new trend in CVPR Search keyword “weakly supervised” (14 papers), “weakly-supervised” (5 papers), “multi-instance” (1 paper), and “multiple instance” (3 papers), 23/783 papers in total http://cvpr2017.thecvf.com/program/main_conference Huazhong University of Science and Technology

WSVL avoids the expensive human annotations Tasks Training info Testing output Weakly-supervised object detection (WSOD) Image-level Bounding box Weakly-supervised sematic segmentation Pixel-level Weakly-supervised instance segmentation Image-level/bounding box Instance pixel-level Semi-supervised object detection/segmentation Partial of fully-labeled data Bounding box/pixel-level Huazhong University of Science and Technology

WSVL avoids the expensive human annotations Tasks Training info Testing output Weakly-supervised object detection (WSOD) Image-level Bounding box Weakly-supervised sematic segmentation Pixel-level Weakly-supervised instance segmentation Image-level/bounding box Instance pixel-level Semi-supervised object detection/segmentation Partial of fully-labeled data Bounding box/pixel-level Huazhong University of Science and Technology

How to do WSOD Huazhong University of Science and Technology

How to do WSOD Possible solutions to this problem, clustering based, matching based, co-segmentation based, topic model based, multi-instance learning based method. Huazhong University of Science and Technology

Solving this problem by multiple instance learning How to do WSOD Possible solutions to this problem, clustering based, matching based, co-segmentation based, topic model based, multi-instance learning based method. Solving this problem by multiple instance learning Image as bag, since image label is given Proposals (Selective Search, EdgeBox, Bing) as instances Proposal descriptors: Deep CNN Features, Fisher Vectors Number of proposals: ~2k (SS), ~4k (EB) Huazhong University of Science and Technology

What is the core problem in WSOD Is it a bird? The answers are YES! Huazhong University of Science and Technology

What is the core problem in WSOD Is it a bird? The answers are YES! However, only some of them are correct detection results (IoU>0.5) Huazhong University of Science and Technology

What is the core problem in WSOD Is it a bird? The answers are YES! However, only some of them are correct detection results (IoU>0.5) Huazhong University of Science and Technology

What is the core problem in WSOD Is it a bird? ambiguity The answers are YES! However, only some of them are correct detection results (IoU>0.5) Huazhong University of Science and Technology

What is the core problem in WSOD Previous methods tend to localize parts of objects instead of whole objects. Result of MIDN/WSDDN [4] Huazhong University of Science and Technology

Multiple Instance Detection Network with Online Instance Classifier Refinement Huazhong University of Science and Technology

Result of MIDN/WSDDN [4] Motivation Proposals having high spatial overlaps with detected parts may cover the whole object, or at least contain larger portion of the object. Result of MIDN/WSDDN [4] Huazhong University of Science and Technology

Result of MIDN/WSDDN [4] Motivation Propagating the scores to the highly overlapped proposals to alleviate the problem cased by ambiguity Result of MIDN/WSDDN [4] Result of OICR Huazhong University of Science and Technology

The multi-instance detection network (MIDN) Huazhong University of Science and Technology

The multi-instance detection network (MIDN) The basic network of WSDDN [4] by H. Bilen and A. Vedaldi Single network, end to end training Huazhong University of Science and Technology

The multi-instance detection network (MIDN) Huazhong University of Science and Technology

The multi-instance detection network (MIDN) Huazhong University of Science and Technology

The network for OICR Huazhong University of Science and Technology

The network for OICR Additional blocks (instance classifiers) for score propagation In-network supervision Huazhong University of Science and Technology

Effective online training/refinement Huazhong University of Science and Technology

Effective online training/refinement The top scoring proposal can always detect at least parts of objects. Huazhong University of Science and Technology

Effective online training/refinement The top scoring proposal can always detect at least parts of objects. Proposals having high spatial overlaps with detected parts may cover larger portion of the object. Huazhong University of Science and Technology

Effective online training/refinement The top scoring proposal can always detect at least parts of objects. Proposals having high spatial overlaps with detected parts may cover larger portion of the object. Proposals with high spatial overlap could share similar label information. Huazhong University of Science and Technology

Effective online training/refinement Huazhong University of Science and Technology

Effective online training/refinement The loss weight controls the learning process. Huazhong University of Science and Technology

Experimental Results Huazhong University of Science and Technology

Experimental Results The influence of refinement times and different refinement strategies Huazhong University of Science and Technology

Experimental Results Detection results (mAP) on VOC 2007 test set Huazhong University of Science and Technology

Experimental Results Huazhong University of Science and Technology

Advantages Conclusion Our method can improve the detection results a lot through our OICR strategy, especially for rigid objects. The network can be trained in a very efficiently (online) way. Huazhong University of Science and Technology

Advantages Limitations Conclusion Our method can improve the detection results a lot through our OICR strategy, especially for rigid objects. The network can be trained in a very efficiently (online) way. Limitations The performance is poor for non-rigid objects, as these objects are always with great deformation and their representative parts are with much less deformation. Huazhong University of Science and Technology

Thank you! Preprint is available at https://arxiv.org/abs/1704.00138. Codes are available at https://github.com/ppengtang/oicr. Huazhong University of Science and Technology