“The Truth About Cats And Dogs”

Slides:



Advertisements
Similar presentations
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
Advertisements

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)
Presenter: Duan Tran (Part of slides are from Pedro’s)
Learning Shared Body Plans Ian Endres University of Illinois work with Derek Hoiem, Vivek Srikumar and Ming-Wei Chang.
Articulated People Detection and Pose Estimation: Reshaping the Future
Diagnosing Error in Object Detectors Department of Computer Science University of Illinois at Urbana-Champaign (UIUC) Derek Hoiem Yodsawalai Chodpathumwan.
Three things everyone should know to improve object retrieval
On-the-fly Specific Person Retrieval University of Oxford 24 th May 2012 Omkar M. Parkhi, Andrea Vedaldi and Andrew Zisserman.
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Weiwei Zhang, Jian Sun, and Xiaoou Tang, Fellow, IEEE.
Ivan Laptev IRISA/INRIA, Rennes, France September 07, 2006 Boosted Histograms for Improved Object Detection.
Lecture 31: Modern object recognition
Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.
Learning to estimate human pose with data driven belief propagation Gang Hua, Ming-Hsuan Yang, Ying Wu CVPR 05.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
Stephen J. Guy 1. Photomontage Photomontage GrabCut – Interactive Foreground Extraction 1.
GrabCut Interactive Image (and Stereo) Segmentation Joon Jae Lee Keimyung University Welcome. I will present Grabcut – an Interactive tool for foreground.
Advisers: Prof. C.V. Jawahar Prof. A. P.Zisserman 3rd August 2011
Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.
More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.
Recognition using Regions CVPR Outline Introduction Overview of the Approach Experimental Results Conclusion.
1. Introduction Humanising GrabCut: Learning to segment humans using the Kinect Varun Gulshan, Victor Lempitksy and Andrew Zisserman Dept. of Engineering.
TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A.
Object Recognizing We will discuss: Features Classifiers Example ‘winning’ system.
Learning to Segment from Diverse Data M. Pawan Kumar Daphne KollerHaithem TurkiDan Preston.
What, Where & How Many? Combining Object Detectors and CRFs
Lecture 29: Recent work in recognition CS4670: Computer Vision Noah Snavely.
Generic object detection with deformable part-based models
The Three R’s of Vision Jitendra Malik.
CS55 Tianfan Xue Adviser: Bo Zhang, Jianmin Li.
EADS DS / SDC LTIS Page 1 7 th CNES/DLR Workshop on Information Extraction and Scene Understanding for Meter Resolution Image – 29/03/07 - Oberpfaffenhofen.
Shape-Based Human Detection and Segmentation via Hierarchical Part- Template Matching Zhe Lin, Member, IEEE Larry S. Davis, Fellow, IEEE IEEE TRANSACTIONS.
Object Recognizing. Recognition -- topics Features Classifiers Example ‘winning’ system.
Computer Vision CS 776 Spring 2014 Recognition Machine Learning Prof. Alex Berg.
“Secret” of Object Detection Zheng Wu (Summer intern in MSRNE) Sep. 3, 2010 Joint work with Ce Liu (MSRNE) William T. Freeman (MIT) Adam Kalai (MSRNE)
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Marco Pedersoli, Jordi Gonzàlez, Xu Hu, and Xavier Roca
Learning Collections of Parts for Object Recognition and Transfer Learning University of Illinois at Urbana- Champaign.
Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.
Object Detection with Discriminatively Trained Part Based Models
Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.
Deformable Part Model Presenter : Liu Changyu Advisor : Prof. Alex Hauptmann Interest : Multimedia Analysis April 11 st, 2013.
Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.
MSRI workshop, January 2005 Object Recognition Collected databases of objects on uniform background (no occlusions, no clutter) Mostly focus on viewpoint.
Training and Evaluating of Object Bank Models Presenter : Changyu Liu Advisor : Prof. Alex Interest : Multimedia Analysis May 16 th, 2013.
Object detection, deep learning, and R-CNNs
Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes.
O BJ C UT M. Pawan Kumar Philip Torr Andrew Zisserman UNIVERSITY OF OXFORD.
CS 1699: Intro to Computer Vision Detection II: Deformable Part Models Prof. Adriana Kovashka University of Pittsburgh November 12, 2015.
Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.
Category Independent Region Proposals Ian Endres and Derek Hoiem University of Illinois at Urbana-Champaign.
Recognition Using Visual Phrases
Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.
Object Recognition by Integrating Multiple Image Segmentations Caroline Pantofaru, Cordelia Schmid, Martial Hebert ECCV 2008 E.
Object Recognizing. Object Classes Individual Recognition.
More sliding window detection: Discriminative part-based models
Rich feature hierarchies for accurate object detection and semantic segmentation 2014 IEEE Conference on Computer Vision and Pattern Recognition Ross Girshick,
Object detection with deformable part-based models
Data Driven Attributes for Action Detection
Object detection, deep learning, and R-CNNs
Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.
R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.
Cheng-Ming Huang, Wen-Hung Liao Department of Computer Science
HOGgles Visualizing Object Detection Features
An HOG-LBP Human Detector with Partial Occlusion Handling
Brief Review of Recognition + Context
Object Classes Most recent work is at the object level We perceive the world in terms of objects, belonging to different classes. What are the differences.
Outline Background Motivation Proposed Model Experimental Results
Presentation transcript:

“The Truth About Cats And Dogs” Omkar M. Parkhi1, Andrea Vedaldi1, C.V. Jawahar2, A. P. Zisserman1 Visual Geometry Group, Oxford University

Object Category Recognition Popular in the community since long time. Several datasets such as Pascal VOC, Caltech, Imagenet have have been introduced. People have been working on categories such as Flowers, Cars person etc. In this work we work with animal categories: cats and Dogs

Why Cats and Dogs? Tough to detect in images Pascal VOC 2010 detection challenge Category AP% Aero plane 58.4 Bicycle 55.3 Bus 55.5 Cat 47.7 Dog 37.2

Why Cats and Dogs? Popular pet animals - always found in images and videos besides humans Google images have about 260 million cat and 168 million dog images indexed. About 65% of United States household have pets. 38 million households have cats 46 million households have dogs This popularity provides an opportunity to collect large amount of data for machine learning.

Why Cats and Dogs? Social networks exists for people having these pets. Petfinder.com a pet adoption website has 3 milion images of cats and dogs. Fun to work with..!

Why Cats and Dogs? Difficulty in automatic classification of cats and dogs images was exploited to build a security system for web services.

Challenges: Deformations Objects appearing in different shapes and sizes Body parts not always visible Hard to model the shape of the object.

Challenges: Occlusion Some portion of the body is covered by other objects Hard to fit a shape model Hard to get information from pixels.

Dataset Evaluation protocols Classification: Average Precision computed as area under the Precision Recall curve is used to evaluate performance. Detection: Recall curve is used to evaluate performance. Detections overlapping 50% with groundtruth are considered true positives. Segmentation: Ratio of intersection over union of ground truth with output segmentation is used to evaluate the performance.

Object Detection: State of the Art “Object Detection with Discriminatively Trained Part Based Models.” P. Felzenszwalb, R. Girshick, D. McAllester and D. Ramanan. In PAMI 2010 System represents objects using mixtures of deformable part models. System consists of combination of Strong low-level features based on histograms of oriented gradients (HOG). Efficient matching algorithms for deformable part-based models (pictorial structures). Discriminative learning with latent variables (latent SVM). Winner of PASCAL VOC 2007 Lifetime achievement award in PASCAL VOC 2010.

Extending Deformable Parts Model for Animal Detection Object Head Torso Legs Legs Representing objects by collection of parts

Object Detection: State of the Art Good overall performance but fails on animal categories. Outperformed by Bag of Words based detectors on animal categories. Can this method be improved to get the state of the art results?

Distinctive Parts Model Model head of the animal How well does it work? Method AP Max. Recall HoG 0.45 0.52 HoG+LBP 0.49 0.58 HoG+LBP (less strict) 0.61 0.79

Distinctive Parts Model With head detected what more can be done? Method AP Max. Recall FGMR Model 0.28 0.55 Regression 0.31 0.56 Can anything better be done?

Distinctive Parts Model Is it possible to take any clues from detected head and segment the whole object?

Interactive Segmentation GrabCut Introduced by Rother et al. in SIGGRAPH 2004 Iteratively minimizes Graph Cut energy function Energy Data Term Pair wise Term Data terms are taken as posterior probabilities from a GMM. GMMs are updated after every iteration.

Segmenting the object Selecting Seeds Some foreground and background pixel (seeds) need to be specified for GMM initialization. Rectangle from the head region is taken as foreground seed. Boundary pixels are used as background seeds. Background is added while some foreground is missing

Segmenting the object Berkeley Edges Introduced in 2002, Berkeley Edge Detector provides edge response by considering context from the images. Response of the edge detector used to model pair wise terms. Cut is encouraged at places where there is high edge response.

Segmenting the object Posterior Probabilities GMMs often un capable of modeling color variations. Foreground and Background color histograms computed on training images. Posteriors are computed using these histograms. Global posteriors are mixed with image specific ones to achieve better modeling. After Before

Distinctive Parts Model (Results) Method AP FGMR Model 0.28 Basic GrabCut 0.37 Adding Global Posteriors 0.41 Adding Berkeley Edges 0.46 Re ranking the detections 0.48 State of the Art in VOC 2010 0.47 Distinctive part model improves AP by 20% over original method. Results comparable to state of the art method are obtained. Still lot of scope to improve results further.

Distinctive Parts Model(Results)

Distinctive Parts Model(Failure Cases)

Future Work Improving segmentations using super pixels. Using multiple segmentations to locate the object Improving head detection results using better features. Finding improved models for subcategory classification. Improving the dataset, adding more images and categories.