COS 429 PS5: Finding Nemo. Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar.

Slides:

Advertisements

Similar presentations

Semantic Contours from Inverse Detectors Bharath Hariharan et.al. (ICCV-11)

Advertisements

Combining Detectors for Human Hand Detection Antonio Hernández, Petia Radeva and Sergio Escalera Computer Vision Center, Universitat Autònoma de Barcelona,

Image classification Given the bag-of-features representations of images from different classes, how do we learn a model for distinguishing them?

BRISK (Presented by Josh Gleason)

Ľubor Ladický1 Phil Torr2 Andrew Zisserman1

Many slides based on P. FelzenszwalbP. Felzenszwalb General object detection with deformable part-based models.

Cos 429: Face Detection (Part 2) Viola-Jones and AdaBoost Guest Instructor: Andras Ferencz (Your Regular Instructor: Fei-Fei Li) Thanks to Fei-Fei Li,

Knowing a Good HOG Filter When You See It: Efficient Selection of Filters for Detection Ejaz Ahmed 1, Gregory Shakhnarovich 2, and Subhransu Maji 3 1 University.

Enhancing Exemplar SVMs using Part Level Transfer Regularization 1.

More sliding window detection: Discriminative part-based models Many slides based on P. FelzenszwalbP. Felzenszwalb.

Quantifying and Transferring Contextual Information in Object Detection Professor: S. J. Wang Student : Y. S. Wang 1.

1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.

Jeremy Wyatt Thanks to Gavin Brown

1 Accurate Object Detection with Joint Classification- Regression Random Forests Presenter ByungIn Yoo CS688/WST665.

CS4670: Computer Vision Kavita Bala Lecture 7: Harris Corner Detection.

Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.

Generic object detection with deformable part-based models

Multiclass object recognition

CS55 Tianfan Xue Adviser: Bo Zhang, Jianmin Li.

Object Bank Presenter ： Liu Changyu Advisor ： Prof. Alex Hauptmann Interest ： Multimedia Analysis April 4 th, 2013.

Reading Between The Lines: Object Localization Using Implicit Cues from Image Tags Sung Ju Hwang and Kristen Grauman University of Texas at Austin Jingnan.

Efficient Region Search for Object Detection Sudheendra Vijayanarasimhan and Kristen Grauman Department of Computer Science, University of Texas at Austin.

Object Detection with Discriminatively Trained Part Based Models

Lecture 31: Modern recognition CS4670 / 5670: Computer Vision Noah Snavely.

Pedestrian Detection and Localization

Latent SVM 1 st Frame: manually select target Find 6 highest weighted areas in template Area of 16 blocks Train 6 SVMs on those areas Train 1 SVM on entire.

Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.

BING: Binarized Normed Gradients for Objectness Estimation at 300fps

Deformable Part Models (DPM) Felzenswalb, Girshick, McAllester & Ramanan (2010) Slides drawn from a tutorial By R. Girshick AP 12% 27% 36% 45% 49% 2005.

Recognition II Ali Farhadi. We have talked about Nearest Neighbor Naïve Bayes Logistic Regression Boosting.

Fitting: The Hough transform

Stable Multi-Target Tracking in Real-Time Surveillance Video

Face Detection Ying Wu Electrical and Computer Engineering Northwestern University, Evanston, IL

Handwritten digit recognition

Project 3 Results.

Object detection, deep learning, and R-CNNs

Robust Real Time Face Detection

Recognition Using Visual Phrases

CS-498 Computer Vision Week 9, Class 2 and Week 10, Class 1

Object Recognition as Ranking Holistic Figure-Ground Hypotheses Fuxin Li and Joao Carreira and Cristian Sminchisescu 1.

Carl Vondrick, Aditya Khosla, Tomasz Malisiewicz, Antonio Torralba Massachusetts Institute of Technology

Semantic Alignment Spring 2009 Ben-Gurion University of the Negev.

Cell Segmentation in Microscopy Imagery Using a Bag of Local Bayesian Classifiers Zhaozheng Yin RI/CMU, Fall 2009.

Strong Supervision From Weak Annotation Interactive Training of Deformable Part Models ICCV /05/23.

Week 4: 6/6 – 6/10 Jeffrey Loppert. This week.. Coded a Histogram of Oriented Gradients (HOG) Feature Extractor Extracted features from positive and negative.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition arXiv: v4 [cs.CV(CVPR)] 23 Apr 2015 Kaiming He, Xiangyu Zhang, Shaoqing.

When deep learning meets object detection: Introduction to two technologies: SSD and YOLO Wenchi Ma.

CS 4501: Introduction to Computer Vision Object Localization, Detection, Semantic Segmentation Connelly Barnes Some slides from Fei-Fei Li / Andrej Karpathy.

License Plate Detection

Object detection with deformable part-based models

Session 7: Face Detection (cont.)

Erasmus University Rotterdam

Lecture 24: Convolutional neural networks

Object Localization Goal: detect the location of an object within an image Fully supervised: Training data labeled with object category and ground truth.

Object detection as supervised classification

R-CNN region By Ilia Iofedov 11/11/2018 BGU, DNN course 2016.

Non-linear classifiers Neural networks

Action Recognition in Temporally Untrimmed Videos

Content-Based Image Retrieval

Content-Based Image Retrieval

Walter J. Scheirer, Samuel E. Anthony, Ken Nakayama & David D. Cox

On-going research on Object Detection *Some modification after seminar

Data Driven Attributes for Action Detection

CornerNet: Detecting Objects as Paired Keypoints

KFC: Keypoints, Features and Correspondences

Object Detection Creation from Scratch Samsung R&D Institute Ukraine

Img1 img3 img5.

Report 2 Brandon Silva.

Introduction Face detection and alignment are essential to many applications such as face recognition, facial expression recognition, age identification,

Presentation transcript:

COS 429 PS5: Finding Nemo

Exemplar -SVM Still a rigid template,but train a separate SVM for each positive instance For each category it can has exemplar with different size aspect ratio

Problem 1 Describe Steps Training : To train one exemplar SVM what we do Testing : For each image what we do ? In this assignment we use 6 images as training and others as testing.

Problem 2 why use image pyramid ? When we compute HoG for each image we compute the HoG for different scale of image. What problem we are trying to solve here? If there any alternative solution? Why not as good as this one ?

Problem 2 Why Non-Maximum Suppression? Look at the code see where we do non- maximum suppression (call the function nmsMe). What does it do? Why we do this?

Problem 3 Visualizing Examplars Visualize the exemplar image : Model.posRgb Visualize the exemplar HoG feature : showHoG(Model.posfeature) Visualize the trained svm weight : showHoG(Model.svm.w)

Problem 3 Visualizing Examplars Try to understand the correspondences among the three visualization. Describe the patterns in the HOG and SVM weight visualization.

Problem4 Testing Procedure Goal : complete the function bbdet = testfeaturePym(imageRgb,Model) What it does : slide one exemplar detector over the image to find Nemo Past your code in the report.

Problem4 Testing Procedure Basic steps (Listed in the code comments) 1. Compute feature pyramid 2.For each level of the feature pyramid – Do convolution get the detection score for each window – Convert the score to bounding box in image domain 3. Do non- maximum-suppression

Convolution get the detection score Dot product between trained svm weight and feature for each window score = fconv(one level of feature, {weight}, 1, 1); The score stores at the left top corner of the window in feature domain. If image at this level is too small, will not do convolution on this and smaller level

Convert the score to bounding box in image domain Use: – the scale of the this level – HOG bin size 8x8 pixels – size of this exemplar Example in white board

Problem6 draw box Paste result in report Can you find Nemo? When it fails ?

Problem 7 Training Procedure After you finish the function testfeaturePym Congratulations! You can train your own detectors. Run rumMe.m with step.train =1 To train 6 exemplars and test on the images What’s the difference of the results ?

[Extra Credit] Detecting Other Objects Find an interesting object to detect Get images Get ground truth box by label it by yourself

Evaluation: Precision & Recall Precision = TP / (TP + FP)Recall = TP / (TP + FN) Object No Object Truth Object No Object Algorithm Prediction True Positive True Negative False Positive False Negative Truth Prediction Intersection Over Union = / [Extra Credit ] Evaluation

X : recall = true positive / total ground truth Y : precision = true positive / total detection

Calibration Idea : make different exemplar detector's score compatible. Good exemplar vs Bad exemplar

Calibration Idea : make different exemplar detector's score compatible. Detection Score 1. On Validation set : Sort detection by score, classify true/false detection Good positive : Similar with exemplar negative: Detection Score Target Score 1 2. sigmoid function All detector’s scores are from 0 to 1. The calibrated score reflects each detector’s presicion

Extra Credit Label Transfer Hints: paper and codehttp://

iHoG Webpage : They have online demo and code for download >> feat = features(im, 8); >> ihog = invertHOG(feat); >> imagesc(ihog);