10/1/201615-494 Cognitive Robotics1 Object Recognition 15-494 Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2008.

Slides:



Advertisements
Similar presentations
Distinctive Image Features from Scale-Invariant Keypoints
Advertisements

Feature Detection. Description Localization More Points Robust to occlusion Works with less texture More Repeatable Robust detection Precise localization.
Distinctive Image Features from Scale-Invariant Keypoints David Lowe.
Presented by Xinyu Chang
RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,
Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
IBBT – Ugent – Telin – IPI Dimitri Van Cauwelaert A study of the 2D - SIFT algorithm Dimitri Van Cauwelaert.
Content Based Image Retrieval
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
(1) Feature-point matching by D.J.Duff for CompVis Online: Feature Point Matching Detection,
A Study of Approaches for Object Recognition
Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.
Distinctive Image Feature from Scale-Invariant KeyPoints
Feature extraction: Corners and blobs
Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.
Feature-based object recognition Prof. Noah Snavely CS1114
Object Recognition Using Distinctive Image Feature From Scale-Invariant Key point D. Lowe, IJCV 2004 Presenting – Anat Kaspi.
Scale Invariant Feature Transform (SIFT)
Blob detection.
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.
Scale-Invariant Feature Transform (SIFT) Jinxiang Chai.
Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.
Computer vision.
Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.
Object Tracking/Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition.
Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.
CVPR 2003 Tutorial Recognition and Matching Based on Local Invariant Features David Lowe Computer Science Department University of British Columbia.
CSCE 643 Computer Vision: Extractions of Image Features Jinxiang Chai.
Wenqi Zhu 3D Reconstruction From Multiple Views Based on Scale-Invariant Feature Transform.
Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.
Puzzle Solver Sravan Bhagavatula EE 638 Project Stanford ECE.
Object Recognition in ROS Using Feature Extractors and Feature Matchers By Dolev Shapira.
Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
Feature extraction: Corners and blobs. Why extract features? Motivation: panorama stitching We have two images – how do we combine them?
A Tutorial on using SIFT Presented by Jimmy Huff (Slightly modified by Josiah Yoder for Winter )
Local features: detection and description
Line Matching Jonghee Park GIST CV-Lab..  Lines –Fundamental feature in many computer vision fields 3D reconstruction, SLAM, motion estimation –Useful.
Presented by David Lee 3/20/2006
776 Computer Vision Jan-Michael Frahm Spring 2012.
Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.
Distinctive Image Features from Scale-Invariant Keypoints Presenter :JIA-HONG,DONG Advisor : Yen- Ting, Chen 1 David G. Lowe International Journal of Computer.
Blob detection.
SIFT.
CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.
776 Computer Vision Jan-Michael Frahm Spring 2012.
10/2/ Cognitive Robotics1 Object Recognition Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2011.
SIFT Scale-Invariant Feature Transform David Lowe
CS262: Computer Vision Lect 09: SIFT Descriptors
Presented by David Lee 3/20/2006
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
Scale Invariant Feature Transform (SIFT)
SIFT paper.
Object Recognition Cognitive Robotics David S. Touretzky &
Object Recognition Cognitive Robotics David S. Touretzky &
CS262 – Computer Vision Lect 08: SIFT Keypoint Detection
Car Recognition Through SIFT Keypoint Matching
Assistive System Progress Report 1
CAP 5415 Computer Vision Fall 2012 Dr. Mubarak Shah Lecture-5
Brief Review of Recognition + Context
The SIFT (Scale Invariant Feature Transform) Detector and Descriptor
SIFT keypoint detection
SIFT.
ECE734 Project-Scale Invariant Feature Transform Algorithm
Presented by Xu Miao April 20, 2005
Presentation transcript:

10/1/ Cognitive Robotics1 Object Recognition Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2008

10/1/ Cognitive Robotics2 What Makes Object Recognition Hard? ● Translation invariance ● Scale invariance ● Rotation invariance (2D) ● Rotation invariance (3D) ● Occlusion ● Figure/ground segmentation (where is the object?) ● Articulated objects (limbs, scissors)

10/1/ Cognitive Robotics3 Template Matching ● Simplest possible object recognition scheme. ● Compare template pixels against image pixels at each image position. TemplateMatch Score

10/1/ Cognitive Robotics4 Sketch templateMatch(const Sketch &sketch, Sketch &kernel, int istart, int jstart, int width, int height) { Sketch result("templateMatch("+sketch->getName()+")",sketch); result->setColorMap(jetMapScaled); int const npix = width * height; int const di = - (int)(width/2); int const dj = - (int)(height/2); for (int si=0; si<sketch.width; si++) for (int sj=0; sj<sketch.height; sj++) { int sum = 0; for (int ki=0; ki<width; ki++) for (int kj=0; kj<height; kj++) { int k_pix = kernel(istart+ki,jstart+kj); if ( si+di+ki >= 0 && si+di+ki < sketch.width && sj+dj+kj >= 0 && sj+dj+kj < sketch.height ) { int s_pix = sketch(si+di+ki,sj+dj+kj); sum += (s_pix - k_pix) * (s_pix - k_pix); } else sum += k_pix * k_pix; } result(si,sj) = uint( sqrt(sum/float(npix))); } result -= result->min(); return result; } Template Matcher

10/1/ Cognitive Robotics5 Limited Invariance Properties OriginalOccludedRotated FlippedSidewaysDiagonal

10/1/ Cognitive Robotics6 Color Histograms (Swain) ● Invariant to translation, 2D rotation, and scale. ● Handles some occlusion. ● But assumes object has already been segmented.

10/1/ Cognitive Robotics7 Object Classes Test Images Figure from M. A. Stricker,

10/1/ Cognitive Robotics8 Blocks World Vision ● One of the earliest computer vision domains. – Roberts (1965) used line drawings of block scenes: the first “computer vision” program. ● Simplified problem because shapes were regular. – Occlusions could be handled. ● Still a hard problem. No standard blocks world vision package exists.

10/1/ Cognitive Robotics9 AIBO Blocks World ● Matt Carson's senior thesis (CMU CSD, 2006). ● Goal: recover positions, orientations, and sizes of blocks.

10/1/ Cognitive Robotics10 Find the Block Faces

10/1/ Cognitive Robotics11 Find the Block From the Faces

10/1/ Cognitive Robotics12 SIFT (Lowe, 2004) ● Scale-Invariant Feature Transform ● Can recognize objects independent of scale, translation, rotation, or occlusion. ● Can segment cluttered scenes. ● Slow training, but fast recognition.

10/1/ Cognitive Robotics13 How Does SIFT Work? ● Generate large numbers of features that densely cover each training object at various scales and orientations. ● A 500 x 500 pixel image may generate 2000 stable features. ● Store these features in a library. ● For recognition, find clusters of features present in the image that agree on the object position, orientation, and scale.

10/1/ Cognitive Robotics14 SIFT Feature Generation 1) Scale-space extrema detection ➔ Use differences of Gaussians to find potential interest points. 2) Keypoint localization ➔ Fit detailed model to determine location and scale. 3) Orientation assignment ➔ Assign orientations based on local image gradients. 4) Keypoint descriptor ➔ Extract description of local gradients at selected scale.

10/1/ Cognitive Robotics15 Gaussian Smoothing

10/1/ Cognitive Robotics16 Difference of Gaussians: Edge Detection Difference of Gaussians Zero Crossings

10/1/ Cognitive Robotics17 Scale Space

10/1/ Cognitive Robotics18 Scale Space Extrema

10/1/ Cognitive Robotics19 Filtering the Features

10/1/ Cognitive Robotics20 Keypoint Descriptors

10/1/ Cognitive Robotics21

10/1/ Cognitive Robotics22 Real-Time SIFT Example ● Fred Birchmore used SIFT to recognize soda cans. See demo videos on his blog.

10/1/ Cognitive Robotics23 SIFT in Tekkotsu ● Lionel Heng did a SIFT implementation as his class project in ● Xinghao Pan is implementing a SIFT tool for Tekkotsu: – Allow users to construct libraries of objects – Each object has a collection of representative images – User can control which SIFT features to use for matching – Java GUI provides for easy management of the library ● How to integrate SIFT with the dual coding system? – Object scale can be used to estimate distance – Match in camera space must be converted to local space

10/1/ Cognitive Robotics24 Object Recognition in the Brain

10/1/ Cognitive Robotics25 Object Recognition in the Brain ● Mishkin & Ungerleider: dual visual pathways. – The dorsal, “where” pathway lies in parietal cortex. – The ventral, “what” pathway lies in temporal cortex. – Lesions to these areas yield very specific effects.

10/1/ Cognitive Robotics26 Serre & Poggio (PAMI 2007): Model Based on Temporal Cortex

10/1/ Cognitive Robotics27 To Learn More About Computer and Biological Vision ● Take Tai Sing Lee's Computer Vision class, ● Take Tai Sing Lee's Computational Neuroscience class, ● Take Mike Lewicki's Computational Perception and Scene Analysis class, ● There are many books on this subject. One of the classics is “Vision” by David Marr.