Object Recognition Cognitive Robotics David S. Touretzky &

Slides:

Advertisements

Similar presentations

Distinctive Image Features from Scale-Invariant Keypoints

Advertisements

Feature Detection. Description Localization More Points Robust to occlusion Works with less texture More Repeatable Robust detection Precise localization.

Distinctive Image Features from Scale-Invariant Keypoints David Lowe.

Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.

Presented by Xinyu Chang

The SIFT (Scale Invariant Feature Transform) Detector and Descriptor

TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.

Object Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition l Panoramas,

Activity Recognition Aneeq Zia. Agenda What is activity recognition Typical methods used for action recognition “Evaluation of local spatio-temporal features.

Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.

IBBT – Ugent – Telin – IPI Dimitri Van Cauwelaert A study of the 2D - SIFT algorithm Dimitri Van Cauwelaert.

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

A Study of Approaches for Object Recognition

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

Distinctive Image Feature from Scale-Invariant KeyPoints

Feature extraction: Corners and blobs

Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.

Object Recognition Using Distinctive Image Feature From Scale-Invariant Key point D. Lowe, IJCV 2004 Presenting – Anat Kaspi.

Scale Invariant Feature Transform (SIFT)

Blob detection.

The SIFT (Scale Invariant Feature Transform) Detector and Descriptor

Lecture 6: Feature matching and alignment CS4670: Computer Vision Noah Snavely.

Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.

Computer vision.

Internet-scale Imagery for Graphics and Vision James Hays cs195g Computational Photography Brown University, Spring 2010.

Object Tracking/Recognition using Invariant Local Features Applications l Mobile robots, driver assistance l Cell phone location or object recognition.

Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.

CVPR 2003 Tutorial Recognition and Matching Based on Local Invariant Features David Lowe Computer Science Department University of British Columbia.

Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.

Puzzle Solver Sravan Bhagavatula EE 638 Project Stanford ECE.

Object Recognition in ROS Using Feature Extractors and Feature Matchers By Dolev Shapira.

Kylie Gorman WEEK 1-2 REVIEW. CONVERTING AN IMAGE FROM RGB TO HSV AND DISPLAY CHANNELS.

Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.

Feature extraction: Corners and blobs. Why extract features? Motivation: panorama stitching We have two images – how do we combine them?

A Tutorial on using SIFT Presented by Jimmy Huff (Slightly modified by Josiah Yoder for Winter )

Local features: detection and description

Scale Invariant Feature Transform (SIFT)

Presented by David Lee 3/20/2006

Recognizing specific objects Matching with SIFT Original suggestion Lowe, 1999,2004.

Blob detection.

CSCI 631 – Foundations of Computer Vision March 15, 2016 Ashwini Imran Image Stitching Link: singhashwini.mesinghashwini.me.

776 Computer Vision Jan-Michael Frahm Spring 2012.

9/28/ Cognitive Robotics1 The Vision Pipeline and Color Image Segmentation Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson.

10/1/ Cognitive Robotics1 Object Recognition Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2008.

10/2/ Cognitive Robotics1 Object Recognition Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2011.

SIFT Scale-Invariant Feature Transform David Lowe

Ullman's Visual Routines and Tekkotsu Sketches

CS262: Computer Vision Lect 09: SIFT Descriptors

Presented by David Lee 3/20/2006

Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.

Scale Invariant Feature Transform (SIFT)

TP12 - Local features: detection and description

Object Recognition Cognitive Robotics David S. Touretzky &

CS262 – Computer Vision Lect 08: SIFT Keypoint Detection

Feature description and matching

Car Recognition Through SIFT Keypoint Matching

Assistive System Progress Report 1

CAP 5415 Computer Vision Fall 2012 Dr. Mubarak Shah Lecture-5

From a presentation by Jimmy Huff Modified by Josiah Yoder

Brief Review of Recognition + Context

The SIFT (Scale Invariant Feature Transform) Detector and Descriptor

Interest Points & Descriptors 3 - SIFT

SIFT keypoint detection

ECE734 Project-Scale Invariant Feature Transform Algorithm

Lecture VI: Corner and Blob Detection

Feature descriptors and matching

Presented by Xu Miao April 20, 2005

Presentation transcript:

Object Recognition 15-494 Cognitive Robotics David S. Touretzky & Ethan Tira-Thompson Carnegie Mellon Spring 2010 8/26/2018 15-494 Cognitive Robotics

What Makes Object Recognition Hard? Translation invariance Scale invariance Rotation invariance (2D) Rotation invariance (3D) Occlusion Figure/ground segmentation (where is the object?) Articulated objects (limbs, scissors) 8/26/2018 15-494 Cognitive Robotics

Template Matching Simplest possible object recognition scheme. Compare template pixels against image pixels at each image position. Template Match Score 8/26/2018 15-494 Cognitive Robotics

Template Matcher Sketch<uint> templateMatch(const Sketch<uchar> &sketch, Sketch<uchar> &kernel, int istart, int jstart, int width, int height) { Sketch<uint> result("templateMatch("+sketch->getName()+")",sketch); result->setColorMap(jetMapScaled); int const npix = width * height; int const di = - (int)(width/2); int const dj = - (int)(height/2); for (int si=0; si<sketch.width; si++) for (int sj=0; sj<sketch.height; sj++) { int sum = 0; for (int ki=0; ki<width; ki++) for (int kj=0; kj<height; kj++) { int k_pix = kernel(istart+ki,jstart+kj); if ( si+di+ki >= 0 && si+di+ki < sketch.width && sj+dj+kj >= 0 && sj+dj+kj < sketch.height ) { int s_pix = sketch(si+di+ki,sj+dj+kj); sum += (s_pix - k_pix) * (s_pix - k_pix); } else sum += k_pix * k_pix; result(si,sj) = uint(65535 - sqrt(sum/float(npix))); result -= result->min(); return result; 8/26/2018 15-494 Cognitive Robotics

Limited Invariance Properties Original Occluded Rotated Flipped Sideways Diagonal 8/26/2018 15-494 Cognitive Robotics

Color Histograms (Swain) Invariant to translation, 2D rotation, and scale. Handles some occlusion. But assumes object has already been segmented. 8/26/2018 15-494 Cognitive Robotics

Object Classes Test Images Figure from M. A. Stricker, http://www.cs.uchicago.edu/files/tr_authentic/TR-92-22.ps 8/26/2018 15-494 Cognitive Robotics

Blocks World Vision One of the earliest computer vision domains. Roberts (1965) used line drawings of block scenes: the first “computer vision” program. Simplified problem because shapes were regular. Occlusions could be handled. Still a hard problem. No standard blocks world vision package exists. 8/26/2018 15-494 Cognitive Robotics

AIBO Blocks World Matt Carson's senior thesis (CMU CSD, 2006). Goal: recover positions, orientations, and sizes of blocks. 8/26/2018 15-494 Cognitive Robotics

Find the Block Faces 8/26/2018 15-494 Cognitive Robotics

Find the Block From the Faces 8/26/2018 15-494 Cognitive Robotics

SIFT (Lowe, 2004) Scale-Invariant Feature Transform Can recognize objects independent of scale, translation, rotation, or occlusion. Can segment cluttered scenes. Slow training, but fast recognition. 8/26/2018 15-494 Cognitive Robotics

How Does SIFT Work? Generate large numbers of features that densely cover each training object at various scales and orientations. A 500 x 500 pixel image may generate 2000 stable features. Store these features in a library. For recognition, find clusters of features present in the image that agree on the object position, orientation, and scale. 8/26/2018 15-494 Cognitive Robotics

SIFT Feature Generation Scale-space extrema detection Use differences of Gaussians to find potential interest points. Keypoint localization Fit detailed model to determine location and scale. Orientation assignment Assign orientations based on local image gradients. Keypoint descriptor Extract description of local gradients at selected scale. 8/26/2018 15-494 Cognitive Robotics

Gaussian Smoothing 8/26/2018 15-494 Cognitive Robotics

Difference of Gaussians: Edge Detection Zero Crossings = Edges 8/26/2018 15-494 Cognitive Robotics

Scale Space 8/26/2018 15-494 Cognitive Robotics

Scale Space Extrema 8/26/2018 15-494 Cognitive Robotics

Filtering the Features 8/26/2018 15-494 Cognitive Robotics

Keypoint Descriptors 8/26/2018 15-494 Cognitive Robotics

8/26/2018 15-494 Cognitive Robotics

Real-Time SIFT Example Fred Birchmore used SIFT to recognize soda cans. http://eyecanseecan.blogspot.com/ See demo videos on his blog. 8/26/2018 15-494 Cognitive Robotics

SIFT in Tekkotsu Lionel Heng did a SIFT implementation as his class project in 2006. Xinghao Pan implemented a SIFT tool for Tekkotsu: Allow users to construct libraries of objects Each object has a collection of representative images User can control which SIFT features to use for matching Java GUI provides for easy management of the library How to integrate SIFT with the dual coding system? Object scale can be used to estimate distance Match in camera space must be converted to local space 8/26/2018 15-494 Cognitive Robotics

SIFT Tool 8/26/2018 15-494 Cognitive Robotics

Object Recognition in the Brain 8/26/2018 15-494 Cognitive Robotics

Object Recognition in the Brain Mishkin & Ungerleider: dual visual pathways. The dorsal, “where” pathway lies in parietal cortex. The ventral, “what” pathway lies in temporal cortex. Lesions to these areas yield very specific effects. 8/26/2018 15-494 Cognitive Robotics

The Macaque “Vision Pipeline” DJ Felleman and DC Van Essen (1991), Cerebral Cortex 1:1-47. RGC = retinal ganglion cells 8/26/2018 15-494 Cognitive Robotics

Serre & Poggio (PAMI 2007): Model Based on Temporal Cortex 8/26/2018 15-494 Cognitive Robotics

To Learn More About Computer and Biological Vision Take Tai Sing Lee's Computer Vision class, 15-385. Take Tai Sing Lee's Computational Neuroscience class, 15-490. There are many books on this subject. One of the classics is “Vision” by David Marr. 8/26/2018 15-494 Cognitive Robotics