CS 376b Introduction to Computer Vision 04 / 29 / 2008 Instructor: Michael Eckmann.

Slides:

Advertisements

Similar presentations

Matching in 2D Readings: Ch. 11: Sec alignment point representations and 2D transformations solving for 2D transformations from point correspondences.

Advertisements

CS 376 Introduction to Computer Graphics 02 / 02 / 2007 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 02 / 27 / 2009 Instructor: Michael Eckmann.

QR Code Recognition Based On Image Processing

CS 206 Introduction to Computer Science II 01 / 20 / 2009 Instructor: Michael Eckmann.

RGB-D object recognition and localization with clutter and occlusions Federico Tombari, Samuele Salti, Luigi Di Stefano Computer Vision Lab – University.

CS 376b Introduction to Computer Vision 04 / 21 / 2008 Instructor: Michael Eckmann.

Computer Vision - A Modern Approach Set: Model-based Vision Slides by D.A. Forsyth Recognition by Hypothesize and Test General idea –Hypothesize object.

Computer Vision Detecting the existence, pose and position of known objects within an image Michael Horne, Philip Sterne (Supervisor)

Mapping: Scaling Rotation Translation Warp

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

CS 206 Introduction to Computer Science II 09 / 09 / 2009 Instructor: Michael Eckmann.

CS 102 Computers In Context (Multimedia)‏ 04 / 20 / 2009 Instructor: Michael Eckmann.

CS 325 Introduction to Computer Graphics 04 / 09 / 2010 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 04 / 11 / 2008 Instructor: Michael Eckmann.

A Study of Approaches for Object Recognition

Object Recognition with Invariant Features n Definition: Identify objects or scenes and determine their pose and model parameters n Applications l Industrial.

CS 376b Introduction to Computer Vision 04 / 16 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 26 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 02 / 27 / 2008 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 09 / 03 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 02 / 18 / 2008 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 09 / 05 / 2008 Instructor: Michael Eckmann.

Distinctive image features from scale-invariant keypoints. David G. Lowe, Int. Journal of Computer Vision, 60, 2 (2004), pp Presented by: Shalomi.

CS 376b Introduction to Computer Vision 02 / 25 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 04 / 01 / 2008 Instructor: Michael Eckmann.

Object Recognition Using Geometric Hashing

Scale Invariant Feature Transform (SIFT)

CS 376b Introduction to Computer Vision 04 / 14 / 2008 Instructor: Michael Eckmann.

CS 106 Introduction to Computer Science I 03 / 03 / 2008 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 12 / 10 / 2008 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 11 / 12 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 04 / 15 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 04 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 04 / 2008 Instructor: Michael Eckmann.

CS 106 Introduction to Computer Science I 10 / 15 / 2007 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 04 / 06 / 2009 Instructor: Michael Eckmann.

CS 106 Introduction to Computer Science I 10 / 16 / 2006 Instructor: Michael Eckmann.

Distinctive Image Features from Scale-Invariant Keypoints By David G. Lowe, University of British Columbia Presented by: Tim Havinga, Joël van Neerbos.

CS 376b Introduction to Computer Vision 02 / 26 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 04 / 02 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 01 / 22 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 02 / 22 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 21 / 2008 Instructor: Michael Eckmann.

CS 206 Introduction to Computer Science II 02 / 13 / 2009 Instructor: Michael Eckmann.

CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.

Wenqi Zhu 3D Reconstruction From Multiple Views Based on Scale-Invariant Feature Transform.

CS 376 Introduction to Computer Graphics 02 / 23 / 2007 Instructor: Michael Eckmann.

CS 325 Introduction to Computer Graphics 03 / 22 / 2010 Instructor: Michael Eckmann.

Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.

COMP322/S2000/L171 Robot Vision System Major Phases in Robot Vision Systems: A. Data (image) acquisition –Illumination, i.e. lighting consideration –Lenses,

Geometric Hashing: A General and Efficient Model-Based Recognition Scheme Yehezkel Lamdan and Haim J. Wolfson ICCV 1988 Presented by Budi Purnomo Nov 23rd.

CS 376b Introduction to Computer Vision 04 / 28 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 18 / 2008 Instructor: Michael Eckmann.

CS 376 Introduction to Computer Graphics 04 / 25 / 2007 Instructor: Michael Eckmann.

CS 325 Introduction to Computer Graphics 02 / 03 / 2010 Instructor: Michael Eckmann.

CSE 185 Introduction to Computer Vision Feature Matching.

CS 106 Introduction to Computer Science I 03 / 02 / 2007 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 02 / 15 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 31 / 2008 Instructor: Michael Eckmann.

CS 376b Introduction to Computer Vision 03 / 17 / 2008 Instructor: Michael Eckmann.

Matching Geometric Models via Alignment Alignment is the most common paradigm for matching 3D models to either 2D or 3D data. The steps are: 1. hypothesize.

CS 325 Introduction to Computer Graphics 03 / 10 / 2010 Instructor: Michael Eckmann.

CS 325 Introduction to Computer Graphics 04 / 07 / 2010 Instructor: Michael Eckmann.

Nearest-neighbor matching to feature database

3D Models and Matching particular matching techniques

Nearest-neighbor matching to feature database

Application: Geometric Hashing

Geometric Hashing: An Overview

Presentation transcript:

CS 376b Introduction to Computer Vision 04 / 29 / 2008 Instructor: Michael Eckmann

Michael Eckmann - Skidmore College - CS 376b - Spring 2008 Today’s Topics Comments/Questions Look back on course and what there is still to learn Chapter 11 – 2D matching –matching in 2d (models to images)‏ focus feature method pose clustering geometrical hashing

Course Information (slide from 1 st day)‏ First week and a half to two weeks maximum –dive in and learn the major differences between C++ and Java so you can code the assignments in C++ using the openCV library. C++ programming knowledge is a great skill to have for any computer science major –quick overview of the openCV library –I will provide sample programs using the openCV library. Michael Eckmann - Skidmore College - CS 376b - Spring 2008

Course Information (slide from 1 st day)‏ Computer Vision topics to be covered –parts of chapters 1-7 and 9-11 in our text book –additional material when our text doesn't go deep enough into a topic e.g. image processing techniques Expect to have 4 programming assignments –1 st one will make sure you understand certain important C++ concepts as well how to use the openCV library in your code –2 nd one will deal with image processing techniques –3 rd and 4 th will probably have to do with edge/feature detection and segmentation Michael Eckmann - Skidmore College - CS 376b - Spring 2008

What we accomplished You learned all the major differences between C++ and Java and should be comfortable programming in C++. We covered image processing techniques and the main operations that are used in many computer vision techniques (e.g. morphological operations, histograms, Fourier transforms,)‏ Filtering images and convolution / cross-correlation techniques (for smoothing, noise reduction/suppression, edge detection, etc.)‏ We examined a few larger topics utilizing the lower-level techniques --- segmentation, texture measures, contour/line finding, Hough transforms, matching in 2D etc. We also covered a good chunk of chapter 12 (Perceiving 3D from 2D)‏ Michael Eckmann - Skidmore College - CS 376b - Spring 2008

Looking ahead A better name for the course might have been –C++ for Java programmers, some digital image processing, and low-level computer vision techniques We didn't cover 3D sensing in any depth nor camera calibration. We didn't cover any complete application areas (e.g. robot navigation, iris recognition, surveillance,...)‏ –but you should know that all these applications use some of the various techniques we learned as well as ones we haven't combined in various ways to accomplish their tasks You have a good background knowledge now to allow for further study in computer vision/image processing. It is a field with plenty of unsolved problems and continues to have an active community of researchers. Michael Eckmann - Skidmore College - CS 376b - Spring 2008

Michael Eckmann - Skidmore College - CS Spring D object recognition via affine mapping Our text describes 3 techniques to determine an affine transformation from a model to an image. Local Feature Focus method Pose Clustering Geometric Hashing

Michael Eckmann - Skidmore College - CS Spring 2007 Local Feature Focus method This is a process to determine if an object model appears in an image and if so, what is the general affine transform between the model and the image. The model has a set of focus features, which are major features that should be able to be found in an image of this object easily (as long as they are not occluded). The model also has a set of nearby features for each focus feature to allow verification of a correct focus feature match and to help determine position and orientation. Let's see the algorithm and figures from the text.

Michael Eckmann - Skidmore College - CS Spring 2007 Pose Clustering This is another process to determine if an object model appears in an image and if so, what is the general affine transform between the model and the image. The model has a set of features and the image has a set of features. These need to be matched. The general idea of pose clustering is to take every possible pair of matching points and compute an RST transform then check for clusters of RST transforms. To get less redundancy and more accuracy, instead of doing every possible pair we can –filter our features by type, where a certain type of feature will only match a feature of the same type (ex. fig )‏

Michael Eckmann - Skidmore College - CS Spring 2007 Pose Clustering To get less redundancy and more accuracy, instead of doing every possible pair we can –filter our features by type, where a certain type of feature will only match a feature of the same type –then only use pairs of matching points that satisfy the above Compute the RST transforms as before but now for a smaller set of matching pairs. For each RST transform with specific computed parameters, count the number of other RST transforms that are within some distance of the transform parameters. There are n-1 distance computations for each of n parameter sets of RST transforms. Or can use binning (like Hough) --- this will be faster but bins need to be chosen well to capture similar parameter sets in the same bin.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing The last two procedures allowed us to determine if a particular model object was found in an image. What if we had many models that we wanted to check against our images? If use pose clustering or local feature focus method, then each model would have to be checked separately to determine if it's in the image. Geometric hashing allows us to check among a large database of models to determine if any of them are in the image.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing It requires a large amount of offline preprocessing of the models as well as a fair amount of space. But this allows for fairly fast online recognition in the average case. Given: large database of models described by feature points in some 2d coordinate system and an image with features extracted from it. Assuming affine transformations only, we want to know which model(s) are in the image and what position and orientation the models are in in the image.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing Each model M is stored as an ordered set of feature points. Any 3 non-collinear points E = (e 00, e 01, e 10 ) define an affine coordinate frame. (Think of a plane. How many points define a plane? Any 3 non-collinear points define a plane.)‏ D = xi(e 01 -e 00 ) + eta(e 10 -e 00 ) + e 00 Can think of e 00 as the origin and (e 01 -e 00 ) and (e 10 -e 00 ) as the coordinate axes of the affine coordinate system.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing Any point D in M can be represented as (xi,eta) pairs w.r.t. the points E. These (xi,eta) pairs are the affine coordinates of the point D. If we apply an affine transformation to all the points in M, (xi,eta) will be the same for each point in M, given the same points E defining the affine coordinate frame.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing Offline processing For each model M choose an ordered set of three model points E = (e 00, e 01, e 10 ) to form the equation: D = xi(e 01 -e 00 ) + eta(e 10 -e 00 ) + e 00 for each other point in the model, D, compute (xi,eta) and put M,E in the hash table indexed on (xi,eta)‏ do the above two bullets for all possible sets of three model points The above gives us a hash table indexed on (xi,eta) which are affine- invariant coordinates. From these affine invariant coordinates, we can get the model M and basis points E where some D in M has (xi,eta) affine coordinates w.r.t. E.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing Now, any (xi,eta) pair can be looked up in the hash table to get all the models/basis points for which some model point has the affine invariant coordinates (xi,eta). If (xi 1, eta 1 ) are affine coordinates for an image point, written in terms of some image basis (set of 3 points), then (xi 1, eta 1 ) are in the hash table iff there is a legal affine transformation of the 4 model points that maps them onto the 4 image points.

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing Online processing –Choose a set of 3 feature points in an image to form a basis, make one the origin and the other two minus the origin make the coordinate axes. –for each other feature point in the image, compute the affine coordinates (xi 1, eta 1 ) –look up (xi 1, eta 1 ) in the hash table. If it is there, then all the models/basis points stored in the hash table are possible candidate matches (the 4 points are possibly from the model stored there)‏ increment a counter in a histogram for each of the models/basis points –peaks in the histogram are possible matches

Michael Eckmann - Skidmore College - CS Spring 2007 Geometric Hashing Online processing –peaks in the histogram are possible matches –as long as the peak is sufficiently high, then it is a possible match. The entire model can be transformed into image coordinates and verified that enough model points actually are in the image --- if so, the model appears, affinely transformed, in the image. –Do all the above steps for all triples of feature points.