Shape Matching Tuesday, Nov 18 Kristen Grauman UT-Austin.

Slides:



Advertisements
Similar presentations
Shape Context and Chamfer Matching in Cluttered Scenes
Advertisements

Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.
TP11 - Fitting: Deformable contours Computer Vision, FCUP, Miguel Coimbra Slides by Prof. Kristen Grauman.
Fitting: Deformable contours
Chamfer Distance for Handshape Detection and Recognition CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
Recovering Human Body Configurations: Combining Segmentation and Recognition Greg Mori, Xiaofeng Ren, and Jitentendra Malik (UC Berkeley) Alexei A. Efros.
Recognizing hand-drawn images using shape context Gyozo Gidofalvi Computer Science and Engineering Department University of California, San Diego
Actions in video Monday, April 25 Kristen Grauman UT-Austin.
- Recovering Human Body Configurations: Combining Segmentation and Recognition (CVPR’04) Greg Mori, Xiaofeng Ren, Alexei A. Efros and Jitendra Malik -
TP14 - Local features: detection and description Computer Vision, FCUP, 2014 Miguel Coimbra Slides by Prof. Kristen Grauman.
Announcements Final Exam May 13th, 8 am (not my idea).
Database-Based Hand Pose Estimation CSE 6367 – Computer Vision Vassilis Athitsos University of Texas at Arlington.
A Low-cost Attack on a Microsoft CAPTCHA Yan Qiang,
Tuesday, Oct 7 Kristen Grauman UT-Austin
Instructor: Mircea Nicolescu Lecture 13 CS 485 / 685 Computer Vision.
Computer Vision Group University of California Berkeley Estimating Human Body Configurations using Shape Context Matching Greg Mori and Jitendra Malik.
Computer Vision Group University of California Berkeley Shape Matching and Object Recognition using Shape Contexts Jitendra Malik U.C. Berkeley (joint.
Announcements Final Exam May 16 th, 8 am (not my idea). Practice quiz handout 5/8. Review session: think about good times. PS5: For challenge problems,
1 Model Fitting Hao Jiang Computer Science Department Oct 6, 2009.
A Study of Approaches for Object Recognition
1 Model Fitting Hao Jiang Computer Science Department Oct 8, 2009.
CS335 Principles of Multimedia Systems Content Based Media Retrieval Hao Jiang Computer Science Department Boston College Dec. 4, 2007.
Human Posture Recognition with Convex Programming Hao Jiang, Ze-Nian Li and Mark S. Drew School of Computing Science Simon Fraser University Burnaby, BC,
Computer Vision Group University of California Berkeley Visual Grouping and Object Recognition Jitendra Malik * U.C. Berkeley * with S. Belongie, C. Fowlkes,
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman University of Oxford ICCV 2003.
Computer Vision Group University of California Berkeley Matching Shapes Serge Belongie *, Jitendra Malik and Jan Puzicha U.C. Berkeley * Present address:
Computer Vision Group University of California Berkeley Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA Greg Mori and Jitendra Malik.
Automatic Image Alignment (feature-based) : Computational Photography Alexei Efros, CMU, Fall 2006 with a lot of slides stolen from Steve Seitz and.
Perceptual Organization: Segmentation and Optical Flow.
Human Computation CSC4170 Web Intelligence and Social Computing Tutorial 7 Tutor: Tom Chao Zhou
Abstract Overall Algorithm Target Matching Error Checking: By comparing what we transform from Kinect Camera coordinate to robot coordinate with what we.
Robust fitting Prof. Noah Snavely CS1114
CSE 185 Introduction to Computer Vision
Fitting: Deformable contours Tuesday, September 24 th 2013 Devi Parikh Virginia Tech 1 Slide credit: Kristen Grauman Disclaimer: Many slides have been.
CS 376b Introduction to Computer Vision 04 / 29 / 2008 Instructor: Michael Eckmann.
2D Shape Matching (and Object Recognition)
TP15 - Tracking Computer Vision, FCUP, 2013 Miguel Coimbra Slides by Prof. Kristen Grauman.
Topic 10 - Image Analysis DIGITAL IMAGE PROCESSING Course 3624 Department of Physics and Astronomy Professor Bob Warwick.
Recognizing Human Figures and Actions Greg Mori Simon Fraser University.
Lecture 4: Feature matching CS4670 / 5670: Computer Vision Noah Snavely.
Face detection Slides adapted Grauman & Liebe’s tutorial
Visual Grouping and Recognition Jitendra Malik University of California at Berkeley Jitendra Malik University of California at Berkeley.
Digital Image Processing CCS331 Relationships of Pixel 1.
Vision-based human motion analysis: An overview Computer Vision and Image Understanding(2007)
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
Fitting: The Hough transform
Computer Vision Group University of California Berkeley On Visual Recognition Jitendra Malik UC Berkeley.
Jack Pinches INFO410 & INFO350 S INFORMATION SCIENCE Computer Vision I.
1 Model Fitting Hao Jiang Computer Science Department Sept 30, 2014.
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
Pictorial Structures and Distance Transforms Computer Vision CS 543 / ECE 549 University of Illinois Ian Endres 03/31/11.
Supervisor: Nakhmani Arie Semester: Winter 2007 Target Recognition Harmatz Isca.
Visual Computing Computer Vision 2 INFO410 & INFO350 S2 2015
Local features: detection and description
Dynamic Programming (DP), Shortest Paths (SP)
Tracking Hands with Distance Transforms Dave Bargeron Noah Snavely.
COS 429 PS3: Stitching a Panorama Due November 10 th.
Invariant Local Features Image content is transformed into local feature coordinates that are invariant to translation, rotation, scale, and other imaging.
SANDEEP MEHTA (ECE, IV Year). CAPTCHA Completely Automated Public Turing test to tell Computers and Humans Apart Invented at CMU by Luis von Ahn, Manuel.
Another Example: Circle Detection
TP11 - Fitting: Deformable contours
CS262: Computer Vision Lect 09: SIFT Descriptors
Fitting: Voting and the Hough Transform
Miguel Tavares Coimbra
Lecture 26 Hand Pose Estimation Using a Database of Hand Images
Paper Presentation: Shape and Matching
Features Readings All is Vanity, by C. Allan Gilbert,
Local features and image matching
Grouping/Segmentation
Fourier Transform of Boundaries
Presentation transcript:

Shape Matching Tuesday, Nov 18 Kristen Grauman UT-Austin

Announcements This week : My office hours for Thurs (11/20) are moved to Friday (11/21) from 1 – 2 pm. Pset 3 returned today Check all semester grades on eGradebook

Where have we encountered shape before?

Low-level features EdgesSilhouettes

Fitting Want to associate a model with observed features [Fig from Marszalek & Schmid, 2007] For example, the model could be a line, a circle, or an arbitrary shape.

Deformable contours Visual Dynamics GroupVisual Dynamics Group, Dept. Engineering Science, University of Oxford. Traffic monitoring Human-computer interaction Animation Surveillance Computer Assisted Diagnosis in medical imaging Applications:

Role of shape Analysis of anatomical structures Figure from Grimson & Golland Morphology Pose Recognition, detection Fig from Opelt et al. Characteristic feature Fig from Belongie et al.

Shape in recognition

Questions What features? How to compare shapes?

Figure from Belongie et al.

Chamfer distance Average distance to nearest feature T: template shape  a set of points I: image to search  a set of points d I (t): min distance for point t to some point in I

Chamfer distance Average distance to nearest feature Edge image How is the measure different than just filtering with a mask having the shape points? How expensive is a naïve implementation?

Source: Yuri Boykov Distance transform Distance Transform Image features (2D) Distance Transform is a function that for each image pixel p assigns a non-negative number corresponding to distance from p to the nearest feature in the image I Features could be edge points, foreground points,…

Distance transform original distance transform edges Value at (x,y) tells how far that position is from the nearest edge point (or other binary mage structure) >> help bwdist

Distance transform (1D) Adapted from D. Huttenlocher // 0 if j is in P, infinity otherwise

Distance Transform (2D) Adapted from D. Huttenlocher

Chamfer distance Average distance to nearest feature Edge imageDistance transform image

Chamfer distance Fig from D. Gavrila, DAGM 1999 Edge imageDistance transform image

A limitation of active contours External energy: snake does not really “see” object boundaries in the image unless it gets very close to it. image gradients are large only directly on the boundary

Distance transform can help External image cost can also be taken from the distance transform of the edge image. original -gradientdistance transform edges

What limitations might we have using only edge points to represent a shape? How descriptive is a point?

Comparing shapes What points on these two sampled contours are most similar? How do you know?

Shape context descriptor Count the number of points inside each bin, e.g.: Count = 4 Count = Compact representation of distribution of points relative to each point Shape context slides from Belongie et al.

Shape context descriptor

Comparing shape contexts Compute matching costs using Chi Squared distance: Recover correspondences by solving for least cost assignment, using costs C ij (Then use a deformable template match, given the correspondences.)

Shape context matching with handwritten digits Only errors made out of 10,000 test examples

CAPTCHA’s CAPTCHA: Completely Automated Turing Test To Tell Computers and Humans Apart Luis von Ahn, Manuel Blum, Nicholas Hopper and John Langford, CMU,

Image-based CAPTCHA

Shape matching application: breaking a visual CAPTCHA Use shape matching to recognize characters, words in spite of clutter, warping, etc. Recognizing Objects in Adversarial Clutter: Breaking a Visual CAPTCHA, by G. Mori and J. Malik, CVPR 2003

Computer Vision Group University of California Berkeley Fast Pruning: Representative Shape Contexts Pick k points in the image at random –Compare to all shape contexts for all known letters –Vote for closely matching letters Keep all letters with scores under threshold dopdop

Computer Vision Group University of California Berkeley Algorithm A: bottom-up Look for letters –Representative Shape Contexts Find pairs of letters that are “consistent” –Letters nearby in space Search for valid words Give scores to the words Input Locations of possible letters Possible strings of letters Matching words

Computer Vision Group University of California Berkeley EZ-Gimpy Results with Algorithm A 158 of 191 images correctly identified: 83% –Running time: ~10 sec. per image (MATLAB, 1 Ghz P3) horse smile canvas spade join here

Computer Vision Group University of California Berkeley Gimpy Multiple words, task is to find 3 words in the image Clutter is other objects, not texture

Computer Vision Group University of California Berkeley Algorithm B: Letters are not enough Hard to distinguish single letters with so much clutter Find words instead of letters –Use long range info over entire word –Stretch shape contexts into ellipses Search problem becomes huge –# of words 600 vs. # of letters 26 –Prune set of words using opening/closing bigrams

Computer Vision Group University of California Berkeley Results with Algorithm B # Correct words% tests (of 24) 1 or more92% 2 or more75% 333% EZ-Gimpy92% dry clear medical door farm importantcard arch plate

Shape matching application II: silhouettes and body pose What kind of assumptions do we need? …

Example-based pose estimation …and animation Build a two-character “motion graph” from examples of people dancing with mocap Populate database with synthetically generated silhouettes in poses defined by mocap (behavior specific dynamics) Use silhouette features to identify similar examples in database Retrieve the pose stored for those similar examples to estimate user’s pose Animate user and hypothetical partner Ren, Shakhnarovich, Hodgins, Pfister, and Viola, 2005.

Fun with silhouettes Liu Ren, Gregory Shakhnarovich, Jessica Hodgins, Hanspeter Pfister and Paul Viola, Learning Silhouette Features for Control of Human MotionLearning Silhouette Features for Control of Human Motion

Summary Shape can be defining feature in recognition, useful for analysis tasks Chamfer measure to compare edge point sets –Distance transform for efficiency Isolated edges points ambiguous –Shape context : local shape neighborhood descriptor Example applications of shape matching

Next: Motion and tracking Tomas Izo