875: Recent Advances in Geometric Computer Vision & Recognition

Slides:



Advertisements
Similar presentations
CSE473/573 – Stereo and Multiple View Geometry
Advertisements

Stereo Vision Reading: Chapter 11
Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923.
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Stereo and Projective Structure from Motion
Recap from Previous Lecture Tone Mapping – Preserve local contrast or detail at the expense of large scale contrast. – Changing the brightness within.
Lecture 8: Stereo.
Stereo.
776 Computer Vision Jan-Michael Frahm, Enrique Dunn Spring 2012.
Last Time Pinhole camera model, projection
CS6670: Computer Vision Noah Snavely Lecture 17: Stereo
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Stereo Matching Vision for Graphics CSE 590SS, Winter 2001 Richard Szeliski.
Multi-view stereo Many slides adapted from S. Seitz.
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
The plan for today Camera matrix
CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
Stereo and Structure from Motion
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
Epipolar Geometry and Stereo Vision Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 04/12/11 Many slides adapted from Lana Lazebnik,
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
12/01/11 How the Kinect Works Computational Photography Derek Hoiem, University of Illinois Photo frame-grabbed from:
Epipolar Geometry and Stereo Vision Computer Vision CS 543 / ECE 549 University of Illinois Derek Hoiem 03/05/15 Many slides adapted from Lana Lazebnik,
Structure from images. Calibration Review: Pinhole Camera.
875: Recent Advances in Geometric Computer Vision & Recognition Jan-Michael Frahm Fall 2011.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
stereo Outline : Remind class of 3d geometry Introduction
Digital Image Processing
776 Computer Vision Jan-Michael Frahm Spring 2012.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.
Multiple View Geometry and Stereo. Overview Single camera geometry – Recap of Homogenous coordinates – Perspective projection model – Camera calibration.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
Random Thoughts 2012 (COMP 066) Jan-Michael Frahm Jared Heinly.
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Noah Snavely, Zhengqi Li
Epipolar Geometry and Stereo Vision
제 5 장 스테레오.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
STEREOPSIS The Stereopsis Problem: Fusion and Reconstruction
How the Kinect Works Computational Photography
Geometry 3: Stereo Reconstruction
EECS 274 Computer Vision Stereopsis.
Announcements Midterms graded (handed back at end of lecture)
Thanks to Richard Szeliski and George Bebis for the use of some slides
What have we learned so far?
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

875: Recent Advances in Geometric Computer Vision & Recognition Jan-Michael Frahm Spring 2014

Introductions

Grade Requirements Presentation of 2 papers in class 30 min talk, 10 min questions Papers for selection must come from: top journals: IJCV, PAMI, CVIU, IVCJ top conferences: CVPR (2010,2011), ICCV (2011), ECCV (2010), approval for all other venues is needed Final project evaluation, extension of a recent method from the above

Grading 20% first presentation 20% second presentation 30% final project 30% attendance & class participation

Schedule Jan. 7th, Introduction Jan 7th, Uncertainty in Stereo (guest Philippos Mordohai) (substitute for Jan 13th class) Jan 15th , Large-scale image localization basic concepts, First paper selection (Large –scale localization) Jan 20th, MLK holiday no class Jan 22nd-29th, Large-scale localization basic concepts Feb. 3rd, 1. round of presentations starts Mar. 10th, 12th Spring break (no class) Mar. 17th, Modeling dynamic objects/scenes basic concepts, Second paper selection, final project definition Mar. 19st, Modeling dynamic objects Mar. 24th, 2. round of presentations starts Apr. 21st, 23rd , final project presentation

How to give a great presentation Structure of the talk: Motivation (motivate and explain the problem) Overview Related work (short concise discussion) Approach Experiments Conclusion and future work

How to give a great presentation Use large enough fonts 5-6 one line bullet items on a slide max Keep it simple No complex formulas in your talk Bad Powerpoint slides How to for presentations

How to give a great presentation Abstract the material of the talk provide understanding beyond details Use pictures to illustrate find pictures on the internet create a graphic (in ppt, graph tool) animate complex pictures

How to give a good presentation Avoid bad color schemes no red on blue looks awful Avoid using laser pointer (especially if you are nervous) Add pointing elements in your presentation Practice to stay within your time! Don’t rush through the talk!

Brush up on Stereo Reconstruction

Stereo Stereo Extraction of 3D information from 2D images Images 3D Point Cloud Stereo

Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Humans can do it Stereograms: Invented by Sir Charles Wheatstone, 1838

Depth Recovery by Stereo Search Space d7 d6 d5 d4 d3 d2 d1 reference image matching image Depth Photo-consistency Epipolar line

Depth Recovery from Stereo Depth Map d9 d8 Search Space d7 d6 d5 d4 Ground Truth Pixel Matching d3 d2 d1 reference image matching image Depth Epipolar line Matching Cost Pixel similarity: measured by color differences depth

Matching criteria Raw pixel values (correlation) Band-pass filtered images [Jones & Malik 92] “Corner” like features [Zhang, …] Edges [many people…] Gradients [Seitz 89; Scharstein 94] Rank statistics [Zabih & Woodfill 94] Intervals [Birchfield and Tomasi 96] Overview of matching metrics and their performance: H. Hirschmüller and D. Scharstein, “Evaluation of Stereo Matching Costs on Images with Radiometric Differences”, PAMI 2008 slide: R. Szeliski

Adaptive Weighting Boundary Preserving More Costly

Simplest Case: Parallel images Image planes of cameras are parallel to each other and to the baseline Camera centers are at same height Focal lengths are the same slide: S. Lazebnik

Simplest Case: Parallel images Image planes of cameras are parallel to each other and to the baseline Camera centers are at same height Focal lengths are the same Then, epipolar lines fall along the horizontal scan lines of the images slide: S. Lazebnik

Essential matrix for parallel images Epipolar constraint: R = I t = (T, 0, 0) x x’ t

Essential matrix for parallel images Epipolar constraint: R = I t = (T, 0, 0) x x’ t

Aggregation Structure Matching Cost depth Search Space Pixelwise Costs

Aggregation Structure Search Space Search Space Cost aggregation: cutting the cost volume. Cost Volume

Aggregation Structure Cost of the center pixel Treat neighbors equally Costs of neighboring pixels Fronto-Parallel Plane Sum of Absolute Differences (SAD) Depth Map Cost Volume

Aggregation Structure Weighted cost of the center pixel Weighted costs of neighboring pixels Color differences Spatial distances Adaptive Weight Yoon and Kweon, PAMI 2006 Depth Map Cost Volume

Aggregation Structure Adaptive Weight Oriented Plane Lu et al., CVPR 2013 Depth Map Cost Volume

Stereo: epipolar geometry Match features along epipolar lines epipolar plane viewing ray epipolar line slide: R. Szeliski

Your basic stereo algorithm Improvement: match windows This should look familar... For each epipolar line For each pixel in the left image compare with every pixel on same epipolar line in right image pick pixel with minimum match cost slide: R. Szeliski

Depth Map Computation Local methods Global methods Image Resolution : the total number of pixels Local methods Depth with the minimum cost Complexity: Global methods Pairwise interactions bN pixels aN pixels N pixels Scharstein and Szeliski, “A taxonomy and evaluation of dense two-frame stereo correspondence algorithms", IJCV 2002

Depth from disparity f x x’ Baseline B z O O’ X Disparity is inversely proportional to depth!

Quadratic precision loss with depth! Depth Sampling Depth sampling for integer pixel disparity Quadratic precision loss with depth!

Depth Sampling Depth sampling for wider baseline

Depth sampling is in O(resolution6) accuracy is closeness to solution

Finding correspondences apply feature matching criterion (e.g., correlation or Lucas-Kanade) at all pixels simultaneously search only over epipolar lines (many fewer candidate positions) slide: R. Szeliski

Correspondence search Left Right scanline Matching cost disparity Slide a window along the right scanline and compare contents of that window with the reference window in the left image Matching cost: SSD or normalized correlation slide: S. Lazebnik

Correspondence search Left Right scanline SSD slide: S. Lazebnik

Correspondence search Left Right scanline Norm. corr slide: S. Lazebnik

Neighborhood size Smaller neighborhood: more details Larger neighborhood: fewer isolated mistakes w = 3 w = 20 slide: R. Szeliski

Failures of correspondence search Occlusions, repetition Textureless surfaces Non-Lambertian surfaces, specularities slide: S. Lazebnik

How can we improve window-based matching? The similarity constraint is local (each reference window is matched independently) Need to enforce non-local correspondence constraints slide: S. Lazebnik

Non-local constraints Uniqueness For any point in one image, there should be at most one matching point in the other image slide: S. Lazebnik

Non-local constraints Uniqueness For any point in one image, there should be at most one matching point in the other image Ordering Corresponding points should be in the same order in both views slide: S. Lazebnik

Non-local constraints Uniqueness For any point in one image, there should be at most one matching point in the other image Ordering Corresponding points should be in the same order in both views Ordering constraint doesn’t hold slide: S. Lazebnik

Non-local constraints Uniqueness For any point in one image, there should be at most one matching point in the other image Ordering Corresponding points should be in the same order in both views Smoothness We expect disparity values to change slowly (for the most part) slide: S. Lazebnik

Multiple-baseline stereo results M. Okutomi and T. Kanade, “A Multiple-Baseline Stereo System,” IEEE Trans. on Pattern Analysis and Machine Intelligence, 15(4):353-363 (1993).

Plane Sweep Stereo Choose a reference view Sweep family of planes at different depths with respect to the reference camera input image input image reference camera Each plane defines a homography warping each input image into the reference view R. Collins. A space-sweep approach to true multi-image matching. CVPR 1996.

Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images 3D scene SAD as similarity (darker is higher similarity)

Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images 3D scene SAD as similarity (darker is higher similarity)

Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images not formal enough not general enough either more precise with sweeping multiple orthogonal directions 3D scene SAD as similarity (darker is higher similarity)

Real-time 3D reconstruction from video “Real-Time Plane-sweeping Stereo with Multiple Sweeping Directions", CVPR 2007 warped images Multi-way sweep not formal enough not general enough either more precise with sweeping multiple orthogonal directions 3D scene SAD as similarity (darker is higher similarity)

3D reconstruction from video view 1 view N

3D reconstruction from video