CS6670: Computer Vision Noah Snavely Lecture 17: Stereo

Slides:



Advertisements
Similar presentations
Stereo Vision Reading: Chapter 11
Advertisements

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923.
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Recap from Previous Lecture Tone Mapping – Preserve local contrast or detail at the expense of large scale contrast. – Changing the brightness within.
Lecture 8: Stereo.
Stereo.
776 Computer Vision Jan-Michael Frahm, Enrique Dunn Spring 2012.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Computer Vision : CISC 4/689 Adaptation from: Prof. James M. Rehg, G.Tech.
Announcements Project 2 questions Midterm out on Thursday
Stereo Binocular Stereo Calibration (finish up) Next Time Motivation
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.
Lecture 20: Light, color, and reflectance CS6670: Computer Vision Noah Snavely.
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Lecture 10: Stereo and Graph Cuts
Lecture 16: Single-view modeling, Part 2 CS6670: Computer Vision Noah Snavely.
CSE 576, Spring 2008Projective Geometry1 Lecture slides by Steve Seitz (mostly) Lecture presented by Rick Szeliski.
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Stereo Guest Lecture by Li Zhang
Project 1 artifact winners Project 2 questions Project 2 extra signup slots –Can take a second slot if you’d like Announcements.
Midterm went out on Tuesday (due next Tuesday) Project 3 out today Announcements.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
VZvZ r t b image cross ratio Measuring height B (bottom of object) T (top of object) R (reference point) ground plane H C scene cross ratio  scene points.
Today: Calibration What are the camera parameters?
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Stereo Readings Trucco & Verri, Chapter 7 –Read through 7.1, 7.2.1, 7.2.2, 7.3.1, 7.3.2, and 7.4, –The rest is optional. Single image stereogram,
Projective Geometry and Single View Modeling CSE 455, Winter 2010 January 29, 2010 Ames Room.
Single-view modeling, Part 2 CS4670/5670: Computer Vision Noah Snavely.
Camera Calibration & Stereo Reconstruction Jinxiang Chai.
What have we leaned so far? Camera structure Eye structure Project 1: High Dynamic Range Imaging.
Announcements Project 1 artifact winners Project 2 questions
Structure from images. Calibration Review: Pinhole Camera.
Last Lecture (optical center) origin principal point P (X,Y,Z) p (x,y) x y.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.
Stereo Many slides adapted from Steve Seitz.
Project 2 code & artifact due Friday Midterm out tomorrow (check your ), due next Fri Announcements TexPoint fonts used in EMF. Read the TexPoint.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Project 2 –contact your partner to coordinate ASAP –more work than the last project (> 1 week) –you should have signed up for panorama kitssigned up –sign.
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Computer Vision Stereo Vision. Bahadir K. Gunturk2 Pinhole Camera.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Digital Image Processing
776 Computer Vision Jan-Michael Frahm Spring 2012.
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
Project 2 artifacts—vote now!! Project 3 questions? Start thinking about final project ideas, partners Announcements.
Project 3 code & artifact due Tuesday Final project proposals due noon Wed (by ) –One-page writeup (from project web page), specifying: »Your team.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
CSE 185 Introduction to Computer Vision Stereo 2.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Noah Snavely, Zhengqi Li
제 5 장 스테레오.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
Announcements Midterms graded (handed back at end of lecture)
Thanks to Richard Szeliski and George Bebis for the use of some slides
What have we learned so far?
Projective geometry Readings
Computer Vision Stereo Vision.
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

CS6670: Computer Vision Noah Snavely Lecture 17: Stereo Single image stereogram, by Niklas Een

Readings Szeliski, Chapter 10 (through 10.5)

Direct linear calibration

Direct linear calibration m23 Can solve for mij by linear least squares use eigenvector trick that we used for homographies

Direct linear calibration Advantage: Very simple to formulate and solve Disadvantages: Doesn’t tell you the camera parameters Doesn’t model radial distortion Hard to impose constraints (e.g., known f) Doesn’t minimize the right error function For these reasons, nonlinear methods are preferred Define error function E between projected 3D points and image positions E is nonlinear function of intrinsics, extrinsics, radial distortion Minimize E using nonlinear optimization techniques

Alternative: multi-plane calibration Images courtesy Jean-Yves Bouguet, Intel Corp. Advantage Only requires a plane Don’t have to know positions/orientations Good code available online! (including in OpenCV) Matlab version by Jean-Yves Bouget: http://www.vision.caltech.edu/bouguetj/calib_doc/index.html Zhengyou Zhang’s web site: http://research.microsoft.com/~zhang/Calib/

Some Related Techniques Image-Based Modeling and Photo Editing Mok et al., SIGGRAPH 2001 http://graphics.csail.mit.edu/ibedit/ Single View Modeling of Free-Form Scenes Zhang et al., CVPR 2001 http://grail.cs.washington.edu/projects/svm/ Tour Into The Picture Anjyo et al., SIGGRAPH 1997 http://koigakubo.hitachi.co.jp/little/DL_TipE.html

What’s the transformation? More than one view? We now know all about the geometry of a single view What happens if we have two cameras? What’s the transformation?

Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923

Mark Twain at Pool Table", no date, UCR Museum of Photography

Epipolar geometry x2 -x1 = the disparity of pixel (x1, y1) (x1, y1) epipolar lines (x1, y1) (x2, y1) Two images captured by a purely horizontal translating camera (rectified stereo pair) x2 -x1 = the disparity of pixel (x1, y1)

Stereo matching algorithms Match Pixels in Conjugate Epipolar Lines Assume brightness constancy This is a tough problem Numerous approaches A good survey and evaluation: http://www.middlebury.edu/stereo/

Your basic stereo algorithm Improvement: match windows For each pixel in the left image For each epipolar line compare with every pixel on same epipolar line in right image pick pixel with minimum match cost

Window size W = 3 W = 20 Effect of window size Better results with adaptive window T. Kanade and M. Okutomi, A Stereo Matching Algorithm with an Adaptive Window: Theory and Experiment,, Proc. International Conference on Robotics and Automation, 1991. D. Scharstein and R. Szeliski. Stereo matching with nonlinear diffusion. International Journal of Computer Vision, 28(2):155-174, July 1998 Smaller window Larger window smaller window: more detail, more noise bigger window: less noise, more detail

Stereo results Scene Ground truth Data from University of Tsukuba Similar results on other images without ground truth Scene Ground truth

Results with window search Window-based matching (best window size) Ground truth

Better methods exist... State of the art method Ground truth Boykov et al., Fast Approximate Energy Minimization via Graph Cuts, International Conference on Computer Vision, September 1999. Ground truth For the latest and greatest: http://www.middlebury.edu/stereo/

Stereo as energy minimization What defines a good stereo correspondence? Match quality Want each pixel to find a good match in the other image Smoothness If two pixels are adjacent, they should (usually) move about the same amount

Stereo as energy minimization Expressing this mathematically Match quality Want each pixel to find a good match in the other image Smoothness If two pixels are adjacent, they should (usually) move about the same amount We want to minimize This is a special type of energy function known as an MRF (Markov Random Field) Effective and fast algorithms have been recently developed: Graph cuts, belief propagation…. for more details (and code): http://vision.middlebury.edu/MRF/ Great tutorials available online (including video of talks)

Depth from disparity f x x’ baseline z C C’ X

Real-time stereo Used for robot navigation (and other tasks) Nomad robot searches for meteorites in Antartica http://www.frc.ri.cmu.edu/projects/meteorobot/index.html Used for robot navigation (and other tasks) Several software-based real-time stereo techniques have been developed (most based on simple discrete search)

Stereo reconstruction pipeline Steps Calibrate cameras Rectify images Compute disparity Estimate depth What will cause errors? Camera calibration errors Poor image resolution Occlusions Violations of brightness constancy (specular reflections) Large motions Low-contrast image regions

Active stereo with structured light Li Zhang’s one-shot stereo camera 2 camera 1 projector camera 1 projector Project “structured” light patterns onto the object simplifies the correspondence problem

Digital Michelangelo Project Laser scanning Digital Michelangelo Project http://graphics.stanford.edu/projects/mich/ Optical triangulation Project a single stripe of laser light Scan it across the surface of the object This is a very precise version of structured light scanning

The Digital Michelangelo Project, Levoy et al. Laser scanned models The Digital Michelangelo Project, Levoy et al.

The Digital Michelangelo Project, Levoy et al. Laser scanned models The Digital Michelangelo Project, Levoy et al.

The Digital Michelangelo Project, Levoy et al. Laser scanned models The Digital Michelangelo Project, Levoy et al.

The Digital Michelangelo Project, Levoy et al. Laser scanned models The Digital Michelangelo Project, Levoy et al.