Image-based modeling Digital Visual Effects, Spring 2008 Yung-Yu Chuang 2008/5/6 with slides by Richard Szeliski, Steve Seitz and Alexei Efros.

Slides:

Advertisements

Similar presentations

The fundamental matrix F

Advertisements

Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.

Recap from Previous Lecture Tone Mapping – Preserve local contrast or detail at the expense of large scale contrast. – Changing the brightness within.

Camera calibration and epipolar geometry

A new approach for modeling and rendering existing architectural scenes from a sparse set of still photographs Combines both geometry-based and image.

Single-view metrology

Copyright  Philipp Slusallek Cs fall IBR: Model-based Methods Philipp Slusallek.

CS6670: Computer Vision Noah Snavely Lecture 17: Stereo

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

Epipolar geometry. (i)Correspondence geometry: Given an image point x in the first view, how does this constrain the position of the corresponding point.

Scene Modeling for a Single View : Computational Photography Alexei Efros, CMU, Fall 2005 René MAGRITTE Portrait d'Edward James …with a lot of slides.

Lecture 11: Structure from motion CS6670: Computer Vision Noah Snavely.

Lecture 20: Light, color, and reflectance CS6670: Computer Vision Noah Snavely.

3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.

CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.

Lecture 20: Two-view geometry CS6670: Computer Vision Noah Snavely.

Automatic Photo Popup Derek Hoiem Alexei A. Efros Martial Hebert Carnegie Mellon University.

May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.

Lec 21: Fundamental Matrix

CSE473/573 – Stereo Correspondence

Stereo Guest Lecture by Li Zhang

Project 1 artifact winners Project 2 questions Project 2 extra signup slots –Can take a second slot if you’d like Announcements.

Scene Modeling for a Single View : Computational Photography Alexei Efros, CMU, Spring 2010 René MAGRITTE Portrait d'Edward James …with a lot of.

Midterm went out on Tuesday (due next Tuesday) Project 3 out today Announcements.

COMP322/S2000/L271 Stereo Imaging Ref.V.S.Nalwa, A Guided Tour of Computer Vision, Addison Wesley, (ISBN ) Slides are adapted from CS641.

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

VZvZ r t b image cross ratio Measuring height B (bottom of object) T (top of object) R (reference point) ground plane H C scene cross ratio  scene points.

David Luebke Modeling and Rendering Architecture from Photographs A hybrid geometry- and image-based approach Debevec, Taylor, and Malik SIGGRAPH.

CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.

3D Scene Models Object recognition and scene understanding Krista Ehinger.

Multi-view geometry. Multi-view geometry problems Structure: Given projections of the same 3D point in two or more images, compute the 3D coordinates.

Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.

Single-view metrology

Single-view modeling, Part 2 CS4670/5670: Computer Vision Noah Snavely.

Camera Calibration & Stereo Reconstruction Jinxiang Chai.

Lecture 11 Stereo Reconstruction I Lecture 11 Stereo Reconstruction I Mata kuliah: T Computer Vision Tahun: 2010.

Announcements Project 1 artifact winners Project 2 questions

Structure from images. Calibration Review: Pinhole Camera.

Multi-view geometry.

Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.

Single-view 3D Reconstruction Computational Photography Derek Hoiem, University of Illinois 10/18/12 Some slides from Alyosha Efros, Steve Seitz.

Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.

Image stitching Digital Visual Effects Yung-Yu Chuang with slides by Richard Szeliski, Steve Seitz, Matthew Brown and Vaclav Hlavac.

Stereo Readings Szeliski, Chapter 11 (through 11.5) Single image stereogram, by Niklas EenNiklas Een.

Stereo Many slides adapted from Steve Seitz.

Project 2 code & artifact due Friday Midterm out tomorrow (check your ), due next Fri Announcements TexPoint fonts used in EMF. Read the TexPoint.

Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.

Single-view 3D Reconstruction Computational Photography Derek Hoiem, University of Illinois 10/12/10 Some slides from Alyosha Efros, Steve Seitz.

CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.

Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.

Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.

Single-view 3D Reconstruction

stereo Outline : Remind class of 3d geometry Introduction

776 Computer Vision Jan-Michael Frahm Spring 2012.

Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.

Computer vision: models, learning and inference M Ahad Multiple Cameras

Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.

Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.

12/15/ Fall 2006 Two-Point Perspective Single View Geometry Final Project Faustinus Kevin Gozali an extended tour.

Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:

CSE 185 Introduction to Computer Vision Stereo 2.

Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.

Single-view metrology

Scene Modeling for a Single View

Announcements Midterms graded (handed back at end of lecture)

3D Photography: Epipolar geometry

Stereo vision Many slides adapted from Steve Seitz.

Presentation transcript:

Image-based modeling Digital Visual Effects, Spring 2008 Yung-Yu Chuang 2008/5/6 with slides by Richard Szeliski, Steve Seitz and Alexei Efros

Announcements Project #3 is extend to 5/16 Project #2 artifact voting results

Honorable mention (7): 張子捷游知澔

Honorable mention (9): 胡傳牲高紹航

Third place (10): 吳懿倫張哲瀚

Second place (13): 賴韻芊黃柏瑜

First place (24): 游名揚曾照涵

Outline Models from multiple (sparse) images –Structure from motion –Facade Models from single images –Tour into pictures –Single view metrology –Other approaches

Models from multiple images (Façade, Debevec et. al. 1996)

Facade Use a sparse set of images Calibrated camera (intrinsic only) Designed specifically for modeling architecture Use a set of blocks to approximate architecture Three components: –geometry reconstruction –texture mapping –model refinement

Idea

Geometric modeling A block is a geometric primitive with a small set of parameters Hierarchical modeling for a scene Rotation and translation could be constrained

Reasons for block modeling Architectural scenes are well modeled by geometric primitives. Blocks provide a high level abstraction, easier to manage and add constraints. No need to infer surfaces from discrete features; blocks essentially provide prior models for architectures. Hierarchical block modeling effectively reduces the number of parameters for robustness and efficiency.

Reconstruction minimize

Reconstruction

nonlinear w.r.t. camera and model

Results 3 of 12 photographs

Results

Texture mapping

Texture mapping in real world Demo movie Michael Naimark, San Francisco Museum of Modern Art, 1984

Texture mapping

View-dependent texture mapping

model VDTM single texture map

View-dependent texture mapping

Model-based stereo Use stereo to refine the geometry knowncameraviewpoints

Stereo scene point optical center image plane

Stereo Basic Principle: Triangulation –Gives reconstruction as intersection of two rays –Requires calibration point correspondence

Stereo correspondence Determine Pixel Correspondence –Pairs of points that correspond to same scene point Epipolar Constraint –Reduces correspondence problem to 1D search along conjugate epipolar lines epipolar plane epipolar line

Finding correspondences apply feature matching criterion (e.g., correlation or Lucas-Kanade) at all pixels simultaneously search only over epipolar lines (much fewer candidate positions)

Image registration (revisited) How do we determine correspondences? –block matching or SSD (sum squared differences) d is the disparity (horizontal motion) How big should the neighborhood be?

Neighborhood size Smaller neighborhood: more details Larger neighborhood: fewer isolated mistakes w = 3w = 20

Depth from disparity f xx’ baseline z CC’ X f input image (1 of 2) [Szeliski & Kang ‘95] depth map 3D rendering

–Camera calibration errors –Poor image resolution –Occlusions –Violations of brightness constancy (specular reflections) –Large motions –Low-contrast image regions Stereo reconstruction pipeline Steps –Calibrate cameras –Rectify images –Compute disparity –Estimate depth What will cause errors?

Model-based stereo key image offset image warped offset image

Epipolar geometry

Results

Comparisons single texture, flatVDTM, flat VDTM, model- based stereo

Final results Kite photography

Final results

Results

Commercial packages REALVIZ ImageModeler

The Matrix Cinefex #79, October 1999.

The Matrix Academy Awards for Scientific and Technical achievement for 2000 To George Borshukov, Kim Libreri and Dan Piponi for the development of a system for image-based rendering allowing choreographed camera movements through computer graphic reconstructed sets. This was used in The Matrix and Mission Impossible II; See The Matrix Disc #2 for more details

Models from single images

Vanishing points Vanishing point –projection of a point at infinity image plane camera center ground plane vanishing point

Vanishing points (2D) image plane camera center line on ground plane vanishing point

Vanishing points Properties –Any two parallel lines have the same vanishing point v –The ray from C through v is parallel to the lines –An image may have more than one vanishing point image plane camera center C line on ground plane vanishing point V line on ground plane

Vanishing lines Multiple Vanishing Points –Any set of parallel lines on the plane define a vanishing point –The union of all of these vanishing points is the horizon line also called vanishing line –Note that different planes define different vanishing lines v1v1 v2v2

Computing vanishing points Properties –P  is a point at infinity, v is its projection –They depend only on line direction –Parallel lines P 0 + tD, P 1 + tD intersect at P  V P0P0 D

Tour into pictures Create a 3D “ theatre stage ” of five billboards Specify foreground objects through bounding polygons Use camera transformations to navigate through the scene

Tour into pictures

The idea Many scenes (especially paintings), can be represented as an axis-aligned box volume (i.e. a stage) Key assumptions: –All walls of volume are orthogonal –Camera view plane is parallel to back of volume –Camera up is normal to volume bottom –Volume bottom is y=0 Can use the vanishing point to fit the box to the particular Scene!

Fitting the box volume User controls the inner box and the vanishing point placement (6 DOF)

Foreground Objects Use separate billboard for each For this to work, three separate images used: –Original image. –Mask to isolate desired foreground images. –Background with objects removed

Foreground Objects Add vertical rectangles for each foreground object Can compute 3D coordinates P0, P1 since they are on known plane. P2, P3 can be computed as before (similar triangles)

Example

glTip

Zhang et. al. CVPR 2001

Oh et. al. SIGGRAPH 2001

video

Automatic popup Input Ground Vertical Sky Geometric LabelsCut’n’Fold3D Model Image Learned Models

Geometric cues Color Location Texture Perspective

Automatic popup

Results Automatic Photo Pop-up Input Images

Results This approach works roughly for 35% of images.

Failures Labeling Errors

Failures Foreground Objects

References P. Debevec, C. Taylor and J. Malik. Modeling and Rendering Architecture from Photographs: A Hybrid Geometry- and Image-Based Approach, SIGGRAPH 1996.Modeling and Rendering Architecture from Photographs: A Hybrid Geometry- and Image-Based Approach Y. Horry, K. Anjyo and K. Arai. Tour Into the Picture: Using a Spidery Mesh Interface to Make Animation from a Single Image, SIGGRAPH 1997.Tour Into the Picture: Using a Spidery Mesh Interface to Make Animation from a Single Image L. Zhang, G. Dugas-Phocion, J.-S. Samson and S. Seitz. Single View Modeling of Free-Form Scenes, CVPR Single View Modeling of Free-Form Scenes B. Oh, M. Chen, J. Dorsey and F. Durand. Image-Based Modeling and Photo Editing, SIGGRAPH 2001.Image-Based Modeling and Photo Editing D. Hoiem, A. Efros and M. Hebert. Automatic Photo Pop-up, SIGGRAPH 2005.Automatic Photo Pop-up