Other 3D Modeling from images systems Dana Cobzas PIMS Postdoc.

Slides:



Advertisements
Similar presentations
Using Strong Shape Priors for Multiview Reconstruction Yunda SunPushmeet Kohli Mathieu BrayPhilip HS Torr Department of Computing Oxford Brookes University.
Advertisements

The fundamental matrix F
For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
MASKS © 2004 Invitation to 3D vision Lecture 7 Step-by-Step Model Buidling.
Stereo.
Silhouettes in Multiview Stereo Ian Simon. Multiview Stereo Problem Input: – a collection of images of a rigid object (or scene) – camera parameters for.
Structure from motion.
Last Time Pinhole camera model, projection
Adam Rachmielowski 615 Project: Real-time monocular vision-based SLAM.
SIGGRAPH Course 30: Performance-Driven Facial Animation Section: Markerless Face Capture and Automatic Model Construction Part 2: Li Zhang, Columbia University.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Structure from motion. Multiple-view geometry questions Scene geometry (structure): Given 2D point matches in two or more images, where are the corresponding.
Multi-view stereo Many slides adapted from S. Seitz.
High-Quality Video View Interpolation
The plan for today Camera matrix
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.
Visual Odometry for Ground Vehicle Applications David Nister, Oleg Naroditsky, James Bergen Sarnoff Corporation, CN5300 Princeton, NJ CPSC 643, Presentation.
Assignment 2 Compute F automatically from image pair (putative matches, 8-point, 7-point, iterative, RANSAC, guided matching) (due by Wednesday 19/03/03)
Stereo and Structure from Motion
Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
CSE473/573 – Stereo Correspondence
CSCE 641 Computer Graphics: Image-based Modeling (Cont.) Jinxiang Chai.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.
Review: Binocular stereo If necessary, rectify the two stereo images to transform epipolar lines into scanlines For each pixel x in the first image Find.
Other 3D Modeling from images systems Dana Cobzas PIMS Postdoc.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Camera Calibration & Stereo Reconstruction Jinxiang Chai.
Reconstructing Relief Surfaces George Vogiatzis, Philip Torr, Steven Seitz and Roberto Cipolla BMVC 2004.
Lecture 11 Stereo Reconstruction I Lecture 11 Stereo Reconstruction I Mata kuliah: T Computer Vision Tahun: 2010.
Structure from images. Calibration Review: Pinhole Camera.
Peter Sturm INRIA Grenoble – Rhône-Alpes (Institut National de Recherche en Informatique et Automatique) An overview of multi-view stereo and other topics.
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
Brief Introduction to Geometry and Vision
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
Xiaoguang Han Department of Computer Science Probation talk – D Human Reconstruction from Sparse Uncalibrated Views.
3D SLAM for Omni-directional Camera
International Conference on Computer Vision and Graphics, ICCVG ‘2002 Algorithm for Fusion of 3D Scene by Subgraph Isomorphism with Procrustes Analysis.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
Dynamic 3D Scene Analysis from a Moving Vehicle Young Ki Baik (CV Lab.) (Wed)
Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
Stereo Many slides adapted from Steve Seitz.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.
Communication Systems Group Technische Universität Berlin S. Knorr A Geometric Segmentation Approach for the 3D Reconstruction of Dynamic Scenes in 2D.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Geometric Computer Vision Review and Outlook. What is Computer Vision? Three Related fieldsThree Related fields –Image Processing: Changes 2D images into.
stereo Outline : Remind class of 3d geometry Introduction
Looking at people and Image-based Localisation Roberto Cipolla Department of Engineering Research team
3D reconstruction from uncalibrated images
776 Computer Vision Jan-Michael Frahm & Enrique Dunn Spring 2013.
Visual Odometry David Nister, CVPR 2004
Paper presentation topics 2. More on feature detection and descriptors 3. Shape and Matching 4. Indexing and Retrieval 5. More on 3D reconstruction 1.
Optical flow and keypoint tracking Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
Announcements Final is Thursday, March 18, 10:30-12:20 –MGH 287 Sample final out today.
Automatic 3D modelling of Architecture Anthony Dick 1 Phil Torr 2 Roberto Cipolla 1 1 Department of Engineering 2 Microsoft Research, University of Cambridge.
제 5 장 스테레오.
CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.
EECS 274 Computer Vision Stereopsis.
Combining Geometric- and View-Based Approaches for Articulated Pose Estimation David Demirdjian MIT Computer Science and Artificial Intelligence Laboratory.
Filtering Things to take away from this lecture An image as a function
Computer Vision Stereo Vision.
Filtering An image as a function Digital vs. continuous images
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

Other 3D Modeling from images systems Dana Cobzas PIMS Postdoc

time Modeling dynamic scenes Large scale (city) modeling

Modeling (large scale) scenes [ Adam Rachmielowski ]

Reconstructing scenes ‘Small’ scenes (one, few buildings)  SFM + multi view stereo  man made scenes: prior on architectural elements  interactive systems City scenes (several streets, large area)  aerial images  ground plane, multi cameras SFM + stereo [+ GPS] depth map fusions

SFM + stereo Man-made environments :  straight edges  family of lines  vanishing points [Dellaert et al 3DPVT06 ] [Zisserman, Werner ECCV02 ]

SFM + stereo  dominant planes  plane sweep – homog between 3D pl. and camera pl.  one parameter search – voting for a plane [Zisserman, Werner ECCV02 ] [Bischof et al 3DPVT06 ]

SFM + stereo  refinement – architectural primitives [Zisserman, Werner ECCV02 … ]

SFM+stereo  Refinement – dense stereo [Pollefeys, Van Gool 98,00,01 ]

Façade – first system [Debevec, Taylor et al. Siggraph 96 ] Based on SFM (points, lines, stereo) Some manual modeling View dependent texture

Priors on architectural primitives θ – parameters for architectural priors type, shape, texture M – model D – data (images) I – reconstructed structures (planes, lines …) [Cipolla, Torr, … ICCV01 ] prior Occluded windows

Interactive systems [Torr et al. Eurogr.06, Siggraph07 ] Video, sparse 3D points, user input M – model primitives D- data I – reconstructed geometry Solved with graph cut

City modeling – aerial images [Heiko Hirschmuller et al - DLR ] Airborne pushbroom camera Semi-global stereo matching (based on mutual information)

City modeling – ground plane Camera cluster car + GPS Calibrated cameras – relative pose GPS – car position - 3D tracking 2D feature tracker 3D points Dense stereo+fusion Texture 3D MODEL [Nister, Pollefeys et al 3DPVT06, ICCV07 ] [ Cornelis, Van Gool CVPR06…] Video: Cannot do frame-frame correspondences SFM

City modeling - example [ Cornelis, Van Gool CVPR06…] 1. feature matching = tracking 2. SFM – camera pose + sparse 3D points 3. Façade reconstruction – rectification of the stereo images - vertical line correlation 4. Topological map generation - orthogonal proj. in the horiz. plane - voting based carving 5. Texture generation - each line segment – column in texture space VIDEO

On-line scene modeling : Adam’s project On-line modeling from video Model not perfect but enough for scene visualization Application predictive display Tracking and Modeling New image Detect fast corners (similar to Harris) SLAM (mono SLAM [Davison ICCV03]) Estimate camera pose Update visible structure Partial bundle adjustment – update all points Save image if keyframe (new view – for texture) Visualization New visual pose Compute closet view Triangulate Project images from closest views onto surface SLAM Camera pose 3D structure Noise model Extended Kalman Filter

Model refinement

Modeling dynamic scenes [Neil Birkbeck]

Multi-camera systems Several cameras mutually registered (precalibrated) Video sequence in each camera Moving object time

Techniques  Naïve : reconstruct shape every frame  Integrate stereo and image motion cues  Extend stereo in temporal domain  Estimate scene flow in 3D from optic flow and stereo Representations :  Disparity/depth  Voxels / level sets  Deformable mesh – hard to keep time consistency Knowledge:  Camera positions  Scene correspondences (structured light)

1. Stereo + motion flow [Zhang, Kambhamettu: On 3D scene flow and structure recovery from multi- view image sequences CVPR 2001, TransSMC 2003] Stereo d E d, E sd Motion flow (u,v,w=  d) E m, E sm

Motion and stereo constraints x t+1 xtxt x’ t x’ t+1 Motion flow Motion smoothness Motion flow Stereo Stereo smoothness

Zhang, Kambhamettu: Results Extensions :  Better motion model (ex. affine)

2. Spacetime stereo [Zhang, Curless, Seitz: Spacetime stereo, CVPR 2003] Extends stereo in time domain: assumes intra-frame correspondences Static scene: disparity Dynamic scene:

Spacetime stereo matching Disparity – linear model in the space-time window Energy Data term in a stereo reconstruction algorithm xlxl xrxr xlxl

Spacetime stereo: Results

Spacetime stereo:video

3. Scene flow [Vedula, Baker, Rander, Collins, Kanade: Three dimensional scene flow, ICCV 99] 2D Optic flow 3D Scene flow Scene flow on tangent plane Motion of x along a ray

Variational Scene Flow [Devernay, Huguet ICCV 2007 ] Extension of [Brox ECCV04] E fl motion flow E fr E st E s -stereo Scene flow: (u,v,d,d’) Data Regularization

Scene flow : results [Devernay, Huguet ICCV 2007 ] Vertical component Right t0 -> Left t0 Left t1 -> Left t0Right t1 -> Left t0

Scene flow: results [Vedula, et al. ICCV 99]

Scene flow: video [Vedula, et al. ICCV 99]

4. Carving in 6D [Vedula, Baker, Seitz, Kanade: Shape and motion carving in 6D] Hexel: 6D photo-consistency:

6D slab sweeping Slab = thickened plane (thikness = upper bound on the flow magnitude)  compute visibility for x 1  determine search region  compute all hexel photo-consistency  carving hexels  update visibility (Problem: visibility below the top layer in the slab before carving)

Carving in 6D: results

7. Surfel sampling [ Carceroni, Kutulakos: Multi-view scene capture by surfel samplig, ICCV01] Surfel: dynamic surface element  shape component : center, normal, curvature  motion component:  reflectance component: Phong parameters

Reconstruction algorithm visibility shadowPhong reflectance c i – camera i l l - light l

Surfel sampling : results

Modeling humans in motion GRIMAGE platform- INRIA Grenoble Multiple calibrated cameras Human in motion Goal: 3D model of the human Instantaneous model that can be viewed from different poses (‘Matrix’) and inserted in an artificial scene (tele- conferences) Our goal: 3D animated human model  capture model deformations and appearance change in motion  animated in a video game

Articulated model Geometric Model Skeleton + skinned mesh (bone weights ) 50+ DOF (CMU mocap data) Tracking  visual hull – bone weights by diffusion  refine mesh/weights Components  silhouette extraction  tracking the course model  learn deformations  learn appearance change [Neil Birkbeck]

Neil- tracking results