Semi-Global Matching with self-adjusting penalties

Slides:

Advertisements

Similar presentations

1Ellen L. Walker Stereo Vision Why? Two images provide information to extract (some) 3D information We have good biological models (our own vision system)

Advertisements

The fundamental matrix F

Efficient High-Resolution Stereo Matching using Local Plane Sweeps Sudipta N. Sinha, Daniel Scharstein, Richard CVPR 2014 Yongho Shin.

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.

www-video.eecs.berkeley.edu/research

Stereo Vision Reading: Chapter 11

Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.

Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.

Does Color Really Help in Dense Stereo Matching?

Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.

Last Time Pinhole camera model, projection

CS6670: Computer Vision Noah Snavely Lecture 17: Stereo

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

The plan for today Camera matrix

CS 223b 1 More on stereo and correspondence. CS 223b 2 =?f g Mostpopular For each window, match to closest window on epipolar line in other image. (slides.

3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.

CSCE 641 Computer Graphics: Image-based Modeling Jinxiang Chai.

CSE473/573 – Stereo Correspondence

Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.

Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.

Stereo matching Class 10 Read Chapter 7 Tsukuba dataset.

3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.

Matthew Brown University of British Columbia (prev.) Microsoft Research [ Collaborators: † Simon Winder, *Gang Hua, † Rick Szeliski † =MS Research, *=MS.

Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.

My Research Experience Cheng Qian. Outline 3D Reconstruction Based on Range Images Color Engineering Thermal Image Restoration.

Introduction Belief propagation: known to produce accurate results for stereo processing/ motion estimation High storage requirements limit the use of.

Stereo Matching Information Permeability For Stereo Matching – Cevahir Cigla and A.Aydın Alatan – Signal Processing: Image Communication, 2013 Radiometric.

Surface Stereo with Soft Segmentation Michael Bleyer 1, Carsten Rother 2, Pushmeet Kohli 2 1 Vienna University of Technology, Austria 2 Microsoft Research.

Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.

Automatic Registration of Color Images to 3D Geometry Computer Graphics International 2009 Yunzhen Li and Kok-Lim Low School of Computing National University.

A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.

Visual Perception PhD Program in Information Technologies Description: Obtention of 3D Information. Study of the problem of triangulation, camera calibration.

MESA LAB Multi-view image stitching Guimei Zhang MESA LAB MESA (Mechatronics, Embedded Systems and Automation) LAB School of Engineering, University of.

Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.

A Non-local Cost Aggregation Method for Stereo Matching

Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.

Automated Reconstruction of Industrial Sites Frank van den Heuvel Tahir Rabbani.

Stereo Many slides adapted from Steve Seitz.

Ray Divergence-Based Bundle Adjustment Conditioning for Multi-View Stereo Mauricio Hess-Flores 1, Daniel Knoblauch 2, Mark A. Duchaineau 3, Kenneth I.

Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.

Computer Vision, Robert Pless

A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.

Lec 22: Stereo CS4670 / 5670: Computer Vision Kavita Bala.

Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.

Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.

Digital Image Processing

55:148 Digital Image Processing Chapter 11 3D Vision, Geometry Topics: Basics of projective geometry Points and hyperplanes in projective space Homography.

Jeong Kanghun CRV (Computer & Robot Vision) Lab..

Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.

Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.

John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.

Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.

Speaker Min-Koo Kang March 26, 2013 Depth Enhancement Technique by Sensor Fusion: MRF-based approach.

Contrast optimization for structure-from-motion surveys James O’Connor 1 Mike Smith 1, Mike R. James

Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.

1 2D TO 3D IMAGE AND VIDEO CONVERSION. INTRODUCTION The goal is to take already existing 2D content, and artificially produce the left and right views.

Shape2Pose: Human Centric Shape Analysis CMPT888 Vladimir G. Kim Siddhartha Chaudhuri Leonidas Guibas Thomas Funkhouser Stanford University Princeton University.

Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.

CS 6501: 3D Reconstruction and Understanding Stereo Cameras

A Plane-Based Approach to Mondrian Stereo Matching

Michael Bleyer LVA Stereo Vision

Summary of “Efficient Deep Learning for Stereo Matching”

CS4670 / 5670: Computer Vision Kavita Bala Lec 27: Stereo.

Jure Zbontar, Yann LeCun

DIGITAL SIGNAL PROCESSING

Semi-Global Stereo Matching with Surface Orientation Priors

SoC and FPGA Oriented High-quality Stereo Vision System

Stereo vision Many slides adapted from Steve Seitz.

Computing the Stereo Matching Cost with a Convolutional Neural Network

Presentation transcript:

Semi-Global Matching with self-adjusting penalties Laboratory of Photogrammetry National Technical University of Athens Semi-Global Matching with self-adjusting penalties E. Karkalou, C. Stentoumis, G. Karras 7th International Workshop 3D ARCH 3D Virtual Reconstruction and Visualization of Complex Architectures 1-3 March 2017, Nafplion, Greece

Introduction 3D Surface Reconstruction Active (laser and optical) scanning Passive (image-based) approaches competitive to laser scanning in terms of accuracy, cost and flexibility a fundamental procedure is dense image matching: automatic determination of pixel correspondences among images for 3D surface reconstruction by using rectified images -epipolar geometry constraint- a disparity map is produced a core part for the majority of cultural heritage applications

Introduction stereo-matching advantages stereo-matching applications limited number of images – speed fixed geometry (e.g. stereo-camera) stereo-matching applications augmented reality through smart-phones reconstruction from a limited number of images (e.g. historical, aerial) robotics, autonomous navigation, … cost computation dissimilarity measure to each pixel for every value in the disparity range (AD, SSD, NCC, filtered images, rank, census) cost aggregation pixel cost is supported by the cost of neighbouring pixels disparity optimization - WTA (winner-takes-all) - energy function disparity refinement post processing of the final disparity map typical algorithm after [Scharstein & Szeliski, 2002 ]

Semi-Global Matching (SGM) - stereo algorithm for the step of optimization - originally proposed by Hirschmüller (2005, 2008) - global 2D energy function: - 1D-cost approximation in each of 8 directions (paths): - advantages: accuracy, computational efficiency, simplicity

SGM Penalties Imposed on disparity changes between neighbouring pixels up to 1 pixel (P1) or larger (P2) P1 penalizes slightly slanted or curved surfaces; P2 penalizes depth discontinuities Penalty adjustment is needed for every different pair of images or, if a different matching cost method is used, even for the same stereo-pair If parameters have not been properly tuned, the performance of the algorithm may not be as efficient as expected

Contribution  Automatic estimation of SGM penalties, after the computation of simple statistical properties of the DSI (Disparity Space Image) volume, already existing from the previous step of the algorithm Penalties are regarded as being self-adjusted to the particular stereo-pair, in relation to the cost function used No time-consuming tuning required – No ground truth disparity maps or multiple data for training are needed Low computational requirements  Evaluation of method via Middlebury benchmark and EPFL dataset

Self-adjusting penalty values Penalty values influence pixel costs and, therefore, should be related to these It is proposed that penalties are derived from the DSI representation S(x,y,l) of the initial cost  Penalty estimation without user intervention

Final algorithm (SGM-SAP) Initial matching cost (Census transform & Hamming distance or Absolute Differences of intensities) Penalty estimation SGM Disparity selection via WTA Disparity refinement [optional]: sub-pixel interpolation, left-right consistency, median filtering

Middlebury – Version 3 stereo-pairs left image right image true disparity map

Results (Middlebury – Version 3) raw results sub-pixel interpolation median filtering

Results in the Middlebury benchmark } Overall error 22.8% 34th position for quarter-size training images in non-occluded regions, 2 pixel threshold Lower error: Playtable, Vintage Higher error: ArtL, Pipes, PlaytableP  Comparison with the original SGM algorithm error higher by 1.8%

Comparison with original SGM Our method performs better (blue colour) in slightly slanted surfaces Performs less well (red colour) in areas of low texture Differences between the two methods: self-adjusting penalties (SGM-SAP) vs tuning-based penalties (original SGM) Note: Original SGM employs more refinement processes

Results in Middlebury 2006 Comparison of SGM results with and without automatic penalty estimation (using the optimal parameters of a tuning process*) Error of our method over the 21 pairs higher by only 0.87% (11.89% to 11.02%) [Census cost metric] and 2.27% (25.72% to 23.45%) [Absolute Differences]  SGM-SAP is expected to work well for any matching cost function * (Stentoumis et al., 2015)

Results for Herz-Jesu-K7 stereo-pair Registration of the generated point cloud onto the ground truth data (from laser scanning) via ICP: Average distance: 25 mm (~1.1 pixel) Standard deviation: 20 mm (~0.9 pixel)

Results for Herz-Jesu-K7 stereo-pair Registration of generated point cloud onto the ground truth data

3D models (Middlebury – Version 3) Stereo-pair: Motorcycle Ground truth for non-occluded regions SGM-SAP with Left-Right Consistency, subpixel interpolation and median filtering

3D models (Middlebury – Version 3) Stereo-pair: Piano Ground truth for non-occluded regions SGM-SAP with Left-Right Consistency, subpixel interpolation and median filtering

3D models (Middlebury – Version 3) Stereo-pair: Playroom Ground truth for non-occluded regions SGM-SAP with Left-Right Consistency, subpixel interpolation and median filtering

3D models (Middlebury – Version 3) Stereo-pair: Recycle Ground truth for non-occluded regions SGM-SAP with Left-Right Consistency, subpixel interpolation and median filtering

SGM-SAP with Left-Right Consistency, subpixel interpolation and 3D models (Middlebury – Version 3) Stereo-pair: Djembe SGM-SAP with Left-Right Consistency, subpixel interpolation and median filtering

Distances of point clouds Stereo-pair: PlaytableP Ground truth for non-occluded regions SGM-SAP with Left-Right Consistency, subpixel interpolation and median filtering

Distances of point clouds

fused model from two stereo-pairs 3D model (Kapnikarea church, Athens) fused model from two stereo-pairs

3D model (Kapnikarea church, Athens)

Conclusions Presentation of a novel approach (SGM-SAP) aiming at the self-adjustment of penalty values in Semi-Global Matching for any image pair and any matching cost method Automatic estimation of the penalties through a simple process of low computational requirements, relying on the DSI volume (which is already computed in the previous step of the matching process) No tuning of penalties is needed No dataset of “similar” images with corresponding ground truth disparity maps has to be available Evaluation on Middlebury-Version 3 stereo-pairs: results competitive to those from original SGM

Future work Testing with more cost functions and SGM-like approaches (“non-local methods”) Evaluation on the challenging KITTI dataset for autonomous driving Implementation in OpenCV

Thank You… for your attention!