Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.

Slides:



Advertisements
Similar presentations
Feature Based Image Mosaicing
Advertisements

Improved Census Transforms for Resource-Optimized Stereo Vision
Spatial-Temporal Consistency in Video Disparity Estimation ICASSP 2011 Ramsin Khoshabeh, Stanley H. Chan, Truong Q. Nguyen.
Gratuitous Picture US Naval Artillery Rangefinder from World War I (1918)!!
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Cuong Cao Pham and Jae Wook Jeon, Member, IEEE
M.S. Student, Hee-Jong Hong
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Stereo.
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Last Time Pinhole camera model, projection
Multi-resolution Real-time Stereo on Commodity Graphics Hardware 第 12 組 組員:吳冠璋 家庭自動化實驗室 Author: Ruigang Yang and Marc Pollefeys Publication: IEEE Computer.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
The plan for today Camera matrix
Virtual Control of Optical Axis of the 3DTV Camera for Reducing Visual Fatigue in Stereoscopic 3DTV Presenter: Yi Shi & Saul Rodriguez March 26, 2008.
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
Stereo Computation using Iterative Graph-Cuts
Lecture 11: Stereo and optical flow CS6670: Computer Vision Noah Snavely.
Real-Time Stereo Matching on Programmable Graphics Hardware Liang Wei.
Aleixo Cambeiro Barreiro 광주과학기술원 컴퓨터 비전 연구실
CSE473/573 – Stereo Correspondence
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
On Building an Accurate Stereo Matching System on Graphics Hardware
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’04) /04 $20.00 c 2004 IEEE 1 Li Hong.
Joint Histogram Based Cost Aggregation For Stereo Matching Dongbo Min, Member, IEEE, Jiangbo Lu, Member, IEEE, Minh N. Do, Senior Member, IEEE IEEE TRANSACTION.
Introduction Belief propagation: known to produce accurate results for stereo processing/ motion estimation High storage requirements limit the use of.
Stereo Matching Information Permeability For Stereo Matching – Cevahir Cigla and A.Aydın Alatan – Signal Processing: Image Communication, 2013 Radiometric.
Michael Bleyer LVA Stereo Vision
3D Fingertip and Palm Tracking in Depth Image Sequences
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.
Joint Depth Map and Color Consistency Estimation for Stereo Images with Different Illuminations and Cameras Yong Seok Heo, Kyoung Mu Lee and Sang Uk Lee.
Takuya Matsuo, Norishige Fukushima and Yutaka Ishibashi
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Cross-Based Local Multipoint Filtering
A Non-local Cost Aggregation Method for Stereo Matching
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 34, NO. 2, FEBRUARY Leonardo De-Maeztu, Arantxa Villanueva, Member, IEEE, and.
Stereo Many slides adapted from Steve Seitz.
CS 4487/6587 Algorithms for Image Analysis
Feature-Based Stereo Matching Using Graph Cuts Gorkem Saygili, Laurens van der Maaten, Emile A. Hendriks ASCI Conference 2011.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image image 1image.
Computer Vision, Robert Pless
A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.
1 Real-Time Stereo-Matching for Micro Air Vehicles Pascal Dufour Master Thesis Presentation.
Window-based Approach For Fast Stereo Correspondence Raj Kumar Gupta, Siu-Yeung Cho IET Computer Vision,
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Lecture 16: Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Fast Cost-volume Filtering For Visual Correspondence and Beyond Asmaa Hosni, Member, IEEE, Christoph Rhemann, Michael Bleyer, Member, IEEE, Carsten Rother,
Digital Image Processing
Solving for Stereo Correspondence Many slides drawn from Lana Lazebnik, UIUC.
Improved Census Transforms for Resource-Optimized Stereo Vision
Jeong Kanghun CRV (Computer & Robot Vision) Lab..
Journal of Visual Communication and Image Representation
Efficient Stereo Matching Based on a New Confidence Metric
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
Stereo Video 1. Temporally Consistent Disparity Maps from Uncalibrated Stereo Videos 2. Real-time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral.
數位三維視訊 楊 家 輝 Jar-Ferr Yang 電腦與通信工程研究所 電機工程學系 國立成功大學 Institute of Computer and Communication Engineering Department of Electrical Engineering National Cheng.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
Local Stereo Matching Using Motion Cue and Modified Census in Video Disparity Estimation Zucheul Lee, Ramsin Khoshabeh, Jason Juang and Truong Q. Nguyen.
Stereo CS4670 / 5670: Computer Vision Noah Snavely Single image stereogram, by Niklas EenNiklas Een.
Video object segmentation and its salient motion detection using adaptive background generation Kim, T.K.; Im, J.H.; Paik, J.K.;  Electronics Letters 
SoC and FPGA Oriented High-quality Stereo Vision System
What have we learned so far?
Stereo vision Many slides adapted from Steve Seitz.
Presentation transcript:

Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie Shi, Qinping Zhao - State Key Laboratory of Virtual Reality Technology and Systems, Beihang University, Beijing , China Liang Wang -University of Kentucky, Lexington, KY, USA 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission (3DIMPVT) 1

Outline Introduction Framework Proposed Algorithm Weight computation Two-pass aggregation based on credibility estimation Winner-take-all guided DP Experimental Results Conclusion 2

Introduction 3

Background Global stereo algorithms: Minimize certain cost functions Belief propagation, Graph-cut High accuracy but low speed Local stereo algorithms : Based on correlation (in local support window) Fast implementation 4

Objective Present a real-time stereo algorithm Improve the accuracy over scanline-based approach Perform in real-time with high quality Related to [20] and inspired by [12] 5 [20] K.-J. Yoon and I.-S. Kweon, “Locally adaptive support-weight approach for visual correspondence search,” in Proc. of IEEE Conf. on Computer Vision and Pattern recognition, 2005, pp.924–931. [12] L. Wang, M. Liao, M. Gong, and R. Yang, “High-quality real-time stereo using adaptive cost aggregation and dynamic programming,” in Intl. Symposium on 3D Data Processing, Visualization and Transmission, 2006, pp. 798–805.

Locally Adaptive Support-Weight Approach [20] Fix-sized support window Based on color similarity and geometry similarity strong results but time consuming 6 [20] K.-J. Yoon and I.-S. Kweon, “Locally adaptive support-weight approach for visual correspondence search,” in Proc. of IEEE Conf. on Computer Vision and Pattern recognition, 2005, pp.924–931.

Locally Adaptive Support-Weight Approach [20] 7

Framework 8

9 Compute weight for each pixel By color similarity Weight Computation Aggregate matching cost 2D aggregation → two 1D windows O(S 2 ) → O(S) Two-pass aggregation Improve dynamic programming(DP) optimization technique Occlusion boundary improving Winner-take-all CPU and GPU in parallel Speed acceleration Acceleration using graphics hardware

Weight Computation 10

Weight Computation 11

Weight Computation 12

Weight Computation 13 Color Color + Geometry

Two-Pass Aggregation 14

Aggregation 15

Two-Pass Aggregation 2D aggregation → separate 1D windows Horizontal & vertical Complexity : O(S 2 ) → O(S) 16

Two-Pass Aggregation 17

Two-Pass Aggregation 18

Credibility Estimation 19

Credibility Estimation C’ C P

Credibility Estimation Compute support weight and its credibility : T(x) : Excludes points which may be unreliable from two-pass aggregation 21

Two-Pass Aggregation Judge ω’(c,p) : Aggregation matching cost: H c’ : the set off all pixels locate on the same line with c’ V c : the set off all pixels locate on the same column with c 22

Two-Pass Aggregation Judge ω’(c,p) : Aggregation matching cost: 23 c cpipi pixel-wise cost

Two-Pass Aggregation 24

Comparison 25 Without Credibility Estimation With Credibility Estimation

Winner-take-all guided DP 26

Winner-take-all guided DP Adopt amended scan-line optimization technique Combines - Winner-Take-All (WTA) Dynamic Programming (DP) Improving depth estimation at occlusion boundaries Better preserves depth discontinuities 27

Dynamic Programming (DP) Energy minimization framework Objective : find disparity function d 28 γ : penalize of depth discontinuities Width : image width Aggregate matching cost

Dynamic Programming (DP) Energy minimization framework Objective : find disparity function d 29 γ : penalize of depth discontinuities Width : image width

Scanline optomization : Dynamic Programming (DP) 30

Dynamic Programming (DP) Traverse the aggregated costs along each scan-line from left to right Maintain the minimal accumulated costs (up to current position) - p = (x,y), p ’ = (x-1,y) For pixel p Traverse the all the disparities d(p ’ ) Calculate the minimum energy 31 O(D 2 ) ( D : disparity search range) not suitable for real-time system Sum cost Minimize

Dynamic Programming (DP) Only consider d(p)-1, d(p), d(p)+1 as disparity smoothness constrain A pixel usually have similar disparity with surrounding pixels 32 O(D) ( D : disparity search range) disparity change slowly at depth discontinue areas blur the occlusion borders (over-smooth) WTA

Winner-Take-All (WTA) Combine WTA and scanline DP Better handle in depth discontinuity areas Fourth disparity candidate : 33

Comparison 34 DP method WTA DP + WTA Ground Truth

Experimental Results 35

Experimental Results Intel W3350 CPU with 3.0 GHZ Geforce GTX 285 graphics card Cost aggregation : using CUDA on the GPU support window (35*35) K=2, γ c =36, discontinuity cost ( γ =3.25 ) 36

37 Ground Truth Proposed

Experiment on dynamic scene Live videos captured by a bumblebee XB3 camera Achieve 20 fps when: handing stereo image pairs of 320×240 pixels with 24 disparity levels Equivalent to MDE/s 38 (MDE/s) : ‧ Million Disparity Evaluations per second ‧ (number of pixels) * (disparity range ) * (obtained frame-rate) ‧ captures the performance of a stereo algorithm in a single number

Experiment on dynamic scene 39

Experimental Results 40

Experimental Results Without & With Credibility Estimation DP vs. WTA vs. DP+WTA 41

Conclusion 42

Conclusion Propose a high quality real-time stereo algorithm Two-pass aggregation Aggregate matching cost WTA Improve DP optimization technique Improve depth estimation at occlusion boundaries CPU and GPU in parallel High-quality depth map at video frame rate Best accuracy among all real-time algorithms 43