Fast Cost-volume Filtering For Visual Correspondence and Beyond Asmaa Hosni, Member, IEEE, Christoph Rhemann, Michael Bleyer, Member, IEEE, Carsten Rother,

Slides:



Advertisements
Similar presentations
Change Detection C. Stauffer and W.E.L. Grimson, “Learning patterns of activity using real time tracking,” IEEE Trans. On PAMI, 22(8): , Aug 2000.
Advertisements

Analysis of Contour Motions Ce Liu William T. Freeman Edward H. Adelson Computer Science and Artificial Intelligence Laboratory Massachusetts Institute.
Spatial-Temporal Consistency in Video Disparity Estimation ICASSP 2011 Ramsin Khoshabeh, Stanley H. Chan, Truong Q. Nguyen.
Cuong Cao Pham and Jae Wook Jeon, Member, IEEE
M.S. Student, Hee-Jong Hong
Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.
Foreground Background detection from video Foreground Background detection from video מאת : אבישג אנגרמן.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
Interactive Image Segmentation using Graph Cuts Mayuresh Kulkarni and Fred Nicolls Digital Image Processing Group University of Cape Town PRASA 2009.
Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1, Lehigh University.
1 Minimum Ratio Contours For Meshes Andrew Clements Hao Zhang gruvi graphics + usability + visualization.
A New Block Based Motion Estimation with True Region Motion Field Jozef Huska & Peter Kulla EUROCON 2007 The International Conference on “Computer as a.
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.
Advanced Topics in Computer Vision Spring 2006 Video Segmentation Tal Kramer, Shai Bagon Video Segmentation April 30 th, 2006.
High-Quality Video View Interpolation
Optical flow and Tracking CISC 649/849 Spring 2009 University of Delaware.
Effective Gaussian mixture learning for video background subtraction Dar-Shyang Lee, Member, IEEE.
Interactive Matting Christoph Rhemann Supervised by: Margrit Gelautz and Carsten Rother.
An Iterative Optimization Approach for Unified Image Segmentation and Matting Hello everyone, my name is Jue Wang, I’m glad to be here to present our paper.
Perceptual Organization: Segmentation and Optical Flow.
100+ Times Faster Weighted Median Filter [cvpr ‘14]
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Presented By : Murad Tukan
Optical Flow Digital Photography CSE558, Spring 2003 Richard Szeliski (notes cribbed from P. Anandan)
On Building an Accurate Stereo Matching System on Graphics Hardware
Fast Cost-volume Filtering For Visual Correspondence and Beyond Asmaa Hosni, Member, IEEE, Christoph Rhemann, Michael Bleyer, Member, IEEE, Carsten Rother,
Image Segmentation Rob Atlas Nick Bridle Evan Radkoff.
Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’04) /04 $20.00 c 2004 IEEE 1 Li Hong.
Joint Histogram Based Cost Aggregation For Stereo Matching Dongbo Min, Member, IEEE, Jiangbo Lu, Member, IEEE, Minh N. Do, Senior Member, IEEE IEEE TRANSACTION.
Stereo Matching Information Permeability For Stereo Matching – Cevahir Cigla and A.Aydın Alatan – Signal Processing: Image Communication, 2013 Radiometric.
Tracking Pedestrians Using Local Spatio- Temporal Motion Patterns in Extremely Crowded Scenes Louis Kratz and Ko Nishino IEEE TRANSACTIONS ON PATTERN ANALYSIS.
Michael Bleyer LVA Stereo Vision
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
Surface Stereo with Soft Segmentation Michael Bleyer 1, Carsten Rother 2, Pushmeet Kohli 2 1 Vienna University of Technology, Austria 2 Microsoft Research.
Graph Cut & Energy Minimization
A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.
Optical Flow Donald Tanguay June 12, Outline Description of optical flow General techniques Specific methods –Horn and Schunck (regularization)
Tzu ming Su Advisor : S.J.Wang MOTION DETAIL PRESERVING OPTICAL FLOW ESTIMATION 2013/1/28 L. Xu, J. Jia, and Y. Matsushita. Motion detail preserving optical.
Graph Cut 韋弘 2010/2/22. Outline Background Graph cut Ford–Fulkerson algorithm Application Extended reading.
Takuya Matsuo, Norishige Fukushima and Yutaka Ishibashi
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Cross-Based Local Multipoint Filtering
A Non-local Cost Aggregation Method for Stereo Matching
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 34, NO. 2, FEBRUARY Leonardo De-Maeztu, Arantxa Villanueva, Member, IEEE, and.
Visual motion Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
Stereo Many slides adapted from Steve Seitz.
Window-based Approach For Fast Stereo Correspondence Raj Kumar Gupta, Siu-Yeung Cho IET Computer Vision,
Effective Optical Flow Estimation
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
1 Markov random field: A brief introduction (2) Tzu-Cheng Jen Institute of Electronics, NCTU
Segmentation of Vehicles in Traffic Video Tun-Yu Chiang Wilson Lau.
A Dynamic Conditional Random Field Model for Object Segmentation in Image Sequences Duke University Machine Learning Group Presented by Qiuhua Liu March.
Journal of Visual Communication and Image Representation
Stereo Video 1. Temporally Consistent Disparity Maps from Uncalibrated Stereo Videos 2. Real-time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral.
Optical flow and keypoint tracking Many slides adapted from S. Seitz, R. Szeliski, M. Pollefeys.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
Motion Segmentation at Any Speed Shrinivas J. Pundlik Department of Electrical and Computer Engineering, Clemson University, Clemson, SC.
ICCV 2007 Optimization & Learning for Registration of Moving Dynamic Textures Junzhou Huang 1, Xiaolei Huang 2, Dimitris Metaxas 1 Rutgers University 1,
Motion estimation Parametric motion (image alignment) Tracking Optical flow.
Motion estimation Digital Visual Effects, Spring 2005 Yung-Yu Chuang 2005/3/23 with slides by Michael Black and P. Anandan.
Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.
Local Stereo Matching Using Motion Cue and Modified Census in Video Disparity Estimation Zucheul Lee, Ramsin Khoshabeh, Jason Juang and Truong Q. Nguyen.
Summary of “Efficient Deep Learning for Stereo Matching”
Secrets of GrabCut and Kernel K-means
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
Analysis of Contour Motions
Optical flow and keypoint tracking
“Traditional” image segmentation
Presentation transcript:

Fast Cost-volume Filtering For Visual Correspondence and Beyond Asmaa Hosni, Member, IEEE, Christoph Rhemann, Michael Bleyer, Member, IEEE, Carsten Rother, Member, IEEE, and Margrit Gelautz,Senior Member, IEEE IEEE Transactions on Pattern Analysis and Machine Intelligence,2013 1

Outline Introduction Related Work Stereo Matching Optical Flow Interactive Image Segmentation Method : Cost-Volume Filtering Applications Experimental Results Conclusion 2

Introduction 3

Many computer vision tasks can be formulated as labeling problems. Solution : a spatially smooth labeling where label transitions are aligned with color edges. => very fast edge preserving filter A generic and simple framework (i) constructing a cost volume (ii) fast cost volume filtering (iii) winner-take-all label selection 4

Related Work 5

Related Work--Stereo Matching Global method Energy minimization process (GC,BP,DP,Cooperative) Per-processing (mean-sift) Accurate but slow Local method A local support region with winner take all Fast but inaccurate. 6

Related Work--Optical Flow Optical flow : the pattern of apparent motion caused by the relative motion between an observer and the scene. A vector field subject to Image Brightness Constancy Equation (IBCE) Application : motion detection, object segmentation, motion compensated encoding, and stereo disparity measurement

Related Work--Optical Flow Each flow vector is a label and subpixel accuracy further increases the label space Method : continuous optimization strategies (coarse-to-fine) SSD Local convolution with oriented Gaussians[13] Local convolution with bilateral filter[14] Adaptive support weights[15,16] Trade off search space (quality) against speed. 8 [13] D. Tschumperle` and R. Deriche, “Vector-Valued Image Regularization with PDE’s: A Common Framework for Different Applications,” CVPR, [14] J. Xiao, H. Cheng, H. Sawhney, C. Rao, and M. Isnardi, “Bilateral Filtering-Based Optical Flow Estimation with Occlusion Detection,” ECCV, [15] M. Werlberger, T. Pock, and H. Bischof, “Motion Estimation with Non-Local Total Variation Regularization,” CVPR, [16] D. Sun, S. Roth, and M. Black, “Secrets of Optical Flow Estimation and Their Principles,” CVPR, 2010.

Related Work--Interactive Image Segmentation Interactive Image Segmentation is a binary labeling problem. Goal : Separate the image into foreground and background regions given some hints by the user. Method : SNAKE Geodesic morphological operator [8,21] Alpha matte [5,22] 9 [5] K. He, J. Sun, and X. Tang, “Guided Image Filtering,” ECCV, [8] A. Criminisi, T. Sharp, and C. Rother, “Geodesic Image and Video Editing,” ACM Graphics, [21] A. Criminisi, T. Sharp, and A. Blake, “GeoS: Geodesic Image Segmentation,”ECCV, [22] C. Rhemann, C. Rother, J. Wang, M. Gelautz, P. Kohli, and P. Rott, “A Perceptually Motivated Online Benchmark for Image Matting,” CVPR, 2009.

Cost-volume Filtering 10

Edge-preserving filtering Edge-preserving filtering methods Weighted Least Square [Lagendijk et al. 1988] Anisotropic diffusion [Perona and Malik 1990] Bilateral filter [Aurich and Weule 95], [Tomasi and Manduchi 98] Digital TV (Total Variation) filter [Chan et al. 2001] 11

Bilateral filter [6] 12 [6] K. Yoon and S. Kweon, “Adaptive Support-Weight Approach for Correspondence Search,” IEEE Trans. Pattern Analysis and Machine Intelligence, Apr

Bilateral filter Advantages Preserve edges in the smoothing process Simple and intuitive Non-iterative Disadvantages Complexity O(Nr 2 ) Gradient distortion Preserves edges,but not gradients 13

Guided filter [5] 14 [5] K. He, J. Sun, and X. Tang, “Guided Image Filtering,” Proc. European Conf. Computer Vision, 2010.

Guided filter 15 Bilateral filter does not have this linear model

Guided filter 16

Bilateral filter V.S. Guided filter 17

Guided filter Edge-preserving filtering Non-iterative O(1) time, fast and non-approximate No gradient distortion 18 Advantages of bilateral filter Disadvantages of bilateral filter

Cost-volume Filtering Label l from C’ : the filtered cost volume i and j : pixel indices. W i,j : The filter weights depend on the guidance image I : the mean vector and covariance of I : a smoothness parameter U : Identity matrix Winner take all : 19 k r r w=(2r+1)^2

Cost-volume Filtering 20

[5] K. He, J. Sun, and X. Tang, “Guided Image Filtering,” Proc. European Conf. Computer Vision, [6] K. Yoon and S. Kweon, “Adaptive Support-Weight Approach for Correspondence Search,” IEEE Trans. Pattern Analysis and Machine Intelligence, Apr [9] F. Crow. Summed-area tables for texture mapping. SIGGRAPH, Cost-volume Filtering 21 (a)Zoom of the green line in the input image. (b)Slice of the cost volume for the line(white/black/red: high/low/lowest costs) (c)The box filter [9] (d)The joint bilateral filter [6] (e)The guided filter [5] (f)Ground-truth labeling

Application 22

Stereo Matching 23

Stereo Matching Cost computation : grayscale gradients in x direction : balances the color and gradient terms : truncation values 24

Stereo Matching Cost computation : grayscale gradients in x and y direction : balances the color and gradient terms : truncation values Occlusion detection : 25

Stereo Matching Cost computation : grayscale gradients in x and y direction : balances the color and gradient terms : truncation values Occlusion detection : Postprocessing 1.Scanline filling : the lowest disparity of the spatially closest nonoccluded pixel 2.Median filter : : adjust the spatial and color similarity : normalization factor 26

Stereo Matching Alternative—symmetric cost aggregation The cost aggregation can be formulated symmetrically for both input images. Replace the 3*1 vector I i in (3) with a 6*1 vector whose entries are given by the RGB color channels of both I i and I’ i-l. 27

Stereo Matching 28 Effect of postprocessing Disparity map with invalidated pixels in red After scanline-based fillingAfter median filtering

Optical flow Cost computation : grayscale gradients in x and y direction : balances the color and gradient terms : truncation values Postprocessing : Median filter and iteration 29

Optical flow Upscale : To find subpixel accurate flow vectors. Color of subpixel filled with bicubic interpolation. 30

Interactive Image Segmentation : foreground F or background B : background color histograms [25] b(i) : the bin of pixel i Cost computation : (binary labeling) => Five iterations [26] 31 [25] S. Vicente, V. Kolmogorov, and C. Rother, “Joint Optimization of Segmentation and Appearance Models,” Proc. IEEE Int’l Conf. Computer Vision, [26] C. Rother, V. Kolmogorov, and A. Blake, “Grabcut-Interactive Foreground Extraction Using Iterated Graph Cuts,” Proc. ACM Siggraph, 2004.

Experimental Results 32

Experimental Results Device : an Intel Core 2 Quad, 2.4 GHZ PC with GeForce GTX480 graphics card with 1.5GB of memory from NVIDIA. Settings parameters:  : depends on the signal-to-noise ratio of an image for stereo for optical flow  Source : Middlebury 33

Stereo Matching 34

Stereo Matching 35

Stereo Matching 36 disparity maps without occlusion filling and postprocessing (invalidated pixels are black)

Stereo Matching 37 [27] A. Hosni, M. Bleyer, M. Gelautz, and C. Rhemann, “Local Stereo Matching Using Geodesic Support Weights,” Int’l Conf. Image Processing, [6] K. Yoon and S. Kweon, “Adaptive Support-Weight Approach for Correspondence Search,” PAMI, [7] C. Richardt, D. Orr, I. Davies, A. Criminisi, and N. Dodgson, “Real-Time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral Grid,”ECCV,2010.

Stereo Matching Million Disparity Estimations per second (MDE/s) W : width H : height D : disparity levels FPS : number of frames per second A larger MDE number means a better performing system. 38

Stereo Matching 39

Stereo Matching 40

Stereo Matching 41 Table 2. Rankings and run times for selected local stereo methods. Run times in the table are averaged over the four Middlebury test images. *The run time in [15] was reported before left-right consistency check in the corresponding paper. Hence, for fairness, we have multiplied the reported time by a factor of 2.

Stereo Matching 42

Optical flow 43

Optical flow 44 The respective AEE and AAE are given in parentheses (AEE/AAE).

Optical flow 45 Large displacement flow (b)-(e) and (l)-(o) Motion magnitude for different methods. (f), (p) Our flow vectors with the color coding as in middleburry. (h)-(k) Backward warping results using flow of different methods—the tip of the foot is correctly recovered by our method. Occluded regions cannot be handled by any method.

Interactive Image Segmentation 46

Interactive Image Segmentation 47

Interactive Image Segmentation Our methodGrabCut [26] Time2.85 ms300 ms error5.3%6.2% 48 [26] C. Rother, V. Kolmogorov, and A. Blake, “Grabcut-Interactive Foreground Extraction Using Iterated Graph Cuts,” Proc. ACM Siggraph, 2004.

Conclusion 49

Conclusion Contribution Propose a edge-preserving property Runtime independent of the filter size Achieved both accuracy and efficiency Future Work Leverage this framework to other application areas Process slanted surfaces [32] 50 [32] M. Bleyer, C. Rhemann, and C. Rother, “Patchmatch Stereo—Stereo Matching with Slanted Support Windows,” Proc. British Machine Vision Conf., 2011.