Journal of Visual Communication and Image Representation

Slides:



Advertisements
Similar presentations
SOFT SCISSORS: AN INTERACTIVE TOOL FOR REALTIME HIGH QUALITY MATTING International Conference on Computer Graphics and Interactive Techniques ACM SIGGRAPH.
Advertisements

Graph cut Chien-chi Chen.
Spatial-Temporal Consistency in Video Disparity Estimation ICASSP 2011 Ramsin Khoshabeh, Stanley H. Chan, Truong Q. Nguyen.
Leveraging Stereopsis for Saliency Analysis
Department of Electrical and Electronic Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong Three-dimensional curve reconstruction from.
Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.
GrabCut Interactive Image (and Stereo) Segmentation Carsten Rother Vladimir Kolmogorov Andrew Blake Antonio Criminisi Geoffrey Cross [based on Siggraph.
Robust Foreground Detection in Video Using Pixel Layers Kedar A. Patwardhan, Guillermoo Sapire, and Vassilios Morellas IEEE TRANSACTION ON PATTERN ANAYLSIS.
Tracking Multiple Occluding People by Localizing on Multiple Scene Planes Professor :王聖智 教授 Student :周節.
Texture Segmentation Based on Voting of Blocks, Bayesian Flooding and Region Merging C. Panagiotakis (1), I. Grinias (2) and G. Tziritas (3)
A KLT-Based Approach for Occlusion Handling in Human Tracking Chenyuan Zhang, Jiu Xu, Axel Beaugendre and Satoshi Goto 2012 Picture Coding Symposium.
Learning to Detect A Salient Object Reporter: 鄭綱 (3/2)
Modeling Pixel Process with Scale Invariant Local Patterns for Background Subtraction in Complex Scenes (CVPR’10) Shengcai Liao, Guoying Zhao, Vili Kellokumpu,
Natan Jacobson, Yen-Lin Lee, Vijay Mahadevan, Nuno Vasconcelos, Truong Q. Nguyen IEEE, ICME 2010.
Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
Video Segmentation Based on Image Change Detection for Surveillance Systems Tung-Chien Chen EE 264: Image Processing and Reconstruction.
1 Static Sprite Generation Prof ︰ David, Lin Student ︰ Jang-Ta, Jiang
Region-Level Motion- Based Background Modeling and Subtraction Using MRFs Shih-Shinh Huang Li-Chen Fu Pei-Yung Hsiao 2007 IEEE.
Advanced Topics in Computer Vision Spring 2006 Video Segmentation Tal Kramer, Shai Bagon Video Segmentation April 30 th, 2006.
Multiview stereo. Volumetric stereo Scene Volume V Input Images (Calibrated) Goal: Determine occupancy, “color” of points in V.
Multi-view stereo Many slides adapted from S. Seitz.
High-Quality Video View Interpolation
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
IEEE TRANSACTIONS ON IMAGE PROCESSING, VOL. 20, NO. 11, NOVEMBER 2011 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, the Chinese university.
Real-time Background Cut Alon Rubin Shira Kritchman Present: , Weizmann Institute of Science.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Introduction of Saliency Map
Manhattan-world Stereo Y. Furukawa, B. Curless, S. M. Seitz, and R. Szeliski 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp.
Speaker: Chi-Yu Hsu Advisor: Prof. Jian-Jung Ding Leveraging Stereopsis for Saliency Analysis, CVPR 2012.
Webcam-synopsis: Peeking Around the World Young Ki Baik (CV Lab.) (Fri)
Stereo Matching Information Permeability For Stereo Matching – Cevahir Cigla and A.Aydın Alatan – Signal Processing: Image Communication, 2013 Radiometric.
Linked Edges as Stable Region Boundaries* Michael Donoser, Hayko Riemenschneider and Horst Bischof This work introduces an unsupervised method to detect.
3D Fingertip and Palm Tracking in Depth Image Sequences
Robust Hand Tracking with Refined CAMShift Based on Combination of Depth and Image Features Wenhuan Cui, Wenmin Wang, and Hong Liu International Conference.
Mutual Information-based Stereo Matching Combined with SIFT Descriptor in Log-chromaticity Color Space Yong Seok Heo, Kyoung Mu Lee, and Sang Uk Lee.
Prakash Chockalingam Clemson University Non-Rigid Multi-Modal Object Tracking Using Gaussian Mixture Models Committee Members Dr Stan Birchfield (chair)
KinectFusion : Real-Time Dense Surface Mapping and Tracking IEEE International Symposium on Mixed and Augmented Reality 2011 Science and Technology Proceedings.
A General Framework for Tracking Multiple People from a Moving Camera
A Local Adaptive Approach for Dense Stereo Matching in Architectural Scene Reconstruction C. Stentoumis 1, L. Grammatikopoulos 2, I. Kalisperakis 2, E.
Joint Depth Map and Color Consistency Estimation for Stereo Images with Different Illuminations and Cameras Yong Seok Heo, Kyoung Mu Lee and Sang Uk Lee.
Exploitation of 3D Video Technologies Takashi Matsuyama Graduate School of Informatics, Kyoto University 12 th International Conference on Informatics.
Graph Cut Algorithms for Binocular Stereo with Occlusions
Interactive Shortest Path Part 3 An Image Segmentation Technique Jonathan-Lee Jones.
Takuya Matsuo, Norishige Fukushima and Yutaka Ishibashi
報告人:張景舜 P.H. Wu, C.C. Chen, J.J. Ding, C.Y. Hsu, and Y.W. Huang IEEE Transactions on Image Processing, Vol. 22, No. 9, September 2013 Salient Region Detection.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Feature-Based Stereo Matching Using Graph Cuts Gorkem Saygili, Laurens van der Maaten, Emile A. Hendriks ASCI Conference 2011.
Online Kinect Handwritten Digit Recognition Based on Dynamic Time Warping and Support Vector Machine Journal of Information & Computational Science, 2015.
A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
Interactive Sand Art Drawing Using RGB-D Sensor
Region-Based Saliency Detection and Its Application in Object Recognition IEEE TRANSACTIONS ON CIRCUITS AND SYSTEM FOR VIDEO TECHNOLOGY, VOL. 24 NO. 5,
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.
Ning Sun, Hassan Mansour, Rabab Ward Proceedings of 2010 IEEE 17th International Conference on Image Processing September 26-29, 2010, Hong Kong HDR Image.
1 Yu Liu 1, Feng Wu 2 and King Ngi Ngan 1 1 Department of Electronic Engineering, The Chinese University of Hong Kong 2 Microsoft Research Asia, Beijing,
Stereo Video 1. Temporally Consistent Disparity Maps from Uncalibrated Stereo Videos 2. Real-time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral.
Max-Confidence Boosting With Uncertainty for Visual tracking WEN GUO, LIANGLIANG CAO, TONY X. HAN, SHUICHENG YAN AND CHANGSHENG XU IEEE TRANSACTIONS ON.
An H.264-based Scheme for 2D to 3D Video Conversion Mahsa T. Pourazad Panos Nasiopoulos Rabab K. Ward IEEE Transactions on Consumer Electronics 2009.
Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection Jiahang Liu, Tao Fang, and Deren Li IEEE TRANSACTIONS ON GEOSCIENCE.
Robust Segmentation of Freight Containers in Train Monitoring Videos Qing-Jie Kong*, Avinash Kumar**, Narendra Ahuja**,Yuncai Liu* **Department of Electrical.
Local Stereo Matching Using Motion Cue and Modified Census in Video Disparity Estimation Zucheul Lee, Ramsin Khoshabeh, Jason Juang and Truong Q. Nguyen.
Detecting Moving Objects, Ghosts, and Shadows in Video Streams
Heechul Han and Kwanghoon Sohn
A. M. R. R. Bandara & L. Ranathunga
A New Approach to Track Multiple Vehicles With the Combination of Robust Detection and Two Classifiers Weidong Min , Mengdan Fan, Xiaoguang Guo, and Qing.
Vehicle Segmentation and Tracking in the Presence of Occlusions
PRAKASH CHOCKALINGAM, NALIN PRADEEP, AND STAN BIRCHFIELD
A Block Based MAP Segmentation for Image Compression
Presentation transcript:

Multi-view video based multiple objects segmentation using graph cut and spatiotemporal projections Journal of Visual Communication and Image Representation Volume 21, Issues 5–6, July–August 2010, Pages 453–461 Qian Zhang, King Ngi Ngan Department of Electronic Engineering, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Speaker : Yi-Ting Chen

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Introduction Most of the interest has been focused on the research of single view segmentation. Depth information in the 3D scene can be reconstructed from multi-view images, but multiple view segmentation has not attracted much attention. Most of the classical and start-of-the-art graph cut based segmentation algorithms require user’s interventions to specify the initial foreground and background regions as hard constraints. un-continuous!!!!!

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Overview of the proposed framework We built a five-view camera system for views v∈{0,1,2,3,4} To reduce the projection error and avoid extensive computational load, we select view 2 as the key view to start the segmentation process.

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Automatic initial interested objects (IIOs) extraction based on saliency model (SM) Inspired by the work in [33], more sophisticated cues such as motion and depth are combined into our topographical SM. (a) input image, (b) saliency map using depth and motion and (c) extracted IIOs. [33] W.X. Yang, K.N. Ngan ,Unsupervised multiple object segmentation of multiview images, Advanced Concepts for Intelligent Vision Systems Conference (2007), pp. 178–189

Multiple objects segmentation using graph cut For individual object, we construct a sub-graph for the pixels belonging to its “Object Rectangle” an enlarged rectangle to encompass the whole object and restricts the segmentation region we convert multiple objects segmentation into several sub-segmentation problems 例子

Objects segmentation by using graph cut The general formulation of energy function : fp!!!!!! Data term smoothness term

Basic energy function Data term evaluate the likelihood of a certain pixel p assigned to the label fp color (RGB) and depth information are combined fp!!!!! 𝜃 𝑐 is the color distributions modeled by the Gaussian Mixture Model (GMM) 𝜃 𝑑 is the depth modeled by the histogram mode g(·) denotes a Gaussian probability distribution h(·) is the histogram mode w(·) is the mixture weighting coefficient 𝑘 𝑝 is GMM component variable 𝑧 𝑝 ={d, r, g, b} is a four-dimensional feature vector for pixel p

Basic energy function Smoothness term Ep,q(fp,fq) measures the penalty of two neighboring pixels p and q with different labels  dist(p,q) is the coordinate distance between p and q  diff(cp,cq) is the average RGB color difference between p and q  βr=(2〈‖(rp-rq)2‖〉)-1, where 〈·〉 is the expectation operator for the red channel. fp fq!!!!!!!!

The result with basic energy function basic energy function using: (a) color, (b) depth, (c) combined color and depth 

The segmentation errors in the rectangles errors occur because their color and depth information are very similar to the foreground data

Background penalty with occlusion reasoning(1/2) Since we capture the same scene at different view points, occluded background regions often occur around the object boundary. the occluded regions have a higher probability to be the background than the visible ones.  impose a background penalty factor αbp=3.5

Background penalty with occlusion reasoning(2/2) background probability map without occlusion penalty, (b) combined occlusion map (c) background probability map with occlusion penalty.

The erroneous segmentations marked as ellipses errors are mainly caused by the strong color contrast in the background comparing to the weak contrast across the “true” object boundary

Foreground contrast enhancement(1/3) To make the color contrast representation more efficient the average color difference is computed in the perceptually uniform L *a *b color space To enhance the contrast across foreground/background boundary and attenuate the background contrast adopting the motion residual information

Foreground contrast enhancement(2/3) The motion residual is defined as the smoothness term combines the L *a *b color and motion residual contrasts 𝑅 𝑡 𝑣 is the reconstructed image from 𝐼 𝑡−1 𝑣 and the motion field 𝑀 𝑡,𝑡−1 𝑣 are the motion residual of p and q

Foreground contrast enhancement(3/3) But combining the color contrast and motion residual contrasts will not only attenuate the background contrast but also weaken the ‘‘true” foreground contrast

Foreground contrast enhancement(3/3) we define a local color contrast to enhance the discontinuity distribution in its neighborhood calculate the local mean μ and the local variance δ of contrast 𝐸 𝑝,𝑞 ∗ 𝑓 𝑝 , 𝑓 𝑝 = 𝐸 𝑝,𝑞 𝑙𝑜𝑐𝑎𝑙 𝑓 𝑝 , 𝑓 𝑞 +𝐸 𝑝,𝑞 𝑚𝑜𝑡𝑖𝑜𝑛𝑅𝑠 𝑓 𝑝 , 𝑓 𝑞

The result with modified energy function

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Multi-view video segmentation In many applications, accurate object segmentation for all views of the frames of a video is required. In this section, we extend the segmentation algorithm to multi-view video

Disparity and Motion Projection projected by pixel-based disparity compensation exploits the spatial consistency projected by pixel-based motion compensation from the mask of its previous frame enforces the temporal consistency

Uncertain boundary band validation To improve the segmentation results, we construct an uncertain band along the object boundary based on an activity measure  using our graph cut algorithm to yield more accurate segmentation layers (a) (b) prediction mask of view 3 (b) uncertain band based on the post-processing

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Experimental results five-view camera system resolution of 640 * 480 at frame rate of 30 frames per second (fps) we demonstrated on two types of multi-view videos simulating different scenarios similar and low depth different depths

Segmentation of IOs with similar and low depth

Segmentation of IOs with different depths

Comparison with others’ methods(1/2) compare with Kolmogorov’s bilayer segmentation algorithm [21] by using their test images (a) left view, (b) right view, (c) result by our proposed algorithm (d) result by Kolmogorov’s algorithm. [21] V. Kolmogorov, A. Criminisi, A. Blake, G. Cross, C. Rother, Probabilistic fusion of stereo with color and contrast for bi-layer segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence 28 (9) (2006) 1480–1492.

Comparison with others’ methods(2/2) compare our proposed algorithm with an existing method employing multi-way cut with α-expansion [33] using our test images [33] W.X. Yang, K.N. Ngan ,Unsupervised multiple object segmentation of multiview images, Advanced Concepts for Intelligent Vision Systems Conference (2007), pp. 178–189

Outline Introduction The proposed framework Method Segmentation for key view Multi-view video segmentation Experimental results Conclusion

Conclusion In this paper, we propose an automatic segmentation algorithm for multiple objects from multi-view video. The experiment was implemented on two representative multi-view videos. Accurate segmentation results with good visual quality and subjective comparison with others’ methods attest to the efficiency and robustness of our proposed algorithm.

Thanks for your listening!