Virtual Control of Optical Axis of the 3DTV Camera for Reducing Visual Fatigue in Stereoscopic 3DTV Presenter: Yi Shi & Saul Rodriguez March 26, 2008.

Slides:



Advertisements
Similar presentations
CSE473/573 – Stereo and Multiple View Geometry
Advertisements

For Internal Use Only. © CT T IN EM. All rights reserved. 3D Reconstruction Using Aerial Images A Dense Structure from Motion pipeline Ramakrishna Vedantam.
Robust statistical method for background extraction in image segmentation Doug Keen March 29, 2001.
Spatial-Temporal Consistency in Video Disparity Estimation ICASSP 2011 Ramsin Khoshabeh, Stanley H. Chan, Truong Q. Nguyen.
Stereo Vision Reading: Chapter 11
Public Library, Stereoscopic Looking Room, Chicago, by Phillips, 1923.
Stereo Many slides adapted from Steve Seitz. Binocular stereo Given a calibrated binocular stereo pair, fuse it to produce a depth image Where does the.
Real-Time Accurate Stereo Matching using Modified Two-Pass Aggregation and Winner- Take-All Guided Dynamic Programming Xuefeng Chang, Zhong Zhou, Yingjie.
Adviser : Ming-Yuan Shieh Student ID : M Student : Chung-Chieh Lien VIDEO OBJECT SEGMENTATION AND ITS SALIENT MOTION DETECTION USING ADAPTIVE BACKGROUND.
Stereo.
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Boundary matting for view synthesis Samuel W. Hasinoff Sing Bing Kang Richard Szeliski Computer Vision and Image Understanding 103 (2006) 22–32.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2005 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Contents Description of the big picture Theoretical background on this work The Algorithm Examples.
Direct Methods for Visual Scene Reconstruction Paper by Richard Szeliski & Sing Bing Kang Presented by Kristin Branson November 7, 2002.
Multiple Human Objects Tracking in Crowded Scenes Yao-Te Tsai, Huang-Chia Shih, and Chung-Lin Huang Dept. of EE, NTHU International Conference on Pattern.
1 Introduction to 3D Imaging: Perceiving 3D from 2D Images How can we derive 3D information from one or more 2D images? There have been 2 approaches: 1.
Stereopsis Mark Twain at Pool Table", no date, UCR Museum of Photography.
3D from multiple views : Rendering and Image Processing Alexei Efros …with a lot of slides stolen from Steve Seitz and Jianbo Shi.
Stereo Computation using Iterative Graph-Cuts
3D Computer Vision and Video Computing 3D Vision Lecture 14 Stereo Vision (I) CSC 59866CD Fall 2004 Zhigang Zhu, NAC 8/203A
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
CSE473/573 – Stereo Correspondence
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Visualization- Determining Depth From Stereo Saurav Basu BITS Pilani 2002.
Multiple View Geometry : Computational Photography Alexei Efros, CMU, Fall 2006 © Martin Quinn …with a lot of slides stolen from Steve Seitz and.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
1 Perceiving 3D from 2D Images How can we derive 3D information from one or more 2D images? There have been 2 approaches: 1. intrinsic images: a 2D representation.
3-D Scene u u’u’ Study the mathematical relations between corresponding image points. “Corresponding” means originated from the same 3D point. Objective.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
A Rapid Stereo Matching Algorithm Based on Disparity Interpolation Gang Yao Yong Liu Bangjun Lei Dong Ren Institute of Intelligent Vision and Image Information.
ICPR/WDIA-2012 High Quality Novel View Synthesis Based on Low Resolution Depth Image and High Resolution Color Image Jui-Chiu Chiang, Zheng-Feng Liu, and.
Course 12 Calibration. 1.Introduction In theoretic discussions, we have assumed: Camera is located at the origin of coordinate system of scene.
Integral University EC-024 Digital Image Processing.
Recap from Monday Image Warping – Coordinate transforms – Linear transforms expressed in matrix form – Inverse transforms useful when synthesizing images.
WIEN Stereo-based Image and Video Analysis for Multimedia Applications Application: “Computer-generated Stereoscopic Paintings“ M. Gelautz, E. Stavrakis,
Stereo Vision Reading: Chapter 11 Stereo matching computes depth from two or more images Subproblems: –Calibrating camera positions. –Finding all corresponding.
CS654: Digital Image Analysis Lecture 8: Stereo Imaging.
December 4, 2014Computer Vision Lecture 22: Depth 1 Stereo Vision Comparing the similar triangles PMC l and p l LC l, we get: Similarly, for PNC r and.
September 23, 2014Computer Vision Lecture 5: Binary Image Processing 1 Binary Images Binary images are grayscale images with only two possible levels of.
Stereo Many slides adapted from Steve Seitz.
Computer Vision, Robert Pless
: Chapter 11: Three Dimensional Image Processing 1 Montri Karnjanadecha ac.th/~montri Image.
Computer Vision Stereo Vision. Bahadir K. Gunturk2 Pinhole Camera.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
CSE 185 Introduction to Computer Vision Stereo. Taken at the same time or sequential in time stereo vision structure from motion optical flow Multiple.
Robust Object Tracking by Hierarchical Association of Detection Responses Present by fakewen.
Bahadir K. Gunturk1 Phase Correlation Bahadir K. Gunturk2 Phase Correlation Take cross correlation Take inverse Fourier transform  Location of the impulse.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Jeong Kanghun CRV (Computer & Robot Vision) Lab..
A global approach Finding correspondence between a pair of epipolar lines for all pixels simultaneously Local method: no guarantee we will have one to.
Project 2 due today Project 3 out today Announcements TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAAAA.
Image-Based Rendering Geometry and light interaction may be difficult and expensive to model –Think of how hard radiosity is –Imagine the complexity of.
Media Processor Lab. Media Processor Lab. Trellis-based Parallel Stereo Matching Media Processor Lab. Sejong univ.
Energy minimization Another global approach to improve quality of correspondences Assumption: disparities vary (mostly) smoothly Minimize energy function:
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
Introduction To IBR Ying Wu. View Morphing Seitz & Dyer SIGGRAPH’96 Synthesize images in transition of two views based on two images No 3D shape is required.
1 2D TO 3D IMAGE AND VIDEO CONVERSION. INTRODUCTION The goal is to take already existing 2D content, and artificially produce the left and right views.
Motion tracking TEAM D, Project 11: Laura Gui - Timisoara Calin Garboni - Timisoara Peter Horvath - Szeged Peter Kovacs - Debrecen.
A Plane-Based Approach to Mondrian Stereo Matching
Summary of “Efficient Deep Learning for Stereo Matching”
제 5 장 스테레오.
: Chapter 11: Three Dimensional Image Processing
Computer Vision Lecture 5: Binary Image Processing
What have we learned so far?
Image Processing, Leture #20
Object Recognition Today we will move on to… April 12, 2018
Computer Vision Stereo Vision.
Course 6 Stereo.
Range calculations For each pixel in the left image, determine what pixel position in the right image corresponds to the same point on the object. This.
Presentation transcript:

Virtual Control of Optical Axis of the 3DTV Camera for Reducing Visual Fatigue in Stereoscopic 3DTV Presenter: Yi Shi & Saul Rodriguez March 26, 2008

Outline ► Motivation ► Stereo Matching ► Virtual View Images Synthesis ► Experimental Results ► Discussion

Motivation ► Trade-off between visual comfort and 3D impact with respect to the baseline-stretch of a 3DTV camera. ► Optimal baseline-stretch is around 65 mm but even the diameter of one commercial TV camera could be greater than 65 mm. Figure 1. Baseline-stretch: Human Vs. 3DTV Camera

Motivation (Cont.) ► Stereoscopic images taken by a 3DTV camera with wide baseline-stretch give uncomfortable feeling and cause undesirable effects such as Puppet Theater Effect: Human beings reproduced in reduced size appear as if they were animated puppets. Figure 2. Puppet Theater Effect (Chinese Pi Ying Xi)

Motivation (Cont.) ► Approach: Virtually ► Approach: Virtually control the positions of optical axis of a 3DTV camera by synthesizing the virtual views between two real cameras. Figure 3. Virtual control of the optical axis of a 3DTV camera

Stereo Matching ► Stereo matching is a process to find corresponding points between reference and target image. ► Disparity is defined as the position difference. ► Cameras are assumed to be parallel, and if not, can be made parallel by certain rectification algorithm. Figure 4. Disparity diagram

Stereo Matching (cont.) ► Block matching  For each epipolar line in two images, fixing one block associated to a certain pixel in the 1st image and linearly scan the blocks in the 2nd image.  For each pair of blocks, the mean squared error is calculated for each of the three RGB components and a block in the 2nd image with the minimum value is extracted as a matched block.  The disparity of the corresponding pixel is then calculated.

Stereo Matching (cont.) ► Drawback of conventional stereo matching: Disparity estimation around occlusion boundaries tends to be erroneous. ► Need to exclude occlusion areas adaptively in the process of stereo matching. ► Use three kinds of shifted matching window: left- shifted, centered, and right-shifted. ► Disparity calculation: First calculate the sum of the square difference for three types of shifted matching window along the epipolar line. Then regard the displacement that gives the smallest sum of the square difference as the disparity of the pixel. Figure 5. 3 kinds of matching windows

Stereo Matching (cont.) ► Edge adaptive shifted window technique  Observation: A region without vertical edges hardly includes the occlusion area in the usual stereo setup.  Thus applying only the centered window if there is no edge in a window is more preferable.  Edge detection can be done by Sobel operator which requires negligible computation.  Computational complexity of non-edge areas can be reduced up to about 1/S of those using non-adaptive shifted windows, where S is the number of shifted matching windows.

Stereo Matching (cont.) ► Coarse-to-fine hierarchical matching scheme is used which attempts to maximize the qualities of stereo matching results while maintaining the computational complexity to an acceptable level. Figure 6. Matching Parymid

Stereo Matching (cont.) ► Occlusion area detection  Two disparity maps are obtained for both left and right views.  Then compare the disparity of a pixel in one image with that of the corresponding pixel in the other image over its entirety.  If the two disparities differ more than one pixel, the pixel is considered as an occluded one, otherwise, not.

Virtual View Images Synthesis ► First determine the target position of virtual view image ► Apply forward mapping ► Guess disparities for uncovered area, which shows up due to difference of the disparities. ► Color uncovered area using the guessed disparities. ► If target point does not belong to a detected occlusion area originally, interpolate the color shown in (d) Figure 7. Color synthesis

Experimental Results Figure 8. Results on a city view (Univ. of Tsukuba)

Experimental Results (cont.) Figure 9. Detected occlusion area

Experimental Results (cont.) ► Quantified error rate measurement  Percentage of badly matched pixels is defined as:

Experimental Results (cont.) Figure10. Errors of disparity map compared with the ground truth map

Experimental Results (cont.) Table 1. Percentage of badly matched pixels ► Percentage of badly matched pixels

Experimental Results (cont.) Figure 11. Simulation results for the “ Dog and Flower ” indoor scene

Experimental Results (cont.) Figure 12. Simulation results for the outdoor scene.

Experimental Results (cont.) Chart 1. The average scores of subjective evaluation for the indoor stereoscopic scene

Experimental Results (cont.) Chart 2. The average scores of our subjective evaluation for the outdoor HD stereoscopic scene

Discussion ► ► The virtual control of the optical axis is a first step toward a versatile framework of post-processing for providing comfortable stereoscopic contents. ► ► It would be interesting as a future work to analyze various mutually affecting factors on visual comfortableness and 3D impact in stereoscopic images, such as focal length, convergence angle, resolution, and the distance to main objects ► ► Use a more rigorous subjective assessment with a sufficient number of subjects evaluating a large set of stereoscopic images.

References ► [1] J. Park, G. M. Um, C. Ahn, and C. Ahn, Virtual control of optical axis of the 3DTV camera for reducing visual fatigue in stereoscopic 3DTV, ETRI Journal, Volume 26, Number 6, December ► [2] M. Accame, F. G.B. De Natale, and D. D. Giusto, Hierarchical block matching for disparity estimation in stereo sequences, International Conference on Image Processing, Volume 2, page(s): , October ► [3] J. Park and S. Inoue, Arbitrary View Generation Using Multiple Cameras, Proc. IEEE ICIP'97, volume 1, page(s): Santa Barbara, USA, Oct

Thank you!