A Plane-Based Approach to Mondrian Stereo Matching

Slides:



Advertisements
Similar presentations
Discontinuity Preserving Stereo with Small Baseline Multi-Flash Illumination Rogerio Feris 1, Ramesh Raskar 2, Longbin Chen 1, Karhan Tan 3 and Matthew.
Advertisements

A Robust Super Resolution Method for Images of 3D Scenes Pablo L. Sala Department of Computer Science University of Toronto.
Efficient High-Resolution Stereo Matching using Local Plane Sweeps Sudipta N. Sinha, Daniel Scharstein, Richard CVPR 2014 Yongho Shin.
Real-time, low-resource corridor reconstruction using a single consumer grade RGB camera is a powerful tool for allowing a fast, inexpensive solution to.
CS 376b Introduction to Computer Vision 04 / 21 / 2008 Instructor: Michael Eckmann.
Computer Vision Detecting the existence, pose and position of known objects within an image Michael Horne, Philip Sterne (Supervisor)
Stereo Matching Segment-based Belief Propagation Iolanthe II racing in Waitemata Harbour.
Lecture 5 Template matching
A new approach for modeling and rendering existing architectural scenes from a sparse set of still photographs Combines both geometry-based and image.
Last Time Pinhole camera model, projection
Stanford CS223B Computer Vision, Winter 2005 Lecture 6: Stereo 2 Sebastian Thrun, Stanford Rick Szeliski, Microsoft Hendrik Dahlkamp and Dan Morris, Stanford.
Contents Description of the big picture Theoretical background on this work The Algorithm Examples.
Processing Digital Images. Filtering Analysis –Recognition Transmission.
Stereo & Iterative Graph-Cuts Alex Rav-Acha Vision Course Hebrew University.
Automatic 2D-3D Registration Student: Lingyun Liu Advisor: Prof. Ioannis Stamos.
Fitting a Model to Data Reading: 15.1,
Stereo Computation using Iterative Graph-Cuts
A Novel 2D To 3D Image Technique Based On Object- Oriented Conversion.
May 2004Stereo1 Introduction to Computer Vision CS / ECE 181B Tuesday, May 11, 2004  Multiple view geometry and stereo  Handout #6 available (check with.
Information that lets you recognise a region.
December 2, 2014Computer Vision Lecture 21: Image Understanding 1 Today’s topic is.. Image Understanding.
Announcements PS3 Due Thursday PS4 Available today, due 4/17. Quiz 2 4/24.
Visualization- Determining Depth From Stereo Saurav Basu BITS Pilani 2002.
3D Scene Models Object recognition and scene understanding Krista Ehinger.
Computer Vision Spring ,-685 Instructor: S. Narasimhan WH 5409 T-R 10:30am – 11:50am Lecture #15.
Tricolor Attenuation Model for Shadow Detection. INTRODUCTION Shadows may cause some undesirable problems in many computer vision and image analysis tasks,
Lecture 12 Stereo Reconstruction II Lecture 12 Stereo Reconstruction II Mata kuliah: T Computer Vision Tahun: 2010.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
WIEN Stereo-based Image and Video Analysis for Multimedia Applications Application: “Computer-generated Stereoscopic Paintings“ M. Gelautz, E. Stavrakis,
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
MESA LAB Two papers in icfda14 Guimei Zhang MESA LAB MESA (Mechatronics, Embedded Systems and Automation) LAB School of Engineering, University of California,
CS654: Digital Image Analysis Lecture 8: Stereo Imaging.
Feature-Based Stereo Matching Using Graph Cuts Gorkem Saygili, Laurens van der Maaten, Emile A. Hendriks ASCI Conference 2011.
1 Howard Schultz, Edward M. Riseman, Frank R. Stolle Computer Science Department University of Massachusetts, USA Dong-Min Woo School of Electrical Engineering.
Acquiring 3D models of objects via a robotic stereo head David Virasinghe Department of Computer Science University of Adelaide Supervisors: Mike Brooks.
Computer Vision, Robert Pless
A Region Based Stereo Matching Algorithm Using Cooperative Optimization Zeng-Fu Wang, Zhi-Gang Zheng University of Science and Technology of China Computer.
1 Artificial Intelligence: Vision Stages of analysis Low level vision Surfaces and distance Object Matching.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Senior Project Poster Day 2005, CIS Dept. University of Pennsylvania Surface Reconstruction from Feature Based Stereo Mickey Mouse, Donald Duck Faculty.
Image-Based Segmentation of Indoor Corridor Floors for a Mobile Robot Yinxiao Li and Stanley T. Birchfield The Holcombe Department of Electrical and Computer.
stereo Outline : Remind class of 3d geometry Introduction
Autonomous Robots Vision © Manfred Huber 2014.
(c) 2000, 2001 SNU CSE Biointelligence Lab Finding Region Another method for processing image  to find “regions” Finding regions  Finding outlines.
Course14 Dynamic Vision. Biological vision can cope with changing world Moving and changing objects Change illumination Change View-point.
Yizhou Yu Texture-Mapping Real Scenes from Photographs Yizhou Yu Computer Science Division University of California at Berkeley Yizhou Yu Computer Science.
Jeong Kanghun CRV (Computer & Robot Vision) Lab..
John Morris Stereo Vision (continued) Iolanthe returns to the Waitemata Harbour.
Image features and properties. Image content representation The simplest representation of an image pattern is to list image pixels, one after the other.
Photoconsistency constraint C2 q C1 p l = 2 l = 3 Depth labels If this 3D point is visible in both cameras, pixels p and q should have similar intensities.
Computer Vision Computer Vision based Hole Filling Chad Hantak COMP December 9, 2003.
Correspondence and Stereopsis. Introduction Disparity – Informally: difference between two pictures – Allows us to gain a strong sense of depth Stereopsis.
3D Perception and Environment Map Generation for Humanoid Robot Navigation A DISCUSSION OF: -BY ANGELA FILLEY.
Summary of “Efficient Deep Learning for Stereo Matching”
Semi-Global Matching with self-adjusting penalties
Introduction to Polygons
José Manuel Iñesta José Martínez Sotoca Mateo Buendía
DIGITAL SIGNAL PROCESSING
Semi-Global Stereo Matching with Surface Orientation Priors
Fast Preprocessing for Robust Face Sketch Synthesis
EECS 274 Computer Vision Stereopsis.
Common Classification Tasks
Brief Review of Recognition + Context
Object Recognition Today we will move on to… April 12, 2018
Computer Vision Stereo Vision.
CSE (c) S. Tanimoto, 2007 Image Understanding
Stereoscopic Surface Perception
Occlusion and smoothness probabilities in 3D cluttered scenes
Artistic Rendering Final Project Initial Proposal
Presentation transcript:

A Plane-Based Approach to Mondrian Stereo Matching Abstract Stereo vision is the problem of estimating a 3D depth map of a scene from two images taken from adjacent viewpoints. It has many applications, including self-driving cars, robotics, and 3D reconstruction. The goal is to find each pixel’s disparity—the distance between its location in the left and right images. The disparity corresponds directly to the point’s distance to the camera, much in the way human eyes perceive depth. Conceptually, disparities can be computed by placing the right image “on top of” the left image and shifting it one pixel at a time until image features or textured regions line up. However, this strategy does not work well in completely untextured regions, such as blank walls, blackboards, and other objects we regularly encounter. The goal of this project is to devise a stereo algorithm that can handle even pathological cases such as scenes consisting solely of solid-colored regions, resembling the abstract paintings by Dutch artist Piet Mondrian. Unlike existing algorithms that rely on matching image texture, our method matches the edges (borders) of single-colored image segments. Once matched, the 3D locations of the edges are used to fit 3D planes that correspond to the original colored segments. Our algorithm generates multiple possible plane hypotheses that represent valid 3D configurations and selects the best hypothesis according to an energy function. The final goal is to employ our novel algorithm in a robust stereo method that can handle both textured and untextured regions in noisy images of natural environments. Dylan Quenneville ’18 Daniel Scharstein Support for this project came from the National Science Foundation under the grant IIS-1320715 Our Approach Selecting Lines to Draw 3D Planes What is Stereo Vision? Stereo vision seeks to extract 3D depth information from two images taken from adjacent viewpoints. This can be done by matching points between the images and measuring their disparity—the difference between a point’s location in the left and right images. This disparity corresponds directly to a point’s distance from the camera. Stereo vision has applications in self-driving cars, robotics, 3D reconstruction, and 3D entertainment. Our approach matches region edges to find disparities, and uses the edge line disparity information to estimate a plane that they bound. The simplest form of this is finding vertical lines in an image and assuming the regions between these lines are ramps between the two disparities. With use of a plane fitting method, our algorithm now can use two edges and their endpoint disparities to draw a 3D plane that fits the (x, y, disparity) coordinates. Disparities at edge endpoints can be used as (x, y, d) coordinates to fit planes for each component. In most images, there will be edges of one plane covering another plane. In these cases, only the occluding plane should use the edge disparities for plane fitting. For this reason, we conduct a depth-first search through all possible planes using some or all edges, rejecting contradictory assignments. In some cases, regions are part of larger planes being partially occluded. We conduct another search to join regions of similar colors if geometrically possible. We also join coplanar components, such as the whiteboard in ‘Corner’ or squares on a patterned surface. Abstract Understanding of Images Motivation Our algorithm begins by segmenting both left and right images into components of like color and assigning them arbitrary labels. We perform a scanline edge detector which draws ‘edgels’ (pixels that represent a boundary between two regions) whenever there is a significant change in color, indicated by a change in component label. Then, these ‘edgels’ are accumulated and fit to line segments that describe the edge in terms of its slope and position rather than a collection of points using a polyline split method. Most passive stereo algorithms match points between left and right images in a stereo pair by placing the right image ‘on top’ of the left image and moving it pixel by pixel until points in the image line up. For images with strongly defined details and texture, these algorithms perform well. Because the matching relies on features lining up at only the correct disparity, untextured (solid-colored) regions create great difficulty. When other objects—such as fences, railings, poles—partially occlude these untextured regions, the matching becomes impossible for most algorithms. This problem has been named ‘Mondrian stereo’ because these untextured images resemble the abstract paintings by Dutch artist Piet Mondrian. Upper Right: the triangle occludes a patterned plane occluding a background. The purple edges show texture edges, owned by the components on either side. The green edges are only owned by the side labeled ‘occlusion’. Upper Left: input image and ground truth disparity map Segmentation and edge detection of a computer-generated image designed to capture the geometry of a room, inspired by the ‘Corner’ image Humans easily recognize that all the small light green regions are part of the background, but we must add an extra step to join regions of similar colors to get from the second image to the third image. Piet Mondrian’s Composition A, 1923 1 2 3 4 5 8 7 6 9 Goals Results The goal of this work is to create an algorithm that can produce accurate depth maps for untextured regions using abstract geometric understanding of the scene rather than pixel-wise matching. Our long-term goal is to use our algorithm on real images in conjunction with existing stereo algorithms. Semi-Global Matching Ours Left Input Right Input Ground Truth First Pass Join by Color Finding Edge Disparities Edges in the left image are matched to edges in the right image if they separate regions of the same color and have similar locations. Once a matching is found without any contradictions, the disparities of each line’s two endpoints are calculated by the distance between line in the left image and their matched lines in the right image. d = 17 d = 12 d = 9 ‘Playroom’: a detailed and mostly textured scene and ground truth disparity map from the Middlebury College stereo training set. ‘Corner’: a scene containing large untextured regions. The right image shows a disparity map produced with Semi-Global Matching, a popular stereo algorithm. The cross-hatched dark red regions show where the algorithm fails. Above: green and blue lines represent the left and right images’ edges, respectively. Left: original left image and ground truth disparity map