Statistical Inference and Synthesis in the Image Domain for Mobile Robot Environment Modeling L. Abril Torres-Méndez and Gregory Dudek Centre for Intelligent.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

Bayesian Belief Propagation
Active Appearance Models
A Robust Super Resolution Method for Images of 3D Scenes Pablo L. Sala Department of Computer Science University of Toronto.
Texture. Limitation of pixel based processing Edge detection with different threshold.
電腦視覺 Computer and Robot Vision I
CSE 473/573 Computer Vision and Image Processing (CVIP)
Joydeep Biswas, Manuela Veloso
Carolina Galleguillos, Brian McFee, Serge Belongie, Gert Lanckriet Computer Science and Engineering Department Electrical and Computer Engineering Department.
Hilal Tayara ADVANCED INTELLIGENT ROBOTICS 1 Depth Camera Based Indoor Mobile Robot Localization and Navigation.
Introduction To Tracking
Texture. Edge detectors find differences in overall intensity. Average intensity is only simplest difference. many slides from David Jacobs.
Kiyoshi Irie, Tomoaki Yoshida, and Masahiro Tomono 2011 IEEE International Conference on Robotics and Automation Shanghai International Conference Center.
Intelligent Systems Lab. Extrinsic Self Calibration of a Camera and a 3D Laser Range Finder from Natural Scenes Davide Scaramuzza, Ahad Harati, and Roland.
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Motion Tracking. Image Processing and Computer Vision: 82 Introduction Finding how objects have moved in an image sequence Movement in space Movement.
Prénom Nom Document Analysis: Parameter Estimation for Pattern Recognition Prof. Rolf Ingold, University of Fribourg Master course, spring semester 2008.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Efficient Moving Object Segmentation Algorithm Using Background Registration Technique Shao-Yi Chien, Shyh-Yih Ma, and Liang-Gee Chen, Fellow, IEEE Hsin-Hua.
1 On the Statistical Analysis of Dirty Pictures Julian Besag.
Rodent Behavior Analysis Tom Henderson Vision Based Behavior Analysis Universitaet Karlsruhe (TH) 12 November /9.
Ensemble Tracking Shai Avidan IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE February 2007.
Stereoscopic Light Stripe Scanning: Interference Rejection, Error Minimization and Calibration By: Geoffrey Taylor Lindsay Kleeman Presented by: Ali Agha.
Real-time Combined 2D+3D Active Appearance Models Jing Xiao, Simon Baker,Iain Matthew, and Takeo Kanade CVPR 2004 Presented by Pat Chan 23/11/2004.
Computing motion between images
High-Quality Video View Interpolation
The plan for today Camera matrix
1 Bayesian Restoration Using a New Nonstationary Edge-Preserving Image Prior Giannis K. Chantas, Nikolaos P. Galatsanos, and Aristidis C. Likas IEEE Transactions.
Illumination Normalization with Time-Dependent Intrinsic Images for Video Surveillance Yasuyuki Matsushita, Member, IEEE, Ko Nishino, Member, IEEE, Katsushi.
Lecture 3a: Feature detection and matching CS6670: Computer Vision Noah Snavely.
Object recognition under varying illumination. Lighting changes objects appearance.
Bootstrapping a Heteroscedastic Regression Model with Application to 3D Rigid Motion Evaluation Bogdan Matei Peter Meer Electrical and Computer Engineering.
Accurate, Dense and Robust Multi-View Stereopsis Yasutaka Furukawa and Jean Ponce Presented by Rahul Garg and Ryan Kaminsky.
Lecture II-2: Probability Review
Statistical Color Models (SCM) Kyungnam Kim. Contents Introduction Trivariate Gaussian model Chromaticity models –Fixed planar chromaticity models –Zhu.
Multi-Scale Surface Descriptors Gregory Cipriano, George N. Phillips Jr., and Michael Gleicher.
My Research Experience Cheng Qian. Outline 3D Reconstruction Based on Range Images Color Engineering Thermal Image Restoration.
Image Analysis and Markov Random Fields (MRFs) Quanren Xiong.
Extracting Places and Activities from GPS Traces Using Hierarchical Conditional Random Fields Yong-Joong Kim Dept. of Computer Science Yonsei.
Reconstruction of 3D Models from Intensity Images and Partial Depth L. Abril Torres-Méndez and Gregory Dudek Centre for Intelligent Machines School of.
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Shape Recognition and Pose Estimation for Mobile Augmented Reality Author : N. Hagbi, J. El-Sana, O. Bergig, and M. Billinghurst Date : Speaker.
1/20 Obtaining Shape from Scanning Electron Microscope Using Hopfield Neural Network Yuji Iwahori 1, Haruki Kawanaka 1, Shinji Fukui 2 and Kenji Funahashi.
1. Introduction Motion Segmentation The Affine Motion Model Contour Extraction & Shape Estimation Recursive Shape Estimation & Motion Estimation Occlusion.
3D SLAM for Omni-directional Camera
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
Global Parametrization of Range Image Sets Nico Pietroni, Marco Tarini, Olga Sorkine, Denis Zorin.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
Course 9 Texture. Definition: Texture is repeating patterns of local variations in image intensity, which is too fine to be distinguished. Texture evokes.
A Model for Learning the Semantics of Pictures V. Lavrenko, R. Manmatha, J. Jeon Center for Intelligent Information Retrieval Computer Science Department,
A Passive Approach to Sensor Network Localization Rahul Biswas and Sebastian Thrun International Conference on Intelligent Robots and Systems 2004 Presented.
Statistics in the Image Domain for Mobile Robot Environment Modeling L. Abril Torres-Méndez and Gregory Dudek Centre for Intelligent Machines School of.
Computer Vision Lecture #10 Hossam Abdelmunim 1 & Aly A. Farag 2 1 Computer & Systems Engineering Department, Ain Shams University, Cairo, Egypt 2 Electerical.
Generalised method of moments approach to testing the CAPM Nimesh Mistry Filipp Levin.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
Chapter 5 Multi-Cue 3D Model- Based Object Tracking Geoffrey Taylor Lindsay Kleeman Intelligent Robotics Research Centre (IRRC) Department of Electrical.
Approximate Inference: Decomposition Methods with Applications to Computer Vision Kyomin Jung ( KAIST ) Joint work with Pushmeet Kohli (Microsoft Research)
Tony Jebara, Columbia University Advanced Machine Learning & Perception Instructor: Tony Jebara.
 Present by 陳群元.  Introduction  Previous work  Predicting motion patterns  Spatio-temporal transition distribution  Discerning pedestrians  Experimental.
1 Chapter 8: Model Inference and Averaging Presented by Hui Fang.
Statistical Models of Appearance for Computer Vision 主講人:虞台文.
Course 5 Edge Detection. Image Features: local, meaningful, detectable parts of an image. edge corner texture … Edges: Edges points, or simply edges,
Motion Estimation Multimedia Systems and Standards S2 IF Telkom University.
Edge Preserving Spatially Varying Mixtures for Image Segmentation Giorgos Sfikas, Christophoros Nikou, Nikolaos Galatsanos (CVPR 2008) Presented by Lihan.
11/25/03 3D Model Acquisition by Tracking 2D Wireframes Presenter: Jing Han Shiau M. Brown, T. Drummond and R. Cipolla Department of Engineering University.
A Plane-Based Approach to Mondrian Stereo Matching
Semi-Global Matching with self-adjusting penalties
Motion and Optical Flow
Texture.
Image Registration 박성진.
Presentation transcript:

Statistical Inference and Synthesis in the Image Domain for Mobile Robot Environment Modeling L. Abril Torres-Méndez and Gregory Dudek Centre for Intelligent Machines School of Computer Science McGill University

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Outline Introduction Problem statement Overview of the method Related work Range Synthesis Algorithm Experimental results Conclusions and future directions

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Our Application Automatic generation of 3D maps. Robot navigation, localization - Ex. For rescue and inspection tasks. Robots are commonly equipped with camera(s) and laser rangefinder.  Would like a full range map of the the environment.  Simple acquisition of data

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Problem Context Pure vision-based methods –Shape-from-X remains challenging, especially in unconstrained environments. Laser line scanners are commonplace, but –Volume scanners remain exotic, costly, slow. –Incomplete range maps are far easier to obtain that complete ones. Proposed solution: Combine visual and partial depth Shape-from-(partial) Shape

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Problem Statement From incomplete range data combined with intensity, perform scene recovery. From range scans like this infer the rest of the map

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Overview of the Method Approximate the composite of intensity and range data at each point as a Markov process. Infer complete range maps by estimating joint statistics of observed range and intensity.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems What knowledge does Intensity provide about Surfaces? Two examples of kind of inferences: Intensity image Range image surface smoothness variations in depth surface smoothness far close

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems What about outliers? Changes in intensity from texture-like regions Intensity Range Not critical for reconstruction since very close neighborhoods represent the same type of smooth surface.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems What about Edges? Edges often detect depth discontinuities Very useful in the reconstruction process! Intensity Range edges

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Range synthesis basis Range and intensity images are correlated, in complicated ways, exhibiting useful structure. - Basis of shape from shading & shape from darkness, but they are based on strong assumptions. The variations of pixels in the intensity and range images are related to the values elsewhere in the image(s). Markov Random Fields

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Related Work Probabilistic updating has been used for –image restoration [e.g. Geman & Geman, TPAMI 1984] as well as –texture synthesis [e.g. Efros & Leung, ICCV 1999]. Problems: Pure extrapolation/interpolation: –is suitable only for textures with a stationary distribution –can converge to inappropriate dynamic equilibria

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems MRFs for Range Synthesis States are described as augmented voxels V=(I,R,E). Z m =(x,y):1≤x,y≤m Z m =(x,y):1≤x,y≤m: mxm lattice over which the image are described. I = {I x,y }, (x,y)  Z m I = {I x,y }, (x,y)  Z m : intensity (gray or color) of the input image E is a binary matrix (1 if an edge exists and 0 otherwise). R={R x,y }, (x,y)  Z m R={R x,y }, (x,y)  Z m : incomplete depth values We model V as an MRF. I and R are random variables. R I v x,y Augmented Range Map I R

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Markov Random Field Model Definition: A stochastic process for which a voxel value is predicted by its neighborhood in range and intensity. N x,y is a square neighborhood of size n x n centered at voxel V x,y.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Computing the Markov Model From observed data, we can explicitly compute intensity intensity & range V x,y N x,y This can be represented parametrically or via a table. –To make it efficient, we use the sample data itself as a table.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems ­ Further, we can do this even with partial neighborhood information. Estimation using the Markov Model From what should an unknown range value be? ¬For an unknown range value with a known neighborhood, we can select the maximum likelihood estimate for V x,y. ® Even further, if both intensity and range are missing we can marginalize out the unknown neighbors. intensity intensity & range

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Interpolate PDF In general, we cannot uniquely solve the desired neighborhood configuration, instead assume The values in N u,v are similar to the values in N x,y, (x,y) ≠ (u,v). Similarity measure: Similarity measure: Gaussian-weighted SSD ( sum of squared differences ). Update schedule is purely causal and deterministic.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Order of Reconstruction Dramatically reflects the quality of the result. Our reconstruction sequence is based on the amount of reliable information surrounding each of the voxels to be synthesized. Edge information used to defer reconstruction of voxels with edges as much as possible. Info-edge-driven ordering Correct result With the spiral-ordering

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Experimental Evaluation Obtain full range and intensity maps of the same environment. Remove most of the range data, then try and estimate what it is. Use the original ground truth data to estimate accuracy of the reconstruction.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Arbitrary shape of unknown range data Input Intensity Synthesized result Scharstein & Szeliski’s Data Set Middlebury College Compact case Ground truth range Input range image Synthesized result Ground truth range

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Input IntensityGround truth range Input range image Arbitrary shape of unknown range data Scharstein & Szeliski’s Data Set Middlebury College Less compact case Synthesized result

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Arbitrary shape of unknown range data Compact Distributed Synthesized results Ground truth range Expected quality of reconstruction degrades with distance from known boundaries Need broader distribution of range-intensity combinations in the sampling

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Ground truth range Approximated scene size: 550cms. Type I: Stripes along the x- and y-axis Input Intensity Data courtesy of Oak Ridge National Labs & Univ. of Florida xwxw rwrw Case 1: r w =10, x w =20. 39% of missing range Synthesized Result Mean Absolute Residual (MAR) Error: 5.76 cms. Ground truth range Approximated scene size: 550cms.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Initial range (r w =5, x w =25). 61% of missing range Case 2 Result Input intensity Mean Absolute Residual (MAR) Error: 8.86 cms. Initial range 61% missing range Ground truth range (approx. scene size: 550 cms.)

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Case 3 Result Initial range 76% missing range Input intensity Initial range data (r w =3, x w =28). 76% missing range Mean Absolute Residual (MAR) Error: 9.99 cms. Ground truth range (approx. scene size: 550 cms.)

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Input intensity and range data Synthesized results Ground truth range

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Input range data. 66% of missing range Achromatic vs. Color Results Using achromatic image Using color image Synthesized range images Greyscale image Color image Ground truth range

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Type II: Stripes along the x-axis (like those obtained by our robot) Case 1: r w =10, x w = % of missing range xwxw rwrw xwxw rwrw Input Intensity Ground truth range Approx. scene size: 600cms. Synthesized Result MAR Error: cms. Ground truth range Approximated scene size: 600cms.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Case 2 Result Mean Absolute Residual (MAR) Error: cms. 75% missing rangeInput intensity

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Case 3 Result Mean Absolute Residual (MAR) Error: cms. 78% missing rangeInput intensity

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Mike, just in case somebody ask The picture on the top appears on the paper and the bottom one on this presentation. The only difference is in the region indicated by the red ellipse in the ground truth range image (right). This region corresponds to the glass window of the intensity image (left). Because it is a reflective surface, our laser range finder cannot measure it correctly. We decided to inter- polate all the incorrect measure- ments to get a new ground truth range image (bottom). Thus, the synthesized images also differ.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems More Experimental Results

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Conclusions Works very well -- is this consistent? Can be more robust than standard methods (e.g. shape from shading) due to limited dependence on a priori reflectance assumptions. Depends on adequate amount of reliable range as input. Depends on statistical consistency of region to be constructed and region that has been measured.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Discussion & Ongoing Work Surface normals are needed when the input range data do not capture the underlying structure Data from real robot –Issues: non-uniform scale, registration, correlation on different type of data –Integration of data from different viewpoints

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Questions ?

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Adding Surface Normals Not in the paper We compute the normals by fitting a plane (smooth surface) in windows of n x n pixels. Normal vector: Eigenvector with the smallest eigenvalue of the covariance matrix. Similarity is now computed between surface normals instead of range values.

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Initial range scans Preliminary Results Not in the paper Synthesized range image Ground truth range Edge map Real intensity image Initial range data Real intensity imageEdge map

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems The Neighborhood The neighborhood N {x,y} is a square mask of size nxn centered at the augmented voxel. Observations: The neighborhood is causal The size is important to capture relevant characteristics v (x,y) I R Neighborhood of v (x,y) N {x,y}

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems

Initial range (none) Novel View Synthesis (ongoing work) Intensity image Range image Known intensity image Infer range map from intensity of current view and from range and intensity of other views. Other view Current view to estimate range map Synthesized range Ground truth range

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Previous results Results using EDGES Ground truth range MAR Errors (cms.)

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems More examples Synthesized results Ground truth rangeThe input intensity and range data

2004 IEEE/RSJ International Conference on Intelligent Robots and Systems Adding Surface Normals Not in the paper We compute the normals by fitting a plane (smooth surface) in windows of nxn pixels. The eigenvector with the smallest eigenvalue of the covariance matrix is the normal vector. – The smallest eigenvalue measures the quality of the fit. In the range synthesis, the similarity is now computed between surface normals instead of range values.