Localization of Piled Boxes by Means of the Hough Transform Dimitrios Katsoulas Institute for Pattern Recognition and Image Processing University of Freiburg.

Slides:



Advertisements
Similar presentations
Object Recognition from Local Scale-Invariant Features David G. Lowe Presented by Ashley L. Kapron.
Advertisements

Image Registration  Mapping of Evolution. Registration Goals Assume the correspondences are known Find such f() and g() such that the images are best.
COMP 175 | COMPUTER GRAPHICS Remco Chang1/6103b – Shapes Lecture 03b: Shapes COMP 175: Computer Graphics February 3, 2015.
Road-Sign Detection and Recognition Based on Support Vector Machines Saturnino, Sergio et al. Yunjia Man ECG 782 Dr. Brendan.
Developable Surface Fitting to Point Clouds Martin Peternell Computer Aided Geometric Design 21(2004) Reporter: Xingwang Zhang June 19, 2005.
Joydeep Biswas, Manuela Veloso
Exploiting Homography in Camera-Projector Systems Tal Blum Jiazhi Ou Dec 11, 2003 [Sukthankar, Stockton & Mullin. ICCV-2001]
CHAPTER 12 Height Maps, Hidden Surface Removal, Clipping and Level of Detail Algorithms © 2008 Cengage Learning EMEA.
Two-View Geometry CS Sastry and Yang
Automatic Feature Extraction for Multi-view 3D Face Recognition
Mapping: Scaling Rotation Translation Warp
Convex Position Estimation in Wireless Sensor Networks
Intelligent Systems Lab. Extrinsic Self Calibration of a Camera and a 3D Laser Range Finder from Natural Scenes Davide Scaramuzza, Ahad Harati, and Roland.
Segmentation (2): edge detection
Uncertainty Representation. Gaussian Distribution variance Standard deviation.
Robust Extraction of Vertices in Range Images by Constraining the Hough Transform Dimitrios Katsoulas Institute for Pattern Recognition and Image Processing.
A Versatile Depalletizer of Boxes Based on Range Imagery Dimitrios Katsoulas*, Lothar Bergen*, Lambis Tassakos** *University of Freiburg **Inos Automation-software.
Virtual Dart: An Augmented Reality Game on Mobile Device Supervisor: Professor Michael R. Lyu Prepared by: Lai Chung Sum Siu Ho Tung.
Segmentation into Planar Patches for Recovery of Unmodeled Objects Kok-Lim Low COMP Computer Vision 4/26/2000.
Exchanging Faces in Images SIGGRAPH ’04 Blanz V., Scherbaum K., Vetter T., Seidel HP. Speaker: Alvin Date: 21 July 2004.
3D Mapping Robots Intelligent Robotics School of Computer Science Jeremy Wyatt James Walker.
Università La Sapienza Rome, Italy Scan matching in the Hough domain Andrea Censi, Luca Iocchi, Giorgio Grisetti dis.uniroma1.it
Stepan Obdrzalek Jirı Matas
UNC Chapel Hill M. C. Lin Overview of Last Lecture About Final Course Project –presentation, demo, write-up More geometric data structures –Binary Space.
Fitting a Model to Data Reading: 15.1,
Object recognition under varying illumination. Lighting changes objects appearance.
Information that lets you recognise a region.
CSE473/573 – Stereo Correspondence
Camera parameters Extrinisic parameters define location and orientation of camera reference frame with respect to world frame Intrinsic parameters define.
Robust estimation Problem: we want to determine the displacement (u,v) between pairs of images. We are given 100 points with a correlation score computed.
Modeling and representation 1 – comparative review and polygon mesh models 2.1 Introduction 2.2 Polygonal representation of three-dimensional objects 2.3.
Accurate Process-Hotspot Detection Using Critical Design Rule Extraction Y. Yu, Y. Chan, S. Sinha, I. H. Jiang and C. Chiang Dept. of EE, NCTU, Hsinchu,
CSE 185 Introduction to Computer Vision
AdvisorStudent Dr. Jia Li Shaojun Liu Dept. of Computer Science and Engineering, Oakland University 3D Shape Classification Using Conformal Mapping In.
Multi-resolution Arc Segmentation: Algorithms and Performance Evaluation Jiqiang Song Jan. 12 th, 2004.
CS 450: Computer Graphics REVIEW: OVERVIEW OF POLYGONS
Gwangju Institute of Science and Technology Intelligent Design and Graphics Laboratory Multi-scale tensor voting for feature extraction from unstructured.
Shape Recognition and Pose Estimation for Mobile Augmented Reality Author : N. Hagbi, J. El-Sana, O. Bergig, and M. Billinghurst Date : Speaker.
1/20 Obtaining Shape from Scanning Electron Microscope Using Hopfield Neural Network Yuji Iwahori 1, Haruki Kawanaka 1, Shinji Fukui 2 and Kenji Funahashi.
October 14, 2014Computer Vision Lecture 11: Image Segmentation I 1Contours How should we represent contours? A good contour representation should meet.
Edge Linking & Boundary Detection
Object Stereo- Joint Stereo Matching and Object Segmentation Computer Vision and Pattern Recognition (CVPR), 2011 IEEE Conference on Michael Bleyer Vienna.
HOUGH TRANSFORM Presentation by Sumit Tandon
Intelligent Vision Systems ENT 496 Object Shape Identification and Representation Hema C.R. Lecture 7.
Self-Calibration and Metric Reconstruction from Single Images Ruisheng Wang Frank P. Ferrie Centre for Intelligent Machines, McGill University.
Generalized Hough Transform
The University of Texas at Austin Vision-Based Pedestrian Detection for Driving Assistance Marco Perez.
Feature based deformable registration of neuroimages using interest point and feature selection Leonid Teverovskiy Center for Automated Learning and Discovery.
CS654: Digital Image Analysis Lecture 25: Hough Transform Slide credits: Guillermo Sapiro, Mubarak Shah, Derek Hoiem.
18 th August 2006 International Conference on Pattern Recognition 2006 Epipolar Geometry from Two Correspondences Michal Perďoch, Jiří Matas, Ondřej Chum.
1 Research Question  Can a vision-based mobile robot  with limited computation and memory,  and rapidly varying camera positions,  operate autonomously.
1 Camera calibration based on arbitrary parallelograms 授課教授:連震杰 學生:鄭光位.
Raquel A. Romano 1 Scientific Computing Seminar May 12, 2004 Projective Geometry for Computer Vision Projective Geometry for Computer Vision Raquel A.
A Flexible New Technique for Camera Calibration Zhengyou Zhang Sung Huh CSPS 643 Individual Presentation 1 February 25,
Plane-based external camera calibration with accuracy measured by relative deflection angle Chunhui Cui , KingNgiNgan Journal Image Communication Volume.
October 1, 2013Computer Vision Lecture 9: From Edges to Contours 1 Canny Edge Detector However, usually there will still be noise in the array E[i, j],
Determining 3D Structure and Motion of Man-made Objects from Corners.
776 Computer Vision Jan-Michael Frahm Spring 2012.
Coherent Scene Understanding with 3D Geometric Reasoning Jiyan Pan 12/3/2012.
Computer Graphics One of the central components of three-dimensional graphics has been a basic system that renders objects represented by a set of polygons.
A Fast Algorithm for Incremental Distance Calculation Ming C. Lin & John Canny University of California, Berkeley 1991 Presentation by Adit Koolwal.
Hough Transform CS 691 E Spring Outline Hough transform Homography Reading: FP Chapter 15.1 (text) Some slides from Lazebnik.
3D Perception and Environment Map Generation for Humanoid Robot Navigation A DISCUSSION OF: -BY ANGELA FILLEY.
Lecture 07 13/12/2011 Shai Avidan הבהרה: החומר המחייב הוא החומר הנלמד בכיתה ולא זה המופיע / לא מופיע במצגת.
CENG 477 Introduction to Computer Graphics
Estimation of 3D Bounding Box for Image Object
Dimitrios Katsoulas and Dimitrios Kosmopoulos
Vehicle Segmentation and Tracking in the Presence of Occlusions
Computer Graphics One of the central components of three-dimensional graphics has been a basic system that renders objects represented by a set of polygons.
Presented by Mohammad Rashidujjaman Rifat Ph.D Student,
Presentation transcript:

Localization of Piled Boxes by Means of the Hough Transform Dimitrios Katsoulas Institute for Pattern Recognition and Image Processing University of Freiburg

Introduction Depalletizing: automatic unloading of piled objects via a robot. How important a solution to the problem is? We deal with objects most frequently encountered: Boxes, box-like objects (e.g. sacks full of material) Applications: Post, distribution centers, airports. Here: Boxes of unknown dimensions.

Object configuration

Existing Systems Intensity cameras are utilized. Based on detection of markers on the exposed surfaces of the objects. Advantages: Computational efficiency Disadvantages: – Markers do not always exist! – Systems deal only with neatly placed configurations of objects of the same dimensions. – Performance depends on lighting conditions at installation sites.

Our approach Question: How can we recover the fully exposed surfaces (graspable surfaces) of the objects from input range images?

Recovery of graspable surfaces Fully exposed object surfaces: Planar surfaces with rectangle boundaries. Hypothesis generation (Edge based): Recover rectangle boundaries of the graspable surfaces. Hypothesis verification (Region based): Examine if the range points inside the hypothesized boundaries lie on the plane defined by the boundary.

Hypothesis generation Rectangle boundaries: Geometric parametric entities with 8 parameters: (6 pose parameters and 2 dimension parameters). Tool for recovery of multiple parametric entities from images: Hough transform (HT). Problems of HT: Computationally inefficient, memory consuming, not robust. Proposed solution: Decompose the problem into simpler sub-problems, use the HT to solve each: – Recovery of the pose. – Recovery of the dimensions.

Pose recovery [Chen.Kak:89]: A visible vertex of a convex object provides the strongest constraints for accurately determining its pose. Vertex detection technique: – Detect 3D object boundary lines via HT. – Group orthogonal pairs of lines to vertices. Vertex representation: – Two boundary lines joining at the vertex point. – The intersection point of those boundaries. Pose estimation: via alignment.

Line detection in Range Images Perform edge detection on input range image. Find 3D lines in the edge map by problem decomposition: – Recover the 4 line parameters by solving 2 2D sub problems each recovering 2 line parameters  Computational efficiency. – Constrain the Hough transform to lie on an 1D curve  Robustness, low memory consumption.

Vertex Detection: Results

Problems of vertex detection Not all the linear boundaries and as consequence not all the vertices of the exposed surfaces were recovered! Reason: Lines passing from randomly selected points (distinguished points) are recovered  computational efficiency. Disadvantages: Some of the boundary lines are not recovered. Side effect: Object dimensions cannot be derived from vertices only. Solution: Derive dimensions from boundary points.

Recovery of dimensions (1) Dimensions: Determined from the pose parameters + the edge points. Candidate edge points: – On the same plane with a detected vertex. – On the first quadrant of the coordinate system defined by the vertex.

Recovery of dimensions (2)

Verification Until now we have managed to recover rectangle boundaries from images. Question: Do those boundaries correspond to boundaries of graspable surfaces? Answer: Derived from the range points inside the boundaries. Verification: Check if the range points inside a recovered boundary belong to the plane defined by the boundary.

Experiments

Statistical tests: Avoid thresholds. Multiple decisions which have to be based on thresholds: – Group lines forming an angle of 90 degrees. – Determine if image points belong to a given plane. Thresholds: difficult to set, depends on the application and on the uncertainty in calculation of line parameters. Can we avoid multiple thresholds? Yes, by introducing statistical tests. All thresholds are replaced by a unique significance value. We adopt the framework of [Foerstner et.al:00] for its compactness and straightforwardness.

Experimental results Computational efficiency (on a Pentium MHz): – Scanning: 6.5 sec – Edge detection: 2 sec – Hypothesis generation + verification: 8 sec. – Overall: ~ 17 sec. Accuracy: – < 2.5 cm translational accuracy. – < 2 degrees rotational accuracy. Robustness: – No false alarms. – The system only occasionally fails to recover all the graspable surfaces in the pile.

OPERATION EXAMPLE

Conclusions Advantages : – Insensitivity to lighting conditions: Usage of laser sensor. – Accuracy: Accurate calculation of pose parameters. – Robustness: Decisions based both on edge and region based information, statistical tests are employed. – Computational efficiency: Parameter recovery problem decomposition into smaller sub-problems. – Versatility: Deals with both jumbled and neat object configurations. – Simplicity. Problems: – Height of objects not recovered. – System fails when no boundary information can be recovered, that is when the distance between neighbouring objects is smaller than the sensor resolution.

Do you want to know how we deal with those objects?... Come to ICCV 03!