Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP 2012) June 28-30, Agadir,

Slides:



Advertisements
Similar presentations
Feature extraction: Corners
Advertisements

Real-Time Template Tracking
Feature Based Image Mosaicing
Implementation of 2-D FFT on the Cell Broadband Engine Architecture William Lundgren Gedae), Kerry Barnes (Gedae), James Steed (Gedae)
Computational Biology, Part 23 Biological Imaging II Robert F. Murphy Copyright  1996, 1999, All rights reserved.
電腦視覺 Computer and Robot Vision I Chapter2: Binary Machine Vision: Thresholding and Segmentation Instructor: Shih-Shinh Huang 1.
3D M otion D etermination U sing µ IMU A nd V isual T racking 14 May 2010 Centre for Micro and Nano Systems The Chinese University of Hong Kong Supervised.
Foreground-Background Separation on GPU using order based approaches Raj Gupta, Sailaja Reddy M., Swagatika Panda, Sushant Sharma and Anurag Mittal Indian.
MATHIEU GAUTHIER PIERRE POULIN LIGUM, DEPT. I.R.O. UNIVERSITÉ DE MONTRÉAL GRAPHICS INTERFACE 2009 Preserving Sharp Edges in Geometry Images.
December 5, 2013Computer Vision Lecture 20: Hidden Markov Models/Depth 1 Stereo Vision Due to the limited resolution of images, increasing the baseline.
Claude TADONKI Mines ParisTech – LAL / CNRS / INP 2 P 3 University of Oujda (Morocco) – October 7, 2011 High Performance Computing Challenges and Trends.
Feature extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.
1 Interest Operators Find “interesting” pieces of the image –e.g. corners, salient regions –Focus attention of algorithms –Speed up computation Many possible.
1 Interest Operator Lectures lecture topics –Interest points 1 (Linda) interest points, descriptors, Harris corners, correlation matching –Interest points.
Motion Estimation Today’s Readings Trucco & Verri, 8.3 – 8.4 (skip 8.3.3, read only top half of p. 199) Numerical Recipes (Newton-Raphson), 9.4 (first.
1 Interest Operators Find “interesting” pieces of the image Multiple possible uses –image matching stereo pairs tracking in videos creating panoramas –object.
EE392J Final Project, March 20, Multiple Camera Object Tracking Helmy Eltoukhy and Khaled Salama.
Department of Electrical Engineering National Cheng Kung University
Computer vision: models, learning and inference
Harris Corner Detector on FPGA Rohit Banerjee Jared Choi : Parallel Computer Architecture and Programming.
September 10, 2012Introduction to Artificial Intelligence Lecture 2: Perception & Action 1 Boundary-following Robot Rules 1  2  3  4  5.
“Low-Power, Real-Time Object- Recognition Processors for Mobile Vision Systems”, IEEE Micro Jinwook Oh ; Gyeonghoon Kim ; Injoon Hong ; Junyoung.
Real-time Object Image Tracking Based on Block- Matching Algorithm ECE 734 Hsiang-Kuo Tang Tai-Hsuan Wu Ying-Tien Lin.
1 Interest Operators Harris Corner Detector: the first and most basic interest operator Kadir Entropy Detector and its use in object recognition SIFT interest.
OBJECT RECOGNITION. The next step in Robot Vision is the Object Recognition. This problem is accomplished using the extracted feature information. The.
Claude Tadonki Mines ParisTech – CRI – Mathématiques et Systèmes Laboratoire de l’Accélérateur Linéaire/IN2P3/CNRS France 2nd.
Automatic LQCD Code Generation Quatrièmes Rencontres de la Communauté Française de Compilation December 5-7, 2011 Saint-Hippolyte (France) Claude Tadonki.
Programming Examples that Expose Efficiency Issues for the Cell Broadband Engine Architecture William Lundgren Gedae), Rick Pancoast.
Advanced SW/HW Optimization Techniques for Application Specific MCSoC m Yumiko Kimezawa Supervised by Prof. Ben Abderazek Graduate School of Computer.
Tracking with CACTuS on Jetson Running a Bayesian multi object tracker on a low power, embedded system School of Information Technology & Mathematical.
EDGE DETECTION IN COMPUTER VISION SYSTEMS PRESENTATION BY : ATUL CHOPRA JUNE EE-6358 COMPUTER VISION UNIVERSITY OF TEXAS AT ARLINGTON.
Tracking with CACTuS on Jetson Running a Bayesian multi object tracker on an embedded system School of Information Technology & Mathematical Sciences September.
EDGE DETECTION USING MINMAX MEASURES SOUNDARARAJAN EZEKIEL Matthew Lang Department of Computer Science Indiana University of Pennsylvania Indiana, PA.
Digital Camera and Computer Vision Laboratory Department of Computer Science and Information Engineering National Taiwan University, Taipei, Taiwan, R.O.C.
Feature extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.
Lecture 7: Features Part 2 CS4670/5670: Computer Vision Noah Snavely.
Lecture 7: Intro to Computer Graphics. Remember…… DIGITAL - Digital means discrete. DIGITAL - Digital means discrete. Digital representation is comprised.
High Performance Computing Group Feasibility Study of MPI Implementation on the Heterogeneous Multi-Core Cell BE TM Architecture Feasibility Study of MPI.
FACE DETECTION : AMIT BHAMARE. WHAT IS FACE DETECTION ? Face detection is computer based technology which detect the face in digital image. Trivial task.
By Pushpita Biswas Under the guidance of Prof. S.Mukhopadhyay and Prof. P.K.Biswas.
Optimizing Ray Tracing on the Cell Microprocessor David Oguns.
Learning Photographic Global Tonal Adjustment with a Database of Input / Output Image Pairs.
Advanced SW/HW Optimization Techniques for Application Specific MCSoC m Yumiko Kimezawa Supervised by Prof. Ben Abderazek Graduate School of Computer.
Presented by Jeremy S. Meredith Sadaf R. Alam Jeffrey S. Vetter Future Technologies Group Computer Science and Mathematics Division Research supported.
Lecture 04 Edge Detection Lecture 04 Edge Detection Mata kuliah: T Computer Vision Tahun: 2010.
Machine Vision Edge Detection Techniques ENT 273 Lecture 6 Hema C.R.
What is Digital Image processing?. An image can be defined as a two-dimensional function, f(x,y) # x and y are spatial (plane) coordinates # The function.
ANR Meeting / PetaQCD LAL / Paris-Sud University, May 10-11, 2010.
SIMD Implementation of Discrete Wavelet Transform Jake Adriaens Diana Palsetia.
Pixel Parallel Vessel Tree Extraction for a Personal Authentication System 2010/01/14 學生:羅國育.
XRD data analysis software development. Outline  Background  Reasons for change  Conversion challenges  Status 2.
Keypoint extraction: Corners 9300 Harris Corners Pkwy, Charlotte, NC.
1 Munther Abualkibash University of Bridgeport, CT.
CONTENTS:  Introduction.  Face recognition task.  Image preprocessing.  Template Extraction and Normalization.  Template Correlation with image database.
3D Vision Interest Points.
Improving the Performance of Fingerprint Classification
IMAGE SEGMENTATION USING THRESHOLDING
Implementing Boosting and Convolutional Neural Networks For Particle Identification (PID) Khalid Teli .
Array Processor.
CSE 455 – Guest Lectures 3 lectures Contact Interest points 1
a kind of filtering that leads to useful features
a kind of filtering that leads to useful features
Implementation of neural gas on Cell Broadband Engine
Sahand Kashani, Stuart Byma, James Larus 2019/02/16
Claude Tadonki Laboratoire de l’Accélérateur Linéaire/IN2P3/CNRS
Mapping the FFT Algorithm to the IBM Cell Processor
Lecture VI: Corner and Blob Detection
Chapter 11: Stereopsis Stereopsis: Fusing the pictures taken by two cameras and exploiting the difference (or disparity) between them to obtain the depth.
Corner Detection COMP 4900C Winter 2008.
Focus on Parallel Combinatorial Optimization
Presentation transcript:

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP 2012) June 28-30, Agadir, Morocco Claude TADONKI Mines ParisTech – CRI (Centre de Recherche en Informatique) Fontainebleau (France) Joint work with Lionel Lacassagne, Elwardani Dadi, Mostafa El Daoudi

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech The Harris-Stephen algorithm its a corner (point of interest) detection algorithm it is an improved variant of the original algorithm by Moravec it is used in computer vision for feature extraction like motion detection image matching tracking 3D reconstruction object recognition

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech Technically, the Harris algorithm is based on a pixelwise autocorrelation S given by where (x, y) is the location of the pixel and I(x, y) its intensity (grayscale mode). At a given point (x, y) of the image, the value of S(x, y) is compared to a suitable threshold, and the decision follows on the nature of the pixel at (x, y). Roughly speaking, the process is achieved by applying four discrete operators, namely Sobel (S), Multiplication (M), Gauss (G), and Coarsity (C). The figure below displays an overview of the global workflow.

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech SobelGauss Sobel and Gauss, which aproximate the first and the second derivatives respectively, are 9->1 or 3x3 operators represented by the following 3x3 matrices Computational issued

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech In order to reduce the repetitive read/write of the entire image, one could fuse or chain consecutive operators whenever possible. But, this implies redundant computations. In order to improve data locality, as this is an important point here (due to the stencil form of the computation), we consider the common technique of tiling. In practice, when it comes to special devices, where there there is a strong constraint on memory alignment, it is more simple to consider row tiles (i.e. a tile is a group of consecutive rows). This simplifies memory accesses implementation, but the performance is not optimal since the shape of the optimal tile is a square. In this work, we provide a generic routine to perform memory transfers of rectangle shapes on the IBM CELL machine and illustrate its efficiency on a tile implementation of the Harris algorithm.

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech The IBM CELL machine a multi-core chip composed of 9 processing elements: o 1 master unit (POWER PC), called Power Processing Element (PPE) o 8 Synergistic Processing Elements (SPE), with SIMD capability & a local memory (256K) data transfers between the main memory and the SPE memory are done trough DMAs DMA (direct memory access) has some important constraints on both the address and the volume of the data to be transferred, and it can be done in parallel with computations

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech DMA issues related to tiling Performing the transfer expressed in figure 4 raises number of problems: the region to be transferred is not contiguous on memory, thus list DMAs are considered the address of one given row is not aligned, thus the global list DMA is not possible the (address, volume) pair of a row does not match the basic DMA rules (the above two ones), thus the entire list DMA cannot be carried out misalignment could come from both sides (main memory and/or local store) the target region on the local store might be out of the container limits We have designed and implemented a routine which performs this task very efficiently

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech Performance results on the Harris algorithm We can observe 50% improvment between square tiles and full row tiles.

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech Conclusion and perspectives This work shows that, when using accelerators, it is important to have an efficient Implementation of the transfers between the main memory and the local memory of the accelerators. Due to the current status of the CELL, we need to explore our ideas on GPUs.

Accelerator-based Implementation of the Harris Algorithm International Conference on Image and Signal Processing 2012 (ICISP12) June 28-30, Agadir, Morocco C. Tadonki - Mines ParisTech THANKS FOR YOUR ATTENTION