Computer Science Department, Duke UniversityPhD Defense TalkMay 4, 2005 FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Patrick Eibl, Dennis Healy, Nikos P.

Slides:



Advertisements
Similar presentations
Image Registration  Mapping of Evolution. Registration Goals Assume the correspondences are known Find such f() and g() such that the images are best.
Advertisements

Compressed sensing Carlos Becker, Guillaume Lemaître & Peter Rennert
DUKE CSNUFFT on Multicores HPEC-2008 Sept. 25, 2008NUFFTs on Multicores 1 Parallelization of NUFFT with Radial Data on Multicore Processors Nikos Pitsianis.
A Matlab Playground for JPEG Andy Pekarske Nikolay Kolev.
ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.
CUDA Programming Lei Zhou, Yafeng Yin, Yanzhi Ren, Hong Man, Yingying Chen.
Acceleration on many-cores CPUs and GPUs Dinesh Manocha Lauri Savioja.
Spectral Processing of Point-sampled Geometry
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.
REALTIME OBJECT-OF-INTEREST TRACKING BY LEARNING COMPOSITE PATCH-BASED TEMPLATES Yuanlu Xu, Hongfei Zhou, Qing Wang*, Liang Lin Sun Yat-sen University,
GPGPU platforms GP - General Purpose computation using GPU
Synergy.cs.vt.edu Power and Performance Characterization of Computational Kernels on the GPU Yang Jiao, Heshan Lin, Pavan Balaji (ANL), Wu-chun Feng.
Computer Science Department, Duke UniversityPhD Defense TalkMay 4, 2005 Fast Extraction of Feature Salience Maps for Rapid Video Data Analysis Nikos P.
BENCHMARK SUITE RADAR SIGNAL & DATA PROCESSING CERES EPC WORKSHOP
Phase Retrieval Applied to Asteroid Silhouette Characterization by Stellar Occultation Russell Trahan & David Hyland JPL Foundry Meeting – April 21, 2014.
Computationally Efficient Histopathological Image Analysis: Use of GPUs for Classification of Stromal Development Olcay Sertel 1,2, Antonio Ruiz 3, Umit.
Christopher Mitchell CDA 6938, Spring The Discrete Cosine Transform  In the same family as the Fourier Transform  Converts data to frequency domain.
Object Based Video Coding - A Multimedia Communication Perspective Muhammad Hassan Khan
By Arun Bhandari Course: HPC Date: 01/28/12. GPU (Graphics Processing Unit) High performance many core processors Only used to accelerate certain parts.
Parallel Applications Parallel Hardware Parallel Software IT industry (Silicon Valley) Users Efficient Parallel CKY Parsing on GPUs Youngmin Yi (University.
CS 6068 Parallel Computing Fall 2013 Lecture 10 – Nov 18 The Parallel FFT Prof. Fred Office Hours: MWF.
Kaihua Zhang Lei Zhang (PolyU, Hong Kong) Ming-Hsuan Yang (UC Merced, California, U.S.A. ) Real-Time Compressive Tracking.
Video Tracking Using Learned Hierarchical Features
Symmetric hash functions for fingerprint minutiae S. Tulyakov, V. Chavan and V. Govindaraju Center for Unified Biometrics and Sensors SUNY at Buffalo,
Advanced Computer Architecture, CSE 520 Generating FPGA-Accelerated DFT Libraries Chi-Li Yu Nov. 13, 2007.
Realtime 3D model construction with Microsoft Kinect and an NVIDIA Kepler laptop GPU Paul Caheny MSc in HPC 2011/2012 Project Preparation Presentation.
Programming Concepts in GPU Computing Dušan Gajić, University of Niš Programming Concepts in GPU Computing Dušan B. Gajić CIITLab, Dept. of Computer Science.
Correspondence-Free Determination of the Affine Fundamental Matrix (Tue) Young Ki Baik, Computer Vision Lab.
Performed by: Ron Amit Supervisor: Tanya Chernyakova In cooperation with: Prof. Yonina Eldar 1 Part A Final Presentation Semester: Spring 2012.
Use of Aerial Videography in Habitat Survey and Computers as Observers Leonard Pearlstine University of Florida.
Tracking with CACTuS on Jetson Running a Bayesian multi object tracker on a low power, embedded system School of Information Technology & Mathematical.
Compressive Sensing for Multimedia Communications in Wireless Sensor Networks By: Wael BarakatRabih Saliba EE381K-14 MDDSP Literary Survey Presentation.
Tracking with CACTuS on Jetson Running a Bayesian multi object tracker on an embedded system School of Information Technology & Mathematical Sciences September.
GPU DAS CSIRO ASTRONOMY AND SPACE SCIENCE Chris Phillips 23 th October 2012.
DUKEMIT-LLHPEC-2007 FFTs of Arbitrary Dimensions on GPUs Xiaobai Sun and Nikos Pitsianis Duke University September 19, 2007 At High Performance Embedded.
1 Complex Images k’k’ k”k” k0k0 -k0-k0 branch cut   k 0 pole C1C1 C0C0 from the Sommerfeld identity, the complex exponentials must be a function.
Fast BVH Construction on GPUs (Eurographics 2009) Park, Soonchan KAIST (Korea Advanced Institute of Science and Technology)
Efficient Local Statistical Analysis via Integral Histograms with Discrete Wavelet Transform Teng-Yok Lee & Han-Wei Shen IEEE SciVis ’13Uncertainty & Multivariate.
GPUs: Overview of Architecture and Programming Options Lee Barford firstname dot lastname at gmail dot com.
Reconstruction of Solid Models from Oriented Point Sets Misha Kazhdan Johns Hopkins University.
Fast Census Transform-based Stereo Algorithm using SSE2
CDVS on mobile GPUs MPEG 112 Warsaw, July Our Challenge CDVS on mobile GPUs  Compute CDVS descriptor from a stream video continuously  Make.
Department of Electronic Engineering, Tsinghua University Nano-scale Integrated Circuit and System Lab. Performance Analysis of Parallel Sparse LU Factorization.
GPU Accelerated MRI Reconstruction Professor Kevin Skadron Computer Science, School of Engineering and Applied Science University of Virginia, Charlottesville,
Frequency Domain Adaptive Filtering Project Supervisor Dr. Edward Jones Myles Ó Fríl.
Visual Odometry David Nister, CVPR 2004
Debunking the 100X GPU vs. CPU Myth An Evaluation of Throughput Computing on CPU and GPU Present by Chunyi Victor W Lee, Changkyu Kim, Jatin Chhugani,
Multimedia Systems and Communication Research Multimedia Systems and Communication Research Department of Electrical and Computer Engineering Multimedia.
Fourier and Wavelet Transformations Michael J. Watts
A New Class of High Performance FFTs Dr. J. Greg Nash Centar ( High Performance Embedded Computing (HPEC) Workshop.
© David Kirk/NVIDIA and Wen-mei W. Hwu, ECE408/CS483, University of Illinois, Urbana-Champaign 1 Graphic Processing Processors (GPUs) Parallel.
Comparison of Image Registration Methods David Grimm Joseph Handfield Mahnaz Mohammadi Yushan Zhu March 18, 2004.
Glossary of Technical Terms Correlation filter: a set of carefully designed correlation templates with regard to shift invariance as well as distortion-
FFTC: Fastest Fourier Transform on the IBM Cell Broadband Engine David A. Bader, Virat Agarwal.
Nawanol Theera-Ampornpunt, Seong Gon Kim, Asish Ghoshal, Saurabh Bagchi, Ananth Grama, and Somali Chaterji Fast Training on Large Genomics Data using Distributed.
SIMD Implementation of Discrete Wavelet Transform Jake Adriaens Diana Palsetia.
WLD: A Robust Local Image Descriptor Jie Chen, Shiguang Shan, Chu He, Guoying Zhao, Matti Pietikäinen, Xilin Chen, Wen Gao 报告人:蒲薇榄.
Face Detection 蔡宇軒.
Visual Tracking of Surgical Tools in Retinal Surgery using Particle Filtering Group 14 William Yang and David Li Presenter: William Yang Mentor: Dr. Rogerio.
Fuzzy type Image Fusion using hybrid DCT-FFT based Laplacian Pyramid Transform Authors: Rajesh Kumar Kakerda, Mahendra Kumar, Garima Mathur, R P Yadav,
Stencil-based Discrete Gradient Transform Using
Yuanrui Zhang, Mahmut Kandemir
Fourier and Wavelet Transformations
Recovery from Occlusion in Deep Feature Space for Face Recognition
Object Recognition in the Dynamic Link Architecture
Fast and Robust Ellipse Detection
Fast and Robust Ellipse Detection
Object tracking in video scenes Object tracking in video scenes
Compressive Sensing Imaging
Digital Systems: Hardware Organization and Design
Presentation transcript:

Computer Science Department, Duke UniversityPhD Defense TalkMay 4, 2005 FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Patrick Eibl, Dennis Healy, Nikos P. Pitsianis and Xiaobai Sun HPEC 2009 MIT Lincoln Lab

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS In Memory of Dennis M. Healy, Jr. Sept HPEC July 2007 Contributed by Mary Healy Hambric

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS LCCs : local correlation coefficients –application requirements –computational challenges –recent progress Fast LCC –Algorithmic acceleration –Architectural acceleration (on GPUs) Building blocks –In image processing and information processing Conclusion Outline Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Fast Convolution or Correlation Sept HPEC Discrete Version Equally spaced sampling Periodic convolution : padding otherwise FFT : O( N log(N) ) Convolution (Correlation) Theorem

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Calculating the LCCs of a prototype-pattern with all patches in a large image  A basic operation in image registration, object classification and recognition, target identification and tracking  Statistically robust to local changes and noise Fast calculation of LCCs without compromising integrity  in many situations, rapid computation of LCCs was based on relaxed linearized conditions, to avoid local translation in mean and normalization in variance  “Fast LCC with local zero-mean translation and unit-variance normalization”, Sun & Pitsianis 2008 Local Correlation Coefficients (LCCs) Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Applications: MONTAGE Sept HPEC A. Portnoy, et al., Design and characterization of thin multiple aperture infrared cameras Applied Optics, Vol. 48 Issue 11, pp (2009)Design and characterization of thin multiple aperture infrared cameras A. Wagadarikar, et al., Video rate spectral imaging using a coded aperture snapshot spectral imager Optics Express, Vol. 17 Issue 8, pp (2009)Video rate spectral imaging using a coded aperture snapshot spectral imager M. Shankar, et al., Thin infrared imaging systems through multichannel sampling Applied Optics, 47(10):B1-B10 (January 2008)Thin infrared imaging systems through multichannel sampling N. P. Pitsianis, et al, Compressive imaging sensors, In Proc. SPIE Vol. 6232, Intelligent Integrated Microsystems; Ravindra A. Athale, John C. Zolper; Eds. pp 43-51, May 2006Compressive imaging sensors

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Sub-pixel Image Registration Sept HPEC 2009

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Template Matching Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Without Local Normalization Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS With Local Normalization Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Local Correlation Coefficients Sept HPEC PT S T P

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Normalized Local Correlation Coefficients Sept HPEC X. Sun, N. P. Pitsianis, and P. Bientinesi, Fast computation of local correlation coefficients Proc. SPIE 7074, (2008)Fast computation of local correlation coefficients PT S T P

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Sept HPEC Algorithmic Acceleration 1.Reformulation in term of convolutions 2.Use of Fourier transform identities

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Reformulation in Terms of Convolutions Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Fast LCC in the Fourier Domain Sept HPEC FFTs per image

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS DFT Identity with real data : Two by one Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Sept HPEC Illustration of Two-by-One identity

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS DFT Identity : One-by-Half Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Sept HPEC Illustration of One-by-Half + -

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Sept HPEC Architectural Acceleration 1.Necessary complement of fast algorithms in real-time image processing or massive data processing 2.The use of GPUs in particular  omnipresent in desktops, laptops  a prototype of parallel architecture  many-core  memory hierarchy  bandwidth  programming model  3D image processing

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS 2D Convolution on CPU and GPU Sept HPEC FFTW on Xeon CUDA SDK 2.3 CUFFT on GPU Execution time (msec)

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Algorithmic Acceleration of 2D Convolution Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Algorithmic Acceleration of 3D Convolution Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS GPU Acceleration of 2D LCCs Sept HPEC Streamed FLCC 4Kx4K FLCC 2Kx2K Streamed FLCC 2Kx2K FLCC 4Kx4K

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS GPU Acceleration of 3D LCCs Sept HPEC Streamed FLCC FLCC 128x128x128 Streamed FLCC FLCC 256x256x256

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Sept HPEC Building Blocks 1.In image processing 2.In data modeling & information processing

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Construction of spectral images from coded intensity data Fast Deconvolution with Incomplete Data Sept HPEC X. Sun and N. P. Pitsianis (2008)

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Object Feature Extraction in Multiple Layers  Simple and densely populated features at lower layers  Complex but sparse features at higher layers Hierarchical Data Indexing & Fast Feature Matching Sept HPEC S. Fidler, G. Berginc, A. Leonardis, Proc IEEE CVPR (2006)

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS Convolution/correlation, LCCs important in new and conventional applications Reformulation critical in designing fast algorithms without compromising quality Joint algorithmic and architectural accelerations Efficient implementation of fast algorithms on modern parallel architectures Conclusions Sept HPEC

FAST PATTERN MATCHING IN 3D IMAGES ON GPUS FANTOM : Algorithm-Architecture Co-design –Supported in part by DARPA MTO AutoGPU –Supported in part by MC-FP7 –Donation of GPUs by NVIDIA CCF-FMM –Supported in part by NSF G. Papamakarios and G. Aristotle Univ. Acknowledgments Sept HPEC