Ultrasonic Imaging using Resolution Enhancement Compression and GPU- Accelerated Synthetic Aperture Techniques Presenter: Anthony Podkowa May 2, 2013 Advisor:

Slides:



Advertisements
Similar presentations
Chapter3 Pulse-Echo Ultrasound Instrumentation
Advertisements

Foundations of Medical Ultrasonic Imaging
Ultrasound Microscopy and High Frequency Coded Signals Antti Meriläinen, Edward Hæggström.
Image processing (spatial &frequency domain) Image processing (spatial &frequency domain) College of Science Computer Science Department
Medical Image Reconstruction Topic 5: Synthetic Aperture Ultrasound Imaging Professor Yasser Mostafa Kadah –
HPCC Mid-Morning Break High Performance Computing on a GPU cluster Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery.
OpenFOAM on a GPU-based Heterogeneous Cluster
ECE 501 Introduction to BME
2009/04/07 Yun-Yang Ma.  Overview  What is CUDA ◦ Architecture ◦ Programming Model ◦ Memory Model  H.264 Motion Estimation on CUDA ◦ Method ◦ Experimental.
Emma Muir, Sam Muir, Jacob Sandlund, & David Smith Advisor: Dr. José Sánchez Co-Advisor: Dr. James Irwin.
CUDA Programming Lei Zhou, Yafeng Yin, Yanzhi Ren, Hong Man, Yingying Chen.
Development and Implementation of a FPGA-based digital beamformer Supervisors: Nandita Bhattacharjee Dr. Andrew Paplinski Dr. Andrew Paplinski Dale Harders.
Acoustic Fossil Imaging by Matt Kaiser & John Lewis Advised by: Dr. James H. Irwin & Mr. José Sánchez.
HPEC_GPU_DECODE-1 ADC 8/6/2015 MIT Lincoln Laboratory GPU Accelerated Decoding of High Performance Error Correcting Codes Andrew D. Copeland, Nicholas.
Despeckle Filtering in Medical Ultrasound Imaging
Characterization Presentation Performed by: Ron Amit Supervisor: Tanya Chernyakova Semester: Spring Sub-Nyquist Sampling in Ultrasound Imaging.
1 1 © 2011 The MathWorks, Inc. Accelerating Bit Error Rate Simulation in MATLAB using Graphics Processors James Lebak Brian Fanous Nick Moore High-Performance.
1 Motivation Video Communication over Heterogeneous Networks –Diverse client devices –Various network connection bandwidths Limitations of Scalable Video.
HIAPER Cloud Radar Transceiver Exciter Receiver Oscillators High-Powered Amplifier Calibration Exciter Receiver Oscillators High-Powered Amplifier Calibration.
Inputs to Signal Generation.vi: -Initial Distance (m) -Velocity (m/s) -Chirp Duration (s) -Sampling Info (Sampling Frequency, Window Size) -Original Signal.
HPCC Mid-Morning Break Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery Introduction to the new GPU (GFX) cluster.
Advisor: Dr. Chandra Christopher Picard Michael Neuberg.
Sub-Nyquist Sampling DSP & SCD Modules Presented by: Omer Kiselov, Daniel Primor Supervised by: Ina Rivkin, Moshe Mishali Winter 2010High Speed Digital.
Tutorial I: Radar Introduction and basic concepts
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
GPU-accelerated Evaluation Platform for High Fidelity Networking Modeling 11 December 2007 Alex Donkers Joost Schutte.
Introduction to Adaptive Digital Filters Algorithms
1 Resolution Enhancement Compression- Synthetic Aperture Focusing Student: Hans Bethe Advisor: Dr. Jose R. Sanchez Bradley University Department of Electrical.
Comprehensive Ultrasound Research Platform Emma Muir Sam Muir Jacob Sandlund David Smith Advisor: Dr. Sánchez Co-advisor: Dr. Irwin.
By Arun Bhandari Course: HPC Date: 01/28/12. GPU (Graphics Processing Unit) High performance many core processors Only used to accelerate certain parts.
1 of 23 Fouts MAPLD 2005/C117 Synthesis of False Target Radar Images Using a Reconfigurable Computer Dr. Douglas J. Fouts LT Kendrick R. Macklin Daniel.
Building Three-Dimensional Images Using a Time-Reversal Chaotic Cavity
Implementing a Speech Recognition System on a GPU using CUDA
Under Supervision of Dr. Kamel A. Arram Eng. Lamiaa Said Wed
Diane Marinkas CDA 6938 April 30, Outline Motivation Algorithm CPU Implementation GPU Implementation Performance Lessons Learned Future Work.
Comprehensive Ultrasound Research Platform Emma Muir Sam Muir Jacob Sandlund David Smith Advisor: Dr. José Sánchez.
Performed by: Ron Amit Supervisor: Tanya Chernyakova In cooperation with: Prof. Yonina Eldar 1 Part A Final Presentation Semester: Spring 2012.
Review of Ultrasonic Imaging
GPU Architecture and Programming
NTUEE Confidential Toward MIMO MC-CDMA Speaker : Pei-Yun Tsai Advisor : Tzi-Dar Chiueh 2004/10/25.
Space-Time and Space-Frequency Coded Orthogonal Frequency Division Multiplexing Transmitter Diversity Techniques King F. Lee.
Experiences Accelerating MATLAB Systems Biology Applications Heart Wall Tracking Lukasz Szafaryn, Kevin Skadron University of Virginia.
BY Siyandiswa Juanitta Bangani Supervisor: Dr R.Van Zyl
Digital Beamforming. Beamforming Manipulation of transmit and receive apertures. Trade-off performance/cost to achieve: –Steer and focus the transmit.
Ultrasound Simulations using REC and SAFT Presenter: Tony Podkowa November 13, 2012 Advisor: Dr José R. Sánchez Department of Electrical and Computer Engineering.
VOCODERS. Vocoders Speech Coding Systems Implemented in the transmitter for analysis of the voice signal Complex than waveform coders High economy in.
Ultrasonic İmaging.
Midterm Presentation Performed by: Ron Amit Supervisor: Tanya Chernyakova Semester: Spring Sub-Nyquist Sampling in Ultrasound Imaging.
Study of Broadband Postbeamformer Interference Canceler Antenna Array Processor using Orthogonal Interference Beamformer Lal C. Godara and Presila Israt.
Chapter 6: Real-Time Image Formation. display B, M, Doppler image processing Doppler processing digital receive beamformer system control keyboard beamformer.
by Arjun Radhakrishnan supervised by Prof. Michael Inggs
Ultrasonic Imaging Parameters Student: Mei-Ru Yang Wei-Ning Lee Advisor: Pai-Chi Li.
GPU Accelerated MRI Reconstruction Professor Kevin Skadron Computer Science, School of Engineering and Applied Science University of Virginia, Charlottesville,
CUDA Basics. Overview What is CUDA? Data Parallelism Host-Device model Thread execution Matrix-multiplication.
A 5-Pulse Sequence for Harmonic and Sub-Harmonic Imaging
Debunking the 100X GPU vs. CPU Myth An Evaluation of Throughput Computing on CPU and GPU Present by Chunyi Victor W Lee, Changkyu Kim, Jatin Chhugani,
Igor Djurović, LJubiša Stanković, Miloš Daković
Piezoelectric Effect  Sound waves striking a PZ material produce an electrical signal  Can be used to detect sound (and echoes)!
1 Resolution Enhancement Compression- Synthetic Aperture Focusing Techniques Student: Hans Bethe Advisor: Dr. Jose R. Sanchez Bradley University Department.
Sunpyo Hong, Hyesoon Kim
Canny Edge Detection Using an NVIDIA GPU and CUDA Alex Wade CAP6938 Final Project.
PSF What is PSF 1way/2way PSFs PSFs for continious & discrete apertures.
RT 2009, 15 th May 2009 Pulse Pile-up Recovery Using Model-Based Signal Processing. Paul. A.B. Scoullar, Southern Innovation, Australia. Prof. Rob J. Evans.
Chapter 10 Digital Signal and Image Processing
Atindra Mitra Joe Germann John Nehrbass
Generalized and Hybrid Fast-ICA Implementation using GPU
Stencil-based Discrete Gradient Transform Using
Spatially Varying Frequency Compounding of Ultrasound Images
Review of Ultrasonic Imaging
Digital Beamforming.
Presentation transcript:

Ultrasonic Imaging using Resolution Enhancement Compression and GPU- Accelerated Synthetic Aperture Techniques Presenter: Anthony Podkowa May 2, 2013 Advisor: Dr José R. Sánchez Department of Electrical and Computer Engineering

Outline I. Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 2

Outline I.Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 3

Motivation Key medical imaging technique Key medical imaging technique Tumor detection Tumor detection Seek to improve Seek to improve Spatial resolution Spatial resolution Signal-to-noise ratio (SNR) Signal-to-noise ratio (SNR) 4

Project Summary Resolution enhancement compression (REC) Resolution enhancement compression (REC) Coded excitation and pulse compression technique Coded excitation and pulse compression technique Improved axial resolution Improved axial resolution Improved SNR Improved SNR Generic synthetic aperture ultrasound (GSAU) Generic synthetic aperture ultrasound (GSAU) Synthetic aperture technique Synthetic aperture technique Improves lateral resolution Improves lateral resolution Improves SNR Improves SNR Computationally expensive, but parallelizable Computationally expensive, but parallelizable 5

Goals: 1. To investigate the combination of both REC and GSAU in an ultrasound system using MATLAB and Field II. 2. To accelerate the GSAU algorithm using a graphics processing unit (GPU) to achieve real-time processing of the images. 6

Outline I. Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 7

System Block Diagram 8 Encoder TransducerGSAU V in (t) V pc (t) Image Recon. Image Output V lc (t) Received Echo Signals Beamformed Signals 256 Wiener Filter Compressed Signals 256

Outline I. Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 9

Resolution Enhancement Compression Based on the convolution equivalence principle Based on the convolution equivalence principle Encoder shapes excitation signal Encoder shapes excitation signal Wiener Filter: Wiener Filter: Compresses the received signals Compresses the received signals Removes corrupting noise Removes corrupting noise 10 Encoder Transducer V in (t) V pc (t) V lc (t) Received Echo Signals 256 Wiener Filter Compressed Signals 256

Convolution Equivalence Principle Make h t (t) act like h d (t) by shaping v 1 (t) Make h t (t) act like h d (t) by shaping v 1 (t) Wiener deconvolution. Wiener deconvolution. 11 Desired Response Desired system Transducer Some other input Some input

Encoder Subsystem V ulc (f) V pc (f) Tukey Window V lc (f) Wiener Deconvolution Filter Inverse Filter V upc (f) 12

Encoder Subsystem V ulc (f) V pc (f) Tukey Window V lc (f) Wiener Deconvolution Filter Inverse Filter V upc (f) 13

Encoder Subsystem V ulc (f) V pc (f) Tukey Window V lc (f) Wiener Deconvolution Filter Inverse Filter V upc (f) 14

Encoder Subsystem V ulc (f) V pc (f) Tukey Window V lc (f) Wiener Deconvolution Filter Inverse Filter V upc (f) 15

System Block Diagram 16 Encoder TransducerGSAU V in (t) V pc (t) Image Recon. Image Output V lc (t) Received Echo Signals Beamformed Signals 256 Wiener Filter Compressed Signals 256

Transducer Specifications 256 elements 256 elements 8 MHz center frequency 8 MHz center frequency 200 MHz sampling frequency 200 MHz sampling frequency 4 mm element height 4 mm element height 0.26 mm element width 0.26 mm element width 0.04 mm element kerf 0.04 mm element kerf 20 mm focus 20 mm focus Height Width Kerf 17

System Block Diagram 18 Encoder TransducerGSAU V in (t) V pc (t) Image Recon. Image Output V lc (t) Received Echo Signals Beamformed Signals 256 Wiener Filter Compressed Signals 256

Outline I. Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 19

GSAU Technique Transmit and receive with one element at a time. Transmit and receive with one element at a time. Calculate delays associated with the distances from element to each pixel: Calculate delays associated with the distances from element to each pixel: 256 x pixels 256 x pixels Parallel processing Parallel processing 20

GPU Programming (CUDA) 21 HostDevice Up to 8 coresHundreds of cores Memory Transfer

CUDA C 22 Allocate data memory on device Allocate data memory on device Copy data from the host memory to the device Copy data from the host memory to the device Spawn several threads to process the data Spawn several threads to process the data Each thread runs the same chunk of code (kernel) Each thread runs the same chunk of code (kernel) Each thread processes the pixel corresponding to its thread index. Each thread processes the pixel corresponding to its thread index. Copy data back from device memory Copy data back from device memory Free device memory Free device memory

Test Hardware Specifications CPU:Intel Core i7-2600K CPU:Intel Core i7-2600K 4 Cores 4 Cores Processor Clock: 3.4 GHz Processor Clock: 3.4 GHz RAM:16 GB RAM:16 GB GPU:NVIDIA Quadro 5000 GPU:NVIDIA Quadro CUDA cores 352 CUDA cores Processor Clock:1026MHz Processor Clock:1026MHz RAM:2560 MB GDDR5 RAM:2560 MB GDDR5 Memory Bandwidth: 120 GB/s Memory Bandwidth: 120 GB/s 23

System Block Diagram 24 Encoder TransducerGSAU V in (t) V pc (t) Image Recon. Image Output V lc (t) Received Echo Signals Beamformed Signals 256 Wiener Filter Compressed Signals 256

Image Reconstruction Subsystem Envelope Detection Logarithmic Compression Limiter Beamformed Signal Image Scan Line 25

Outline I. Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 26

Simulation Settings Point imaged at 20mm Point imaged at 20mm Tukey window taper: α = 0.08 Tukey window taper: α = 0.08 γ = 1 (Wiener filter) γ = 1 (Wiener filter) Additive noise injected (σ n = 0.1 σ s ) Additive noise injected (σ n = 0.1 σ s ) Excitation schemes studied: Excitation schemes studied: REC REC Conventional pulsing (Delta function) Conventional pulsing (Delta function) 27

Encoding 28 Linear chirp: Linear chirp: 0 – MHz 0 – MHz 12.5 μs 12.5 μs Desired Response: Desired Response: 200% BW 200% BW Transducer Response: Transducer Response: 100% BW 100% BW MSE: 4.46x10 -7 MSE: 4.46x10 -7

GPU Acceleration 29 GPUs perform faster using single precision GPUs perform faster using single precision 4.5% round off error 4.5% round off error Computation time decreased from s to 0.25 s Computation time decreased from s to 0.25 s

Wiener Filter 30 Received signals compressed axially Received signals compressed axially 3 dB gain in SNR 3 dB gain in SNR

REC + GSAU 31 Received signals compressed laterally Received signals compressed laterally 5 dB gain in SNR 5 dB gain in SNR

CP + GSAU 32 Received signals compressed laterally Received signals compressed laterally SNR loss of 0.3 dB SNR loss of 0.3 dB 10 dB less SNR than REC + GSAU, and 5 dB less than REC alone 10 dB less SNR than REC + GSAU, and 5 dB less than REC alone

Resolution Analysis 33 Resolution computed from the modulation transfer function (MTF) Resolution computed from the modulation transfer function (MTF) MTF is the spatial Fourier transform of the point spread function (PSF). MTF is the spatial Fourier transform of the point spread function (PSF). Critical wavenumber k 0 computed by determining the point where normalized MTF crosses 0.1 Critical wavenumber k 0 computed by determining the point where normalized MTF crosses 0.1 Resolution given by: Resolution given by:

Axial Resolution 34 CP: mm CP: mm REC: mm REC: mm CP+GSAU: mm CP+GSAU: mm REC+GSAU: mm REC+GSAU: mm

Lateral Resolution 35 CP: mm CP: mm REC: mm REC: mm CP+GSAU: mm CP+GSAU: mm REC+GSAU: mm REC+GSAU: mm

Outline I. Motivation & project summary II.Block diagram A.REC B.GSAU III.Results IV.Areas of Expansion 36

Potential Areas of Expansion GSAU GSAU Improved interpolation (linear, polynomial) Improved interpolation (linear, polynomial) Alternative reweighting schemes Alternative reweighting schemes Other SA techniques: Other SA techniques: Synthetic transmit aperture ultrasound (STAU) Synthetic transmit aperture ultrasound (STAU) Synthetic receive aperture ultrasound (SRAU) Synthetic receive aperture ultrasound (SRAU) GPU speedup GPU speedup Use of optimized libraries (CUBLAS, MAGMA) Use of optimized libraries (CUBLAS, MAGMA) Reduce thread overhead Reduce thread overhead 37

Conclusions 38 REC + GSAU exhibit the best performance in SNR. REC + GSAU exhibit the best performance in SNR. CP + GSAU exhibit the best performance in spatial resolution. CP + GSAU exhibit the best performance in spatial resolution. GPU acceleration results in a speedup by a factor of 116. GPU acceleration results in a speedup by a factor of 116.

References 39 [1] M. Oelze, “Bandwidth and resolution enhancement through pulse compression,” IEEE Trans. Ultrason., Ferroelec., and Freq. Contr., vol. 54, no. 4, pp , Apr [2] J. Sanchez and M. Oelze, “An ultrasonic imaging speckle-suppression and contrast-enhancement technique by means of frequency compounding and coded excitation,” IEEE Trans. Ultrason., Ferroelec., and Freq. Contr., vol. 56, no. 7, pp , Jul [3] S. Nikolov, “Synthetic aperture tissue and flow ultrasound imaging,” Ph.D. dissertation, Technical University of Denmark, [Online]. Available: [4]J. Jensen, “Field: A program for simulating ultrasound systems,” in Medical & Biological Engineering & Computing, vol. 34, 1996, pp [5]J. Jensen, and N. Svendsen, “Calculation of pressure fields from arbitrary shaped, apodized, and excited ultrasound transducers,” IEEE Trans. Ultrason., Ferroelec. and Freq. Contr.

Ultrasonic Imaging using Resolution Enhancement Compression and GPU- Accelerated Synthetic Aperture Techniques Presenter: Anthony Podkowa May 2, 2013 Advisor: Dr José R. Sánchez Department of Electrical and Computer Engineering

Importing into MATLAB 41 Generate PTX file from CUDA code Generate PTX file from CUDA code Initialize kernel object using PTX file Initialize kernel object using PTX file Convert input data to a gpuArray Convert input data to a gpuArray Evaluate kernel Evaluate kernel Bring the output data back using the gather() function Bring the output data back using the gather() function

Derivation of Envelope Detection 42

Apodization Spatial Windowing Spatial Windowing Used to shape the beam profile Used to shape the beam profile Reweighting by apodization coefficients Reweighting by apodization coefficients a1a1 a2a2 aNaN 43

Generic Synthetic Aperture Ultrasound Electrically focus signals to create an artificial aperture. Electrically focus signals to create an artificial aperture. Pros: Pros: Improved lateral resolution. Improved lateral resolution. Improved SNR. Improved SNR. Cons: Cons: Computationally expensive. Computationally expensive. 44