Institute of Medical Engineering 1 20th Annual International Conference on Magnetic Resonance Angiography Graz, 15-18.10.2008 Real Time Elimination of.

Slides:



Advertisements
Similar presentations
ENV 2006 CS4.1 Envisioning Information: Case Study 4 Focus and Context for Volume Visualization.
Advertisements

Professor Horst Cerjak, Thomas Pock A Duality Based Approach for Realtime TV-L 1 Optical Flow ICG A Duality Based Approach for Realtime TV-L.
Fast and Accurate Optical Flow Estimation
DTAM: Dense Tracking and Mapping in Real-Time
Evaluation of Reconstruction Techniques
GPU Virtualization Support in Cloud System Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer Science and Information.
Intro to GPU’s for Parallel Computing. Goals for Rest of Course Learn how to program massively parallel processors and achieve – high performance – functionality.
A 4-year $2.6 million grant from the National Institute of Biomedical Imaging and Bioengineering (NIBIB), to perform “real-time” CT imaging dose calculations.
An Effective GPU Implementation of Breadth-First Search Lijuan Luo, Martin Wong and Wen-mei Hwu Department of Electrical and Computer Engineering, UIUC.
Shape From Light Field meets Robust PCA
ECE 562 Computer Architecture and Design Project: Improving Feature Extraction Using SIFT on GPU Rodrigo Savage, Wo-Tak Wu.
Paper Presentation - An Efficient GPU-based Approach for Interactive Global Illumination- Rui Wang, Rui Wang, Kun Zhou, Minghao Pan, Hujun Bao Presenter.
ICG Professor Horst Cerjak, Thomas Pock Variational Methods for 3D Reconstruction Thomas Pock 1, Chrisopher Zach 2 and Horst Bischof 1 1 Institute.
GPU Computing with CUDA as a focus Christie Donovan.
2009/04/07 Yun-Yang Ma.  Overview  What is CUDA ◦ Architecture ◦ Programming Model ◦ Memory Model  H.264 Motion Estimation on CUDA ◦ Method ◦ Experimental.
Optimizing and Auto-Tuning Belief Propagation on the GPU Scott Grauer-Gray and Dr. John Cavazos Computer and Information Sciences, University of Delaware.
L13: Review for Midterm. Administrative Project proposals due Friday at 5PM (hard deadline) No makeup class Friday! March 23, Guest Lecture Austin Robison,
DCABES 2009 China University Of Geosciences 1 The Parallel Models of Coronal Polarization Brightness Calculation Jiang Wenqian.
Back-Projection on GPU: Improving the Performance Wenlay “Esther” Wei Advisor: Jeff Fessler Mentor: Yong Long April 29, 2010.
ALICE HLT High Speed Tracking and Vertexing Real-Time 2010 Conference Lisboa, May 25, 2010 Sergey Gorbunov 1,2 1 Frankfurt Institute for Advanced Studies,
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications From J. Fowers, G. Brown, P. Cooke, and G. Stitt, University.
To GPU Synchronize or Not GPU Synchronize? Wu-chun Feng and Shucai Xiao Department of Computer Science, Department of Electrical and Computer Engineering,
CSE 6367 Computer Vision Stereo Reconstruction Camera Coordinate Transformations “Everything should be made as simple as possible, but not simpler.” Albert.
Jared Barnes Chris Jackson.  Originally created to calculate pixel values  Each core executes the same set of instructions Mario projected onto several.
Introduction Belief propagation: known to produce accurate results for stereo processing/ motion estimation High storage requirements limit the use of.
SAGE: Self-Tuning Approximation for Graphics Engines
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
1/24 Exploring the Design Space of a Parallel Object Recognition System Bor-Yiing Su, Kurt Keutzer,
Jason P. Stockmann 1 and R. Todd Constable 1,2 Yale University, Department of Biomedical Engineering 1, Department of Diagnostic Radiology 2, New Haven,
Challenges Bit-vector approach Conclusion & Future Work A subsequence of a string of symbols is derived from the original string by deleting some elements.
BY: ALI AJORIAN ISFAHAN UNIVERSITY OF TECHNOLOGY 2012 GPU Architecture 1.
By Arun Bhandari Course: HPC Date: 01/28/12. GPU (Graphics Processing Unit) High performance many core processors Only used to accelerate certain parts.
MS Thesis Defense “IMPROVING GPU PERFORMANCE BY REGROUPING CPU-MEMORY DATA” by Deepthi Gummadi CoE EECS Department April 21, 2014.
General Purpose Computing on Graphics Processing Units: Optimization Strategy Henry Au Space and Naval Warfare Center Pacific 09/12/12.
EE369C Final Project: Accelerated Flip Angle Sequences Jan 9, 2012 Jason Su.
Gregory Fotiades.  Global illumination techniques are highly desirable for realistic interaction due to their high level of accuracy and photorealism.
1 © 2012 The MathWorks, Inc. Parallel computing with MATLAB.
Programming Concepts in GPU Computing Dušan Gajić, University of Niš Programming Concepts in GPU Computing Dušan B. Gajić CIITLab, Dept. of Computer Science.
NVIDIA Tesla GPU Zhuting Xue EE126. GPU Graphics Processing Unit The "brain" of graphics, which determines the quality of performance of the graphics.
HighlY Constrained Back PRojection (HYPR) Thank you to Oliver Wieben!!
Fast Support Vector Machine Training and Classification on Graphics Processors Bryan Catanzaro Narayanan Sundaram Kurt Keutzer Parallel Computing Laboratory,
GPU Architecture and Programming
SIGGRAPH 2011 ASIA Preview Seminar Rendering: Accuracy and Efficiency Shinichi Yamashita Triaxis Co.,Ltd.
Introduction: Lattice Boltzmann Method for Non-fluid Applications Ye Zhao.
ICAL GPU 架構中所提供分散式運算 之功能與限制. 11/17/09ICAL2 Outline Parallel computing with GPU NVIDIA CUDA SVD matrix computation Conclusion.
1 Workshop 9: General purpose computing using GPUs: Developing a hands-on undergraduate course on CUDA programming SIGCSE The 42 nd ACM Technical.
GPU Programming Shirley Moore CPS 5401 Fall 2013
GPU Accelerated MRI Reconstruction Professor Kevin Skadron Computer Science, School of Engineering and Applied Science University of Virginia, Charlottesville,
Conclusions Simulated fMRI phantoms with real motion and realistic susceptibility artifacts have been generated and tested using SPM2. Image distortion.
Sudhanshu Khemka.  Treats each document as a vector with one component corresponding to each term in the dictionary  Weight of a component is calculated.
Weekly Report- Reduction Ph.D. Student: Leo Lee date: Oct. 30, 2009.
1 GPU programming Dr. Bernhard Kainz. 2 Dr Bernhard Kainz Overview About myself Motivation GPU hardware and system architecture GPU programming languages.
Computer Architecture Lecture 24 Parallel Processing Ralph Grishman November 2015 NYU.
GFlow: Towards GPU-based High- Performance Table Matching in OpenFlow Switches Author : Kun Qiu, Zhe Chen, Yang Chen, Jin Zhao, Xin Wang Publisher : Information.
Canny Edge Detection Using an NVIDIA GPU and CUDA Alex Wade CAP6938 Final Project.
GPU Accelerated Vessel Segmentation Using Laplacian Eigenmaps Lin Cheng, Hyunsu Cho and Peter A. Yoon Trinity College.
Fast and parallel implementation of Image Processing Algorithm using CUDA Technology On GPU Hardware Neha Patil Badrinath Roysam Department of Electrical.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Visual Tracking of Surgical Tools in Retinal Surgery using Particle Filtering Group 14 William Yang and David Li Presenter: William Yang Mentor: Dr. Rogerio.
Exploiting Graphics Processors for High-performance IP Lookup in Software Routers Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu IEEE INFOCOM.
A Globally Optimal Algorithm for Robust TV-L 1 Range Image Integration Christopher Zach VRVis Research Center Thomas Pock, Horst Bischof.
Two-Dimensional Phase Unwrapping On FPGAs And GPUs
Stencil-based Discrete Gradient Transform Using
GPU-based iterative CT reconstruction
Parallel Plasma Equilibrium Reconstruction Using GPU
I. E. Venetis1, N. Nikoloutsakos1, E. Gallopoulos1, John Ekaterinaris2
PowerGrid for harnessing high performance computing to reconstruct large sets of high-resolution non-Cartesian MRI data Brad Sutton Assoc Prof, Bioengineering.
Nathan Grabaskas: Batched LA and Parallel Communication Optimization
East China Normal University Fang Li
Gradient Domain Salience-preserving Color-to-gray Conversion
Presentation transcript:

Institute of Medical Engineering 1 20th Annual International Conference on Magnetic Resonance Angiography Graz, Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware F. Knoll 1, M. Unger 2, F. Ebner 3, R. Stollberger 1 1 Institute of Medical Engineering, TU Graz, Austria 2 Institute for Computer Graphics and Vision, TU Graz, Austria, 3 Department of Radiology, Medical University Graz

2 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Introduction Total Variation constrained reconstruction methods effectively reduce streaking artifacts from undersampled radial data sets [1]. Iteratively solve a constrained optimization problem. Main Drawback: Computationally expensive → not suitable for daily clinical practice. [1] Block et al., MRM 57: (2007)

3 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware GPU Computing Make use of the massive parallelism of GPU architectures! Nvidia GForce GTX Processor Cores CUDA (Compute Unified Device Architecture): C type programming access to GPU → Cheap, pocket sized supercomputer How to solve TV optimization problems?

4 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Total Variation [2] u…Reconstructed Images f…Original images with artifacts λ...Regularization parameter Ω…Image Domain [2] Rudin et al. Phys. D, 60(1-4) , 1992 → A parallelized formulation is needed for efficient GPU implementation.

5 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware GPU Implementation Solve the associated dual Euler Lagrange Equations for each pixel in the 3D data set (Chambolle’s Algorithm) [3]. Parallization: A PDE for each pixel. Start an individual thread on the GPU for each calculation. [3] Pock et al., CVGPU Workshop CVPR 2008

6 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Results: 80 projections TR=3.74ms TE=1.48ms FA=20° Matrix size (x,y,z) = 448x352x40 Retrospective subsampling in x-y- plane 80 radial projections Δt = 12s CE dataset of the carotid arteries: a)Conventional Regridding Reconstruction b)Reconstruction with TV constraint

7 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Results: 40 projections TR=3.74ms TE=1.48ms FA=20° Matrix size (x,y,z) = 448x352x40 Retrospective subsampling in x-y- plane 40 radial projections Δt = 6s CE dataset of the carotid arteries: a)Conventional Regridding Reconstruction b)Reconstruction with TV constraint

8 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Reconstruction Time Implementation typeReconstruction time: 448x352x40 Dataset Performance (Iterations/s) CPU version9 min0.18 CUDA, Nvidia GTX s414 Speedup of 2300! GPU reconstruction times allow real time imaging.

9 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Conclusion Radial undersampling provides data sets with high temporal resolution. 3D ROF Total Variation efficiently removes streaking artifacts, while preserving blood vessels in MRA. The GPU implementation facilitates image reconstruction times that are far below the corresponding acquisition times. We believe that this may pave the way for these reconstruction strategies, currently promising research topics, to become powerful tools in daily clinical practice. Data fidelity term in image space limits the algorithm to special applications. k-space data fidelity term is WIP.

10 MRA - Club 08 Real Time Elimination of Undersampling Artifacts in CE MRA using Variational Denoising on Graphics Hardware Demonstration Acknowledgements: This work was funded by Austrian Science Fund (FWF) project F32 SFB : „Mathematical Optimization and Applications in Biomedical Sciences”