Computationally Efficient Histopathological Image Analysis: Use of GPUs for Classification of Stromal Development Olcay Sertel 1,2, Antonio Ruiz 3, Umit.

Slides:



Advertisements
Similar presentations
+ Accelerating Fully Homomorphic Encryption on GPUs Wei Wang, Yin Hu, Lianmu Chen, Xinming Huang, Berk Sunar ECE Dept., Worcester Polytechnic Institute.
Advertisements

GPGPU Introduction Alan Gray EPCC The University of Edinburgh.
HPCC Mid-Morning Break High Performance Computing on a GPU cluster Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery.
IMGD 4000: Computer Graphics in Games Emmanuel Agu.
Computer graphics & visualization Jens Schneider Martin Kraus Rüdiger Westermann.
A many-core GPU architecture.. Price, performance, and evolution.
Acceleration of the Smith– Waterman algorithm using single and multiple graphics processors Author : Ali Khajeh-Saeed, Stephen Poole, J. Blair Perot. Publisher:
Rasterization and Ray Tracing in Real-Time Applications (Games) Andrew Graff.
Multi Agent Simulation and its optimization over parallel architecture using CUDA™ Abdur Rahman and Bilal Khan NEDUET(Department Of Computer and Information.
Introduction CS 524 – High-Performance Computing.
Control Flow Virtualization for General-Purpose Computation on Graphics Hardware Ghulam Lashari Ondrej Lhotak University of Waterloo.
1 ITCS 6/8010 CUDA Programming, UNC-Charlotte, B. Wilkinson, Jan 19, 2011 Emergence of GPU systems and clusters for general purpose High Performance Computing.
3D Graphics Processor Architecture Victor Moya. PhD Project Research on architecture improvements for future Graphic Processor Units (GPUs). Research.
IN4151 Introduction 3D graphics 1 Introduction to 3D computer graphics part 2 Viewing pipeline Multi-processor implementation GPU architecture GPU algorithms.
Real-World GPGPU Mark Harris NVIDIA Developer Technology.
ATI GPUs and Graphics APIs Mark Segal. ATI Hardware X1K series 8 SIMD vertex engines, 16 SIMD fragment (pixel) engines 3-component vector + scalar ALUs.
Evolutions of GPU Architectures Andrew Coile CMPE220 3/2007.
Accelerating Machine Learning Applications on Graphics Processors Narayanan Sundaram and Bryan Catanzaro Presented by Narayanan Sundaram.
GPU Graphics Processing Unit. Graphics Pipeline Scene Transformations Lighting & Shading ViewingTransformations Rasterization GPUs evolved as hardware.
GPGPU overview. Graphics Processing Unit (GPU) GPU is the chip in computer video cards, PS3, Xbox, etc – Designed to realize the 3D graphics pipeline.
To GPU Synchronize or Not GPU Synchronize? Wu-chun Feng and Shucai Xiao Department of Computer Science, Department of Electrical and Computer Engineering,
GPGPU platforms GP - General Purpose computation using GPU
HPCC Mid-Morning Break Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery Introduction to the new GPU (GFX) cluster.
Accelerating SQL Database Operations on a GPU with CUDA Peter Bakkum & Kevin Skadron The University of Virginia GPGPU-3 Presentation March 14, 2010.
Interactive Visualization of Volumetric Data on Consumer PC Hardware: Introduction Daniel Weiskopf Graphics Hardware Trends Faster development than Moore’s.
CSE 690 General-Purpose Computation on Graphics Hardware (GPGPU) Courtesy David Luebke, University of Virginia.
General-Purpose Computation on Graphics Hardware.
Motivation “Every three minutes a woman is diagnosed with Breast cancer” (American Cancer Society, “Detailed Guide: Breast Cancer,” 2006) Explore the use.
GPU-accelerated Evaluation Platform for High Fidelity Networking Modeling 11 December 2007 Alex Donkers Joost Schutte.
Training Program on GPU Programming with CUDA 31 st July, 7 th Aug, 14 th Aug 2011 CUDA Teaching UoM.
CuMAPz: A Tool to Analyze Memory Access Patterns in CUDA
Database and Stream Mining using GPUs Naga K. Govindaraju UNC Chapel Hill.
Codeplay CEO © Copyright 2012 Codeplay Software Ltd 45 York Place Edinburgh EH1 3HP United Kingdom Visit us at The unique challenges of.
Computer Graphics Graphics Hardware
BY: ALI AJORIAN ISFAHAN UNIVERSITY OF TECHNOLOGY 2012 GPU Architecture 1.
By Arun Bhandari Course: HPC Date: 01/28/12. GPU (Graphics Processing Unit) High performance many core processors Only used to accelerate certain parts.
Advisor: Dr. Aamir Shafi Co-Advisor: Mr. Ali Sajjad Member: Dr. Hafiz Farooq Member: Mr. Tahir Azim Optimizing N-body Simulations for Multi-core Compute.
Massively Parallel Mapping of Next Generation Sequence Reads Using GPUs Azita Nouri, Reha Oğuz Selvitopi, Özcan Öztürk, Onur Mutlu, Can Alkan Bilkent University,
Neuroblastoma Stroma Classification on the Sony Playstation 3 Tim Hartley, Olcay Sertel, Mansoor Khan, Umit Catalyurek, Joel Saltz, Metin Gurcan Department.
Programming Concepts in GPU Computing Dušan Gajić, University of Niš Programming Concepts in GPU Computing Dušan B. Gajić CIITLab, Dept. of Computer Science.
General-Purpose Computation on Graphics Hardware.
Accelerating image recognition on mobile devices using GPGPU
Stream Processing Main References: “Comparing Reyes and OpenGL on a Stream Architecture”, 2002 “Polygon Rendering on a Stream Architecture”, 2000 Department.
1)Leverage raw computational power of GPU  Magnitude performance gains possible.
May 8, 2007Farid Harhad and Alaa Shams CS7080 Overview of the GPU Architecture CS7080 Final Class Project Supervised by: Dr. Elias Khalaf By: Farid Harhad.
A SEMINAR ON 1 CONTENT 2  The Stream Programming Model  The Stream Programming Model-II  Advantage of Stream Processor  Imagine’s.
GPUs – Graphics Processing Units Applications in Graphics Processing and Beyond COSC 3P93 – Parallel ComputingMatt Peskett.
Debunking the 100X GPU vs. CPU Myth An Evaluation of Throughput Computing on CPU and GPU Present by Chunyi Victor W Lee, Changkyu Kim, Jatin Chhugani,
Shangkar Mayanglambam, Allen D. Malony, Matthew J. Sottile Computer and Information Science Department Performance.
From Turing Machine to Global Illumination Chun-Fa Chang National Taiwan Normal University.
© David Kirk/NVIDIA and Wen-mei W. Hwu, ECE408/CS483, University of Illinois, Urbana-Champaign 1 Graphic Processing Processors (GPUs) Parallel.
Sunpyo Hong, Hyesoon Kim
Ray Tracing by GPU Ming Ouhyoung. Outline Introduction Graphics Hardware Streaming Ray Tracing Discussion.
GPGPU introduction. Why is GPU in the picture Seeking exa-scale computing platform Minimize power per operation. – Power is directly correlated to the.
Scientific Computing Goals Past progress Future. Goals Numerical algorithms & computational strategies Solve specific set of problems associated with.
3/12/2013Computer Engg, IIT(BHU)1 CUDA-3. GPGPU ● General Purpose computation using GPU in applications other than 3D graphics – GPU accelerates critical.
Fast and parallel implementation of Image Processing Algorithm using CUDA Technology On GPU Hardware Neha Patil Badrinath Roysam Department of Electrical.
Graphic Processing Units Presentation by John Manning.
S. Pardi Frascati, 2012 March GPGPU Evaluation – First experiences in Napoli Silvio Pardi.
Accelerating K-Means Clustering with Parallel Implementations and GPU Computing Janki Bhimani Miriam Leeser Ningfang Mi
Exploiting Graphics Processors for High-performance IP Lookup in Software Routers Jin Zhao, Xinya Zhang, Xin Wang, Yangdong Deng, Xiaoming Fu IEEE INFOCOM.
General Purpose computing on Graphics Processing Units
Computer Graphics Graphics Hardware
GPU Architecture and Its Application
Graphics Processing Unit
GP2: General Purpose Computation using Graphics Processors
Static Image Filtering on Commodity Graphics Processors
Computer Graphics Graphics Hardware
Ray Tracing on Programmable Graphics Hardware
CSE 502: Computer Architecture
Presentation transcript:

Computationally Efficient Histopathological Image Analysis: Use of GPUs for Classification of Stromal Development Olcay Sertel 1,2, Antonio Ruiz 3, Umit Catayurek 1,2, Manuel Ujaldon 3, Joel Saltz 1, Metin Gurcan 1 Dept. of Computer Architecture, 1 Dept. of Biomedical Informatics, 2 Dept. of Electrical & Computer Engineering, 3 Dept. of Pathology, The Ohio State University, 3 Dept. of Computer Architecture, The University of Malaga

2 Why do we need high-performance tools?  The size of a single whole-slide image is extremely large!  Typically an uncompressed whole-slide image digitized at 40x is more than 40GB.  A spatial resolution of 120K x 120K  120K x 120K x 3 Bytes(RGB) per pixel ≈ 43.2 GB  Complicated and time-consuming image analysis algorithms.

3 Parallel processing infrastructure ` Whole-slide image Label 1 Label 2 Background Label 3 Assign classification labels Classification map Image tiles (40X magnification) Processor 1Processor N ……… Parallel Classification

4 What is GPGPU?  GPGPU stands for General Purpose Graphics Processing Units  Initially designed for gaming applications  Fast GPUs are used to implement complex shader and rendering operations for real-time effects. Doom 3, © id Software Call of Duty, © Infinity Ward

5 Applications Physically-based Simulation Particle Systems Molecular Dynamics Fluid models Signal and Image Processing Segmentation Volume Rendering Visualization Photon Mapping Ray Tracing Medical Image Analysis Databases & Data Mining Database queries Stream Mining

6 GPU resources CPUGPU Processor clock2.13 GHz575 MHz Raw computational power10 GFLOPS520 GFLOPS Memory bus width64 bits384 bits Memory clock2x333 MHz2x900 MHz Memory bandwidth10.8 GB/s86.4 GB/s Memory size and type2 Gb DDR2768 Mb GDDR3  GPUs:  Speed increasing at cubed- Moore’s law!  Ubiquitous and inexpensive  Functional units for specific graphics-based operations (vertex & pixel shaders)  Small memory but raw computational power  Memory bandwidth & clock provides superior performance

7 GPU implementation  The implementation is crucial  Programming model is unusual  Programming idioms tied to computer graphics  Programming environment tightly constrained  Can’t simply port CPU code:  Poorly suited to sequential, “pointer-chasing” code  Missing support for some basic functionality (e.g., integers, bitwise operations)  Underlying architectures are:  Inherently parallel  Rapidly evolving (even in basic feature set!)  Largely secret

8 Computational savings on GPUs Execution times (in msec.) for a 1Kx1K image tile. CPU (Matlab)CPU (C++)GPU LA*B* conversion Statistical features LBP Total Processing of a relatively small whole-slide image of 50Kx50K size is: 47 sec. on GPU 35 min. on CPU Task to performC++ vs. MatlabGPU vs. C++GPU vs. Matlab RGB to LA*B* conv. 5.9x - 5.2x69.2x x406.1x x Statistical features 122.2x x0.2x - 2.1x21.8x x LBP operator 8.3x - 3.9x4.2x x34.6x x TOTAL 13.3x - 7.6x2.6x x33.4x x Performance gain depends on image resolution, varying from 128x128 to 1024x1024

9 Verification of the out values MeanStandard deviation CPU(Matlab) / CPU(C++) 1.4     CPU(C++) / GPU 6.5     CPU(Matlab) / GPU 1.5     Verification of the output values across hardware platforms obtained from 500 training images. There is no variation in the classification accuracy when using the feature values computed on GPU

10 Future directions & Conclusions  Processing of the whole-slide images is essential to overcome the sampling bias problem.  We need HPC tools that are available due to the huge sizes of whole-slide images and sophisticated image analysis algorithms  The processing time can be reduced drastically using different infrastructures  We are investigating novel ways of whole-slide images over various computational infrastructures  Cluster of GPUs  One drawback of GPUs is the low-level programmability  Requires good knowledge of architecture  Rapid changes in the architecture  However, higher level development tools (CUDA by NVidia)

11 Thanks for your attention Any questions?