IM.Grid: A Grid Computing Solution for image processing

Slides:



Advertisements
Similar presentations
Structure of Computer Systems
Advertisements

Chang, Chia-Hao (Howard) Comp 1631 Winter Semester Multi-Core/Processor.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
Juan Mendivelso.  Serial Algorithms: Suitable for running on an uniprocessor computer in which only one instruction executes at a time.  Parallel Algorithms:
COLLEGE FOR PROFESSIONAL STUDIES TOPIC OF PRESENTATION PROCESSOR IN COMPUTER.
 Introduction to Operating System Introduction to Operating System  Types Of An Operating System Types Of An Operating System  Single User Single User.
Lappeenranta University of Technology / JP CT30A7001 Concurrent and Parallel Computing Introduction to concurrent and parallel computing.
BY: ALI AJORIAN ISFAHAN UNIVERSITY OF TECHNOLOGY 2012 GPU Architecture 1.
Introduction, background, jargon Jakub Yaghob. Literature T.G.Mattson, B.A.Sanders, B.L.Massingill: Patterns for Parallel Programming, Addison- Wesley,
Example: Sorting on Distributed Computing Environment Apr 20,
Parallel Algorithms & Distributed Computing Matt Stimmel Matt White.
O PTIMAL SERVICE TASK PARTITION AND DISTRIBUTION IN GRID SYSTEM WITH STAR TOPOLOGY G REGORY L EVITIN, Y UAN -S HUN D AI Adviser: Frank, Yeong-Sung Lin.
CS- 492 : Distributed system & Parallel Processing Lecture 7: Sun: 15/5/1435 Foundations of designing parallel algorithms and shared memory models Lecturer/
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Big data Usman Roshan CS 675. Big data Typically refers to datasets with very large number of instances (rows) as opposed to attributes (columns). Data.
Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉 教授 : 許毅然 作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.
Parallel processing
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-1.
Computer Structure 2015 – Intel ® Core TM μArch 1 Computer Structure Multi-Threading Lihu Rappoport and Adi Yoaz.
Barnes Hut – A Broad Review Abhinav S Bhatele The 27th day of April, 2006.
Background Computer System Architectures Computer System Software.
SCHOOL OF ENGINEERING AND ADVANCED TECHNOLOGY Engineering Project Routing in Small-World Networks.
Large-scale geophysical electromagnetic imaging and modeling on graphical processing units Michael Commer (LBNL) Filipe R. N. C. Maia (LBNL-NERSC) Gregory.
S. Pardi Frascati, 2012 March GPGPU Evaluation – First experiences in Napoli Silvio Pardi.
StoRM+Lustre Performance Test with 10Gbps Network YAN Tian for Distributed Computing Group Meeting Nov. 4th, 2014.
Dynamic Mobile Cloud Computing: Ad Hoc and Opportunistic Job Sharing.
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
Using the VTune Analyzer on Multithreaded Applications
Optimizing Distributed Actor Systems for Dynamic Interactive Services
Modeling Big Data Execution speed limited by: Model complexity
Introduction to Parallel Computing: MPI, OpenMP and Hybrid Programming
Chapter 1: Introduction
Parallel Plasma Equilibrium Reconstruction Using GPU
CS427 Multicore Architecture and Parallel Computing
Chilimbi, et al. (2014) Microsoft Research
VirtualGL.
Multi-core processors
Multi-core processors
High Performance Computing on an IBM Cell Processor --- Bioinformatics
Computer Structure Multi-Threading
Parallel Density-based Hybrid Clustering
Spatial Analysis With Big Data
Steven Ge, Xinmin Tian, and Yen-Kuang Chen
Multi-core processors
Core i7 micro-processor
Introduction to Parallelism.
Real-Time Ray Tracing Stefan Popov.
Algorithm Analysis CSE 2011 Winter September 2018.
Challenging Cloning Related Problems with GPU-Based Algorithms
Lifecycle Suppose we have two processes that require the CPU. The first one had the CPU and you would like to let the second process run, ie context switch.
Hyperthreading Technology
April 30th – Scheduling / parallel
Yu Su, Yi Wang, Gagan Agrawal The Ohio State University
Project BNB-Grid: solving large scale optimization problems in a distributed environment Good afternoon, I’m Mikhail Posypkin from Institute for System.
Kai Bu 13 Multiprocessors So today, we’ll finish the last part of our lecture sessions, multiprocessors.
Chapter 4: Threads.
CLUSTER COMPUTING.
Fast Communication and User Level Parallelism
Hybrid Programming with OpenMP and MPI
Fine-grained vs Coarse-grained multithreading
Multithreaded Programming
Subject Name: Operating System Concepts Subject Number:
PROCESSES & THREADS ADINA-CLAUDIA STOICA.
Operating System Introduction.
LO2 – Understand Computer Software
CS 286 Computer Organization and Architecture
Vrije Universiteit Amsterdam
Operating System Overview
Chapter 1: Introduction
Hwajung Lee ITEC452 Distributed Computing Lecture 1 Introduction to Distributed Systems.
Presentation transcript:

IM.Grid: A Grid Computing Solution for image processing Image Mining Group @ Institut Pasteur Korea HongKee Moon IM.Grid: A Grid Computing Solution for image processing

Introduction Goals How? Increase speed of computation Make it much faster! How? Super powerful computer Distributed computing Multithreading Thread is a light-weight processing unit A program can have multiple threads running simultaneously Grid computing A program exploits multiple remote computers Difficult to implement GPGPU-Grid computing A program exploits multiple remote computers powered by GPGPU Very difficult to implement

Index Goals & Structure Comparisons Demo Multithreads & Grids Results Possible applications

Goals Reliability High performance

Image Processing Pipeline

Structure

Structure

Batch Processing Load an image Process the image Results Results

Multithreading Load two images Process the images Results

Grid Computing Network Results Load ten images Process the images

GPGPU-Grid Computing Network Results Load ten images Process the images

IM 1.0 IM 1.0 Batch process mode Costs less memory but slow Process screening well by well Takes 384 seconds (if it takes 1 second for processing a well of 384 well-plate) Costs less memory but slow As you have seen the batch process diagram already, It processes screening well by well.

IM 2.0 + Multithreads IM 2.0 + Multithreads Multithreaded process Process screening multiple wells simultaneously Takes 384/N seconds (N: number of threads) High performance Needs large amounts of memory for loading N images Out-of-memory exception In the multithreads, It processes screening multiple wells simultaneously. If you have 10 threads, it just takes 38 seconds. However, it might be out-of-memory exception if it loads multiple big pictures in parallel.

IM 2.0 + Grids IM 2.0 + Grids Same as Multithreaded process Process screening multiple wells simultaneously Takes 384/M seconds (M: number of grids) Communication to grids Guarantees high performance with less memory Suitable for HT-HCS The performance of grids can be more or less same as the multithreading approach.

IM 2.0 + GPGPU + Grids

Demo Compare batch-mode and multithreads IM2 multithread demo.avi Compare multithreads and grid computing Multithreads Video.avi Dell 690 workstation 2 1.8GHz Quad-core with 4GB memory Grid Video.avi 10 Dell PowerEdge Blade Servers Each grid has 2 3GHz processors with hyper-threading and 2GB memory Now I’m showing you several movies in which we tested our approaches.

Multithreads & Grids Multithreads Grids Multiple instances of plugin Allocates (N x plugin instance) + (N x images) in the memory of one computer Image data is loaded in PC  high network bandwidth Better performance if PC has multi cores Grids Multithreads only for communication with multiple computers  sends parameters and receives results Less memory usage Image data is loaded in each grid  low network bandwidth Performance is guaranteed regardless of PC performance

Result 1 We wanted to check how different they are with respect to the speed of computations. we gave them the hard summation problems. The trends of Batch and multithread are quite steep compared to grid computing having a flat line. The performance gaps become big as the number of iterations increase.

Result 2 We also wanted to test how the network bandwidth affects image processing. In the grid mode, processing times are almost same in 1GB line and Wireless. Because the PC didn’t have to load images when it used multiple grids. However, there are huge differences in the batch mode.

Possible applications High Throughput–High Content Screening Search for optimal parameters during new algorithm development Solve complex problems with divide-conquer strategy Can be used for another general purpose Let us know if you want to use Grids!

Reference “IM.Grid, a Grid Computing Approach in High Throughput-High Content Screening”, The 9th IEEE/ACM International Conference on Grid Computing, 2008