"Parallel MATLAB in production supercomputing with applications in signal and image processing" Ashok Krishnamurthy David Hudak John Nehrbass Siddharth.

Slides:



Advertisements
Similar presentations
Building a CFD Grid Over ThaiGrid Infrastructure Putchong Uthayopas, Ph.D Department of Computer Engineering, Faculty of Engineering, Kasetsart University,
Advertisements

Distributed Processing, Client/Server and Clusters
High Performance Computing Course Notes Grid Computing.
Page 1 Dorado 400 Series Server Club Page 2 First member of the Dorado family based on the Next Generation architecture Employs Intel 64 Xeon Dual.
Introduction CSCI 444/544 Operating Systems Fall 2008.
WEST VIRGINIA UNIVERSITY HPC and Scientific Computing AN OVERVIEW OF HIGH PERFORMANCE COMPUTING RESOURCES AT WVU.
Ohio Bioinformatics Consortium Research Infrastructure Ashok KrishnamurthyOhio Supercomputer Center Michael Raymer Wright State University Zhong Hui DuanUniversity.
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
1 Week #1 Objectives Review clients, servers, and Windows network models Differentiate among the editions of Server 2008 Discuss the new Windows Server.
Distributed Processing, Client/Server, and Clusters
1 Week #1 Objectives Review clients, servers, and Windows network models Differentiate among the editions of Server 2008 Discuss the new Windows Server.
Chapter 16 Client/Server Computing Patricia Roy Manatee Community College, Venice, FL ©2008, Prentice Hall Operating Systems: Internals and Design Principles,
IBM RS6000/SP Overview Advanced IBM Unix computers series Multiple different configurations Available from entry level to high-end machines. POWER (1,2,3,4)
Silicon Graphics, Inc. Poster Presented by: SGI Proprietary Technologies for Breakthrough Research Rosario Caltabiano North East Higher Education & Research.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
DANSE Central Services Michael Aivazis Caltech NSF Review May 23, 2008.
High Performance Computing (HPC) at Center for Information Communication and Technology in UTM.
Distributed Systems: Client/Server Computing
VMware vCenter Server Module 4.
Hands-On Microsoft Windows Server 2008 Chapter 1 Introduction to Windows Server 2008.
Operating Systems Operating System
Parallelization with the Matlab® Distributed Computing Server CBI cluster December 3, Matlab Parallelization with the Matlab Distributed.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
High Performance Computing G Burton – ICG – Oct12 – v1.1 1.

 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
1 Advanced Storage Technologies for High Performance Computing Sorin, Faibish EMC NAS Senior Technologist IDC HPC User Forum, April 14-16, Norfolk, VA.
© 2008 The MathWorks, Inc. ® ® Parallel Computing with MATLAB ® Silvina Grad-Freilich Manager, Parallel Computing Marketing
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
LLNL-PRES-XXXXXX This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.
Operating Systems CS3502 Fall 2014 Dr. Jose M. Garrido
Bright Cluster Manager Advanced cluster management made easy Dr Matthijs van Leeuwen CEO Bright Computing Mark Corcoran Director of Sales Bright Computing.
PARMON A Comprehensive Cluster Monitoring System A Single System Image Case Study Developer: PARMON Team Centre for Development of Advanced Computing,
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Debugging and Profiling GMAO Models with Allinea’s DDT/MAP Georgios Britzolakis April 30, 2015.
17-April-2007 High Performance Computing Basics April 17, 2007 Dr. David J. Haglin.
DANSE Central Services Michael Aivazis Caltech NSF Review May 31, 2007.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,
Ch 1. A Python Q&A Session Spring Why do people use Python? Software quality Developer productivity Program portability Support libraries Component.
Computer Emergency Notification System (CENS)
ATCA based LLRF system design review DESY Control servers for ATCA based LLRF system Piotr Pucyk - DESY, Warsaw University of Technology Jaroslaw.
The PROGRESS Grid Service Provider Maciej Bogdański Portals & Portlets 2003 Edinburgh, July 14th-17th.
Looking Ahead: A New PSU Research Cloud Architecture Chuck Gilbert - Systems Architect and Systems Team Lead Research CI Coordinating Committee Meeting.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Scalable Systems Software for Terascale Computer Centers Coordinator: Al Geist Participating Organizations ORNL ANL LBNL.
Cluster Software Overview
System Center Lesson 4: Overview of System Center 2012 Components System Center 2012 Private Cloud Components VMM Overview App Controller Overview.
1 Grid Activity Summary » Grid Testbed » CFD Application » Virtualization » Information Grid » Grid CA.
Chapter 9: Networking with Unix and Linux. Objectives: Describe the origins and history of the UNIX operating system Identify similarities and differences.
Comprehensive Scientific Support Of Large Scale Parallel Computation David Skinner, NERSC.
NA-MIC National Alliance for Medical Image Computing Core 1b – Engineering Computational Platform Jim Miller GE Research.
Understanding Parallel Computers Parallel Processing EE 613.
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
Tackling I/O Issues 1 David Race 16 March 2010.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
ENEA GRID & JPNM WEB PORTAL to create a collaborative development environment Dr. Simonetta Pagnutti JPNM – SP4 Meeting Edinburgh – June 3rd, 2013 Italian.
Advanced Computing Facility Introduction
VisIt Project Overview
Working With Azure Batch AI
Tools and Services Workshop Overview of Atmosphere
University of Technology
GAMMA: An Efficient Distributed Shared Memory Toolbox for MATLAB
Advanced Computing Facility Introduction
GAMMA: An Efficient Distributed Shared Memory Toolbox for MATLAB
Building and running HPC apps in Windows Azure
Presentation transcript:

"Parallel MATLAB in production supercomputing with applications in signal and image processing" Ashok Krishnamurthy David Hudak John Nehrbass Siddharth Samsi Vijay Gadepally

2 Functions - Scope of Activity Supercomputing. Computation, software, storage, and support services empower Ohio’s scientists, engineers, faculty, students, businesses and other clients. Networking. Ohio’s universities, colleges, K-12 and state government connect to the network. OSC also provides engineering services, video conferencing, and support through a 24x7 service desk. Research. Lead science and engineering projects, assist researchers with custom needs, partner with regional, national, and international researchers in groundbreaking initiatives, and develop new tools. Education. The Ralph Regula School of Computational Science delivers computational science training to students and companies across Ohio.

3 OSC provides a stable computational infrastructure to support scientific research computing IBM 1350 Opteron dual-core w/IBM Cell 4,000+ cores 8 TBytes memory 22+ teraflops Blend of 4 core and 8 core nodes –Large processor count –Large memory SMP jobs Intel P4 Cluster 2.46 TF 512 processors 1 TBytes memory Itanium2 Cluster 2.7 TF 596 processors 3x16 Alt 1 TBytes memory PRODUCTION COMPUTING Mass Storage 470 TBytes disk 80 TBytes tape NFS, PVFS, iSCSI Infiniband or Myrinet Interconnect Gateway to User Science

4 Visualization Cluster AMD Opteron 72 processors 144 GBytes memory nVIDIA Quadro 5600 graphics card(330 GF) Providing agile computational infrastructure to support the research and innovation process Apple G5 Cluster PowerPC G5 64 processors 128 GBytes mem MATLAB/GRI Cluster AMD Opteron 164 processors 328 GBytes memory RESEARCH COMPUTING Mass Storage 470 TBytes disk 80 TBytes tape NFS, PVFS, iSCSI Gateway to User Science BALE Cluster AMD Athlon processors 220 GBytes memory nVIDIA GeForce 6150 GPU

5 OSC Instrumentation and Analytics Services Remote instrumentation uses OSC’s state-wide resources –Networking, Storage, HPC, Analytics (web service) Parallel MATLAB

Parallel MATLAB Choices MathWorks Distributed Computing Engine + Toolbox Interactive Supercomputing’s Star-P MIT Lincoln Labs pMATLAB + OSC bcMPI EMPOWER. PARTNER. LEAD.

MATLAB® Distributed Computing Toolbox Architecture Image source:

EMPOWER. PARTNER. LEAD. Star-P Architecture Image source:

Dedicated Cluster: Desktop client can directly address cluster compute nodes

Typical Shared Production Cluster: Desktop client cannot directly address cluster compute nodes

Parallel MATLAB Computing Configurations Have to map client+engines to a number of computer configurations Local or Remote –Local - single administrative domain Uniform set of user accounts Implicit trust relationship –Remote - multiple administrative domains Authentication required Dedicated or shared –Dedicated - preallocated resources available on demand –Shared - allocation request must complete prior to interactive session

EMPOWER. PARTNER. LEAD. MATLAB Distributed Computing Engine & Toolbox Mathworks implementation of parallel MATLAB Consists of two products : –MATLAB® Distributed Computing Engine (MDCE) –Distributed Computing Toolbox (DCT) MDCE enables users to run MATLAB applications on a cluster DCT provides toolbox migration : Client’s toolbox licenses are available when the parallel job runs on the cluster Can be used in interactive mode as well as in non- interactive batch jobs

EMPOWER. PARTNER. LEAD. MATLAB® Distributed Computing Engine Customization of scripts to run under shared, remote resources which includes –System specific batch scripts must be generated at run time –Authentication and remote connection setup when a job is submitted to the cluster

EMPOWER. PARTNER. LEAD. MATLAB® Distributed Computing Engine Customization Client side : –Need a custom job submission function as defined in the MDCE documentation for job submission –Need to provide a mechanism to copy job data over to cluster –Client must also resolve any file dependencies for the algorithm being run Image source: Solution number : 1-2KEGMH

EMPOWER. PARTNER. LEAD. MATLAB® Distributed Computing Engine Customization Cluster side customization: –Need “decode” function on MATLAB path on the cluster, as specified in the documentation –Some customization of MPI launch script is required Image source: Solution number : 1-2KEGMH

EMPOWER. PARTNER. LEAD. Star-P Star-P is a client-server parallel computing platform available from Interactive Supercomputing Designed to work with high level languages such as MATLAB and Python Designed for interactive usage

EMPOWER. PARTNER. LEAD. Star-P Customization required to run under the Torque workload manager –Custom parameters to be used include command-line options that control job submission to the cluster –Custom MPI launch mechanism developed by Interactive Supercomputing for use on OSC clusters Does not support toolbox migration Monitoring and debugging server backend processes is still a challenge

EMPOWER. PARTNER. LEAD. pMATLAB + bcMPI Runs on UNIX: tested on Linux, NetBSD, MacOS X API with MatlabMPI –If you can use MatlabMPI, you can probably use bcMPI –bcMPI tags are numeric, MatlabMPI alphanumeric Broadcast, barrier, reduce operations bcMPI supports synchronous or asynchronous sends –MPI_Buffer_attach, MPI_Buffer_detatch, MPI_Probe MPI communicator support (new in v1.1) –Supports many MATLAB data types, but no sparse support

EMPOWER. PARTNER. LEAD. bcMPI Advantages : –Extensible - core library makes it easy to add additional MPI functions and interpreter data types –Portable - no dependencies on any machine, or specific MPI library implementations –Scalable - use efficient algorithms; take advantage of native MPI library and communications hardware –Open source software developed at OSC Can be used only in non-interactive batch jobs

20 Example of remote instrumentation application: access to electron microscope Parallel MATLAB Example: Image analysis of Scanning Electron Microscope Images Scanning Electron Microscope at OSU Center for Accelerated Maturation of Materials Demonstrates real-time user control Adding analytics and collaboration services for image analysis and computational modeling

CAMM Portal Interface 21

EMPOWER. PARTNER. LEAD. MATLAB GUI to set image processing parameters

SEGMENTS ADDED TOGETHER

Parallel Processing : MATLAB GUI

Parallel MATLAB : Results 396 images processed on 2, 4, 8, 12 and 16 processors

EMPOWER. PARTNER. LEAD. Parallel MATLAB Application : Acoustics Signal Processing MATLAB used to analyze acoustic signatures used for self-localization of sensors Comparative analysis using multiple algorithms on multiple data sets – embarrassingly parallel

EMPOWER. PARTNER. LEAD. Parallel MATLAB application : Synthetic Aperture Radar Model [1] Develop synthetic aperture radar model for forest clusters MATLAB used to study invertibility of forest model and to fit model parameters to SAR data from several forests Parallel MATLAB used to perform above studies [1] A Model for Generating Synthetic VHF SAR Forest Clutter Images – Julie Ann Jackson, Randolph L. Moses. Paper submitted to IEEE Aerospace and Electronic Systems Journal

Parallel MATLAB Application: Pathology slide image processing

Summary Parallel MATLAB is of interest to many in OSC’s user community Parallel MATLAB is bringing in new users to High Performance Computing Users would like the same experience as desktop MATLAB, but are willing to give up some of it for quicker turn-around time Providing a complete solution is a lot of work – even for embarrassingly parallel cases EMPOWER. PARTNER. LEAD.