High Performance Computing Kyaw Zwa Soe (Director) Ministry of Science & Technology Centre of Advanced Science & Technology.

Slides:



Advertisements
Similar presentations
Commodity Computing Clusters - next generation supercomputers? Paweł Pisarczyk, ATM S. A.
Advertisements

♦ Commodity processor with commodity inter- processor connection Clusters Pentium, Itanium, Opteron, Alpha GigE, Infiniband, Myrinet, Quadrics, SCI NEC.
Today’s topics Single processors and the Memory Hierarchy
Beowulf Supercomputer System Lee, Jung won CS843.
ENIAC, first US Electronic Computer, ENIAC “circuit board”
Zhao Lixing.  A supercomputer is a computer that is at the frontline of current processing capacity, particularly speed of calculation.  Supercomputers.
ICS 556 Parallel Algorithms Ebrahim Malalla Office: Bldg 22, Room
Types of Parallel Computers
Information Technology Center Introduction to High Performance Computing at KFUPM.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
ASU/TGen Computational Facility.
History of Distributed Systems Joseph Cordina
Silicon Graphics, Inc. Poster Presented by: SGI Proprietary Technologies for Breakthrough Research Rosario Caltabiano North East Higher Education & Research.
Supercomputers Daniel Shin CS 147, Section 1 April 29, 2010.
A Comparative Study of Network Protocols & Interconnect for Cluster Computing Performance Evaluation of Fast Ethernet, Gigabit Ethernet and Myrinet.
An Introduction to Princeton’s New Computing Resources: IBM Blue Gene, SGI Altix, and Dell Beowulf Cluster PICASso Mini-Course October 18, 2006 Curt Hillegas.
Beowulf Cluster Computing Each Computer in the cluster is equipped with: – Intel Core 2 Duo 6400 Processor(Master: Core 2 Duo 6700) – 2 Gigabytes of DDR.
Arquitectura de Sistemas Paralelos e Distribuídos Paulo Marques Dep. Eng. Informática – Universidade de Coimbra Ago/ Machine.
Linux clustering Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University.
1 Computer Science, University of Warwick Architecture Classifications A taxonomy of parallel architectures: in 1972, Flynn categorised HPC architectures.
Universidad Politécnica de Baja California. Juan P. Navarro Sanchez 9th level English Teacher: Alejandra Acosta The Beowulf Project.
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
Chapter 2 Computer Clusters Lecture 2.1 Overview.
1 Parallel computing and its recent topics. 2 Outline 1. Introduction of parallel processing (1)What is parallel processing (2)Classification of parallel.
Benchmarks for Parallel Systems Sources/Credits:  “Performance of Various Computers Using Standard Linear Equations Software”, Jack Dongarra, University.
Bob Thome, Senior Director of Product Management, Oracle SIMPLIFYING YOUR HIGH AVAILABILITY DATABASE.
Tools and Utilities for parallel and serial codes in ENEA-GRID environment CRESCO Project: Salvatore Raia SubProject I.2 C.R. ENEA-Portici. 11/12/2007.
Lappeenranta University of Technology / JP CT30A7001 Concurrent and Parallel Computing Introduction to concurrent and parallel computing.
Principles of Scalable HPC System Design March 6, 2012 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
QCD Project Overview Ying Zhang September 26, 2005.
CLUSTER COMPUTING STIMI K.O. ROLL NO:53 MCA B-5. INTRODUCTION  A computer cluster is a group of tightly coupled computers that work together closely.
1b.1 Types of Parallel Computers Two principal approaches: Shared memory multiprocessor Distributed memory multicomputer ITCS 4/5145 Parallel Programming,
March 3rd, 2006 Chen Peng, Lilly System Biology1 Cluster and SGE.
The Red Storm High Performance Computer March 19, 2008 Sue Kelly Sandia National Laboratories Abstract: Sandia National.
Taking the Complexity out of Cluster Computing Vendor Update HPC User Forum Arend Dittmer Director Product Management HPC April,
Presented by Reliability, Availability, and Serviceability (RAS) for High-Performance Computing Stephen L. Scott and Christian Engelmann Computer Science.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY 1 On-line Automated Performance Diagnosis on Thousands of Processors Philip C. Roth Future.
A High-Performance Scalable Graphics Architecture Daniel R. McLachlan Director, Advanced Graphics Engineering SGI.
Center for Computational Sciences O AK R IDGE N ATIONAL L ABORATORY U. S. D EPARTMENT OF E NERGY Vision for OSC Computing and Computational Sciences
Parallel Computing Department Of Computer Engineering Ferdowsi University Hossain Deldari.
- Rohan Dhamnaskar. Overview  What is a Supercomputer  Some Concepts  Couple of examples.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
CPU Benyovszky Máté. Bevezetés Szubjektív témák Horizontális kitekintés – Aktualitások, újdonságok – Nem sok történelem Itt képek - doksiban linkek főleg.
CLUSTER COMPUTING TECHNOLOGY BY-1.SACHIN YADAV 2.MADHAV SHINDE SECTION-3.
Computing Resources at Vilnius Gediminas Technical University Dalius Mažeika Parallel Computing Laboratory Vilnius Gediminas Technical University
Computing Environment The computing environment rapidly evolving ‑ you need to know not only the methods, but also How and when to apply them, Which computers.
By Chi-Chang Chen.  Cluster computing is a technique of linking two or more computers into a network (usually through a local area network) in order.
Outline Why this subject? What is High Performance Computing?
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
1 High Performance Computing: A Look Behind and Ahead Jack Dongarra Computer Science Department University of Tennessee.
Lecture 11: Unix Clusters Asoc. Prof. Guntis Barzdins Asist. Girts Folkmanis University of Latvia Dec 10, 2004.
3/12/2013Computer Engg, IIT(BHU)1 INTRODUCTION-2.
Computer System Evolution. Yesterday’s Computers filled Rooms IBM Selective Sequence Electroinic Calculator, 1948.
The types of computers and their functionalities.
Constructing a system with multiple computers or processors 1 ITCS 4/5145 Parallel Programming, UNC-Charlotte, B. Wilkinson. Jan 13, 2016.
VGrADS and GridSolve Asim YarKhan Jack Dongarra, Zhiao Shi, Fengguang Song Innovative Computing Laboratory University of Tennessee VGrADS Workshop – September.
Advanced Network Administration Computer Clusters.
Super Computing By RIsaj t r S3 ece, roll 50.
Constructing a system with multiple computers or processors
CRESCO Project: Salvatore Raia
32nd TOP500 List SC08, Austin, TX.
Is System X for Me? Cal Ribbens Computer Science Department
CLUSTER COMPUTING.
Constructing a system with multiple computers or processors
Constructing a system with multiple computers or processors
Constructing a system with multiple computers or processors
Types of Parallel Computers
Presentation transcript:

High Performance Computing Kyaw Zwa Soe (Director) Ministry of Science & Technology Centre of Advanced Science & Technology

Why we need high performance computing? Today, computing means not only to crunch the numbers Today, computing means not only to crunch the numbers Add too many graphical add-ons over the computing ( e.g. 3D games, video editing programs) Add too many graphical add-ons over the computing ( e.g. 3D games, video editing programs) Have to execute too many programs at the same time Have to execute too many programs at the same time Most of the programs have characteristic of parallel computing (such as VHDL ) Most of the programs have characteristic of parallel computing (such as VHDL ) Very popular in Nural- Network and Genetic Algorithms computing Very popular in Nural- Network and Genetic Algorithms computing

Ways of Improving Increasing the Clock Speed (has limitation on silicon fabrication technology) Increasing the Clock Speed (has limitation on silicon fabrication technology) Improvements by Architecture Improvements by Architecture Pipe line Architecture Pipe line Architecture Vector or Array Processor Architecture Vector or Array Processor Architecture Cluster and Grid Computing Cluster and Grid Computing CISC (Complex Instruction Set Computers) vs. RISC (Reduced Instruction Set Computers) CISC (Complex Instruction Set Computers) vs. RISC (Reduced Instruction Set Computers)

Pipe Line Like an assembly line in the factory Like an assembly line in the factory Need particular attention in dividing the task of each step in the pipe line Need particular attention in dividing the task of each step in the pipe line Not a problem oriented solution, so each instruction of the predecessor processor can be implemented Not a problem oriented solution, so each instruction of the predecessor processor can be implemented Minimum cycle time of the pipe line is limited by the longest time required to complete the most time consuming task that can not be divided Minimum cycle time of the pipe line is limited by the longest time required to complete the most time consuming task that can not be divided Faster, but in the scale of 10 or more is very difficult Faster, but in the scale of 10 or more is very difficult

Array or Vector Processors Single Control Unit Processor Memory.....

Why we choose Cluster Easy to implement Easy to implement Can be constructed by commodity equipments Can be constructed by commodity equipments Scalable and reliable ( because failure occurs, repair and maintenance is very easy) Scalable and reliable ( because failure occurs, repair and maintenance is very easy) Very cost effective Very cost effective Application sensitive but not by HW. Application sensitive but not by HW.

What is Cluster Computing All About? First, what is a cluster? First, what is a cluster? A cluster is a large group of computers networked together, all controlled by a “head node”.

What is Cluster Computing All About? Why do we want a cluster? Why do we want a cluster? To do complex calculations To do complex calculations The other option is a single super computer with a small number of extremely fast processors. The other option is a single super computer with a small number of extremely fast processors. Expensive! Expensive! Vs.

What is Cluster Computing All About? Limitations of cluster computing: Limitations of cluster computing: Work needs to be easily divisible: Work needs to be easily divisible: Work can be divided up into computationally independent segments Work can be divided up into computationally independent segments Work to be done Finished work Divided up Each computer does its own piece of the whole

Closed Cluster Configuration compute node compute node compute node compute node High Speed Network Service Network gateway node External Network compute node compute node compute node compute node High Speed Network gateway node External Network File Server node Front-end

High Performance Computing Cluster High Performance Computing Cluster Load Balancing Load Balancing High Availability Cluster High Availability Cluster Different kinds of PC cluster

High Performance Computing Cluster (HPCC) Start from 1994 Start from 1994 Donald Becker of NASA assemble the world ’ s first cluster with 16 sets of DX4 PCs and 10 Mb/s ethernet Donald Becker of NASA assemble the world ’ s first cluster with 16 sets of DX4 PCs and 10 Mb/s ethernet Also called Beowulf cluster Also called Beowulf cluster Built from commodity off-the-shelf hardware Built from commodity off-the-shelf hardware Applications like data mining, simulations, parallel processing, weather modelling, computer graphical rendering, etc. Applications like data mining, simulations, parallel processing, weather modelling, computer graphical rendering, etc.

Load Balancing Cluster PC cluster deliver load balancing performance PC cluster deliver load balancing performance Commonly used with busy ftp and web servers with large client base Commonly used with busy ftp and web servers with large client base Large number of nodes to share load Large number of nodes to share load

High Availability Cluster Avoid downtime of services Avoid downtime of services Avoid single point of failure Avoid single point of failure Always with redundancy Always with redundancy Almost all load balancing cluster are with HA capability Almost all load balancing cluster are with HA capability

Beowulf cluster Beowulf cluster

Message Passing Interface Message Passing Interface MPICH ( MPICH ( LAM/MPI ( LAM/MPI ( Mathematical Mathematical fftw (fast fourier transform) fftw (fast fourier transform) pblas (parallel basic linear algebra software) pblas (parallel basic linear algebra software) atlas (a collections of mathematical library) atlas (a collections of mathematical library) sprng (scalable parallel random number generator) sprng (scalable parallel random number generator) sprng MPITB -- MPI toolbox for MATLAB MPITB -- MPI toolbox for MATLAB MPITB Quantum Chemistry software Quantum Chemistry software gaussian, qchem gaussian, qchemqchem Molecular Dynamic solver Molecular Dynamic solver NAMD, gromacs, gamess NAMD, gromacs, gamess NAMDgromacsgamess NAMDgromacsgamess Weather modelling Weather modelling MM5 ( MM5 ( Cluster computing applications

Relative Interconnect Performance LANSustainedMb/sTheoreticalMb/sLatencyµSec Fast Ethernet Gigabit Ethernet Myrinet Dolphin ScaLI

Necessary Software Modules for Building & Running a Cluster System Clustering Clustering Installation Installation Administration Administration Monitoring Monitoring Tools/Libraries Tools/Libraries File system File system Job Management Job Management Scheduler Scheduler

RankSite Country/Year Computer / Processors Manufacturer R max R peak 1DOE/NNSA/LLNL DOE/NNSA/LLNL United States/2005 BlueGene/L eServer Blue Gene Solution / IBM eServer Blue Gene Solution IBM Thomas J. Watson Research Center IBM Thomas J. Watson Research Center United States/2005 BGW eServer Blue Gene Solution / IBM eServer Blue Gene Solution NASA/Ames Research Center/NAS NASA/Ames Research Center/NAS United States/2004 Columbia SGI Altix 1.5 GHz, Voltaire Infiniband / SGI SGI Altix 1.5 GHz, Voltaire Infiniband The Earth Simulator Center The Earth Simulator Center Japan/2002 Earth-SimulatorEarth-Simulator / 5120 NEC Barcelona Supercomputer Center Barcelona Supercomputer Center Spain/2005 MareNostrum JS20 Cluster, PPC 970, 2.2 GHz, Myrinet / 4800 IBM JS20 Cluster, PPC 970, 2.2 GHz, Myrinet ASTRON/University Groningen ASTRON/University Groningen Netherlands/2005 eServer Blue Gene SolutioneServer Blue Gene Solution / IBM Lawrence Livermore National Laboratory Lawrence Livermore National Laboratory United States/2004 Thunder Intel Itanium2 Tiger4 1.4GHz - Quadrics / 4096 California Digital Corporation Intel Itanium2 Tiger4 1.4GHz - Quadrics Computational Biology Research Center, AIST Computational Biology Research Center, AIST Japan/2005 Blue Protein eServer Blue Gene Solution / 8192 IBM eServer Blue Gene Solution

9Ecole Polytechnique Federale de Lausanne Ecole Polytechnique Federale de Lausanne Switzerland/2005 eServer Blue Gene SolutioneServer Blue Gene Solution / 8192 IBM Sandia National Laboratories Sandia National Laboratories United States/2005 Red Storm, Cray XT3, 2.0 GHzRed Storm, Cray XT3, 2.0 GHz / 5000 Cray Inc Oak Ridge National Laboratory Oak Ridge National Laboratory United States/2005 Cray XT3, 2.4 GHzCray XT3, 2.4 GHz / 3748 Cray Inc Los Alamos National Laboratory Los Alamos National Laboratory United States/2002 ASCI Q ASCI Q - AlphaServer SC45, 1.25 GHz / 8192 Hewlett-Packard ASCI Q - AlphaServer SC45, 1.25 GHz Lawrence Livermore National Laboratory Lawrence Livermore National Laboratory United States/2005 eServer pSeries p GHzeServer pSeries p GHz / 2048 IBM Virginia Tech Virginia Tech United States/2004 System X 1100 Dual 2.3 GHz Apple XServe/Mellanox Infiniband 4X/Cisco GigE / 2200 Self-made 1100 Dual 2.3 GHz Apple XServe/Mellanox Infiniband 4X/Cisco GigE Japan Atomic Energy Research Institute Japan Atomic Energy Research Institute Japan/2005 SGI Altix 3700 Bx2, 1.6 GHz, NUMALinkSGI Altix 3700 Bx2, 1.6 GHz, NUMALink / 2048 SGI