Introduction to HPC Workshop March 1 st, 2016. Introduction George Garrett & The HPC Support Team Research Computing Services CUIT.

Slides:



Advertisements
Similar presentations
MPI Basics Introduction to Parallel Programming and Cluster Computing University of Washington/Idaho State University MPI Basics Charlie Peck Earlham College.
Advertisements

Chapter 3. MPI MPI = Message Passing Interface Specification of message passing libraries for developers and users –Not a library by itself, but specifies.
Parallel ISDS Chris Hans 29 November 2004.
Southgreen HPC system Concepts Cluster : compute farm i.e. a collection of compute servers that can be shared and accessed through a single “portal”
Software Tools Using PBS. Software tools Portland compilers pgf77 pgf90 pghpf pgcc pgCC Portland debugger GNU compilers g77 gcc Intel ifort icc.
Tutorial on MPI Experimental Environment for ECE5610/CSC
IT MANAGEMENT OF FME, 21 ST JULY  THE HPC FACILITY  USING PUTTY AND WINSCP TO ACCESS THE SERVER  SENDING FILES TO THE SERVER  RUNNING JOBS 
Introduction to HPC Workshop October Introduction Rob Lane HPC Support Research Computing Services CUIT.
High Performance Computing
Job Submission on WestGrid Feb on Access Grid.
Distributed Memory Programming with MPI. What is MPI? Message Passing Interface (MPI) is an industry standard message passing system designed to be both.
Quick Tutorial on MPICH for NIC-Cluster CS 387 Class Notes.
HPCC Mid-Morning Break Interactive High Performance Computing Dirk Colbry, Ph.D. Research Specialist Institute for Cyber Enabled Discovery.
Introduction to Linux Workshop February Introduction Rob Lane & The HPC Support Team Research Computing Services CUIT.
Introduction to UNIX/Linux Exercises Dan Stanzione.
Research Computing with Newton Gerald Ragghianti Newton HPC workshop Sept. 3, 2010.
Electronic Visualization Laboratory, University of Illinois at Chicago MPI on Argo-new Venkatram Vishwanath Electronic Visualization.
VIPBG LINUX CLUSTER By Helen Wang March 29th, 2013.
HPC at HCC Jun Wang Outline of Workshop1 Overview of HPC Computing Resources at HCC How to obtain an account at HCC How to login a Linux cluster at HCC.
Bigben Pittsburgh Supercomputing Center J. Ray Scott
1 SEEM3460 Tutorial Unix Introduction. 2 Introduction Unix-like system is everywhere Linux Android for smartphones Google Chrome OS for Chromebook Web.
Yeti Operations INTRODUCTION AND DAY 1 SETTINGS. Rob Lane HPC Support Research Computing Services CUIT
MPI and High Performance Computing: Systems and Programming Barry Britt, Systems Administrator Department of Computer Science Iowa State University.
Yeti Operations Committee MARCH MEETING. Agenda 1.Usage Report 2.Downtime 3.Research Publications 4.Rentals 5.Free Tier 6.Next Purchase Round 7.Issues?
HPC for Statistics Grad Students. A Cluster Not just a bunch of computers Linked CPUs managed by queuing software – Cluster – Node – CPU.
Introduction to UNIX Road Map: 1. UNIX Structure 2. Components of UNIX 3. Process Structure 4. Shell & Utility Programs 5. Using Files & Directories 6.
CS 591 x I/O in MPI. MPI exists as many different implementations MPI implementations are based on MPI standards MPI standards are developed and maintained.
Operating Systems Process Creation
Running on GCB part1 By: Camilo Silva. Simple steps to run MPI 1.Use putty or the terminal 2.SSH to gcb.fiu.edu 3.Loggin by providing your username and.
Introduction to Scripting Workshop October
Software Tools Using PBS. Software tools Portland compilers pgf77 pgf90 pghpf pgcc pgCC Portland debugger GNU compilers g77 gcc Intel ifort icc.
Cluster Computing Applications for Bioinformatics Thurs., Sept. 20, 2007 process management shell scripting Sun Grid Engine running parallel programs.
Running Parallel Jobs Cray XE6 Workshop February 7, 2011 David Turner NERSC User Services Group.
1 HPCI Presentation Kulathep Charoenpornwattana. March 12, Outline Parallel programming with MPI Running MPI applications on Azul & Itanium Running.
Introduction to HPC Workshop October Introduction Rob Lane & The HPC Support Team Research Computing Services CUIT.
1 Getting Started with C++ Part 2 Linux. 2 Getting Started on Linux Now we will look at Linux. See how to copy files between Windows and Linux Compile.
Portable Batch System – Definition and 3 Primary Roles Definition: PBS is a distributed workload management system. It handles the management and monitoring.
Introduction to Linux Workshop February 15, 2016.
Advanced topics Cluster Training Center for Simulation and Modeling September 4, 2015.
Introduction to Parallel Computing Presented by The Division of Information Technology Computer Support Services Department Research Support Group.
Wouter Verkerke, NIKHEF 1 Using ‘stoomboot’ for NIKHEF-ATLAS batch computing What is ‘stoomboot’ – Hardware –16 machines, each 2x quad-core Pentium = 128.
+ Vieques and Your Computer Dan Malmer & Joey Azofeifa.
NREL is a national laboratory of the U.S. Department of Energy, Office of Energy Efficiency and Renewable Energy, operated by the Alliance for Sustainable.
Introduction to Scripting Workshop February 23, 2016.
An Brief Introduction Charlie Taylor Associate Director, Research Computing UF Research Computing.
Advanced Computing Facility Introduction
Hackinars in Bioinformatics
Hands on training session for core skills
GRID COMPUTING.
Auburn University
Welcome to Indiana University Clusters
Unix Scripts and PBS on BioU
HPC usage and software packages
MPI Basics.
How to use the HPCC to do stuff
Using Paraguin to Create Parallel Programs
Hodor HPC Cluster LON MNG HPN Head Node Comp Node Comp Node Comp Node
Architecture & System Overview
Assignment Preliminaries
Postdoctoral researcher Department of Environmental Sciences, LSU
Introduction to HPC Workshop
College of Engineering
Introduction to Message Passing Interface (MPI)
CCR Advanced Seminar: Running CPLEX Computations on the ISE Cluster
High-Performance Computing at the Martinos Center
High Performance Computing in Bioinformatics
MPI MPI = Message Passing Interface
Quick Tutorial on MPICH for NIC-Cluster
CSCE 206 Lab Structured Programming in C
Working in The IITJ HPC System
Presentation transcript:

Introduction to HPC Workshop March 1 st, 2016

Introduction George Garrett & The HPC Support Team Research Computing Services CUIT

Introduction HPC Basics

Introduction What is HPC?

Introduction What can you do with HPC?

Yeti 2 head nodes 167 execute nodes 200 TB storage

Yeti

HP S6500 Chassis

HP SL230 Server

Yeti Configuration1 st Round2 nd Round CPUE5-2650LE5-2650v2 GPUNvidia K20Nvidia K40 64 GB Memory GB Memory GB Memory353 Infiniband1648 GPU45 Total Systems10166

Yeti Configuration1 st Round2 nd Round CPUE5-2650LE5-2650v2 Cores88 Speed GHz FLOPS

Job Scheduler Manages the cluster Decides when a job will run Decides where a job will run We use Torque/Moab

Job Queues Jobs are submitted to a queue Jobs sorted in priority order Not a FIFO

Access Mac Instructions 1.Run terminal

Access Windows Instructions 1.Search for putty on Columbia home page 2.Select first result 3.Follow link to Putty download page 4.Download putty.exe 5.Run putty.exe

Access Mac (Terminal) $ ssh Windows (Putty) Host Name: yetisubmit.cc.columbia.edu

Work Directory $ cd /vega/free/users/UNI Replace “UNI” with your UNI $ cd /vega/free/users/hpc2108

Copy Workshop Files Files are in /tmp/workshop $ cp /tmp/workshop/*.

Editing No single obvious choice for editor vi – simple but difficult at first emacs – powerful but complex nano – simple but not really standard

nano $ nano hellosubmit “^” means “hold down control” ^a : go to beginning of line ^e : go to end of line ^k: delete line ^o: save file ^x: exit

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V # Set output and error directories #PBS -o localhost:/vega/free/users/UNI/ #PBS -e localhost:/vega/free/users/UNI/ # Print "Hello World" echo "Hello World" # Sleep for 10 seconds sleep 10 # Print date and time date

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V # Set output and error directories #PBS -o localhost:/vega/free/users/UNI/ #PBS -e localhost:/vega/free/users/UNI/ # Print "Hello World" echo "Hello World" # Sleep for 20 seconds sleep 20 # Print date and time date

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m n #PBS -V

hellosubmit #!/bin/sh # Directives #PBS -N HelloWorld #PBS -W group_list=yetifree #PBS -l nodes=1:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m n #PBS -V

hellosubmit # Set output and error directories #PBS -o localhost:/vega/free/users/UNI/ #PBS -e localhost:/vega/free/users/UNI/

hellosubmit # Set output and error directories #PBS -o localhost:/vega/free/users/UNI/ #PBS -e localhost:/vega/free/users/UNI/

hellosubmit # Print "Hello World" echo "Hello World" # Sleep for 20 seconds sleep 20 # Print date and time date

qsub $ qsub hellosubmit

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $

qstat $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0

hellosubmit $ qsub hellosubmit moose.cc.columbia.edu $ qstat Job ID Name User Time Use S Queue moo HelloWorld hpc Q batch0 $ qstat qstat: Unknown Job Id Error moose.cc.columbi

hellosubmit $ ls -l total 4 -rw hpc2108 yetifree 398 Oct 8 22:13 hellosubmit -rw hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e rw hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o739369

hellosubmit $ ls -l total 4 -rw hpc2108 yetifree 398 Oct 8 22:13 hellosubmit -rw hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e rw hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o739369

hellosubmit $ ls -l total 4 -rw hpc2108 yetifree 398 Oct 8 22:13 hellosubmit -rw hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e rw hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o739369

hellosubmit $ ls -l total 4 -rw hpc2108 yetifree 398 Oct 8 22:13 hellosubmit -rw hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e rw hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o739369

hellosubmit $ ls -l total 4 -rw hpc2108 yetifree 398 Oct 8 22:13 hellosubmit -rw hpc2108 yetifree 0 Oct 8 22:44 HelloWorld.e rw hpc2108 yetifree 41 Oct 8 22:44 HelloWorld.o739369

hellosubmit $ cat HelloWorld.o Hello World Thu Oct 9 12:44:05 EDT 2014

hellosubmit $ cat HelloWorld.o Hello World Thu Oct 9 12:44:05 EDT 2014 Any Questions?

Interactive Most jobs run as “batch” Can also run interactive jobs Get a shell on an execute node Useful for development, testing, troubleshooting

Interactive $ cat interactive qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Interactive $ cat interactive qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Interactive $ cat interactive qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Interactive $ cat interactive qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Interactive $ cat interactive qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Interactive $ cat interactive qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb

Interactive $ qsub -I -W group_list=yetifree -l walltime=5:00,mem=100mb qsub: waiting for job moose.cc.columbia.edu to start

Interactive qsub: job moose.cc.columbia.edu ready.--.,-,-,--(/o o\)-,-,-,.,' // oo \\ ',,' /| __ |\ ',,' //\,__,/\\ ',, /\ /\,, /'`\ /' \, | /' `\ /' '\ | | \ ( ) / | ( /\| /' '\ |/\ ) \| /' /'`\ '\ |/ | /' `\ | ( ( ) ) `\ \ /' /' / / \ \ v v v v v v | You are in an interactive job. | | Your walltime is 00:05:00 | dallas$

Interactive $ hostname dallas.cc.columbia.edu

Interactive $ exit logout qsub: job moose.cc.columbia.edu completed $

GUI Can run GUI’s in interactive jobs Need X Server on your local system See user documentation for more information

User Documentation hpc.cc.columbia.edu Go to “HPC Support” Click on Yeti user documentation

Job Queues Scheduler puts all jobs into a queue Queue selected automatically Queues have different settings

Batch Job Queues QueueTime LimitMemory LimitMax. User Run Routen/a Batch 02 hours8 GB512 Batch 112 hours8 GB512 Batch 212 hours16 GB128 Batch 35 days16 GB64 Batch 43 daysNone8

QueueTime LimitMemory LimitMax. User Run Infinibandn/a IB22 hoursNone10 IB1212 hoursNone10 IB4848 hoursNone10 Infiniband Job Queues

QueueTime LimitMemory LimitMax. User Run GPUn/a GPU 22 hoursNone4 GPU 1212 hoursNone4 GPU 723 daysNone4 GPU Job Queues

QueueTime LimitMemory LimitMax. User Run Interactive4 hoursNone4 Special RequestVaries Other Job Queues

qstat -q $ qstat -q server: moose.cc.columbia.edu Queue Memory CPU Time Walltime Node Run Que Lm State batch0 8gb -- 02:00: E R batch1 8gb -- 12:00: E R batch2 16gb -- 12:00: E R batch3 16gb :00: E R batch :00: E R interactive :00: E R interlong :00: E R route E R

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

from: to: date: Mon, Mar 2, 2015 at 10:38 PM subject: PBS JOB moose.cc.columbia.edu PBS Job Id: moose.cc.columbia.edu Job Name: HelloWorld Exec host: dallas.cc.columbia.edu/2 Execution terminated Exit_status=0 resources_used.cput=00:00:02 resources_used.mem=8288kb resources_used.vmem=304780kb resources_used.walltime=00:02:02 Error_Path: localhost:/vega/free/users/hpc2108/HelloWorld.e Output_Path: localhost:/vega/free/users/hpc2108/HelloWorld.o739386

MPI Message Passing Interface Allows applications to run across multiple computers

MPI Edit MPI submit file Compile sample program

MPI #!/bin/sh # Directives #PBS -N MpiHello #PBS -W group_list=yetifree #PBS -l nodes=3:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V # Set output and error directories #PBS -o localhost:/vega/free/users/UNI/ #PBS -e localhost:/vega/free/users/UNI/ # Run mpi program. module load openmpi/1.6.5-no-ib mpirun mpihello

MPI #!/bin/sh # Directives #PBS -N MpiHello #PBS -W group_list=yetifree #PBS -l nodes=3:ppn=1,walltime=00:01:00,mem=20mb #PBS -M #PBS -m abe #PBS -V # Set output and error directories #PBS -o localhost:/vega/free/users/UNI/ #PBS -e localhost:/vega/free/users/UNI/ # Run mpi program. module load openmpi/1.6.5-no-ib mpirun mpihello

MPI $ module avail $ module load openmpi/1.6.5-no-ib $ module list $ which mpicc /usr/local/openmpi-1.6.5/bin/mpirun

MPI $ mpicc -o mpihello mpihello.c

MPI $ mpicc -o mpihello mpihello.c $ ls mpihello mpihello

MPI $ qsub mpisubmit moose.cc.columbia.edu

MPI $ qstat

MPI $ cat MpiHello.o Hello from worker 1! Hello from the master! Hello from worker 2!

MPI – mpihello.c #include void master(void); void worker(int rank); int main(int argc, char *argv[]) { int rank; MPI_Init(&argc, &argv);

MPI – mpihello.c MPI_Comm_rank(MPI_COMM_WORLD, &rank); if (rank == 0) { master(); } else { worker(rank); } MPI_Finalize(); return 0; }

MPI – mpihello.c void master(void) { printf("Hello from the master!\n"); } void worker(int rank) { printf("Hello from worker %d!\n", rank); }

Yeti Free Tier request to Request must come from faculty member or researcher

Questions? Any questions?

Workshop Copy any files you wish to keep to your home directory Please fill out feedback forms Thanks!