Presentation is loading. Please wait.

Presentation is loading. Please wait.

Running clusters on a Shoestring US Lattice QCD Fermilab SC 2007.

Similar presentations


Presentation on theme: "Running clusters on a Shoestring US Lattice QCD Fermilab SC 2007."— Presentation transcript:

1 Running clusters on a Shoestring US Lattice QCD Fermilab SC 2007

2 Poor man’s cluster tool-kit rgang – run commands on worker nodes IPMI – Intelligent Platform Management PXE boot – cluster installation

3 rgang rgang allows one to execute commands on or distribute files to many nodes. rgang forks separate rsh or ssh children, which execute in parallel. After successfully waiting on returns from each child or after timing out, rgang displays the sorted node responses. To allow scaling to kilo clusters, rgang can utilize a tree-structure, via an nway switch. When so invoked, rgang uses rsh or ssh to spawn copies of itself on multiple nodes. These copies in turn spawn additional copies.

4 rgang rgang took 78 seconds to transfer a 50MB file from the head-node to 600 worker nodes using nway=2 option. head-node 50MB file nway = 2

5 IPMI IPMI is an open standard for monitoring, logging, recovery, inventory, and control of hardware that is implemented independent of the main CPU, BIOS, and OS. The Baseboard Management Controller (BMC) is the brain behind platform management and its primary purpose is to handle the autonomous sensor monitoring and event logging features. ipmitool provides a simple command-line in- band and out-of-band interface to the BMC.

6 IPMI OS Running OS not loaded, OS booting, OS unresponsive In-band Server Management Out-of-band Server Management BMC Ethernet Local interface SSIF, KCS, SMBus Power cycle/on/off CPU temperature System temperature CPU/Chassis Fan speed Sensor Event Logs We power on, off or reset a thousand worker nodes in parallel in seconds.

7 PXE boot PXE booting is one of the many ways you can boot a computer entirely from the network without any form of storage on the destination computer (RAM aside). PXE booting allows us to mass install an OS on the worker nodes. The worker node OS images are stored on the local hard disk on a PXE-TFTP-BOOTP server.

8 PXE boot head-node root image boot image DOS image Node IP Response PXE-BOOTP Request TFTP Request tomsrtbt and kernel Response clusterInstallScript partitions local disk, remote copies images and unzips them into disk partitions, configures network and reboots the node. It takes 14 minutes to install 80 AMD Opteron 2.4 GHz nodes, using an Intel Xeon 3.2 GHz PXE-BOOTP-TFTP server, over a fast Ethernet network.

9


Download ppt "Running clusters on a Shoestring US Lattice QCD Fermilab SC 2007."

Similar presentations


Ads by Google