Computer Systems Lab The University of Wisconsin - Madison Department of Computer Sciences Linux Clusters David Thompson

Slides:



Advertisements
Similar presentations
Monday 24 May 2004DAPNIA/Pierre-Francois Honore1 DAPNIA site report.
Advertisements

Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
What You Will Learn Components of a computer’s system software The importance of an operating system Functions of an operating system Types of user interfaces.
Presented by: Yash Gurung, ICFAI UNIVERSITY.Sikkim BUILDING of 3 R'sCLUSTER PARALLEL COMPUTER.
Module 8: Concepts of a Network Load Balancing Cluster
Jefferson Lab Site Report Sandy Philpott Thomas Jefferson National Accelerator Facility Newport News, Virginia USA
1 Web Server Administration Chapter 3 Installing the Server.
Understanding Networks I. Objectives Compare client and network operating systems Learn about local area network technologies, including Ethernet, Token.
Automating Linux Installations at CERN G. Cancio, L. Cons, P. Defert, M. Olive, I. Reguero, C. Rossi IT/PDP, CERN presented by G. Cancio.
Introduction to Computer Administration System Administration
12/04/98HEPNT - Windows NT Days1 NT Cluster & MS Dfs Gunter Trowitzsch & DESY WindowsNT Group.
CLUSTER COMPUTING Prepared by: Kalpesh Sindha (ITSNS)
Fundamentals of Networking Discovery 1, Chapter 2 Operating Systems.
1 Web Server Administration Chapter 3 Installing the Server.
Stuart Cunningham - Computer Platforms COMPUTER PLATFORMS Network Operating Systems Week 9.
1 The Solaris Distributed Computing Solution The operating system is a set of programs that manages all computer operations and provides an interface between.
Module 3: Preparing for Cluster Service Installation.
Online Systems Status Review of requirements System configuration Current acquisitions Next steps... Upgrade Meeting 4-Sep-1997 Stu Fuess.
Version 4.0. Objectives Describe how networks impact our daily lives. Describe the role of data networking in the human network. Identify the key components.
27/04/05Sabah Salih Particle Physics Group The School of Physics and Astronomy The University of Manchester
Installing and Managing a Large Condor Pool Derek Wright Computer Sciences Department University of Wisconsin-Madison
CASPUR Site Report Andrei Maslennikov Sector Leader - Systems Catania, April 2001.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
การติดตั้งและทดสอบการทำคลัสเต อร์เสมือนบน Xen, ROCKS, และไท ยกริด Roll Implementation of Virtualization Clusters based on Xen, ROCKS, and ThaiGrid Roll.
The SLAC Cluster Chuck Boeheim Assistant Director, SLAC Computing Services.
Ohio Supercomputer Center Cluster Computing Overview Summer Institute for Advanced Computing August 22, 2000 Doug Johnson, OSC.
Paul Scherrer Institut 5232 Villigen PSI HEPIX_AMST / / BJ95 PAUL SCHERRER INSTITUT THE PAUL SCHERRER INSTITUTE Swiss Light Source (SLS) Particle accelerator.
Beowulf Cluster Jon Green Jay Hutchinson Scott Hussey Mentor: Hongchi Shi.
A study of introduction of the virtualization technology into operator consoles T.Ohata, M.Ishii / SPring-8 ICALEPCS 2005, October 10-14, 2005 Geneva,
Introduction to U.S. ATLAS Facilities Rich Baker Brookhaven National Lab.
November 2, 2000HEPiX/HEPNT FERMI SAN Effort Lisa Giacchetti Ray Pasetes GFS information contributed by Jim Annis.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
20-22 September 1999 HPSS User Forum, Santa Fe CERN IT/PDP 1 History  Test system HPSS 3.2 installation in Oct 1997 IBM AIX machines with IBM 3590 drives.
6/26/01High Throughput Linux Clustering at Fermilab--S. Timm 1 High Throughput Linux Clustering at Fermilab Steven C. Timm--Fermilab.
Manchester HEP Desktop/ Laptop 30 Desktop running RH Laptop Windows XP & RH OS X Home server AFS using openafs 3 DB servers Kerberos 4 we will move.
The Roadmap to New Releases Derek Wright Computer Sciences Department University of Wisconsin-Madison
Jefferson Lab Site Report Sandy Philpott Thomas Jefferson National Accelerator Facility (formerly CEBAF - The Continuous Electron Beam Accelerator Facility)
CASPUR Site Report Andrei Maslennikov Lead - Systems Amsterdam, May 2003.
Cluster Software Overview
26/4/2001LAL Site Report - HEPix - LAL 2001 LAL Site Report HEPix – LAL Apr Michel Jouvin
RAL Site report John Gordon ITD October 1999
Cyber Security Review, April 23-24, 2002, 0 Operated by the Southeastern Universities Research Association for the U.S. Depart. Of Energy Thomas Jefferson.
Queensland University of Technology CRICOS No J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.
December 26, 2015 RHIC/USATLAS Grid Computing Facility Overview Dantong Yu Brookhaven National Lab.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
Randy MelenApril 14, Stanford Linear Accelerator Center Site Report April 1999 Randy Melen SLAC Computing Services/Systems HPC Team Leader.
RHIC/US ATLAS Tier 1 Computing Facility Site Report Christopher Hollowell Physics Department Brookhaven National Laboratory HEPiX Upton,
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
Windows NT at DESY Status report HEP NT 4 th -8 th October 1999 SLAC.
Batch Software at JLAB Ian Bird Jefferson Lab CHEP February, 2000.
R. Krempaska, October, 2013 Wir schaffen Wissen – heute für morgen Controls Security at PSI Current Status R. Krempaska, A. Bertrand, C. Higgs, R. Kapeller,
2: Operating Systems Networking for Home & Small Business.
CEG 2400 FALL 2012 Linux/UNIX Network Operating Systems.
Class Meeting 11 ITI-481 – UNIX ADMIN Chris Uriarte, Instructor ITI-481: Unix Administration Rutgers University Internet Institute Instructor: Chris Uriarte.
EGEE is a project funded by the European Union under contract IST Test di GPFS a Catania IV Workshop INFN Grid – Bari Ottobre
Managing Large Linux Farms at CERN OpenLab: Fabric Management Workshop Tim Smith CERN/IT.
Thousands of Linux Installations (and only one administrator) A Linux cluster client for the University of Manchester A V Le Blanc I T Services University.
Linux Systems Administration 101 National Computer Institute Sep
Network Attached Storage Overview
Guide to Linux Installation and Administration, 2e
CCNA Routing and Switching Routing and Switching Essentials v6.0
PC Farms & Central Data Recording
Chapter 10: Device Discovery, Management, and Maintenance
CCNA Routing and Switching Routing and Switching Essentials v6.0
Chapter 10: Device Discovery, Management, and Maintenance
NCSA Supercluster Administration
Web Server Administration
Presentation transcript:

Computer Systems Lab The University of Wisconsin - Madison Department of Computer Sciences Linux Clusters David Thompson \ ~thomas/madlug

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Overview The Computer Systems Lab (CSL) Clusters The condor/db cluster Scalable Linux Administration

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Overview The Computer Systems Lab (CSL) Clusters The condor/db cluster Scalable Linux Administration

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Computer Systems Lab Purpose Staff Resources

Computer Systems Lab The University of Wisconsin - Madison Department of Computer Sciences Purpose “To support the research and teaching missions of the Department of Computer Sciences”

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Staff 8 Full Time Part Time

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Responsibilities Networks –Gigabit, 100BaseT, ATM, FDDI –Cisco, Foundry routers –3com, HP, Cisco switches

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Responsibilities (cont.) Operating Systems –Solaris, Linux, Digital Unix, AIX, IRIX, NT Applications –compilers, dbs, simulators, , image processing....

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Responsibilities (cont.) 641 software packages installed –69 Gbytes –multiple version –each package installed for several architectures –several thousand builds

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Responsibilities - (cont.) Workstations –600 PCs (including cluster) –200 Sparcs –15 Alphas –others 5600 User home directories –69 Gbytes

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Responsibilities (more) AFS –1 Tbyte of ubiquitous file space –14 File Servers, 3 db Servers –95% client cache hit rates Backups –2 week epoch cycle (1 Tb) –Daily incs

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Overview The Computer Systems Lab (CSL) Clusters The condor/db cluster Scalable Linux Administration

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Clusters Definitions Architectures Example Applications

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Definitions NOW - Network of workstations COW - Cluster of workstations –“Some degree of network isolation” –“Dedicated function”

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Architectures N-dimensional arrays –“previous & next” neighbor –hypercube Simple Network

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Architectures Distributed –MPI –PVM –condor

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Examples The Hive

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Examples - The Hive

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Examples - The Hive (cont.)

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Redundant Networks

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Cluster Applications Image Analysis – tilton.html Parallel Virtual File System (PVFS) – Speech Recognition –

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Cluster Applications (cont.) Physics –Viscoelasticity –Seismology –Big Bang html

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Cluster Applications (cont.) Physics (cont.) –Laser Interferometer Gravitational-Wave Observatory (LIGO) –NA49 (??) –Large Acceptance Hadron Detector for an Investigation of Pb-induced Reactions at the CERN SPS

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Overview The Computer Systems Lab (CSL) Clusters The condor/db cluster Scalable Linux Administration

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Computer Science Cluster Two connected clusters –Dual Xeon 550mHz, 512k cache, 1 Gig RAM, Ultra 2 SCSI 9 Gig boot disk, tulip network –64 node compute cluster –36 node db cluster with 4 extra 9 Gig disks and GNIC-II Gigabit ethernet –Red Hat Linux 6.1, kernel

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Cluster Architecture

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Cluster Picture

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Overview The Computer Systems Lab (CSL) Clusters The condor/db cluster Scalable Linux Administration

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Scalable Linux Administration What Why Installation Maintenance

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Scalable Admin - What Leverage Control systems Remote monitoring Operating system upgrades Centralized Services –kerberos, afs, logging

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Scalable Admin - Why Consistent user view –Available applications –Stability Predictable Admin Environment Security

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Scalable Admin - Installation Red Hat Kickstart –Configuration file network config, nfs locations, disk layout, RPMs to install –Boot disk, nfs, or bootp/dhcp –Post-install script –redhat-6.1/i386/doc/HOWTO/KickStart-HOWTO

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Sample Kickstart Script # $Id: ks.cfg,v /10/07 18:57:24 thomas Exp $ lang en_US network --bootproto bootp nfs --server pinstall.cs.wisc.edu --dir /install/redhat- 6.0/i386 keyboard us zerombr yes clearpart --all part / --size 100 #part /tmp --size 300 part /var --size 75 part /usr --size 570 part swap --size 127 part /var/vice/cache --size 120 part /local --size 2 --grow --maxsize 4000

Computer Systems Lab The University of Wisconsin Madison Department of Computer Sciences Scalable Admin - Maintenance Update RPMS –Create list of RPMs, versions, and files to install –Each computer updates based on list Special files –package (afs) –cfengine (gnu) –config files (filedist)

Computer Systems Lab The University of Wisconsin - Madison Department of Computer Sciences Linux Clusters David Thompson