Grid’5000 Grid' DAS-3 workshop 104/12/06 Grid’5000 * DAS-3 – Grid'5000 workshop December 4th, *5000 CPUs Pierre NEYRON - INRIA Head of Grid’5000 Technical Committee mailto:
Grid’5000 Grid' DAS-3 workshop 204/12/06 Grid'5000 hardware Overview Hardware summary Grid'5000 Network Backbone network Security architecture Routing tables Grid'5000 Software stack Accounts management Grid tools Grid'5000 People Committees Technical staff Collaborative work Outline
Grid’5000 Grid' DAS-3 workshop 304/12/06 Grid’5000 platform overview 1000 (684) 518 (518) 500 (198) 500 (116) 500 (252) 500 (270) 500 (322) 500 (146) 500 (94) Nation wide Grid: 9 sites, 500 CPUs/site (1000 for Orsay) Currently: 2600 CPUs Interlink network: 10GB/s dark fibre
Grid’5000 Grid' DAS-3 workshop 404/12/06 Grenoble Rennes Lyon Toulouse Sophia Orsay Bordeaux
Grid’5000 Grid' DAS-3 workshop 504/12/06 Grid’5000 platform hardware Grid'5000 hardware summary see: Manufacturers: HP Dell IBM Sun Apple CPU types: AMD Opteron [80%] Intel Xeon64 Intel Itanium2 Intel Xeon32 PowerPC G5 Network: Gigabit Ethernet Myrinet Infiniband
Grid’5000 Grid' DAS-3 workshop 604/12/06 Grid'5000 hardware Overview Hardware summary Grid'5000 Network Backbone network Security architecture Routing tables Grid'5000 Software stacks Accounts management Grid tools Grid'5000 People Committees Technical staff Collaborative work Outline
Grid’5000 Grid' DAS-3 workshop 704/12/06 Grid’5000 backbone network Grid'5000 dedicated backbone network Renater4: Sites connected to a single layer-2 VLAN RENATER Rennes RENATER Lille RENATER Nancy RENATER Bordeaux ParisLyon RENATER Grenoble Lyon RENATER Sophia RENATER Toulouse Black fibre Dedicated Lambda Fully isolated traffic! 10 Gbps 1Gb through Renater3 (MPLS) Renater4 VLAN /24 RENATER Orsay DAS-3 via Geant and Surfnet
Grid’5000 Grid' DAS-3 workshop 804/12/06 Grid'5000 router Site's Grid'5000 access machine Secure access Grid'5000 site Grid’5000 security architecture 2 fibers: 1 dedicated to Grid’5000 Grid'5000 site Grid'5000 site Grid'5000 network is isolated from Renater general purpose network. Access is granted thru secure gateways Lab Reginal Node Regional node Regional Node Cluster Lab's router SSH Regional Node Internet Renater general purpose national network backbone Grid'5000 dedicated isolated network backbone Lab network Grid'5000 site Site servers (DNS, LDAP, NFS, DHCP/PXE, Console, Reboot) Cluster nodes workstation Renater ds Grid'5000 network is
Grid’5000 Grid' DAS-3 workshop 904/12/06 Grid’5000 routing tables Site-to-Site routing tables Bordeaux /24 ( ) Grenoble (idpot ) /24 ( ) (icluster2) /24 ( ) (icare ) /24 ( ) Lille /24 ( ) Lyon /17 ( ) Nancy /22 ( ) Orsay /16 ( ) Rennes /23 ( ) Sophia /22 ( ) Toulouse /24 ( ) On site, not routed local networks Several private networks are used but not routed from site to site: admin network, IPMA/RSA network, Myrinet/Infiniband network... Routing for experiments using virtualization address space used for virtual nodes (work in progress) => DAS-3 connection: Need to avoid IP conflicts
Grid’5000 Grid' DAS-3 workshop 1004/12/06 Grid'5000 hardware Overview Hardware summary Grid'5000 Network Backbone network Routing tables Security Grid'5000 Software stacks Accounts management Grid tools Grid'5000 People Committees Technical staff Collaborative work Outline
Grid’5000 Grid' DAS-3 workshop 1104/12/06 Cluster A single account for every user on every sites: Every site runs a LDAP server Every site manages its LDAP branch of user accounts Every site replicates the LDAP branches of the other sites Home directories structure automatically created All sites share the same global Grid'5000 account directory Grid'5000 account management Grid'5000 site SSH Home directories /home/ / LDAP accounts site local branch + replicated remote branches Grid'5000 site SSH Grid'5000 site remote accounts replication + home directory creation & on demand synchronization Unique user accounts on Grid'5000
Grid’5000 Grid' DAS-3 workshop 1204/12/06 Cluster Grid'5000 grid tools Grid'5000 site SSH Grid'5000 site SSH Resource management & deployment homes LDAP Grid'5000 site OAR Kadeploy OAR Kadeploy OAR Grid Every cluster is managed by a OAR server On user's demand, every cluster can be redeployed with the user's provided environment OAR grid allows grid submission over the whole grid
Grid’5000 Grid' DAS-3 workshop 1304/12/06 Grid'5000 hardware Overview Hardware summary Grid'5000 Network Backbone network Routing tables Security Grid'5000 Software stacks Accounts management Grid tools Grid'5000 People Committees Technical staff Collaborative work Outline
Grid’5000 Grid' DAS-3 workshop 1404/12/06 Grid’5000 committees Steering Committee (CP): Pr. Franck Cappello, Pr. Thierry Priol, Pr. Brigitte Plateau & al Technical Committee (CT): 20 engineers + researchers, about 50 persons Users: more than 200 active users, located on Grid'5000 sites/nearby/in foreign countries (Netherlands, Brazil, Africa, Japan...)
Grid’5000 Grid' DAS-3 workshop 1504/12/06 Grid’5000 technical staff Grid'5000 technical staff is dispatched on all 9 sites, France wild. National: coordination + development (2 engineers) Sites: 1 or 2 engineers per sites for the administration People hosted in INRIA research labs or universities. Bordeaux – Grenoble – Lille – Lyon – Nancy - Orsay (Paris) – Rennes – Sophia-Antipolis (Nice) - Toulouse Grid'5000 as a computer grid, and as a grid of People
Grid’5000 Grid' DAS-3 workshop 1604/12/06 Grid’5000 collaborative work Communication means: Mailing lists (CP, CT, users, site-users, site- staff, network-staff) Instant messaging (Grid'5000 dedicated Jabber server) Phone + audio-conf (monthly CT audio- meeting + task dedicated meeting... ) Physical meeting every 4 months Tools aiding collaborative work: Wiki: public web site + user's portal + committees portal Bugzilla: ticket tracking, task assignment SVN repository / GForge incident tracking tools/cross admin logs/monitoring
Grid’5000 Grid' DAS-3 workshop 1704/12/06 QUESTIONS ?