Quattor Usage at Nikhef

Slides:



Advertisements
Similar presentations
Geoff Quigley, Stephen Childs and Brian Coghlan Trinity College Dublin
Advertisements

S.Chechelnitskiy / SFU Simon Fraser Running CE and SE in a XEN virtualized environment S.Chechelnitskiy Simon Fraser University CHEP 2007 September 6 th.
S. Gadomski, "ATLAS computing in Geneva", journee de reflexion, 14 Sept ATLAS computing in Geneva Szymon Gadomski description of the hardware the.
WP4-install task report WP4 workshop Barcelona project conference 5/03 German Cancio.
EGEE is a project funded by the European Union under contract IST Quattor Installation of Grid Software C. Loomis (LAL-Orsay) GDB (CERN) Sept.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
PROOF Cluster Management in ALICE Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stephen Childs Trinity College Dublin &
INFSO-RI Enabling Grids for E-sciencE SA1 and gLite: Test, Certification and Pre-production Nick Thackray SA1, CERN.
Large Farm 'Real Life Problems' and their Solutions Thorsten Kleinwort CERN IT/FIO HEPiX II/2004 BNL.
20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.
INFSO-RI Enabling Grids for E-sciencE SCDB C. Loomis / Michel Jouvin (LAL-Orsay) Quattor Tutorial LCG T2 Workshop June 16, 2006.
The Scaling and Validation Programme PoC David Groep & vle-pfour-team VL-e Workshop NIKHEF SARA LogicaCMG IBM.
An Agile Service Deployment Framework and its Application Quattor System Management Tool and HyperV Virtualisation applied to CASTOR Hierarchical Storage.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stuart Kenny and Stephen Childs Trinity.
Software Management with Quattor German Cancio CERN/IT.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
Final Implementation of a High Performance Computing Cluster at Florida Tech P. FORD, X. FAVE, K. GNANVO, R. HOCH, M. HOHLMANN, D. MITRA Physics and Space.
23 January 2007WLCG workshop, CERN System Management Working Group Alessandra Forti WLCG workshop CERN, 23 January 2007.
Linux Configuration using April 12 th 2010 L. Brarda / CERN (some slides & pictures taken from the Quattor website) ‏
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
Evangelos Markatos and Charalampos Gkikas FORTH-ICS Athens, th Mar Institute of Computer Science - FORTH Christos.
INRNE's participation in LCG Elena Puncheva Preslav Konstantinov IT Department.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Performance analysis comparison Andrea Chierici Virtualization tutorial Catania 1-3 dicember 2010.
Overview of cluster management tools Marco Mambelli – August OSG Summer Workshop TTU - Lubbock, TX THE UNIVERSITY OF CHICAGO.
INFSO-RI Enabling Grids for E-sciencE Quattor Workshop Summary C. Loomis (LAL-Orsay) GDB Meeting (Rome) April 5, 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
SCDB Update Michel Jouvin LAL, Orsay March 17, 2010 Quattor Workshop, Thessaloniki.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
Status of the NL-T1. BiG Grid – the dutch e-science grid Realising an operational ICT infrastructure at the national level for scientific research (e.g.
A comparison between xen and kvm Andrea Chierici Riccardo Veraldi INFN-CNAF CCR 2009.
CNAF - 24 September 2004 EGEE SA-1 SPACI Activity Italo Epicoco.
CVMFS Alessandro De Salvo Outline  CVMFS architecture  CVMFS usage in the.
Quattor installation and use feedback from CNAF/T1 LCG Operation Workshop 25 may 2005 Andrea Chierici – INFN CNAF
Quattor: An administration toolkit for optimizing resources Marco Emilio Poleggi - CERN/INFN-CNAF German Cancio - CERN
1 Policy Based Systems Management with Puppet Sean Dague
INFN-T1 migration to scdb Andrea Chierici 8 th Quattor Workshop Bruxelles.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Cumulus - dynamic cluster available under Clusterix
LCG/EGEE Installation J. A. Templon Undecided (NIKHEF)
AII v2 Ronald Starink Luis Fernando Muñoz Mejías
IBCP - CNRS STATUS Christelle Eloto Lyon - France
Virtualisation for NA49/NA61
UAM status report Luis Fernando Muñoz Mejías
Dag Toppe Larsen UiB/CERN CERN,
Dag Toppe Larsen UiB/CERN CERN,
Status of Fabric Management at CERN
Quattor in Amazon Cloud
Database Services at CERN Status Update
Moving from CREAM CE to ARC CE
Virtualisation for NA49/NA61
German Cancio CERN IT .quattro architecture German Cancio CERN IT.
NIKHEF Data Processing Fclty
Future Test Activities SA3 All Hands Meeting Dublin
Quality Control in the dCache team.
Leanne Guy EGEE JRA1 Test Team Manager
PES Lessons learned from large scale LSF scalability tests
Spacewalk and Koji at Fermilab
Virtualization in the gLite Grid Middleware software process
Quattor : Installation and Configuration Management
SUSE Linux Enterprise Desktop Administration
Quattor Advanced Tutorial, LAL
Grid Management Challenge - M. Jouvin
A Virtual Machine Monitor for Utilizing Non-dedicated Clusters
Presentation transcript:

Quattor Usage at Nikhef Ronald Starink QWG workshop Bologna – March 2008

NIKHEF grid Projects: EGEE grid Tier-1 for LHC (with SARA) National projects (VL-e, BIG GRID) Sites: NIKHEF-ELPROD: ~150 hosts (~400 cores) Including main (LCG) services April 2008: 250-300 hosts?! Installation Test Bed: ~15 nodes Similar setup as production Staf: 2.5 FTE (4 people) / 1.25 (2) Quattor- aware/friendly 0.5 FTE hardware support

Quattor Usage Install with Quattor: Nearly all grid machines (CentOS 3, 4, 5 i386) Not: core server (LDAP, NFS user homes), Quattor server Generic x86-64 servers (trivial) Configure with Quattor (ncm-components): Basic Linux services Grid Middleware using Yaim via ncm-yaim Local modifications to standard Yaim Requires frequent patching Torque + Maui

Templates Still using home-cooked namespace layout :-( Lacking time to investigate the required changes Intention to switch to QWG Benefit from & contribute to community effort ... interest from another site Namespace organization: 3 facilities (clusters): PRD, ITB, test Will this scale? Straightforward implementation Xen guests

Monitoring Result of ncm-ncd /var/log/ncm/ncd.log Result of last ncm-ncd run Time stamp of last run Check ncm-cdispd still running Ganglia: overview (“did all nodes execute ncm- ncd?”) Nagios: notification when non-zero exit Is ncm-cdispd running (NRPE)? How? Install rpm(s) at Quattor clients Nagios server: (still) manual configuration Ganglia server: nothing to do!

Tools Pan compiler v7 AII version 2 See dedicated talk on upgrade SPMA “SCDB--”: SCDB-based (compile, deploy, update repos) no Subversion shell script hiding ant calls: makexprof -f prd pushxprof -f itb tbn14 tbn16 some patches to build.xml

Quattor Setup – 1 Scaling issues Currently one Quattor server: DHCP TFTP Software repositories NFS server for /osinstall and SCDB root Additional Quattor VM build host Compile profiles (faster because more CPUs Questions How to distribute repositories? How to distribute load for TFTP? Preferably no master-slave setup Already resolved at other sites?

Quattor Setup - 2

Summary Not many changes since last workshop Setup works pretty well Monitoring AII v2 Issues: Pan compiler performance: remains concern Scaling problems Occasionally reconfiguration does not occur Future: Closer SCDB integration QWG templates? Scaling...