Quattor-for-Castor Jan van Eldik Sept 7, 2005. 2 Outline Overview of CERN –Central bits CDB template structure SWREP –Local bits Updating profiles.

Slides:



Advertisements
Similar presentations
26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
Advertisements

CCTracker Presented by Dinesh Sarode Leaf : Bill Tomlin IT/FIO URL
ASIS et le projet EU DataGrid (EDG) Germán Cancio IT/FIO.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 10: Server Administration.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Installing and Setting up mongoDB replica set PREPARED BY SUDHEER KONDLA SOLUTIONS ARCHITECT.
Module 16: Software Maintenance Using Windows Server Update Services.
Partner Logo German Cancio – WP4-install LCFG HOW-TO - n° 1 WP4 hands-on workshop: EDG LCFGng exercises
1 Worker Nodes Installation&Configuration Sara Bertocco INFN Padova 11 th International GridKa School 2013 – Big Data, Clouds and Grids.
Software Licensing, Made Simple SELECT Server XM Edition
WP4-install task report WP4 workshop Barcelona project conference 5/03 German Cancio.
EGEE is a project funded by the European Union under contract IST Quattor Installation of Grid Software C. Loomis (LAL-Orsay) GDB (CERN) Sept.
AI project components: Facter and Hiera
Module 13: Maintaining Software by Using Windows Server Update Services.
ELFms meeting, 2/3/04 German Cancio, 2/3/04 Proxy servers in CERN-CC.
DataGrid is a project funded by the European Commission under contract IST IT Post-C5, Managing Computer Centre machines with Quattor.
EDG LCFGng: concepts Fabric Management Tutorial - n° 2 LCFG (Local ConFiGuration system)  LCFG is originally developed by the.
1 Linux in the Computer Center at CERN Zeuthen Thorsten Kleinwort CERN-IT.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Large Computer Centres Tony Cass Leader, Fabric Infrastructure & Operations Group Information Technology Department 14 th January and medium.
quattor NCM components introduction tutorial German Cancio CERN IT/FIO.
EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO
CERN Manual Installation of a UI – Oxford July - 1 LCG2 Administrator’s Course Oxford University, 19 th – 21 st July Developed.
CMSBrownBag,05/29/2007 B.Mangano How to “use” CMSSW on own Linux Box and be happy In this context “use” means: - check-out pre-compiled CMSSW code - run.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2
PROOF Cluster Management in ALICE Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
1 Quick Overview Overview Network –IPTables –Snort Intrusion Detection –Tripwire –AIDE –Samhain Monitoring & Configuration –Beltaine –Lemon –Prelude Conclusions.
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
05/29/2002Flavia Donno, INFN-Pisa1 Packaging and distribution issues Flavia Donno, INFN-Pisa EDG/WP8 EDT/WP4 joint meeting, 29 May 2002.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stephen Childs Trinity College Dublin &
German Cancio – WP4 developments Partner Logo System Management: Node Configuration & Software Package Management
CN2140 Server II Kemtis Kunanuraksapong MSIS with Distinction MCT, MCITP, MCTS, MCDST, MCP, A+
Large Farm 'Real Life Problems' and their Solutions Thorsten Kleinwort CERN IT/FIO HEPiX II/2004 BNL.
Deployment work at CERN: installation and configuration tasks WP4 workshop Barcelona project conference 5/03 German Cancio CERN IT/FIO.
20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.
SMS Enhancements on the Client Side Véronique Lefébure ELFms Meeting 23 rd of August 2005.
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
RPMS - EHR v1.1 Techie Class Really Powerful at Measuring Stuff.
SPMA & SWRep: Basic exercises HEPiX hands-on, NIKHEF 5/03 German Cancio
Software Management with Quattor German Cancio CERN/IT.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2 [Including slides prepared by Lex Holt.]
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
QWG Errata Management Framework Ian Collier 10 th Quattor Workshop Rutherford Appleton Laboratory October 2010.
Fabric Management with ELFms BARC-CERN collaboration meeting B.A.R.C. Mumbai 28/10/05 Presented by G. Cancio – CERN/IT.
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
Quattor and ELFms An introduction for the new Sysadmins (and others) Sophie Lemaitre Véronique Lefébure April 2011 CF-ASI.
CDB Namespaces, “service templates” and profile structure Véronique Lefébure June the 13 th 2008 ELFms Brainstorming meeting.
Linux Configuration using April 12 th 2010 L. Brarda / CERN (some slides & pictures taken from the Quattor website) ‏
EGEE is a project funded by the European Union under contract IST Installation and configuration of gLite services Robert Harakaly, CERN,
Quattor tutorial Introduction German Cancio, Rafael Garcia, Cal Loomis.
Introduction to NCM Configuration components German Cancio CERN/IT.
Partner Logo Olof Bärring, WP4 workshop 10/12/ n° 1 (My) Vision of where we are going WP4 workshop, 10/12/2002 Olof Bärring.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Agile Infrastructure Project Overview : Status and.
Managing Large Linux Farms at CERN OpenLab: Fabric Management Workshop Tim Smith CERN/IT.
INFSO-RI Enabling Grids for E-sciencE Running reliable services: the LFC at CERN Sophie Lemaitre
Quattor: An administration toolkit for optimizing resources Marco Emilio Poleggi - CERN/INFN-CNAF German Cancio - CERN
AI How to: System Update and Additional Software
NA61/NA49 virtualisation:
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Status of Fabric Management at CERN
Miroslav Siket, Dennis Waldron
WP4-install status update
Spring Cleaning the Software Repositories Matthias Schröder
Status and plans of central CERN Linux facilities
German Cancio CERN IT .quattro architecture German Cancio CERN IT.
Quattor Usage at Nikhef
Software deployment and service administration with Quattor
Grid Management Challenge - M. Jouvin
Presentation transcript:

Quattor-for-Castor Jan van Eldik Sept 7, 2005

2 Outline Overview of CERN –Central bits CDB template structure SWREP –Local bits Updating profiles SPMA NCM Exercises Browsing CDB templates Changing RPMs with the SPMA Updating Lemon monitoring configuration Later: giving root access, adding new RPMs to the repository, configuring castor

3 Disclaimer(s) This is very incomplete! Focus is on “standard operations” I am Just Another Quattor user tip of the day: Monkey see, monkey do

4 CDB configuration database “Global schema” to describe node Written in homegrown PAN language (declarative and procedural bits) Host templates profile_.tpl, including more general templates pro_*.tpl Compiled into XML files Every node has a local copy of its configuration information Very node-centric (by design)

5 CDB - 2 CDB updates can be (very) slow… Most information also available in CDBSQL –i.e. in an Oracle database –asynchronous updates, can be very slow too…

6 Example : profile_tpsrv901.tpl profile_tpsrv901.tpl From node specific to site specific Overwriting of certain values If-then-else, functions Include-files make it hard to find where information comes from  –Software packages –Configuration info –Hardware description –Administrative information –serial console, info derived from landb, …

7 Modify templates with cdbop Useful commands: –help –list profile_tpsrv* –get profile_tpsrv901.tpl pro_system_tapeserver.tpl –!sh # drops you to a shell –!emacs profile_tpsrv901.tpl pro_system_tapeserver.tpl –update *.tpl –commit Versioning available Run on lxplus, NICE authentication

8 Software repository SWREP If you want to provide new RPMs… swrep-client put i386_slc3 myfile.rpm /cern/cc Separate repositories – per architecture {i386,ia64,x86_64}_slc3 – OS i386_{slc3,rhes3} On lxplus, uses ssh authentication

9 On the nodes… Synchronization between CDB and local profile is crucial!!! But automagic: –Hosts are notified of profile changes –Hourly cron job, just in case… –You can /usr/bin/ccm-fetch by hand List local cache, as root ncm-query –dump / | less

10 Invoking SPMA to change RPM-set PAN functions pkg_add(),pkg_del(), pkg_repl() SPMA can be configured to not touch packages it does not know about "/software/components/spma/userpkgs" = "yes"; "/software/components/spma/userprio" = "yes"; SPMA can be forcefully disabled  echo ‘disabling SPMA – JvE’ > /etc/nospma Run (as root): spma_wrapper.sh [--noaction] [INFO] The following package operations are required: replace - SINDES noarch with SINDES noarch install stk-ssi-devel cern i386 [INFO] Please be patient... 2 operation(s) to verify/execute. [OK] SPMA finished successfully.

11 Configuring the node NCM components configure services afs, sendmail, tapeserver, fmonagent, spma, … All components on the node ncm-ncd –list Configure access control and grub ncm-ncd –configure access_control grub All-in-one: spma_ncm_wrapper.sh

12 some hints ‘n tips… Use wassh to run commands on multiple hosts wassh –s slc3 uptime wassh shutdown –r now Quick check to see if Lemon is happy check-this-host Disable alarm sending echo ‘JvE did this’ > /home/operator/nomorealarms Use serial console connect2console.sh l3006d # and pray :)

13 #1 – use cdbop Log on to lxplus, start cdbop, get the node profile of a tapeserver of my choice Log on as root to that tapeserver, and run ncm-query –dump /hardware | less Locate the serial number of the machine in both sessions Locate the serial numbers of the harddisks in both sessions

14 #2 – adding an RPM Add to your tapeserver profile "/software/packages"= pkg_add("CERN-CC-PrepareInstall","2.3-0","noarch"); Commit the change Use ncm-query –dump /software/packages | less and try to find the new package Run spma_wrapper.sh to install the package Run rpm –e CERN-CC-PrepareInstall to remove it Re-run spma_wrapper.sh

15 #3 – change Lemon alarm Disable alarm sending on your tapeserver Check the Sure alarms Stop atd daemon with service atd stop Check the Sure alarms again Locate the “atd” monitoring configuration, starting at the Lemon website De-activate it in the tapeserver profile “/system/monitoring/….…./active” = false; Reconfigure fmonagent The Sure alarm should be gone… Reactive atd monitoring, start the daemon, re-enable alarm sending

16 Follow-up Adding new RPMs to SWREP, and deploying them on a cluster Giving (root) access