1 Linux in the Computer Center at CERN Zeuthen 21.10.2002 Thorsten Kleinwort CERN-IT.

Slides:



Advertisements
Similar presentations
GridPP7 – June 30 – July 2, 2003 – Fabric monitoring– n° 1 Fabric monitoring for LCG-1 in the CERN Computer Center Jan van Eldik CERN-IT/FIO/SM 7 th GridPP.
Advertisements

Fabric Management at CERN BT July 16 th 2002 CERN.ch.
German Cancio – WP4 developments Partner Logo WP4-install plans WP6 meeting, Paris project conference
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
ASIS et le projet EU DataGrid (EDG) Germán Cancio IT/FIO.
12. March 2003Bernd Panzer-Steindel, CERN/IT1 LCG Fabric status
Status of Globus activities within INFN (update) Massimo Sgaravatto INFN Padova for the INFN Globus group
Automating Linux Installations at CERN G. Cancio, L. Cons, P. Defert, M. Olive, I. Reguero, C. Rossi IT/PDP, CERN presented by G. Cancio.
Chapter 14 Network Management Business Aspects Architectures Technology.
Linux Operations and Administration
1 TAC2000/ IP Telephony Lab Advanced Linux Administration Language: Offered in English Instructor: Dr. Quincy Wu (
Framework for Automated Builds Natalia Ratnikova CHEP’03.
WP4-install task report WP4 workshop Barcelona project conference 5/03 German Cancio.
Managing Mature White Box Clusters at CERN LCW: Practical Experience Tim Smith CERN/IT.
EDG LCFGng: concepts Fabric Management Tutorial - n° 2 LCFG (Local ConFiGuration system)  LCFG is originally developed by the.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Yannick Patois – CVS and Autobuild tools at CCIN2P3 – hepix - October, n° 1 CVS setup at CC-IN2P3 and Datagrid edg- build tools CVS management,
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
EDG WP4: installation task LSCCW/HEPiX hands-on, NIKHEF 5/03 German Cancio CERN IT/FIO
Ramiro Voicu December Design Considerations  Act as a true dynamic service and provide the necessary functionally to be used by any other services.
SMS 2003 Deployment and Managing Windows Security Rafal Otto Internet Services Group Department of Information Technology CERN 26 May 2016.
13 th May 2004LINUX, which LINUX?1 Presentation to the AB/CO Technical Committee – Linux as the Future Console O/S Alastair Bland, 13 th May 2004.
Nov 1, 2000Site report DESY1 DESY Site Report Wolfgang Friebel DESY Nov 1, 2000 HEPiX Fall
Partner Logo DataGRID WP4 - Fabric Management Status HEPiX 2002, Catania / IT, , Jan Iven Role and.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2
Configuration Management with Cobbler and Puppet Kashif Mohammad University of Oxford.
May PEM status report. O.Bärring 1 PEM status report Large-Scale Cluster Computing Workshop FNAL, May Olof Bärring, CERN.
SLAC Site Report Chuck Boeheim Assistant Director, SLAC Computing Services.
F. Rademakers - CERN/EPLinux Certification - FOCUS Linux Certification Fons Rademakers.
PROOF Cluster Management in ALICE Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
Quattor-for-Castor Jan van Eldik Sept 7, Outline Overview of CERN –Central bits CDB template structure SWREP –Local bits Updating profiles.
German Cancio – WP4 developments Partner Logo System Management: Node Configuration & Software Package Management
Large Farm 'Real Life Problems' and their Solutions Thorsten Kleinwort CERN IT/FIO HEPiX II/2004 BNL.
Deployment work at CERN: installation and configuration tasks WP4 workshop Barcelona project conference 5/03 German Cancio CERN IT/FIO.
20-May-2003HEPiX Amsterdam EDG Fabric Management on Solaris G. Cancio Melia, L. Cons, Ph. Defert, I. Reguero, J. Pelegrin, P. Poznanski, C. Ungil Presented.
G. Cancio, L. Cons, Ph. Defert - n°1 October 2002 Software Packages Management System for the EU DataGrid G. Cancio Melia, L. Cons, Ph. Defert. CERN/IT.
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
May http://cern.ch/hep-proj-grid-fabric1 EU DataGrid WP4 Large-Scale Cluster Computing Workshop FNAL, May Olof Bärring, CERN.
Olof Bärring – WP4 summary- 4/9/ n° 1 Partner Logo WP4 report Plans for testbed 2 [Including slides prepared by Lex Holt.]
Cluster Configuration Update Including LSF Status Thorsten Kleinwort for CERN IT/PDP-IS HEPiX I/2001 LAL Orsay Tuesday, December 08, 2015.
23.March 2004Bernd Panzer-Steindel, CERN/IT1 LCG Workshop Computing Fabric.
German Cancio – WP4 developments Partner Logo WP4-install progress CERN, 19/6/2002 for WP4-install.
Maite Barroso - 10/05/01 - n° 1 WP4 PM9 Deliverable Presentation: Interim Installation System Configuration Management Prototype
ASIS + RPM: ASISwsmp German Cancio, Lionel Cons, Philippe Defert, Andras Nagy CERN/IT Presented by Alan Lovell.
Randy MelenApril 14, Stanford Linear Accelerator Center Site Report April 1999 Randy Melen SLAC Computing Services/Systems HPC Team Leader.
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
CERN 19/06/2002 Kickstart file generator Andrea Chierici (INFN-CNAF) Enrico Ferro (INFN-LNL) Marco Serra (INFN-Roma)
Quattor tutorial Introduction German Cancio, Rafael Garcia, Cal Loomis.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Lemon Computer Monitoring at CERN Miroslav Siket, German Cancio, David Front, Maciej Stepniewski Presented by Harry Renshall CERN-IT/FIO-FS.
Managing Large Linux Farms at CERN OpenLab: Fabric Management Workshop Tim Smith CERN/IT.
CNAF - 24 September 2004 EGEE SA-1 SPACI Activity Italo Epicoco.
CERN IT Department CH-1211 Genève 23 Switzerland M.Schröder, Hepix Vancouver 2011 OCS Inventory at CERN Matthias Schröder (IT-OIS)
Virtualisation for NA49/NA61
Dag Toppe Larsen UiB/CERN CERN,
High Availability Linux (HA Linux)
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Monitoring and Fault Tolerance
Dag Toppe Larsen UiB/CERN CERN,
WP4-install status update
Virtualisation for NA49/NA61
Status and plans of central CERN Linux facilities
German Cancio CERN IT .quattro architecture German Cancio CERN IT.
Module 01 ETICS Overview ETICS Online Tutorials
Presentation transcript:

1 Linux in the Computer Center at CERN Zeuthen Thorsten Kleinwort CERN-IT

1 October 2015Thorsten Kleinwort IT/FIO/IS 2 Overview Linux at CERN: Past Present Future Linux at CERN: Some details: “Legacy stuff” Configuration Installation AOB Outlook

1 October 2015Thorsten Kleinwort IT/FIO/IS 3 (Pre-) Linux at CERN: Past Several private Clusters Few machines in each Cluster All types of hardware: HP, AIX, SGI, Sun,… Proprietary base installation (CD, tape…) OS independent post installation (SUE) OS independent software distribution (ASIS)

1 October 2015Thorsten Kleinwort IT/FIO/IS 4 Linux at CERN: present I Decommissioning of RISC hardware: AIX, HP, DEC, and SGI have all gone and are not supported in the Computer Center any more Focus on Linux (Intel) & Solaris (Sun) Linux for login and batch, as desktop machines, and for disk and tape servers Solaris for servers & cross check platform but no general login and batch any more Now: LXPLUS ~ 75 nodes, LXBATCH ~600 nodes

1 October 2015Thorsten Kleinwort IT/FIO/IS 5 Linux at CERN: present II Installation & Maintenance outsourced: Done by at company (Serco) Using our (old) tools Big problems describing the “Service” they have to provide Installation became incomprehensible Administration far from being automated For the new Linux RedHat 7, we took back the responsibilities

1 October 2015Thorsten Kleinwort IT/FIO/IS 6 Linux at CERN: present III Current installed version still RedHat 6.1 Certification for RedHat 7.3 is ongoing, we (CC) are ready We (CC) have a complete new, automated installation for Linux 7.3 We have started to use a configuration database (CCConfig) We have redone the monitoring

1 October 2015Thorsten Kleinwort IT/FIO/IS 7 Linux at CERN: Node Monitoring System Configuration System Installation System Fault Tolerance System

1 October 2015Thorsten Kleinwort IT/FIO/IS 8 Linux at CERN: future LCG: ~10000 Linux nodes Computing Grid EDG: Split up in Tasks: WP4, Fabric Mgmt: Installation Configuration Monitoring Fault Tolerance

1 October 2015Thorsten Kleinwort IT/FIO/IS 9 Linux: Details ASIS: Was a tool for a platform independent software distribution Now (on Linux): RPM based Uses now system data base SUE: Was a common tool for all platforms Still in use, but deficiencies are apparent Now only used for configuration Kickstart: The RedHat tool for automatic installation

1 October 2015Thorsten Kleinwort IT/FIO/IS 10 Linux: Details II “BIS” installation: Reflected Cluster & OS dependencies Was ‘junked’ for a better and cleaner installation Configuration: New Configuration interface, CCConfig() Conform with WP4 Configuration Task Already available now as a PERL module on the node: Can be used, e.g. within SUE to provide node information

1 October 2015Thorsten Kleinwort IT/FIO/IS 11 Linux Details: III Linux 7.3 in the Computer Center: Automatic generation of kickstart files Boot machines with netboot or floppy Install base installation with Kickstart Install CERN and CC stuff afterwards with RPM Configure with SUE, configuration from CCConfig SUE may be replaced by the WP4 installation tool Maintenance of the machine: rpmupdate and SUE run on demand, triggered by notification (No regular run) Monitoring done with a prototype of WP4 monitoring

1 October 2015Thorsten Kleinwort IT/FIO/IS 12 Linux Details: IV Next steps: Upgrade our whole farms (~700) and install some new arrivals (~300) until mid next year to RedHat 7.3 Make installation AFS independent Collaborate with WP4: Provide replacement for SUE Use Configuration Management for hardware and software database Enhance Monitoring (Correlation engine, …

1 October 2015Thorsten Kleinwort IT/FIO/IS 13 Linux Details: V The batch System LSF Some Security issues Handling of /etc/passwd, /etc/group Configuration

1 October 2015Thorsten Kleinwort IT/FIO/IS 14 The batch system LSF: Introduced in 1997, Version 3.2: Multicluster: several submission & execution cluster, due to the large number of Clusters Using fixed partitions per group/experiment Current Version 4.2: Back to one cluster (submission & execution) Using “fairshare”: Better utilisation But slow reconfiguration times: around 15 min Good cooperation with Platform (Canada, UK, Germany {Munich})

1 October 2015Thorsten Kleinwort IT/FIO/IS 15 Secure host information Problem: How to get a private key on a new installed host: Floppy boot: Put a key on the floppy Network boot: Trust your network (Bootp) Put a private/public key (gpg) on the host in an early stage of the installation Use this key to encrypt secure information: SSH host keys The header of /etc/passwd (contains crypted pwds) Framework to generate and manage keys and secure information

1 October 2015Thorsten Kleinwort IT/FIO/IS 16 User Account Management Problem: Big amount of data (~1MB), changes irregularly: /etc/passwd, /etc/group We keep this information local Using “client poll”, together with a notification mechanism, for updates

1 October 2015Thorsten Kleinwort IT/FIO/IS 17 ServerClient CCDB Create new files & Publish them Server HTTP (LDAP) passwd.users group.users accounts Get files & Put them in place Forced pull Cron Boot Notify clients Notify daemon Subscription Database Subscribe Subscription daemon LAN

1 October 2015Thorsten Kleinwort IT/FIO/IS 18 Configuration Problem: SUE does not have a configuration information interface Invented CCConfig(): High level API for getting host information In collaboration with WP4 configuration task plans: Use a HLDL (High Level Description Language) for describing a host Use a compiler to create XML Download the XML file into local cache Use low level API for config info and for CCConfig

1 October 2015Thorsten Kleinwort IT/FIO/IS 19 Configuration: plans

1 October 2015Thorsten Kleinwort IT/FIO/IS 20 Summary Due to increasing number of Linux hosts and out-of-time tools, we have redone our Linux installation We are a little bit ahead of the EDG project: Needed our own solutions But in collaboration with them We are now preparing for LHC (~10000) All tools have to be re-evaluated