1 CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t Puppet in the CERN CC Tomas Karasek Steve Traylen Oct 24 2012.

Slides:



Advertisements
Similar presentations
CERN - IT Department CH-1211 Genève 23 Switzerland t SVN Pilot: CVS Replacement Manuel Guijarro Jonatan Hugo Hugosson Artur Wiecek David.
Advertisements

CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
CERN IT Department CH-1211 Genève 23 Switzerland t Some Hints for “Best Practice” Regarding VO Boxes Running Critical Services and Real Use-cases.
Version Control with git. Version Control Version control is a system that records changes to a file or set of files over time so that you can recall.
CERN Data Centre Evolution Gavin SDCD12: Supporting Science with Cloud Computing Bern 19 th November 2012.
Tools and software process for the FLP prototype B. von Haller 9. June 2015 CERN.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
AI project components: Facter and Hiera
CERN IT Department CH-1211 Genève 23 Switzerland t The CERN Agile Infrastructure Project: Configuration and Operations Tools Helge Meinhard.
Puppet with vSphere Workshop Install, configure and use Puppet on your laptop for vSphere DevOps Billy Lieberman August 1, 2015.
The Art and Zen of Managing Nagios with Puppet Michael Merideth - VictorOps
CERN IT Department CH-1211 Genève 23 Switzerland t ITIL at CERN Tony Cass HEPiX LBL, 29 th October 2009.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
CERN IT Department CH-1211 Genève 23 Switzerland t Experiences running a production Puppet Ben Jones HEPiX Bologna Spring.
Configuration Management Evolution at CERN Gavin
CERN IT Department CH-1211 Geneva 23 Switzerland t Storageware Flavia Donno CERN WLCG Collaboration Workshop CERN, November 2008.
Configuration Management with Cobbler and Puppet Kashif Mohammad University of Oxford.
CERN IT Department CH-1211 Genève 23 Switzerland t Evolution of virtual infrastructure with Hyper-V Juraj Sucik, Slavomir Kubacka Internet.
CERN IT Department CH-1211 Geneva 23 Switzerland t Daniel Gomez Ruben Gaspar Ignacio Coterillo * Dawid Wojcik *CERN/CSIC funded by Spanish.
Continuous Integration and Code Review: how IT can help Alex Lossent – IT/PES – Version Control Systems 29-Sep st Forum1.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES AI’s user access, OpenStack security groups and firewall.
Jose Castro Leon CERN – IT/OIS CERN Agile Infrastructure Infrastructure as a Service.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES GGUS Overview ROC_LA CERN
An Agile Service Deployment Framework and its Application Quattor System Management Tool and HyperV Virtualisation applied to CASTOR Hierarchical Storage.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
CERN IT Department CH-1211 Genève 23 Switzerland t The Agile Infrastructure Project Part 1: Configuration Management Tim Bell Gavin McCance.
CERN IT Department CH-1211 Geneva 23 Switzerland t GDB CERN, 4 th March 2008 James Casey A Strategy for WLCG Monitoring.
Agile Infrastructure IaaS Compute Jan van Eldik CERN IT Department Status Update 6 July 2012.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
CERN IT Department CH-1211 Geneva 23 Switzerland t CF Computing Facilities Agile Infrastructure Monitoring CERN IT/CF.
CERN IT Department CH-1211 Genève 23 Switzerland t Load Testing Dennis Waldron, CERN IT/DM/DA CASTOR Face-to-Face Meeting, Feb 19 th 2009.
CERN IT Department CH-1211 Genève 23 Switzerland t Towards agile software development Marwan Khelif IT-CS-CT IT Technical Forum – 31th May.
CERN IT Department CH-1211 Genève 23 Switzerland PES 1 Ermis service for DNS Load Balancer configuration HEPiX Fall 2014 Aris Angelogiannopoulos,
CERN IT Department CH-1211 Genève 23 Switzerland t IT Configuration Activities Gavin McCance Online Cross-experiment Meeting, 14 June 2012.
Agile Infrastructure: an updated overview of IaaS at CERN
Configuration Report 12/02/2015
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
Tim Bell 04/07/2013 Intel Openlab Briefing2.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
Scaling the CERN OpenStack cloud Stefano Zilli On behalf of CERN Cloud Infrastructure Team 2.
Configuration Report 2/5/2016 Document reference2 21/05/2015 – 28/05/2015.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Development Workflow of the Configuration Management.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CF Monitoring: Lemon, LAS, SLS I.Fedorko(IT/CF) IT-Monitoring.
OpenStack overview of the project Belmiro Daniel Rodrigues Moreira CERN IT-PES-PS January 2011 Disclaimer: This presentation reflects the experience and.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Alarming with GNI VOC WG meeting 12 th September.
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES CVMFS deployment status Ian Collier – STFC Stefan Roiser – CERN.
CERN - IT Department CH-1211 Genève 23 Switzerland t Operating systems and Information Services OIS Proposed Drupal Service Definition IT-OIS.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Juraj Sucik, Michal Kwiatek, Rafal.
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES AI Images, flavours and partitions Vítor Gouveia,
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CC Monitoring I.Fedorko on behalf of CF/ASI 18/02/2011 Overview.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Monitoring Pedro Andrade CERN – IT/GT HEPiX Spring 2012.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF SINDES Secure INformation DElivery System CERN IT/CF-ASI.
CERN AI Config Management 16/07/15 AI for INFN visit2 Overview for INFN visit.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Improving resilience of T0 grid services Manuel Guijarro.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Cluman: Advanced Cluster Management for Large-scale Infrastructures.
Cloud Installation & Configuration Management. Outline  Definitions  Tools, “Comparison”  References.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Agile Infrastructure Project Overview : Status and.
Configuration Services at CERN HEPiX fall Ben Jones, HEPiX Fall 2014.
Automating operational procedures with Daniel Fernández Rodríguez - Akos Hencz -
CERN IT Department CH-1211 Genève 23 Switzerland M.Schröder, Hepix Vancouver 2011 OCS Inventory at CERN Matthias Schröder (IT-OIS)
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES GIT Service in the Agile Infrastructure Project Vítor.
Scaling Puppet and Foreman for HPC
Presentation transcript:

1 CERN IT Department CH-1211 Genève 23 Switzerland t Puppet in the CERN CC Tomas Karasek Steve Traylen Oct

2 CERN IT Department CH-1211 Genève 23 Switzerland t CERN Agile Infrastructure Motivation Component Releases –Configuration Puppet, Foreman and Hiera Punch -> Judy –Provision OpenStack –Other Services koji, git, jira Community Interactions. AI as a Production Service –Expanding user base

3 CERN IT Department CH-1211 Genève 23 Switzerland t Motivation for CERN AI. CERN IT is changing strategy for machine provision and configuration. Rationale –Need to manage twice as many servers as today –No increase in staff numbers –Our deployment of configuration tools becoming increasingly brittle. –New services take far to long to deploy. Approach –We are no longer a special case for compute. –Adopt open source tool chain model –Contribute new function back to community. 3

4 CERN IT Department CH-1211 Genève 23 Switzerland t Configuration Components Puppet (2.7) –Responsible for configuration, an industry standard. Foreman (1.0) –Groups hosts into hostgroups of similar configuration. –Generates kickstart files from where puppet can take over. Hiera (1.0) –A data store used by puppet. Mcollective (2.2) –pub sub messaging to control and query hosts. CDB legacy (old) –Still some items in CDB... e.g warranty information. 4

5 CERN IT Department CH-1211 Genève 23 Switzerland t Configuration - Punch Service First puppet infrastructure known as “Punch” –One 4 core node, set up “by hand”. –puppet, foreman running behind passenger (mod_ruby) –In built own puppetca (cert authority) –All project members with root access. Secret files uploaded by hand. Secret files being distributed by puppet –Node started to struggle once 400 puppet agents attached - CPU limitation on server. This was with reconfigurations every 15 minutes which is excessive. Punch ran for 6 months. –Punch was never a scalable solution. 5

6 CERN IT Department CH-1211 Genève 23 Switzerland t Configuration - Judy Service Punch replaced by Judy in August –All components are deployed with puppet. –2 backend puppetmasters, 2 backend foreman. –mod_loadbalence redirecting requests. Using CERN CA. CertBaby Service Hooks up users kerberos identity, machine ownership and certificate requests. 6

7 CERN IT Department CH-1211 Genève 23 Switzerland t Judy Service Scale Currently 1200 puppet agents. –500 node added in the last week. –100 a day being added right now. –Agents are running on Hardware CVI Service (hyper-v) OpenStack Nova (kvm) (all new ones) –Organized in 37 hostgroups with 60 subgroups. Adding more puppetmasters or foreman backends is easy. –Same problem as scaling web pages, e.g Number of active connections at redirector. Consistency across back end servers. 7

8 CERN IT Department CH-1211 Genève 23 Switzerland t Puppet Manifests. Puppet manifests are very (too?) quick to develop. –Takes little longer than configuring the service. –e.g an apollo module written in two days. while apollo configuration was being learnt. –later paramatization of hardcoded values easy. Puppet code to be executed on nodes is distributed by puppet first. –i.e no need to package any puppet modules. –Makes new feature development, deployment very fast. We and others will get better at sharing puppet manifests as hiera becomes normal. 8

9 CERN IT Department CH-1211 Genève 23 Switzerland t Puppet Git and Environments Git used for puppet modules & manifests. Git branches map to dynamic environments –local development can be ‘puppet apply’d. –admins push changes to a (gitolite) repository –puppet masters pull branches and translate to environments –Production, Testing & Devel branches –Topic branches for major changes –Some services live in their own branches risk of divergence... Atlassian Crucible & Fisheye for module review process... not really started. 9

10 CERN IT Department CH-1211 Genève 23 Switzerland t Foreman Groups hosts of similar configuration. Top group -> service. e.g lxbatch, cernfts,... Subgroups may be very different e.g –cvmfs/stratum0 vs cvmfs/lxcvmfs.

11 CERN IT Department CH-1211 Genève 23 Switzerland t Separate Code and Data Quattor separated code and data well: –It was one motivation to write Quattor and drop LCFGng in the first place. hiera takes the separation to a new level: –puppet asks for a value from hiera? $myNTP = hiera(‘ntpservers’) –result can be string, array, hash,.... –The lookup is based on a nodes properties, e.g Since I am at CERN answer is ntp1.cern.ch Since I am in Budapest answer is ntp2.cern.ch –The schema of results for CERN nodes, Budapest nodes, SLC5 nodes, debian nodes can be arranged and changed as we please. 11

12 CERN IT Department CH-1211 Genève 23 Switzerland t Hiera and Hostgroups We arrange nodes in to (sub)hostgroups in foreman. A tree of YAML files stored in git maps on to these. e.g for castor hostgroups –hostgroup/castor/diskserver/atlas.yaml –hostgroup/castor/diskserver.yaml –hostgroup/castor.yaml –os/slc5.yaml –common.yaml The files above contain increasingly general keyvalues for look up in hiera. Schema and can be fully customized to CERN space with no fear of polluting the code. # A YAML file. --- castorns: ns.cern.ch 12

13 CERN IT Department CH-1211 Genève 23 Switzerland t Configuration Next Steps Deploy puppetdb –Performance improvements - community raving. –Repository for configuration data mining. Deploy mcollective –Pub and Sub system for sending action commands to hosts. –Message broker needs ACLs on queues corresponding to full diversity of CERN hosts and actions. –Data mine puppetdb. Workflow –Move to git pull request process for central configuration. 13

14 CERN IT Department CH-1211 Genève 23 Switzerland t Community Interactions CERN presenting to community/vendors. –PuppetConf, San Francisco, Sep 2012 –Openstack Summit, San Francisco Apr 2012 –Openstack Summit, San Diego, Oct 2012 (now) –PuppetCamp, Geneva, July 2012 CERN has code contributions to: –facter, the foreman, puppet, various puppet modules, mcollective, openstack nova, keystone and swift. –This is increasing as new students/fellows are employed for their puppet, ruby,.. skills. CERN puppet-users meeting, IT, ATLAS pit,.. Share our own 14

15 CERN IT Department CH-1211 Genève 23 Switzerland t Other AI Services Agile is not just Puppet and Openstack. AI created a gitolite ACL’ed GIT service. –CERN IT is now provisioning a public GIT service based on this. –AI will migrate its projects ASAP. AI created a Koji service for RPMs. –Creates RPMS and publishes to yum. –The service is now being used by others with in IT. e.g castor builds, data management, lemon,... AI ran jira early before a central service was created. –AI already migrated to central service. 15

16 CERN IT Department CH-1211 Genève 23 Switzerland t AI Service in Production Several Services running now on AI. –Some CVMFS components. –SLC6 batch services –SLC build machines –GIT gateways. –CASTOR (compass VO) –Test systems, glusterfs, swift,.. –New top level hostgroups every week now. From November AI opening up more. –Experiment services (voboxes) will start to use AI service. –Documentation to be updated/consolidated. 16

17 CERN IT Department CH-1211 Genève 23 Switzerland t Conclusions Agile Infrastructure Project We are ready for hardware arriving in Budapest in –Puppet configured VMs on Puppet configured OpenStack. Documentation: –More user facing documentation needed. Configuration with Puppet: –Services needing knowledge of everything –Inter sysadmin trust. –Test facility for AI. OpenStack deployment –Increase scale. 17

18 CERN IT Department CH-1211 Genève 23 Switzerland t URLs AI Project Pages: CERN modules CERN agile tickets AI Presentations : 18