2nd ASPERA Workshop 30-31 May 2011, Barcelona, Spain P. Mato /CERN.

Slides:



Advertisements
Similar presentations
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
Advertisements

Batch Production and Monte Carlo + CDB work status Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
CVMFS: Software Access Anywhere Dan Bradley Any data, Any time, Anywhere Project.
Amazon EC2 Quick Start adapted from EC2_GetStarted.html.
Microsoft ® Application Virtualization 4.6 Infrastructure Planning and Design Published: September 2008 Updated: February 2010.
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
1 port BOSS on Wenjing Wu (IHEP-CC)
What is Driving the Virtual Desktop? VMware View 4: Built for Desktops VMware View 4: Deployment References…Q&A Agenda.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Introduction to CVMFS A way to distribute HEP software on cloud Tian Yan (IHEP Computing Center, BESIIICGEM Cloud Computing Summer School.
Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), A. Harutyunyan.
Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), S. Klemer 1),
1 Resource Provisioning Overview Laurence Field 12 April 2015.
Architecture and ATLAS Western Tier 2 Wei Yang ATLAS Western Tier 2 User Forum meeting SLAC April
Predrag Buncic (CERN/PH-SFT) WP9 - Workshop Summary
Predrag Buncic (CERN/PH-SFT) Introduction to WP9 Portable Analysis Environment Using Virtualization Technology IBM-VM 360, CERNVM,
Changes to CernVM-FS repository are staged on an “installation box" using a read/write file system interface. There is a dedicated installation box for.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
DPHEP Workshop CERN, December Predrag Buncic (CERN/PH-SFT) CernVM R&D Project Portable Analysis Environments using Virtualization.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
WLCG Overview Board, September 3 rd 2010 P. Mato, P.Buncic Use of multi-core and virtualization technologies.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
ADC3 Workshop 2011 CERN, 19 May Predrag Buncic, CERN/PH-SFT CernVM on the Cloud.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
Predrag Buncic (CERN/PH-SFT) Virtualizing LHC Applications.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen Budapest
The CernVM Infrastructure Insights of a paradigmatic project Carlos Aguado Sanchez Jakob Blomer Predrag Buncic.
Predrag Buncic (CERN/PH-SFT) Software Packaging: Can Virtualization help?
Testing CernVM-FS scalability at RAL Tier1 Ian Collier RAL Tier1 Fabric Team WLCG GDB - September
CernVM-FS Infrastructure for EGI VOs Catalin Condurache - STFC RAL Tier1 EGI Webinar, 5 September 2013.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
CVMFS: Software Access Anywhere Dan Bradley Any data, Any time, Anywhere Project.
Feedback from CMS Andrew Lahiff STFC Rutherford Appleton Laboratory Contributions from Christoph Wissing, Bockjoo Kim, Alessandro Degano CernVM Users Workshop.
Predrag Buncic (CERN/PH-SFT) Virtualization – the road ahead.
The CernVM Project A new approach to software distribution Carlos Aguado Jakob Predrag
36 th LHCb Software Week Pere Mato/CERN.  Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis.
NA61 Collaboration Meeting CERN, December Predrag Buncic, Mihajlo Mudrinic CERN/PH-SFT Enabling long term data preservation.
Predrag Buncic (CERN/PH-SFT) CernVM Status. CERN, 24/10/ Virtualization R&D (WP9)  The aim of WP9 is to provide a complete, portable and easy.
Journées Informatiques de l'IN2P May 2010, Aussois, France P. Mato /CERN.
Considerations on Using CernVM-FS for Datasets Sharing Within Various Research Communities Catalin Condurache STFC RAL UK ISGC, Taipei, 18 March 2016.
Geant4 GRID production Sangwan Kim, Vu Trong Hieu, AD At KISTI.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
Predrag Buncic (CERN/PH-SFT) Virtualization R&D (WP9) Status Report.
CVMFS Alessandro De Salvo Outline  CVMFS architecture  CVMFS usage in the.
Predrag Buncic, CERN/PH-SFT The Future of CernVM.
CHEP 2010 Taipei, 19 October Predrag Buncic Jakob Blomer, Carlos Aguado Sanchez, Pere Mato, Artem Harutyunyan CERN/PH-SFT.
CERN IT-Storage Strategy Outlook Alberto Pace, Luca Mascetti, Julien Leduc
Jean-Philippe Baud, IT-GD, CERN November 2007
CernVM-FS vs Dataset Sharing
Use of HLT farm and Clouds in ALICE
Update on revised HEPiX Contextualization
Virtualisation for NA49/NA61
NA61/NA49 virtualisation:
Blueprint of Persistent Infrastructure as a Service
Dag Toppe Larsen UiB/CERN CERN,
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Dag Toppe Larsen UiB/CERN CERN,
StratusLab Final Periodic Review
StratusLab Final Periodic Review
From Application to Appliance
Introduction to CVMFS A way to distribute HEP software on cloud
Virtualisation for NA49/NA61
Virtualization in the gLite Grid Middleware software process
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
Outline Virtualization Cloud Computing Microsoft Azure Platform
Presentation transcript:

2nd ASPERA Workshop May 2011, Barcelona, Spain P. Mato /CERN

 Aims to provide a complete, portable and easy to configure user environment for developing and running LHC data analysis locally and on the Grid independent of physical software and hardware platform (Linux, Windows, MacOS) ◦ Code check-out, edition, compilation, local small test, debugging,… ◦ Grid submission, data access… ◦ Event displays, interactive data analysis, … ◦ Suspend, resume…  Decouple application lifecycle from evolution of system infrastructure  Reduce effort to install, maintain and keep up to date the experiment software 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN 1. Minimal Linux OS (SL5) 2. CernVM-FS - HTTP network file system optimized for jus in time delivery of experiment software 3. Flexible configuration and contextualization mechanism based on public Cloud API

CheckoutCompileTestDebug Submit to Grid Display results Suspend Resume 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

~1000 different IP addresses ~2000 different IP addresses

Is CernVM suitable for deployment on Grid and Cloud infrastructure? What are the benefits of going CernVM way comparing to more traditional 1) approach to batch node virtualization? 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN 1)Traditional approach: Take “standard” batch node [2GB] and add experiment software [10GB] and generate VM image. Have experiment and security team certify the image, deploy it to all sites and worker nodes. Repeat this procedure 1-2 times per week and per experiment.

 Just enough OS to run LHC applications  Built using commercial tool (rBuilder by rPath) ◦ Top-down approach - starting from application and automatically discovering dependencies  Small images (250MB), easy to move around 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN  Just enough OS to run LHC applications  Built using commercial tool (rBuilder by rPath) ◦ Top-down approach - starting from application and automatically discovering dependencies  Small images (250MB), easy to move around

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN  Installable CD/DVD  Stub Image  Raw Filesystem Image  Netboot Image  Compressed Tar File  Demo CD/DVD (Live CD/DVD)  Raw Hard Disk Image  Vmware ® Virtual Appliance  Vmware ® ESX Server Virtual Appliance  Microsoft ® VHD Virtual Apliance  Xen Enterprise Virtual Appliance  Virtual Iron Virtual Appliance  Parallels Virtual Appliance  Amazon Machine Image  Update CD/DVD  Appliance Installable ISO Starting from experiment software… …ending with a custom Linux specialised for a given task

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN Every build and every file installed on the system is automatically versioned and accounted for in a database

◦ Experiment software is changing frequently and we want to avoid need to frequently update, certify and redistribute VM images with every release ◦ Only a small fraction of software release is really used ◦ Demonstrated scalability and reliability ◦ Now being deployed on across all Grid sites as the channel for software distributions 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN  Experiment software is changing frequently and we want to avoid need to frequently update, certify and redistribute VM images with every release  Only a small fraction of software release is really used  Demonstrated scalability and reliability  Now being deployed on across all Grid sites as the channel for software distributions

 CernVM comes with the read-only file system (CernVM-FS) optimized for software distribution ◦ Very little fraction of the experiment software is actually used (~10%) ◦ Very aggressive local caching, web proxy cache (squids) ◦ Transparent file compression ◦ Integrity checks using checksums, signed file catalog ◦ Operational in off-line mode  No need to install any experiment software ◦ ‘Virtually’ all versions of all applications are already installed ◦ The user just needs to start using it to trigger the download 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 Principle: Digitally signed repositories with certificate white-list 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 Stratum Model + Fast and Scalable + No single point of failure - Complex hierarchy 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 Proxies ◦ SL5 Squid, load-balancing + fail-over e. g. CVMFS_HTTP_PROXY="A|B|C"  Mirrors ◦ Fail-over mirrors at CERN, RAL, BNL For roaming users automatic ordering based on RTT 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 CernVM-FS is set to replace current software installation methods  Current method ◦ Run sgm job via grid ◦ Write files within job to some shared storage ◦ Validate software ◦ Publish tag in BDII ◦ Process has to be repeated (an debugged) at every site  Being deployed at Tier-1s, Tier-2s and Tier-3s ◦ ATLAS and LHCb running productions with it 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 Install once (on stratum zero) ◦ Files appear everywhere across WLCG  This can be many days faster  Hopefully less variation across sites ◦ Common path /cvmfs/… ◦ Very few variables: Cache size, Squid QOS ◦ Same install bugs every where – fix once  Some sites are struggling to provide scalable NFS/AFS ◦ One shared area read by every batch worker ◦ It scales better than NFS/AFS 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 There are several ways to contextualize CernVM Web UI (for individual user) CernVM Contextualization Agent Hepix CDROM method ☞ Amazon EC2 API user_data method 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN  There are several ways to contextualize CernVM Web UI (for individual user) CernVM Contextualization Agent Hepix CDROM method ☞ Amazon EC2 API user_data method

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN 1. Login to Web interface 2. Create user account 3. Select experiment, appliance flavor and preferences

 Basic principles: ◦ Owner of CernVM instance can contextualize and configure it to run arbitrary service as unprivileged user ◦ Site can use HEPIX method to inject monitoring and accounting hooks without functionally modifying the image  The contextualization is based on rPath amiconfig package extended with CernVM plugin ◦ This tool will execute at boot time (before network services are available), parse user data and look for python style configuration blocks. ◦ If match is found the corresponding plugin will process the options and execute configuration steps if needed.  For more info on CernVM contextualization using EC2 API, see: ntextualizationhttps://cernvm.cern.ch/project/trac/cernvm/wiki/EC2Co ntextualization 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

[cernvm] # list of ',' separated organizations/experiments (lowercase) organisations = cms # list of ',' separated repositories (lowercase) repositories = cms,grid # list of ',' separated user accounts to create users = cms:cms: # CVMFS HTTP proxy proxy = : ;DIRECT # install extra conary group group_profile = group-cms # script to be executed as given user: :/path/to/script.sh contextualization_command = cms:/path/to/script.sh # list of ',' seperated services to start services = # extra environment variables to define environment = CMS_SITECONFIG=EC2,CMS_ROOT=/opt/cms

Now we have CernVM OS, FS, Contextualization, Cloud API, … What’s next? 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

PanDA Server EC2 API Private/Academic Clouds Public/Commercial Clouds Store results ?

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN MGM FST MQ Management Server Pluggable Namespace, Quota, ACLs,HA Strong Authentication Capability Engine File Placement File Location Message Queue Service State Messages File Transaction Reports File Storage File & File Meta Data Store Capability Authorization File & Block Disk Error Detection (Scrubbing) Layout Plugins [ currently Raid-1(n) ] xrootdserver xrootdserver xrootdserver MGMPlugin MQPlugin FSTPlugin Implemented as plugins in xrootd NS sync async FUSE XrdClient Clients XROOT client & FUSE KRB5 + X509 authenticated NS Failover bash# df /eos eosatlas.cern.ch 3.8P 39G 3.8P 1% /eos ~]# df /eos Filesystem 1K-blocks Used Available Use% Mounted on eos % /eos Andreas Peters IT/DM

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN Listing available images: ~]$ cvm --region CERN -H ls -i AMI LOCATION STATE VISIBILITY ARCH TYPE ami hepix_sl55_x86_64_kvm available Public i386 machine ami cernvm232_head_slc5_x86_64_kvm available Public i386 machine ami cernvm231_slc5_x86_64_kvm available Public i386 machine ami cernvm232_batch_slc5_x86_64_kvm available Public i386 machine ami lxdev_slc6_quattor_slc6_x86_64_kvm available Public i386 machine Support for multiple regions: ~]$ cvm --region EC2 -H ls -i ami-5c3ec235 AMI LOCATION STATE VISIBILITY ARCH TYPE ami-5c3ec235 download.cernvm.cern.ch.s3.amazonaws.com/cernvm x86_64_1899.img.manifest.xml available Public x86_64 machine

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN Proxy certificate ~]$ lcg grid-proxy-init Your identity: /DC=ch/DC=cern/OU=computers/CN=pilot/copilot.cern.ch Creating proxy Done Your proxy is valid until: Thu May 19 06:43: ~]$ lcg grid-proxy-info subject : /DC=ch/DC=cern/OU=computers/CN=pilot/copilot.cern.ch/CN= issuer : /DC=ch/DC=cern/OU=computers/CN=pilot/copilot.cern.ch identity : /DC=ch/DC=cern/OU=computers/CN=pilot/copilot.cern.ch type : Proxy draft (pre-RFC) compliant impersonation proxy strength : 512 bits path : /tmp/x509up_u500 timeleft : 11:59:56 This proxy certificate is time limited and authorized only to  Request jobs from dedicated PanDA queue  Write (but not delete) files in a given EOS directory

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN Starting contextualized CernVM images on lxCloud: ~]$ cvm --region CERN run ami proxy --template panda-wn:1 r-47a5402e predrag default i-195 ami pending default 0 m1.small T01:00:00+01:00 default ~]$ cvm --region CERN -H ls ID RID OWNER GROUP DNS STATE KEY TYPE i-195 default predrag default running default m1.small ~]$ cvm --region CERN -H ls ID RID OWNER GROUP DNS STATE KEY TYPE i-195 default predrag default running default m1.small i-196 default predrag default running default m1.small i-197 default predrag default running default m1.small i-198 default predrag default running default m1.small i-199 default predrag default pending default m1.small i-200 default predrag default pending default m1.small i-201 default predrag default pending default m1.small i-202 default predrag default pending default m1.small i-203 default predrag default pending default m1.small i-204 default predrag default pending default m1.small i-205 default predrag default pending default m1.small i-206 default predrag default pending default m1.small

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN Starting more contextualized CernVM images on EC2: ~]$ cvm run ami-5c3ec235 -g default -t m1.large --kernel aki-9800e5f1 --key ami --proxy --template panda-wn:10 r-ad962dc default i-f3b04a9d ami-5c3ec235 pending ami 0 m1.large r-ad962dc default i-f1b04a9f ami-5c3ec235 pending ami 1 m1.large r-ad962dc default i-cfb04aa1 ami-5c3ec235 pending ami 2 m1.large r-ad962dc default i-cdb04aa3 ami-5c3ec235 pending ami 3 m1.large.... ~]$ cvm --region EC2 -H ls ID RID OWNER GROUP DNS STATE KEY TYPE i-f3b04a9d r-ad962dc default ec running ami m1.large i-f1b04a9f r-ad962dc default ec running ami m1.large i-cfb04aa1 r-ad962dc default ec running ami m1.large i-cdb04aa3 r-ad962dc default ec running ami m1.large i-cbb04aa5 r-ad962dc default ec running ami m1.large i-c9b04aa7 r-ad962dc default ec running ami m1.large i-c7b04aa9 r-ad962dc default ec running ami m1.large i-c5b04aab r-ad962dc default ec running ami m1.large i-c3b04aad r-ad962dc default ec running ami m1.large i-c1b04aaf r-ad962dc default ec running ami m1.large

2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN

 Described the three elements that constitutes the CernVM image ◦ Minimal Linux OS (SL5) ◦ CernVM-FS - HTTP network file system optimized for jus in time delivery of experiment software  Can also be used independently of CernVM ◦ Flexible configuration and contextualization mechanism based on public Cloud API  User environment petty well understood, evolving towards a job hosting environment (grid, cloud, volunteering computing)  Testing CernVM on Amazon-EC2 and LXCloud ◦ A way to simplify software deployment and jump on the Cloud-wagon 2nd ASPERA Workshop, May 2011, Barcelona P. Mato/CERN