Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), A. Harutyunyan.

Slides:



Advertisements
Similar presentations
Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
Advertisements

INFSO-RI An On-Demand Dynamic Virtualization Manager Øyvind Valen-Sendstad CERN – IT/GD, ETICS Virtual Node bootstrapper.
Virtualization and the Cloud
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
CVMFS: Software Access Anywhere Dan Bradley Any data, Any time, Anywhere Project.
Tanenbaum 8.3 See references
Open Science Grid Software Stack, Virtual Data Toolkit and Interoperability Activities D. Olson, LBNL for the OSG International.
Testing Virtual Machine Performance Running ATLAS Software Yushu Yao Paolo Calafiura LBNL April 15,
1 port BOSS on Wenjing Wu (IHEP-CC)
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 2.
Introduction to CVMFS A way to distribute HEP software on cloud Tian Yan (IHEP Computing Center, BESIIICGEM Cloud Computing Summer School.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
David Cameron Riccardo Bianchi Claire Adam Bourdarios Andrej Filipcic Eric Lançon Efrat Tal Hod Wenjing Wu on behalf of the ATLAS Collaboration CHEP 15,
Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), S. Klemer 1),
Predrag Buncic (CERN/PH-SFT) WP9 - Workshop Summary
ALICE Offline Week | CERN | November 7, 2013 | Predrag Buncic AliEn, Clouds and Supercomputers Predrag Buncic With minor adjustments by Maarten Litmaath.
1 BOINC-VM and Volunteer Cloud Computing Ben Segal / CERN and: Predrag Buncic, Jakob Blomer, Pere Mato / CERN Carlos Aguado Sanchez, Artem Harutyunyan.
Predrag Buncic (CERN/PH-SFT) Introduction to WP9 Portable Analysis Environment Using Virtualization Technology IBM-VM 360, CERNVM,
Changes to CernVM-FS repository are staged on an “installation box" using a read/write file system interface. There is a dedicated installation box for.
DPHEP Workshop CERN, December Predrag Buncic (CERN/PH-SFT) CernVM R&D Project Portable Analysis Environments using Virtualization.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
WLCG Overview Board, September 3 rd 2010 P. Mato, P.Buncic Use of multi-core and virtualization technologies.
Virtual Machines Created within the Virtualization layer, such as a hypervisor Shares the physical computer's CPU, hard disk, memory, and network interfaces.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Tools and techniques for managing virtual machine images Andreas.
Full and Para Virtualization
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
G.Govi CERN/IT-DB 1 September 26, 2003 POOL Integration, Testing and Release Procedure Integration  Packages structure  External dependencies  Configuration.
Predrag Buncic (CERN/PH-SFT) Virtualizing LHC Applications.
1 BOINC Case Study: Ben Segal / CERN with: Predrag Buncic / CERN Daniel Lombrana Gonzalez / Univ. Extremadura David Weir / Imperial.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen Budapest
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
The CernVM Infrastructure Insights of a paradigmatic project Carlos Aguado Sanchez Jakob Blomer Predrag Buncic.
Predrag Buncic (CERN/PH-SFT) Software Packaging: Can Virtualization help?
2nd ASPERA Workshop May 2011, Barcelona, Spain P. Mato /CERN.
Feedback from CMS Andrew Lahiff STFC Rutherford Appleton Laboratory Contributions from Christoph Wissing, Bockjoo Kim, Alessandro Degano CernVM Users Workshop.
Predrag Buncic (CERN/PH-SFT) Virtualization – the road ahead.
The CernVM Project A new approach to software distribution Carlos Aguado Jakob Predrag
36 th LHCb Software Week Pere Mato/CERN.  Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis.
Virtual Machines Module 2. Objectives Define virtual machine Define common terminology Identify advantages and disadvantages Determine what software is.
NA61 Collaboration Meeting CERN, December Predrag Buncic, Mihajlo Mudrinic CERN/PH-SFT Enabling long term data preservation.
Predrag Buncic (CERN/PH-SFT) CernVM Status. CERN, 24/10/ Virtualization R&D (WP9)  The aim of WP9 is to provide a complete, portable and easy.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
Cloud Computing Application in High Energy Physics Yaodong Cheng IHEP, CAS
Journées Informatiques de l'IN2P May 2010, Aussois, France P. Mato /CERN.
Advancing CernVM-FS and its Development Infrastructure José Molina Colmenero CERN EP-SFT.
Volunteer Clouds and Citizen Cyberscience for LHC Physics Artem Harutyunyan / CERN Carlos Aguado Sanchez / CERN, Jakob Blomer / CERN, Predrag Buncic /
Predrag Buncic (CERN/PH-SFT) Virtualization R&D (WP9) Status Report.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
Predrag Buncic, CERN/PH-SFT The Future of CernVM.
CHEP 2010 Taipei, 19 October Predrag Buncic Jakob Blomer, Carlos Aguado Sanchez, Pere Mato, Artem Harutyunyan CERN/PH-SFT.
Installation of the ALICE Software
Use of HLT farm and Clouds in ALICE
Virtualisation for NA49/NA61
NA61/NA49 virtualisation:
Blueprint of Persistent Infrastructure as a Service
Dag Toppe Larsen UiB/CERN CERN,
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Dag Toppe Larsen UiB/CERN CERN,
From Application to Appliance
David Cameron ATLAS Site Jamboree, 20 Jan 2017
Virtualisation for NA49/NA61
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Ivan Reid (Brunel University London/CMS)
Backfilling the Grid with Containerized BOINC in the ATLAS computing
Presentation transcript:

Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), A. Harutyunyan 2), P. Mato 1), Y. Yao 3) 1) CERN, Geneva, 2) Yerevan Physics Institute, Yerevan, 3) LBNL, Berkeley

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ CernVM Project Talk Outline  CernVM Project Building blocks User Interface and API Scalability and performance  Future plans & directions  Release status  Conclusions  Links

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ CernVM Project Portable Analysis Environment using Virtualization Technology (WP9)  Approved in 2007 (2+2 years) as R&D activity in CERN/PH Department  Started January 2008  Sister project to Multicore R&D Project goals:  Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis locally and on the Grid independent of physical software and hardware platform (Linux, Windows, MacOS)  Decouple application lifecycle from evolution of system infrastructure  Reduce effort to install, maintain and keep up to date the experiment software  Lower the cost of software development by reducing the number of compiler-platform combinations

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ A complete Data Analysis environment available for each experiment  Code check-out, edition, compilation, local small test, debugging, …  Grid submission, data access…  Event displays, interactive data analysis, … No software installation required Suspend/resume capability

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Key Building Blocks rBulder from rPath (  A tool to build VM images for various virtualization platforms rPath Linux 1  Slim Linux OS binary compatible with RH/SLC4 rAA - rPath Linux Appliance Agent  Web user interface  XMLRPC API Can be fully customized and extended by means of plugins (  401) CVMFS - CernVM file system  Read only file system optimized for software distribution Aggressive caching  Operational in offline mode For as long as you stay within the cache Build types  Installable CD/DVD  Stub Image  Raw Filesystem Image  Netboot Image  Compressed Tar File  Demo CD/DVD (Live CD/DVD)  Raw Hard Disk Image  Vmware ® Virtual Appliance  Vmware ® ESX Server Virtual Appliance  Microsoft ® VHD Virtual Apliance  Xen Enterprise Virtual Appliance  Virtual Iron Virtual Appliance  Parallels Virtual Appliance  Amazon Machine Image  Update CD/DVD  Appliance Installable ISO

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ class Root(CPackageRecipe): name='root' version=' ' buildRequires = ['libpng:devel', 'libpng:devellib','krb5:devel', 'libstdc++:devel’,'libxml2:devel', 'openssl:devel','python:devel', 'xorg-x11:devel', 'zlib:devel', 'perl:devel', 'perl:runtime'] def setup(r): r.addArchive('ftp://root.cern.ch/root/%(name)s_v%(version)s.source.tar.gz’) r.Environment('ROOTSYS',%(builddir)s') r.ManualConfigure('--prefix=/opt/root ') r.Make() r.MakeInstall() Conary Package Manager

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ CernVM Variations group-cernvm (core packages) group-cernvm (core packages) group-cernvm-devel (development tools) group-cernvm-devel (development tools) group-slc4 (SLC4 compatibility libs) group-slc4 (SLC4 compatibility libs) group- (groups and extra packages required by experiment) group- (groups and extra packages required by experiment) 100 MB compat-db4 compat-openssl compat-libstdc++slc3 compat-libxml2 compat-readline compat-tcl compat-tk group- -desktop (lightweight X environment) group- -desktop (lightweight X environment) group-cernvm-desktop X11 group-cernvm-desktop X11

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ import xmlrpclib import os server = xmlrpclib.ServerProxy(url) r = server.cernvm.Config.configGridUIVersion(” ") As easy as 1,2,3 1. Login to Web interface 2. Create user account 3. Select experiment, appliance flavor and preferences

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ CVMFS CernVM File System (CVMFS) is derived from Parrot ( and its GROW-FS plugin code base and adapted to run as a FUSE kernel module adding extra features like:  possibility to use multiple file catalogues on the server side  transparent file compression under given size threshold  dynamical expansion of environment variables embedded in symbolic links

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Scalable Infrastructure

Publishing Releases 1. Each experiment is given a VM to install and test their software using own installation tools 2. Publishing is an atomic operation

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Primary (master) software repository Secondary software repository Secondary software repository Reverse Proxy Load Balancing Reverse Proxy Load Balancing Reverse Proxy CernVM (thin client) Site Reverse Proxy Load Balancing Reverse Proxy Regional Reverse Proxy The aim is to reduce latency which is the most important issue for distributed network file systems 3/9/0912 Current deployment model

Benchmark setup ROOT (version 5.22, 100Mb) The benchmark runs in Xen VM and includes compilation and execution of ROOT stressHepix suite  The system has to find and fetch binaries, libraries and headers from the network file system but, once loaded, the libraries remain in a local cache We compare the performance of CVMFS to AFS by artificially introducing network latency and limiting bandwith (using tc tool)

CVMFS vs AFS These plots show the time penalty  t=t AFS,CVMFS - t local  resulting from having the application binaries, search paths, and include paths reside on a network file system,  Additional performance loss due to virtualization (not shown here) is consistently <5% CVMFS shows consistently better performance than AFS in case of ‘cold cache’  ‘Worm cache’ performance is in case of AFS and CVMFS equal to local file access

Proxy Server Proxy Server CernVM HTTP server HTTP server Proxy Server HTTP server HTTP server Proxy Server HTTP server HTTP server Proxy Server Removing single point of failure (1) Slave servers could be deployed on strategic locations to reduce latency and provide redundancy

HTTP server HTTP server Proxy Server CernVM Content Distributio n Network WAN Use existing Content Delivery Networks to remove single point of failure  Amazon CloudFront (  Coral CDN ( LAN Use P2P like mechanism for discovery of nearby CernVMs and cache sharing between them. No need to manually setup proxy servers (but they could still be used where exist) Removing single point of failure (2)

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Bridging Grids & Clouds BOINC  Open-source software for volunteer computing and grid computing  CernVM CoPilot development  Based on BOINC, experience and CernVM image  Image size is of outmost importance to motivate volonteers  Can be easily adapted to Pilot Job frameworks (AliEn,Dirac, Panda) … or Condor Worker, or proofd..  Aims to demonstrate running of ATLAS simulation using BOINC infrastructure and PanDa BOINC E PanDA Pilot

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Adapter CernVM CoPilot 0. Send host JDL (free disk space, free memory, available packages) 1. Append framework specific information and request a job 3. Send input files and commands for execution (packages are already there) 5. Register output files 4. When the job is done send back the output files (and the result of validation) 2. Send user job JDL from Task Queue AliEn/DIRAC/PanDA

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Simplifying Grid deployment Nimbus (former Globus Workspace Service)  Nimbus is a set of open source tools that together provide an "Infrastructure-as-a-Service" (IaaS) cloud computing solution  Google Summer School (hosted at ANL) project to deploy a one-click, auto-configuring virtual Grid overlay for Alice/AliEn Successfully created virtual AliEn site for ALICE with one command See CHEP’09 contribution by Artem Harutyunyan (poster #214)

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Release now available Available now for download from  Can be run on  Linux (KVM, Xen,VMware Player, VirtualBox)  Windows(WMware Player, VirtualBox)  Mac (Fusion, Parallels, VirtualBox) Release Notes Appliance can be configured and used with ALICE, LHCb, ATLAS (and CMS) software frameworks  ALICE (AliEn 2.15,2.16, AliRoot v4-14-Rev-04, v4-15-Rev-03, v4-16-Rev-01)  ATLAS ( , , , , , )  CMS (CMSSW_2_2_5, CMSSW_2_2_6 )  LHCB (GAUDI_v19r8, GAUDI_v19r9, GAUDI_v20r0, GAUDI_v20r2, GAUDI_v20r3, GAUDI_v10r11, DAVINCI_v20r3, DAVINCI_v19r12, DAVINCI_v20r3,DAVINCI_v22r1)

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Summary If you work for one of the LHC experiments and you have one or a few problems listed below then CernVM might be what you are looking for:  Your experiment software is not compatible with your hardware or s/w platform running on your laptop/desktop  You do not want to spend time to manually install and keep software up to date  Your want to profit from the latest developments in CPU technology and use your multi/many core CPU to its maximal potential without modifying your application  You want to run your software on voluntary resources beyond the current Grid CernVM is a ‘thin’ virtual machines capable of running software of all four LHC experiments  Based on rPath Linux, binary compatible with SL4  Extensible user interface (plugins)  Package groups can be tailored to individual user group Experiment software is injected in VM by means of a file system (CVMFS) specially optimized for efficient software distribution  This allows us to develop, deploy and maintain only ONE image  Easy for end user but also great potential for deployment on cloud & grid  Using HTTP protocol allows efficient caching using standard technology Version available at

CHEP 2009 CernVM – A Virtial Machine for LHC Experiments Prague, 26/03/ Mailing lists  (open list for announcements and discussion) (end-user support for the CernVM project) Savannah Portal  Please submit bugs and feature requests to Savannah at CernVM Home Page:  rBuilder & Download Page:  ATLAS Wiki  CernVM Links…