The CernVM Project A new approach to software distribution Carlos Aguado Jakob Predrag

Slides:



Advertisements
Similar presentations
Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
Advertisements

“It’s going to take a month to get a proof of concept going.” “I know VMM, but don’t know how it works with SPF and the Portal” “I know Azure, but.
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Understanding and Managing WebSphere V5
Yury Kissin Infrastructure Consultant Storage improvements Dynamic Memory Hyper-V Replica VM Mobility New and Improved Networking Capabilities.
Cloud Computing for the Enterprise November 18th, This work is licensed under a Creative Commons.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Components of Windows Azure - more detail. Windows Azure Components Windows Azure PaaS ApplicationsWindows Azure Service Model Runtimes.NET 3.5/4, ASP.NET,
Building service testbeds on FIRE D5.2.5 Virtual Cluster on Federated Cloud Demonstration Kit August 2012 Version 1.0 Copyright © 2012 CESGA. All rights.
1 port BOSS on Wenjing Wu (IHEP-CC)
Connecting OurGrid & GridSAM A Short Overview. Content Goals OurGrid: architecture overview OurGrid: short overview GridSAM: short overview GridSAM: example.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
Introduction to CVMFS A way to distribute HEP software on cloud Tian Yan (IHEP Computing Center, BESIIICGEM Cloud Computing Summer School.
Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), A. Harutyunyan.
Predrag Buncic (CERN/PH-SFT) CernVM - a virtual software appliance for LHC applications C. Aguado-Sanchez 1), P. Buncic 1), L. Franco 1), S. Klemer 1),
Support in setting up a non-grid Atlas Tier 3 Doug Benjamin Duke University.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Predrag Buncic (CERN/PH-SFT) WP9 - Workshop Summary
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Update on Windows 7 at CERN & Remote Desktop.
1 BOINC-VM and Volunteer Cloud Computing Ben Segal / CERN and: Predrag Buncic, Jakob Blomer, Pere Mato / CERN Carlos Aguado Sanchez, Artem Harutyunyan.
Predrag Buncic (CERN/PH-SFT) Introduction to WP9 Portable Analysis Environment Using Virtualization Technology IBM-VM 360, CERNVM,
VMware vSphere Configuration and Management v6
DPHEP Workshop CERN, December Predrag Buncic (CERN/PH-SFT) CernVM R&D Project Portable Analysis Environments using Virtualization.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen CERN
WLCG Overview Board, September 3 rd 2010 P. Mato, P.Buncic Use of multi-core and virtualization technologies.
Virtual Machines Created within the Virtualization layer, such as a hypervisor Shares the physical computer's CPU, hard disk, memory, and network interfaces.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Copyright © 2005 VMware, Inc. All rights reserved. How virtualization can enable your business Richard Allen, IBM Alliance, VMware
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Predrag Buncic (CERN/PH-SFT) Virtualizing LHC Applications.
1 BOINC Case Study: Ben Segal / CERN with: Predrag Buncic / CERN Daniel Lombrana Gonzalez / Univ. Extremadura David Weir / Imperial.
Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.
NA61/NA49 virtualisation: status and plans Dag Toppe Larsen Budapest
The CernVM Infrastructure Insights of a paradigmatic project Carlos Aguado Sanchez Jakob Blomer Predrag Buncic.
Atlas Software Structure Complicated system maintained at CERN – Framework for Monte Carlo and real data (Athena) MC data generation, simulation and reconstruction.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
Predrag Buncic (CERN/PH-SFT) Software Packaging: Can Virtualization help?
T3g software services Outline of the T3g Components R. Yoshida (ANL)
2nd ASPERA Workshop May 2011, Barcelona, Spain P. Mato /CERN.
CernVM-FS Infrastructure for EGI VOs Catalin Condurache - STFC RAL Tier1 EGI Webinar, 5 September 2013.
Feedback from CMS Andrew Lahiff STFC Rutherford Appleton Laboratory Contributions from Christoph Wissing, Bockjoo Kim, Alessandro Degano CernVM Users Workshop.
Predrag Buncic (CERN/PH-SFT) Virtualization – the road ahead.
OpenNebula: Experience at SZTAKI Peter Kacsuk, Sandor Acs, Mark Gergely, Jozsef Kovacs MTA SZTAKI EGI CF Helsinki.
36 th LHCb Software Week Pere Mato/CERN.  Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis.
NA61 Collaboration Meeting CERN, December Predrag Buncic, Mihajlo Mudrinic CERN/PH-SFT Enabling long term data preservation.
Predrag Buncic (CERN/PH-SFT) CernVM Status. CERN, 24/10/ Virtualization R&D (WP9)  The aim of WP9 is to provide a complete, portable and easy.
EGI-InSPIRE RI EGI Webinar EGI-InSPIRE RI Porting your application to the EGI Federated Cloud 17 Feb
Cloud Computing Application in High Energy Physics Yaodong Cheng IHEP, CAS
Predrag Buncic (CERN/PH-SFT) Virtualization R&D (WP9) Status Report.
Predrag Buncic, CERN/PH-SFT The Future of CernVM.
CHEP 2010 Taipei, 19 October Predrag Buncic Jakob Blomer, Carlos Aguado Sanchez, Pere Mato, Artem Harutyunyan CERN/PH-SFT.
Use of HLT farm and Clouds in ALICE
Virtualisation for NA49/NA61
NA61/NA49 virtualisation:
Blueprint of Persistent Infrastructure as a Service
Dag Toppe Larsen UiB/CERN CERN,
Progress on NA61/NA49 software virtualisation Dag Toppe Larsen Wrocław
Dag Toppe Larsen UiB/CERN CERN,
StratusLab Final Periodic Review
StratusLab Final Periodic Review
Provisioning 160,000 cores with HEPCloud at SC17
From Application to Appliance
Virtualisation for NA49/NA61
Virtualization in the gLite Grid Middleware software process
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Outline Virtualization Cloud Computing Microsoft Azure Platform
Cloud Web Filtering Platform
Internet and Web Simple client-server model
Presentation transcript:

The CernVM Project A new approach to software distribution Carlos Aguado Jakob Predrag

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Outline Motivation Mission Requirements Service architecture Conclusions

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Outline Motivation Mission Requirements Service architecture Conclusions

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Motivation LHC experiment frameworks are a complex world – Heavy, multiplatform (arch+OS+compiler) – Grid and development environments Industry trend to Multi/Many-Core CPU architectures Demand for scalable, parallel applications (simulation, reconstruction, analysis) – Large architectural impact, different programming model

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Outline Motivation Mission Requirements Service architecture Conclusions

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Project mission Portable Analysis Environment using Virtualization Technology (WP9) – Approved in 2007 (2+2 years) as R&D activity in CERN/PH Department Project goals: – Provide a complete, portable and easy to configure user environment for developing and running LHC data analysis – Reduce effort to install, maintain and keep up to date the experiment software – Decouple application lifecycle from evolution of system infrastructure – Lower the cost of software development by reducing the number of platforms

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Constraints This is a proof of concept: preproduction environment Oriented to create community Not a final production service, but worldwide Economic Modest budget for external support (tools) and HW (during all project long) – Netapps, SAN clusters, Barracuda, Inifiband, etc. – RightScale, VSA, etc.

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Objectives CernVM: versatile virtual platform – Baseline virtual machine: CernVM Virtual Software Appliance Management of the baseline image for all experiments with possible further customization – Software Delivery Network: the CernVM File System (CVMFS) HTTP compliant

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Outline Motivation Mission Requirements Service architecture Conclusions

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments CernVM App Store

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments CernVM App Store  Scalability  Delegated management  Availability  Performance

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Requirements Ability to provide: – Virtual Appliance Configuration Management – Uniform and persistent URL namespace With a large HTTP-repository behind – All services sandboxed as appliances Deployment and management interface

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Outline Motivation Mission Requirements Service architecture Conclusions

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Application model

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments As easy as 1,2,3 import xmlrpclib import os server = xmlrpclib.ServerProxy(url) r = server.cernvm.Config.configGridUIVersion(” ") 1. Login to Web interface 2. Create user account 3. Select experiment, appliance flavor and preferences

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Service architecture I

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Primary (master) software repository Secondary software repository Secondary software repository Reverse Proxy Load Balancing Reverse Proxy Load Balancing Reverse Proxy CernVM (thin client) Site Reverse Proxy Load Balancing Reverse Proxy Regional Reverse Proxy Service architecture II The aim is to reduce latency which is the most important issue for distributed network file systems

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments rPath philosophy: – JeOS – Transactional model for software deployment (Conary) – Simple user interface (rAA) service

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments rPath philosophy: – JeOS – Transactional model for software deployment (Conary) – Simple user interface (rAA) service Build types  Installable CD/DVD  Stub Image  Raw Filesystem Image  Netboot Image  Compressed Tar File  Demo CD/DVD (Live CD/DVD)  Raw Hard Disk Image  Vmware ® Virtual Appliance  Vmware ® ESX Server Virtual Appliance  Microsoft ® VHD Virtual Apliance  Xen Enterprise Virtual Appliance  Virtual Iron Virtual Appliance  Parallels Virtual Appliance  Amazon Machine Image  Update CD/DVD  Appliance Installable ISO

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments class Root(CPackageRecipe): name='root' version=' ' buildRequires = ['libpng:devel', 'libpng:devellib','krb5:devel', 'libstdc++:devel’,'libxml2:devel', 'openssl:devel','python:devel', 'xorg-x11:devel', 'zlib:devel', 'perl:devel', 'perl:runtime'] def setup(r): r.addArchive('ftp://root.cern.ch/root/%(name)s_v%(version)s.source.tar.gz’) r.Environment('ROOTSYS',%(builddir)s') r.ManualConfigure('--prefix=/opt/root ') r.Make() r.MakeInstall() Conary Package Manager

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments CDN: the URL scheme://hostname:port/path?query#fragment  scheme: http | https  hostname: cernvm.cern.ch cernvm-webfs.cern.ch rbuilder.cern.ch …

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Service architecture

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments 1: Front end Gateway to back end services DNS Load-balanced cluster hosting public services: – Switch L7: combination of Squid/Apache ‘Router’ + webcache SSL engine URL mapper – LDAP directory (multimaster replica)

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Virtualization platforms – Management tools are key: provision (deployment and life cycle), resource allocation, integration VMWare Infrastructure VMotion HA: High Availability DRS: Distributed Resource Scheduling 2: CPU provisioning

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Several ways – Shared disk file systems Fibre Channel Protocol, iSCSI … On top: GFS, OCFS,GPFS… – Network file systems NFS, AFS, SMB, Lustre … Thin provisioning – Virtual SAN Appliance – Openfiler 3: Storage provisioning I

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments NexentaStor – 2-node cluster with active replication – LDAP: users, groups, netgroups – ZFS CVS-like semantics SW Raid NFS – System disk (vmdk) – Data vols. (NFS mounts) 3: Storage provisioning II

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments 4: Networking Two different physical networks – Public: dedicated CERN IP service (/24) Only the front end is opened in the central firewall – Private: access to storage and VM operations Aggregation of interfaces is used between CPU servers and the storage

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Network emulated within ESX: – VMs can easily move Connection to NexentaStor via 2xGb 4:Networking

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments This is it! Front end rBuilder VMWare Nexenta

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments And some statistics With those 8 machines we currently run: – CVMFS (clients and web servers), rBuilder, LDAP, Zenoss, Boinc, djabberd, SFT-Web Traffic (April 2009): – +15GB from experiment repositories – +600 downloads of CernVM – +800 different IPs – +50GB total HTTP traffic – ~30% hits in the cache

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Outline Motivation Mission Requirements Service architecture Conclusions

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Conclusions Complete solution for LHC community – Run your release in your HW without management issues Versatile and modular infrastructure – Easy deployment of new services and resources – Good performance for commodity HW – Easy replicable by definition (DNS-related techniques)

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Thanks! Mailing lists: (open list for announcements and discussion) (end-user support for the CernVM project) Savannah Portal: Please submit bugs and feature requests to Savannah at CernVM Home Page: rBuilder & Download Page: CernVM Wiki: ATLAS Wiki:

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments Bridging Grids & Clouds BOINC – Open-source software for volunteer computing and grid computing – CernVM CoPilot development – Based on BOINC, experience and CernVM image – Image size is of outmost importance to motivate volunteers – Can be easily adapted to Pilot Job frameworks (AliEn,Dirac, Panda) … or Condor Worker, or proofd.. – Aims to demonstrate running of ATLAS simulation using BOINC infrastructure and PanDa BOINC PanDA Pilot

HEPiX Umeå May 27, 2009CernVM – A virtual machine for LHC experiments CernVM CoPilot Adapter 0. Send host JDL (free disk space, free memory, available packages) 1. Append framework specific information and request a job 3. Send input files and commands for execution (packages are already there) 5. Register output files 4. When the job is done send back the output files (and the result of validation) 2. Send user job JDL from Task Queue AliEn/DIRAC/PanDA