Volunteer Clouds for the LHC experiments H. Riahi – 12/11/15 EGI User Forum 20152 Laurence Field Hassen Riahi CERN IT-SDC.

Slides:



Advertisements
Similar presentations
Volunteer Computing Laurence Field IT/SDC 21 November 2014.
Advertisements

The Prototype Laurence Field IT/SDC 11 November 2014.
1 Bridging Clouds with CernVM: ATLAS/PanDA example Wenjing Wu
Client/Server Grid applications to manage complex workflows Filippo Spiga* on behalf of CRAB development team * INFN Milano Bicocca (IT)
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Pilots 2.0: DIRAC pilots for all the skies Federico Stagni, A.McNab, C.Luzzi, A.Tsaregorodtsev On behalf of the DIRAC consortium and the LHCb collaboration.
Public-resource computing for CEPC Simulation Wenxiao Kan Computing Center/Institute of High Physics Energy Chinese Academic of Science CEPC2014 Scientific.
1 port BOSS on Wenjing Wu (IHEP-CC)
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
The Data Bridge Laurence Field IT/SDC 6 March 2015.
The EDGI project receives Community research funding 1 EDGI Brings Desktop Grids To Distributed Computing Interoperability Etienne URBAH
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
Volunteer Computing 2 Overview Volunteer Computing BOINC Volunteer Computing For HEP Virtualization Volunteer Towards a Common Platform.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
David Cameron Claire Adam Bourdarios Andrej Filipcic Eric Lancon Wenjing Wu ATLAS Computing Jamboree, 3 December 2014 Volunteer Computing.
David Cameron Riccardo Bianchi Claire Adam Bourdarios Andrej Filipcic Eric Lançon Efrat Tal Hod Wenjing Wu on behalf of the ATLAS Collaboration CHEP 15,
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Successful Common Projects: Structures and Processes WLCG Management.
The ILC And the Grid Andreas Gellrich DESY LCWS2007 DESY, Hamburg, Germany
1 Resource Provisioning Overview Laurence Field 12 April 2015.
Cloud Status Laurence Field IT/SDC 09/09/2014. Cloud Date Title 2 SaaS PaaS IaaS VMs on demand.
1 The Adoption of Cloud Technology within the LHC Experiments Laurence Field IT/SDC 17/10/2014.
Virtualised Worker Nodes Where are we? What next? Tony Cass GDB /12/12.
02/06/2014, ESRF – Grenoble Potential connections to e-Infrastructure projects …focusing on science gateways and volunteer computing Robert Lovas Institute.
GLIDEINWMS - PARAG MHASHILKAR Department Meeting, August 07, 2013.
TEMPLATE DESIGN © BOINC: Middleware for Volunteer Computing David P. Anderson Space Sciences Laboratory University of.
2012 Objectives for CernVM. PH/SFT Technical Group Meeting CernVM/Subprojects The R&D phase of the project has finished and we continue to work as part.
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
1 Volunteer Computing at CERN past, present and future Ben Segal / CERN (describing the work of many people at CERN and elsewhere ) White Area lecture.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
Workload management, virtualisation, clouds & multicore Andrew Lahiff.
Julia Andreeva on behalf of the MND section MND review.
The GridPP DIRAC project DIRAC for non-LHC communities.
1 BOINC + CernVM Ben Segal / CERN (describing the work of many people at CERN and elsewhere ) Pre-GDB on Volunteer Computing CERN, November 11, 2014.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
A. Valassi – CERN Clouds 2 nd Price Inquiry LHCb brainstorming – 7 th Oct CERN Cloud 2 nd Price Inquiry Andrea Valassi (IT-SDC) Thanks to Andrew,
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
David P. Anderson Space Sciences Laboratory University of California – Berkeley Public Distributed Computing with BOINC.
1 Cloud Services Requirements and Challenges of Large International User Groups Laurence Field IT/SDC 2/12/2014.
– Past, Present, Future Volunteer Computing at CERN Helge Meinhard, Nils Høimyr / CERN for the CERN BOINC service team H. Meinhard et al. - Volunteer.
New solutions for large scale functional tests in the WLCG infrastructure with SAM/Nagios: The experiments experience ES IT Department CERN J. Andreeva.
The GridPP DIRAC project DIRAC for non-LHC communities.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
DIRAC for Grid and Cloud Dr. Víctor Méndez Muñoz (for DIRAC Project) LHCb Tier 1 Liaison at PIC EGI User Community Board, October 31st, 2013.
ARC-CE: updates and plans Oxana Smirnova, NeIC/Lund University 1 July 2014 Grid 2014, Dubna using input from: D. Cameron, A. Filipčič, J. Kerr Nilsen,
Volunteer Clouds and Citizen Cyberscience for LHC Physics Artem Harutyunyan / CERN Carlos Aguado Sanchez / CERN, Jakob Blomer / CERN, Predrag Buncic /
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
DIRAC Distributed Computing Services A. Tsaregorodtsev, CPPM-IN2P3-CNRS FCPPL Meeting, 29 March 2013, Nanjing.
Multi-community e-Science service connecting grids & clouds R. Graciani 1, V. Méndez 2, T. Fifield 3, A. Tsaregordtsev 4 1 University of Barcelona 2 University.
CernVM and Volunteer Computing Ivan D Reid Brunel University London Laurence Field CERN.
Review of the WLCG experiments compute plans
Status of WLCG FCPPL project
Laurence Field IT/SDC Cloud Activity Coordination
Dag Toppe Larsen UiB/CERN CERN,
Belle II Physics Analysis Center at TIFR
Dag Toppe Larsen UiB/CERN CERN,
External Focus Dr Ivan D Reid Brunel University London 02/09/2016 Ivan D Reid.
How to enable computing
David Cameron ATLAS Site Jamboree, 20 Jan 2017
FCT Follow-up Meeting 31 March, 2017 Fernando Meireles
WLCG Collaboration Workshop;
Ivan Reid (Brunel University London/CMS)
Exploit the massive Volunteer Computing resource for HEP computation
Presentation transcript:

Volunteer Clouds for the LHC experiments H. Riahi – 12/11/15 EGI User Forum Laurence Field Hassen Riahi CERN IT-SDC

Volunteer Computing A type of distributed computing Origins in mid 1990s Computer owners donate computing capacity To a cause or project Not necessarily only spare cycles on desktops Idle machines in data centers Home clusters Berkeley Open Infrastructure for Network Computing (BOINC) Started in 2002 Funded by the National Science Foundation (NSF) 12K Active Users, 16K Active Hosts In 2007, IBM World Community Grid switched to BOINC 67K Active Users, 1.8M Active Hosts 3 H. Riahi – 12/11/15 EGI User Forum 2015

Volunteer Perspective Download and run BOINC software Choose a project Enter an address and password Or silent connection with a key Earn Credit 4 H. Riahi – 12/11/15 EGI User Forum 2015

Challenges For HEP The cost of using the free resources (>100K hosts) Initial integration requires investment Operations and maintenance Attracting and retention of volunteers Advertisement Engagement Low Level of Assurance Anyone can register as a volunteer Not the same level of trust as with Grid authentication 5 H. Riahi – 12/11/15 EGI User Forum 2015

BOINC with Virtualization 6 H. Riahi – 12/11/15 EGI User Forum 2015 Pioneered with Test4Theory and CernVM Included into the mainstream BOINC code BOINC projects currently deploying Virtualisation: CERN (Theory, ATLAS, CMS, LHCb) Climateprediction.net … Vboxwrapper:

BOINC 7 CERN IT-PES H. Riahi – 12/11/15 EGI User Forum 2015 BOINC server cluster servers Sixtrack, Theory, ATLAS Test servers (CMS, LHCb, project with EPFL, Dev environments) BOINC server application support MySQL database server back-end BOINC server application configuration, updates and monitoring Handled by the project teams: Porting of applications to BOINC Application specific job management framework Communication with users about scientific projects Management of forums and portal content

BOINC Service Monitor 8 CERN IT-PES: Sixtrack: 100k parallel Jobs H. Riahi – 12/11/15 EGI User Forum 2015

SixTrack (LHC machine) 9 H. Riahi – 12/11/15 EGI User Forum 2015 Original classic BOINC project for beam simulations Calculates stability of proton orbits in the LHC accelerator Based on experience from the Compact Physics Screensaver (CPSS) Ran SixTrack on desktop computers at CERN Outreach project for CERN’s 50th anniversary 2004 Also Year of Physics (Einstein Year) 2005 Renewed effort for LHC upgrade studies (HL-LHC) 12K Active Users 19K Active Hosts 35 TeraFLOPS

Test4Theory Theoretical fitting of all past experimental data Including from the LHC Using Monte Carlo simulation based on Standard Model Launched 2011 In partnership with the Citizen Cyberscience Centre (CCC) Pioneered use of Virtualization with BOINC Uses recent developments from CERN’s PH-SFT Group CernVM CernVMFS Wide range of potential (physics) applications In 2014 changed name to Virtual Total of 1.7 trillion events simulated since H. Riahi – 12/11/15 EGI User Forum 2015

BOINC For LHC Experiments Use CernVM and virtualization Belong to Follow the Vacuum model (LHCb) In development since 2012 Communicates directly with DIRAC Started as pilot beginning of 2014 ARC CE used to interface with BOINC PanDA for job management Supports simulations 11 H. Riahi – 12/11/15 EGI User Forum 2015

Started development in Summer 2014 The integration with Analysis framework (CRAB) is achieved Available from the production Analysis infrastructure Supports users’ private MC generation The integration with Production framework (WMAgent) is on-going 12 H. Riahi – 12/11/15 EGI User Forum 2015

The Vacuum Model Inline with the cloud approach Common approaches Reduce costs Both development and operation Untrusted resource Authentication Validation 13 CernVM Agent Experiment Framework H. Riahi – 12/11/15 EGI User Forum 2015

14 Async stage-out CRAB3 DataBridge VM Volunteer Agent Wrapper VBoxwrapper Grid GET Proxy DynaFed Ceph VCCS Volunteer CA CRT Happy User Task Glidein WMS Jobs GET Glidein glidein Join Condor Pool PUT Output FTS architecture WebFTS EGI User Forum 2015 H. Riahi – 12/11/15

Usage 15 H. Riahi – 12/11/15 EGI User Forum 2015

EGIFedCloud integration in WLCG 16 Andrew McNab et al. - HEPiX Spring “The Vacuum Model“The Vacuum Model for running jobs in VMs” H. Riahi – 12/11/15 EGI User Forum 2015

17 CMS Analysis workflow H. Riahi – 12/11/15 EGI User Forum 2015

Cloud accounting for WLCG 18  Only some WLCG sites joined the EGIFedCloud  Need to work with sites/VOs to deliver pledged resources Laurence Field – GDB Nov – “WLCG Resource Reporting”“WLCG Resource Reporting” H. Riahi – 12/11/15 EGI User Forum 2015

EGIFedCloud resources for CMS 4 CMS sites have joined so far Pilot site: INFN-PRISMA-BARI Great infrastructure support by Marica Antonacci (Thanks!) Only dozens of cores Resources are mainly used for testing Pilot infrastructure for the vacuum model in CMS Used now for CERN Commercial Consumer side accounting: cms-gangliacms-ganglia Daily log: DailyLogCMSDailyLogCMS CMS SAM tests: RAL-CMS-SAMRAL-CMS-SAM 19 H. Riahi – 12/11/15 EGI User Forum 2015

EGIFedCloud integration status for Analysis Vcycle instances at CERN It supports multiple providers (tenants) and VOs (contextualization) CRAB3 submission Submission to Job type: Analysis/PrivateMC Registration in CMS information system Job execution The job reads the input data, executed and stage-out the outputs Data is read via XrootD and the stage-out is done directly or via ASO CMS Job Monitoring Dashboard reporting 20 ✔ ✗ ✔ ✔ ✔ H. Riahi – 12/11/15 EGI User Forum 2015

Results 21 H. Riahi – 12/11/15 EGI User Forum 2015 Test of the execution of a privateMC workflow submitted with CRAB3 Generate minimum bias events with the Pythia MC generation 1000 jobs (100evt/job) Direct stage-out in a CMS Tier-2

Conclusions 22 Volunteer Computing can and is providing Significant additional computing resources Potentially O(100K) machines The experiments are trying to exploit this opportunistic resource projects exist or are in development The vacuum model has been adopted with success for exploiting both EGIFedCloud and volunteer clouds resources for LHC experiments Needs to attract WLCG cloud sites to integrate their private cloud in EGIFedCloud Work with them to update their usage report to join WLCG production environment H. Riahi – 12/11/15 EGI User Forum 2015

23 H. Riahi – 12/11/15 EGI User Forum 2015