EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks CRAB: the CMS tool to allow data analysis.

Slides:



Advertisements
Similar presentations
1 14 Feb 2007 CMS Italia – Napoli A. Fanfani Univ. Bologna A. Fanfani University of Bologna MC Production System & DM catalogue.
Advertisements

1 CRAB Tutorial 19/02/2009 CERN F.Fanzago CRAB tutorial 19/02/2009 Marco Calloni CERN – Milano Bicocca Federica Fanzago INFN Padova.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Infrastructure overview Arnold Meijster &
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Workload Management Massimo Sgaravatto INFN Padova.
A tool to enable CMS Distributed Analysis
Ian M. Fisk Fermilab February 23, Global Schedule External Items ➨ gLite 3.0 is released for pre-production in mid-April ➨ gLite 3.0 is rolled onto.
Client/Server Grid applications to manage complex workflows Filippo Spiga* on behalf of CRAB development team * INFN Milano Bicocca (IT)
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Ian Fisk and Maria Girone Improvements in the CMS Computing System from Run2 CHEP 2015 Ian Fisk and Maria Girone For CMS Collaboration.
Physicists's experience of the EGEE/LCG infrastructure usage for CMS jobs submission Natalia Ilina (ITEP Moscow) NEC’2007.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Zhiling Chen (IPP-ETHZ) Doktorandenseminar June, 4 th, 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE II - Network Service Level Agreement (SLA) Establishment EGEE’07 Mary Grammatikou.
José M. Hernández CIEMAT Grid Computing in the Experiment at LHC Jornada de usuarios de Infraestructuras Grid January 2012, CIEMAT, Madrid.
Computing Infrastructure Status. LHCb Computing Status LHCb LHCC mini-review, February The LHCb Computing Model: a reminder m Simulation is using.
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
Finnish DataGrid meeting, CSC, Otaniemi, V. Karimäki (HIP) DataGrid meeting, CSC V. Karimäki (HIP) V. Karimäki (HIP) Otaniemi, 28 August, 2000.
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
F. Fassi, S. Cabrera, R. Vives, S. González de la Hoz, Á. Fernández, J. Sánchez, L. March, J. Salt, A. Lamas IFIC-CSIC-UV, Valencia, Spain Third EELA conference,
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Tier-2  Data Analysis  MC simulation  Import data from Tier-1 and export MC data CMS GRID COMPUTING AT THE SPANISH TIER-1 AND TIER-2 SITES P. Garcia-Abia.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Stefano Belforte INFN Trieste 1 Middleware February 14, 2007 Resource Broker, gLite etc. CMS vs. middleware.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IPv6 test methodology Mathieu Goutelle (CNRS.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The usage of the gLite Workload Management.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Antonio Retico CERN, Geneva 19 Jan 2009 PPS in EGEEIII: Some Points.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Use of the gLite-WMS in CMS for production and analysis Giuseppe Codispoti On behalf of the CMS Offline and Computing.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
INFSO-RI Enabling Grids for E-sciencE CRAB: a tool for CMS distributed analysis in grid environment Federica Fanzago INFN PADOVA.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update Authorization Service Christoph Witzig,
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
1 Andrea Sciabà CERN The commissioning of CMS computing centres in the WLCG Grid ACAT November 2008 Erice, Italy Andrea Sciabà S. Belforte, A.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Mining Job Monitoring Data Automatic Error.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Claudio Grandi INFN Bologna Workshop congiunto CCR e INFNGrid 13 maggio 2009 Le strategie per l’analisi nell’esperimento CMS Claudio Grandi (INFN Bologna)
ATLAS Distributed Analysis S. González de la Hoz 1, D. Liko 2, L. March 1 1 IFIC – Valencia 2 CERN.
Workload Management Workpackage
CRAB Server CRAB (CMS Remote Analysis Builder)
Simulation use cases for T2 in ALICE
Monitoring of the infrastructure from the VO perspective
N. De Filippis - LLR-Ecole Polytechnique
Grid Computing in CMS: Remote Analysis & MC Production
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CRAB: the CMS tool to allow data analysis in a distributed environment Federica Fanzago for the CRAB team CERN - CNAF

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 2 Enabling Grids for E-sciencE EGEE-II INFSO-RI Outline CMS introduction –CMS computing model Distributed analysis CRAB “standalone” –the flow –CRAB usage The evolution of CRAB  the client – server architecture –the flow –the implementation –CRAB client – server usage Conclusion

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 3 Enabling Grids for E-sciencE EGEE-II INFSO-RI Introduction CMS “Compact Muon Solenoid” is one of the four particle physics experiment that will collect data at LHC “Large Hadron Collider” starting in 2008 at CERN The large amount of produced data (events) should be available for analysis to world-wide distributed physicists “bunch crossing” every 25 nsecs. 100 “triggers” per second Each triggered event ~1 MB in size CMS will produce –~2 PB events/year (assumes startup luminosity 2x1033 cm-2 s-1) All events will be stored into files –O(10^6) files/year Files will be grouped in Fileblocks –O(10^3) Fileblocks/year Fileblocks will be grouped in Datasets –O(10^3) Datasets (total after 10 years of CMS) – TB

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 4 Enabling Grids for E-sciencE EGEE-II INFSO-RI In order to –store and manage this huge quantity of data –assure data access to physicists of CMS collaboration –assure enough computing power for data analysis and simulation –guarantee resources and data availability to allowed users CMS uses a distributed architecture based on Grid infrastructure Tools for accessing distributed data and resources are provided by WLCG (World LHC Computing Grid) with two main different flavours –LCG/gLite in Europe, OSG in the US Issues and help

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 5 Enabling Grids for E-sciencE EGEE-II INFSO-RI Online system Tier 0 Tier 1 Tier 2 Tier 3 Offline farm CERN Computer center. Tier2 Center InstituteB InstituteA... workstation Italy Regional Center Fermilab Regional Center France Regional Center recorded data CMS computing model The CMS offline computing system is arranged in hierarchical Tiers geographically distributed. Remote data accessible via grid T1s take care about calibration, skimming and reconstruction. They sent data to T2s Data from DAQ are sent, stored and first step reconstructed at Tier-0, then spread over T1s T2s provide power for analysis and simulation

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 6 Enabling Grids for E-sciencE EGEE-II INFSO-RI Distributed analysis The analysis in a distributed environment is a complex computing task because it assume to know: –which data are available –where data are stored and how to access them –which resources are available and are able to comply with analysis requirements –Grid and CMS infrastructure details Users point of view: –Want to analyze distributed data as they were in the local farm –Don’t want to became Grid experts

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 7 Enabling Grids for E-sciencE EGEE-II INFSO-RI Analysis chain The CMS collaboration is developing some tools interfaced with Grid services to simplify the analysis task, including –Agents for automatic data distribution among tiers –Catalogs for data location –CMS software distribution and installation –CRAB user-friendly tool to help users to run their analysis code on data available at remote sites, hiding Grid and CMS infrastructure details

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 8 Enabling Grids for E-sciencE EGEE-II INFSO-RI CRAB The aim of CRAB (Cms Remote Analysis Builder) is to simplify the work of users to create, submit and manage their analysis job in Grid environments. Users have to develop their analysis code in a interactive environment and decide which data to analyse. CRAB handles data discovery, resources availability, job creation and submission, status monitoring and output retrieval.

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 9 Enabling Grids for E-sciencE EGEE-II INFSO-RI The user-friendly tool CRAB is a user-friendly tool –Simple to install (distributed as tar archive) –Only a configuration file where to provide analysis information –Command line for actions (under user control)  >>crab -create  >>crab -submit (…. -status, -getOutput, -kill….) –Requirements  To interact with the Grid services and resources: CRAB must be installed in a UI  To interact with CMS environment: the CMS software has to be available in the UI

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 10 Enabling Grids for E-sciencE EGEE-II INFSO-RI CRAB “standalone” flow

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 11 Enabling Grids for E-sciencE EGEE-II INFSO-RI Other info CRAB was born in April 2005 It is written in python Users appreciated the tool, providing feedback and asking for improvements CRAB was used by many CMS collaborators to analyze remote data for CMS Physics TDR, otherwise not accessible Involved in most of the CMS Challenges –the Magnet Test Cosmic Challenge –the Computing, Software, and Analysis (CSA06) Challenges over millions of simulated events

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 12 Enabling Grids for E-sciencE EGEE-II INFSO-RI CRAB usage during CSA06 CSA 2006 (last 2006 quarter): –jobs/day peak: 18 kjobs –Average rate: 9 kjobs/day Top 20 CE where CRAB-Jobs run Top 20 dataset

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 13 Enabling Grids for E-sciencE EGEE-II INFSO-RI The CRAB client – server idea Current users community –~600 users in 2007 –The average daily number ~70 Their number is increasing The submission jobs too… When real data will be available the foreseen number of analysis job is 100 kjobs/day We need to improve the scalability of whole system. CRAB client – server architecture

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 14 Enabling Grids for E-sciencE EGEE-II INFSO-RI Why a CRAB server? To increase the scalability of the whole system to comply with the CMS requirements (100 kjobs/day) To reduce the user load automating most of the action done via command line, as submission, error handling, output retrieval, resubmission To improve the reliability of the system delegating to the server jobs handling Constraints: –direct and Server submission modes must be transparent –same configuration file, same commands interface –simple switch from standalone to server mode and vice versa

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 15 Enabling Grids for E-sciencE EGEE-II INFSO-RI CRAB Client - Server flow The server: –Get tasks/credentials –Multi-threaded submit –Monitor automatically the jobs life-cycle –Retrieve automatically the output –Handle errors automatically –Decide whether to resubmit jobs –Notify the users whenever done CRAB Client User interface

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 16 Enabling Grids for E-sciencE EGEE-II INFSO-RI Server architecture The server components are independent and implemented as agents communicating through an asynchronous and persistent message service. Submission Related Monitoring Related

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 17 Enabling Grids for E-sciencE EGEE-II INFSO-RI Server usage server test reached up to 19 kjobs in 1 day CRAB job submitted in January 2008 ~ 600 users CRAB standalone ~ 40 users CRAB client server

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 18 Enabling Grids for E-sciencE EGEE-II INFSO-RI Some statistics ~ jobs terminated in January 2008

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 19 Enabling Grids for E-sciencE EGEE-II INFSO-RI Job failures Grid and CMS infrastructure congestion of RB/WMS. problem with the copy of output file to Storage Element wrong site installation CMS infrastructure CMSSW software not available CMSSW misleading error code problems with published dataset (some input file not found) User code not correct

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 20 Enabling Grids for E-sciencE EGEE-II INFSO-RI Failure statistics %Error Code Description Failed to copy an output file to the SE Output file(s) not found killed, unblockable (POSIX) 7.61Hangup Output_sandbox too big for WMS CMSSW exception Abort(ANSI) 1.490Application exception 1.0Error without spec Required application version is not found at the site Error while loading shared library End of job from user application VO_CMS_SW_DIR not defined Also due to input file not found Also due to problem with copy to SE

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 21 Enabling Grids for E-sciencE EGEE-II INFSO-RI Conclusion CRAB was born in April ’05 A big effort to understand user needs and how to use in the best way services provided by Grid Lot of work to make it robust, flexible and reliable  from CRAB “Standalone” to CRAB Client-Server The support is an important task: need effort by us, remote site manager and Grid service in order to improve the stability of the system Users appreciate the tool and are asking for further improvements The use of CRAB proves the distributed analysis works for a generic CMS user !

Federica Fanzago CERN – CNAF EGEE USER FORUM 2008 Clermont-Ferrand 22 Enabling Grids for E-sciencE EGEE-II INFSO-RI Documentation CRAB homepage – CRAB twiki – – CRAB mailing list for users –