Survey on User’s Computing Experience

Slides:



Advertisements
Similar presentations
Andrew McNab - Manchester HEP - 17 September 2002 Putting Existing Farms on the Testbed Manchester DZero/Atlas and BaBar farms are available via the Testbed.
Advertisements

Current methods for negotiating firewalls for the Condor ® system Bruce Beckles (University of Cambridge Computing Service) Se-Chang Son (University of.
LNL CMS M.Biasotto, Bologna, 29 aprile LNL Analysis Farm Massimo Biasotto - LNL.
Parasol Architecture A mild case of scary asynchronous system stuff.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
K.Harrison CERN, 23rd October 2002 HOW TO COMMISSION A NEW CENTRE FOR LHCb PRODUCTION - Overview of LHCb distributed production system - Configuration.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
JetWeb on the Grid Ben Waugh (UCL), GridPP6, What is JetWeb? How can JetWeb use the Grid? Progress report The Future Conclusions.
Computing and LHCb Raja Nandakumar. The LHCb experiment  Universe is made of matter  Still not clear why  Andrei Sakharov’s theory of cp-violation.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
- Iain Bertram R-GMA and DØ Iain Bertram RAL 13 May 2004 Thanks to Jeff Templon at Nikhef.
Wenjing Wu Andrej Filipčič David Cameron Eric Lancon Claire Adam Bourdarios & others.
1 DIRAC – LHCb MC production system A.Tsaregorodtsev, CPPM, Marseille For the LHCb Data Management team CHEP, La Jolla 25 March 2003.
Grid job submission using HTCondor Andrew Lahiff.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Karsten Köneke October 22 nd 2007 Ganga User Experience 1/9 Outline: Introduction What are we trying to do? Problems What are the problems? Conclusions.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
8 th CIC on Duty meeting Krakow /2006 Enabling Grids for E-sciencE Feedback from SEE first COD shift Emanoil Atanassov Todor Gurov.
Structural Biology on the GRID Dr. Tsjerk A. Wassenaar Biomolecular NMR - Utrecht University (NL)
Overview Background: the user’s skills and knowledge Purpose: what the user wanted to do Work: what the user did Impression: what the user think of Ganga.
AliRoot survey: Analysis P.Hristov 11/06/2013. Are you involved in analysis activities?(85.1% Yes, 14.9% No) 2 Involved since 4.5±2.4 years Dedicated.
Data Analysis w ith PROOF, PQ2, Condor Data Analysis w ith PROOF, PQ2, Condor Neng Xu, Wen Guan, Sau Lan Wu University of Wisconsin-Madison 30-October-09.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
Alien and GSI Marian Ivanov. Outlook GSI experience Alien experience Proposals for further improvement.
WMS baseline issues in Atlas Miguel Branco Alessandro De Salvo Outline  The Atlas Production System  WMS baseline issues in Atlas.
A Data Handling System for Modern and Future Fermilab Experiments Robert Illingworth Fermilab Scientific Computing Division.
First test of the PoC. Caveats I am not a developer ;) I was also beta tester of Crab3+WMA in 2011; I restarted testing it ~2 weeks ago to have a 1 to.
Gestion des jobs grille CMS and Alice Artem Trunov CMS and Alice support.
VO Box discussion ATLAS NIKHEF January, 2006 Miguel Branco -
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
Vendredi 27 avril 2007 Management of ATLAS CC-IN2P3 Specificities, issues and advice.
Grid Computing: An Overview and Tutorial Kenny Daily BIT Presentation 22/09/2016.
Intro to Data Structures Concepts ● We've been working with classes and structures to form linked lists – a linked list is an example of something known.
Condor Week May 2012No user requirements1 Condor Week 2012 An argument for moving the requirements out of user hands - The CMS experience presented.
Claudio Grandi INFN Bologna Workshop congiunto CCR e INFNGrid 13 maggio 2009 Le strategie per l’analisi nell’esperimento CMS Claudio Grandi (INFN Bologna)
Service Availability Monitoring
Kevin Thaddeus Flood University of Wisconsin
Grid Computing: Running your Jobs around the World
ALICE & Clouds GDB Meeting 15/01/2013
Xiaomei Zhang CMS IHEP Group Meeting December
Overview of the Belle II computing
Practical: The Information Systems
IW2D migration to HTCondor
Akiya Miyamoto KEK 1 June 2016
David Adams Brookhaven National Laboratory September 28, 2006
CREAM Status and Plans Massimo Sgaravatto – INFN Padova
INFN-GRID Workshop Bari, October, 26, 2004
EGEE VO Management.
A full demonstration based on a “real” analysis scenario
Model (CMS) T2 setup for end users
Nicolas Jacq LPC, IN2P3/CNRS, France
Scalability Tests With CMS, Boss and R-GMA
Artem Trunov and EKP team EPK – Uni Karlsruhe
Artem Trunov, Günter Quast EKP – Uni Karlsruhe
Analysis Operations Monitoring Requirements Stefano Belforte
N. De Filippis - LLR-Ecole Polytechnique
Exploring the Power of EPDM Tasks - Working with and Developing Tasks in EPDM By: Marc Young XLM Solutions
UM D0RACE STATION Status Report Chunhui Han June 20, 2002
Agile testing for web API with Postman
DØ MC and Data Processing on the Grid
The CMS Beijing Site: Status and Application
Grid Computing Software Interface
Presentation transcript:

Survey on User’s Computing Experience Artem Trunov EKP

Consideration Motivation: to improve user's experience with regard to access to data, running jobs… Questions Where do you run jobs? How do you submit them? How do ou manage grid submission to different sites? Do you use CRAB? Is it convinient for you? Where do you keep logs from the jobs? If your jobs produce data, how to do you get it back? Where do you store it? How often do you have problems with grid jobs? What kind of problems? Where do you seek help for grid problems? Do you think you'd do a better/faster debugging of your problems if you had access to corresponding sites, etc? Would you really try to debug your jobs at this level? Your ideal model of your work environment. 9 full answers, 2 general (from new entrants)

Where and how people run jobs Grid is used by majority. One person so far managed to only use local batch exclusively. Clear preference to limit a number of sites where jobs run CERN, FNAL are the choice for high avalability DESY, Aachen, GridKa – as home or “friendly” sites, where the environment is familiar and people are willing to help Black/white list and “edg-job-submit –r” is used to steer jobs to prefered sites. CRAB is used by almost all, but some people also or exclusively use direct edg-job-submit through own automation scripts. CRAB is not (and can not be) used by people running on unofficial datasets or producing own MC. People use their local (institute) UI to submit jobs.

CRAB experience 6 – use it 3 - don’t use it 3 – like it 2 – report problems 3 - don’t use it 1 never used Grid 1 avoids using Grid and Crab for problems 1 need to run on official data

Logs and Data Personal logs – in the home dir CRAB users keep them mostly where CRAB brings them – in a job dir. Production logs – in dCache Small data output – back to home Big data output – copy to an SE Local or friendly SE is preferable

Notes on storage Individual opinions Copying from a job to an SE presents a point of failure Noticed that handling data grid way is not convenient and could be improved. Prefer to analyze data locally at institute cluster to avoid grid problems. Wi

Grid Problems 70 – 80 – 90% efficiency Trouble points CRAB 4 Data or SW published but not available Environment not correctly set 2 Problem with output to an SE 2 “Black holes” 3 RB problems updating job status, not matching sites, SEs, aborts jobs

Grid problem addressing 6 directly to site experts 1 CMS forums 4 GGUS 1 fix him/herself the system 1 fix him/herself a problem with own setup 2 could not resolve at all, don’t believe that problems could be solved 

Notes on Grid problems Simples grid job submission takes 5-10 minutes Individual opinions Simples grid job submission takes 5-10 minutes Reporting properly (GGUS, Savannah) is too much work Many problem are specific and time consuming Many problems are transient, no one adresses those “usual” problems. It’s accepted as “natural” that 70% of jobs fail. Impossible to report every single problem – too many.

Opinions on access to site for debugging 5 believe in advantage debugging at a site 2 want access to at least one grid site 1 want access to worker nodes 1 want access to a portal machine(s) 4 doesn’t believe in advantage of debugging at a site

Ideal work environment 2 Prefered all grid 3 Prefered all non-grid 3 Prefered both, split tasks 1 workgroup servers (incl. UI) 4 local batch farm 3 local access to storage Also 1 Proof 1 Better support 1 Better debugging options

My humble conclusions Grid is difficult, and in some cases preventing people from successful work However, most anyway believe in it and would use it if not completely, then partially. Most would prefer to do analysis w/o the Grid Generally, people would be happier with local access to our T2, T3 resources, where they would find: Easy access to storage Access to local batch farm Debugging options