CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.

Slides:



Advertisements
Similar presentations
IEEE NSS 2003 Performance of the Relational Grid Monitoring Architecture (R-GMA) CMS data challenges. The nature of the problem. What is GMA ? And what.
Advertisements

Metadata Progress GridPP18 20 March 2007 Mike Kenyon.
CRAB Tutorial Federica Fanzago – Cern/Cnaf 13/02/2007 CRAB Tutorial (Cms Remote Analysis Builder)
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
Copyright © 2007, Oracle. All rights reserved. Managing Concurrent Requests.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services GS group meeting Monitoring and Dashboards section Activity.
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Julia Andreeva CERN (IT/GS) CHEP 2009, March 2009, Prague New job monitoring strategy.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
Grid infrastructure analysis with a simple flow model Andrey Demichev, Alexander Kryukov, Lev Shamardin, Grigory Shpiz Scobeltsyn Institute of Nuclear.
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Overlook of Messaging.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Operating System Principles And Multitasking
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Priorities update Andrea Sciabà IT/GS Ulrich Schwickerath IT/FIO.
CERN IT Department CH-1211 Genève 23 Switzerland t DM Database Monitoring Tools Database Developers' Workshop CERN, July 8 th, 2008 Dawid.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
 CMS data challenges. The nature of the problem.  What is GMA ?  And what is R-GMA ?  Performance test description  Performance test results  Conclusions.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Update on Network Performance Monitoring.
Korea Workshop May GAE CMS Analysis (Example) Michael Thomas (on behalf of the GAE group)
CERN IT Department CH-1211 Geneva 23 Switzerland t A proposal for improving Job Reliability Monitoring GDB 2 nd April 2008.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI User-centric monitoring of the analysis and production activities within.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
Development of test suites for the certification of EGEE-II Grid middleware Task 2: The development of testing procedures focused on special details of.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
CERN - IT Department CH-1211 Genève 23 Switzerland CASTOR F2F Monitoring at CERN Miguel Coelho dos Santos.
CERN IT Department CH-1211 Genève 23 Switzerland t Future Needs of User Support (in ATLAS) Dan van der Ster, CERN IT-GS & ATLAS WLCG Workshop.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
ATLAS Distributed Analysis DISTRIBUTED ANALYSIS JOBS WITH THE ATLAS PRODUCTION SYSTEM S. González D. Liko
CERN IT Department CH-1211 Genève 23 Switzerland t Bamboo users meeting IT-CS-CT.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
Geant4 GRID production Sangwan Kim, Vu Trong Hieu, AD At KISTI.
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
Seven things you should know about Ganga K. Harrison (University of Cambridge) Distributed Analysis Tutorial ATLAS Software & Computing Workshop, CERN,
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
CERN IT Department CH-1211 Genève 23 Switzerland t DPM status and plans David Smith CERN, IT-DM-SGT Pre-GDB, Grid Storage Services 11 November.
CERN IT Department CH-1211 Genève 23 Switzerland t EGEE09 Barcelona ATLAS Distributed Data Management Fernando H. Barreiro Megino on behalf.
Daniele Bonacorsi Andrea Sciabà
L’analisi in LHCb Angelo Carbone INFN Bologna
ALICE Monitoring
New monitoring applications in the dashboard
Monitoring of the infrastructure from the VO perspective
Initial job submission and monitoring efforts with JClarens
Presentation transcript:

CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward Karavakis On behalf of the Dashboard Team

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Introduction You should be able to monitor your analysis tasks – without any hassle! Task Monitoring collects and exposes a user-centric set of information to the user regarding submitted tasks. Part of the Dashboard Framework: – Uses the job monitoring information from the Dashboard database. Available at: – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Good reasons to use it Focused on the user's perspective. Easy to use and to navigate. Intuitive in layout. Fast with very low latency. Updates in 'real time'. Bookmark your favourite tasks. Offers a wide selection of graphical plots. User-driven development. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Status Collection We don't have your grid certificate – cannot directly query the Grid Logging and Bookkeeping System. We rely on the job status sent to the Dashboard either from the jobs themselves from the CRAB UI via MonALISA or on the job status information on RGMA and ICRTM. Provides monitoring functionalities regardless of the submission method or the middleware. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Inconsistent results? What happens if you see an inconsistency between Dashboard Task Monitoring and crab -status ? As soon as the job is finished at the worker node it is reported to Dashboard as finished. CRAB reports a finished job only when it is considered DONE by the Grid and normally, a small delay is introduced by the Grid Services. But, if you notice that some data is missing regarding your jobs, you should believe crab - status. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Recent Development Many UI improvements. URLs to every task, user and particular views of a given task. Consumed Time information: Task-specific: Average Efficiency, total & average CPU and Wall Clock time usage, average CPU time per event. Job-specific: Efficiency for a specific job. Extended the selection of the plots. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Graphical Plots Various Plots available, including: Distribution by Site (successful, failed, running and pending, processed events). Terminated jobs (in terms of Success/ Failures and over time). Application-failed and Grid-aborted jobs by Reason of Failure. Timing plots: Average Efficiency distributed by Site, CPU & Wall Clock time spent on Successful and Failed jobs. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Future Development Fully configurable time interval. Filters allowing you to search for detailed information about a specific task or job. Automatic generation of commands for: resubmissions / killings / getting logging info, retrieving output,.. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Required improvements The weakest point is failure diagnostics for both Grid and Application failures. It would be extremely useful to get not only the exit-code, which sometimes can be misleading, but a detailed reason of failure as well. i.e.: ‘Could not save output file A on the storage element B’. Requires modifications of the CRAB Wrapper. A user shouldn’t have to search the log files to understand what went wrong. – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Feedback The Monitoring tool is available at: You can make it better! Please send us your suggestions and feedback at: – Monitoring: Tracking your tasks with Task Monitoring

CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Demo Short demonstration of the application. – Monitoring: Tracking your tasks with Task Monitoring