Input from CMS Nicolò Magini Andrea Sciabà IT/SDC 5 July 2013.

Slides:



Advertisements
Similar presentations
WLCG Operations and Tools TEG Monitoring – Experiment Perspective Simone Campana and Pepe Flix Operations TEG Workshop, 23 January 2012.
Advertisements

Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
OSG Operations and Interoperations Rob Quick Open Science Grid Operations Center - Indiana University EGEE Operations Meeting Stockholm, Sweden - 14 June.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks VO-specific systems for the monitoring of.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
1 Andrea Sciabà CERN Towards a global monitoring system for CMS computing Lothar A. T. Bauerdick Andrea P. Sciabà Computing in High Energy and Nuclear.
1 1 Service Composition for LHC Computing Grid Monitoring Beob Kyun Kim e-Science Division, KISTI
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Experiment Support ANALYSIS FUNCTIONAL AND STRESS TESTING Dan van der Ster, CERN IT-ES-DAS for the HC team: Johannes Elmsheuser, Federica Legger, Mario.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Dashboard program of work Julia Andreeva GS Group meeting
CERN IT Department CH-1211 Geneva 23 Switzerland t GDB CERN, 4 th March 2008 James Casey A Strategy for WLCG Monitoring.
22 February 2008GS Group Meeting - EIS section GS-EIS: Experiment Integration Support section Five staff: Harry Renshall Section Leader Simone Campana.
Julia Andreeva, CERN IT-ES GDB Every experiment does evaluation of the site status and experiment activities at the site As a rule the state.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
WLCG infrastructure monitoring proposal Pablo Saiz IT/SDC/MI 16 th August 2013.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
WLCG Monitoring Roadmap Julia Andreeva, CERN , WLCG workshop, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
LCG Introduction John Gordon, STFC GDB June 8 th 2011.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Site Manageability & Monitoring Issues for LCG Ian Bird IT Department, CERN LCG MB 24 th October 2006.
The CMS Top 5 Issues/Concerns wrt. WLCG services WLCG-MB April 3, 2007 Matthias Kasemann CERN/DESY.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
Julia Andreeva on behalf of the MND section MND review.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ian Bird All Activity Meeting, Sofia
New solutions for large scale functional tests in the WLCG infrastructure with SAM/Nagios: The experiments experience ES IT Department CERN J. Andreeva.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Ideal information system - CMS Andrea Sciabà IS.
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
MND section. Summary of activities Job monitoring In collaboration with GridView and LB teams enabled full chain from LB harvester via MSG to Dashboard.
Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities XVIII International Conference on Computing in High Energy and Nuclear.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Present and Future Pedro Andrade (CERN IT) 31 st August.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Monitoring Overview: status, issues and outlook Simone Campana.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
Efi.uchicago.edu ci.uchicago.edu Sharing Network Resources Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago Federated Storage.
Monitoring Working Group Update Grid Deployment Board 5 th December, CERN Ian Neilson.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Monitoring Evolution 1 Alberto AIMAR, IT-CM-MM. Outline Mandate Data Centres Monitoring Experiments Dashboards Architecture Plans Status Demo 2.
IT Monitoring Service Status and Progress 1 Alberto AIMAR, IT-CM-MM.
Claudio Grandi INFN Bologna Workshop congiunto CCR e INFNGrid 13 maggio 2009 Le strategie per l’analisi nell’esperimento CMS Claudio Grandi (INFN Bologna)
Daniele Bonacorsi Andrea Sciabà
WLCG IPv6 deployment strategy
Monitoring Evolution and IPv6
James Casey, CERN IT-GD WLCG Workshop 1st September, 2007
POW MND section.
Experiment Dashboard overviw of the applications
Maite Barroso, SA1 activity leader CERN 27th January 2009
Monitoring of the infrastructure from the VO perspective
Presentation transcript:

Input from CMS Nicolò Magini Andrea Sciabà IT/SDC 5 July 2013

Introduction  Short assessment of each application in   If not otherwise noted, no significant functionality is missing  Communication between CMS and previous Dashboard and SAM teams was already very close, so CMS needs should be well known 21 May 2013 Monitoring of Grid Operations - S. Roiser 2

Job Monitoring  Interactive view  Essential for job tracking and troubleshooting  Should be continuously adapted to changes in CMS workload management  Task monitoring  Very much used by analysis users  Monitoring on Android  Convenient but not a priority  Historical view  Totally essential for monitoring and accounting, both for central operations and for sites!  Must be accurate  MyWLCG job trends  Not used 21 May 2013 Monitoring of Grid Operations - S. Roiser 3

Data Management  WLCG transfer dashboard  Useful for transfer monitoring and troubleshooting, complements PhEDEx  AAA monitoring  Very useful as detailed xrootd monitoring  Still under development, not yet widely used  Needs more validation  Missing features:  Client-server matrix (now it’s only source-destination)  Separation between local and remote traffic  CMS Datasets, MyWLCG Transfers  Not used 21 May 2013 Monitoring of Grid Operations - S. Roiser 4

Site/Service Monitoring (1)  CMS VO feed  Essential for SAM  CMS SSB  Essential for sites, computing shifts, central operations and site support  CMS SUM  Essential for sites and computing shifts; might be merged with MyWLCG but all features must be retained  CMS Nagios  Essential  MIDMON, OPS-MONITOR  Outside of CMS scope  SAM GridMon  Potentially very useful to CMS to make direct queries to SAM  Monthly reports, A/R Trends, T0/1SiteView, GridMap  Not used by CMS 21 May 2013 Monitoring of Grid Operations - S. Roiser 5

Site/Service Monitoring (2)  SAM Nagios installation, Probe development documentation  Very important (as all documentation)  CMS Critical Services  Essential to CMS computing shifts  Personalized dashboard  Not used (it might in the future)  Google Earth interface  Not for operational use but “nice to have”  SiteView  Possibly useful for sites but actual usage unknown; not so useful for CMS central operations  SAM validation  Very useful for proper probe development and testing 21 May 2013 Monitoring of Grid Operations - S. Roiser 6

Conclusions  Most of the products of IT-SDC-MI are used by one or more of management, central operations, site contacts in CMS 21 May 2013 Monitoring of Grid Operations - S. Roiser 7

Reminder  The scope of this presenation was limited to the services provided by IT-SDC-MI. Many other monitoring tools provided by other parties are used by CMS, including but not limited to  IT-SDC-OL tools  Popularity, Victor, HammerCloud (also for commissioning/stress testing)  IT-SDC-ID tools  e.g. FTS3 Monitor  Tools from other IT groups  e.g. Lemon, SLS, …  Tools from other institutes in WLCG  e.g. HappyFace, FTS2 Monitor, …  Experiment-specific tools  PhEDEx monitoring, job monitoring tools, CMS Site Readiness, … 21 May 2013 Monitoring of Grid Operations - S. Roiser 8