Julia Andreeva on behalf of the MND section MND review.

Slides:



Advertisements
Similar presentations
WLCG Monitoring Consolidation NEC`2013, Varna Julia Andreeva CERN IT-SDC.
Advertisements

1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
CERN IT Department CH-1211 Genève 23 Switzerland t Messaging System for the Grid as a core component of the monitoring infrastructure for.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Input from CMS Nicolò Magini Andrea Sciabà IT/SDC 5 July 2013.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services GS group meeting Monitoring and Dashboards section Activity.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks VO-specific systems for the monitoring of.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Julia Andreeva CERN (IT/GS) CHEP 2009, March 2009, Prague New job monitoring strategy.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
1 Andrea Sciabà CERN Towards a global monitoring system for CMS computing Lothar A. T. Bauerdick Andrea P. Sciabà Computing in High Energy and Nuclear.
GridPP3 Project Management GridPP20 Sarah Pearce 11 March 2008.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Dashboard program of work Julia Andreeva GS Group meeting
Julia Andreeva, CERN IT-ES GDB Every experiment does evaluation of the site status and experiment activities at the site As a rule the state.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
WLCG infrastructure monitoring proposal Pablo Saiz IT/SDC/MI 16 th August 2013.
WLCG Monitoring Roadmap Julia Andreeva, CERN , WLCG workshop, CERN.
Monitoring for CCRC08, status and plans Julia Andreeva, CERN , F2F meeting, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
Status Organization Overview of Program of Work Education, Training It’s the People who make it happen & make it Work.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
CERN IT Department CH-1211 Geneva 23 Switzerland t A proposal for improving Job Reliability Monitoring GDB 2 nd April 2008.
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
The GridPP DIRAC project DIRAC for non-LHC communities.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI User-centric monitoring of the analysis and production activities within.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
Enabling Grids for E-sciencE Grid monitoring from the VO/User perspective. Dashboard for the LHC experiments Julia Andreeva CERN, IT/PSS.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
New solutions for large scale functional tests in the WLCG infrastructure with SAM/Nagios: The experiments experience ES IT Department CERN J. Andreeva.
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
The GridPP DIRAC project DIRAC for non-LHC communities.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
MND section. Summary of activities Job monitoring In collaboration with GridView and LB teams enabled full chain from LB harvester via MSG to Dashboard.
Ian Bird LCG Project Leader Status of EGEE  EGI transition WLCG LHCC Referees’ meeting 21 st September 2009.
Accounting Review Summary from the pre-GDB related to CPU (wallclock) accounting Julia Andreeva CERN-IT GDB 13th April
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
Campana (CERN-IT/SDC), McKee (Michigan) 16 October 2013 Deployment of a WLCG network monitoring infrastructure based on the perfSONAR-PS technology.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Monitoring Overview: status, issues and outlook Simone Campana.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
Daniele Bonacorsi Andrea Sciabà
Key Activities. MND sections
POW MND section.
FTS Monitoring Ricardo Rocha
Experiment Dashboard overviw of the applications
Monitoring of the infrastructure from the VO perspective
Presentation transcript:

Julia Andreeva on behalf of the MND section MND review

Major Achievements in 2009 GS Group meeting,  Dashboard applications are widely used by the LHC experiments, in particular by ATLAS and CMS. Example : CMS Dashboard Almost 4K unique visitors in October, more than 1.5 million pages were accessed Multiple new features added Applications are used for ATLAS and CMS computing shifts  Get on board analysis users (CMS Task monitoring) CMS analysis physicists daily use Task monitoring October analysis excersise

Major achievements in 2009  Considerable progress in development and deployment of the applications providing high level view, both at the global WLCG scope and at the level of a single site (Siteview, Google Earth for the LHC experiments)  Applications for site administrators and VO support teams at the sites are becoming more and more popular (Dashboard SAM portal, Site Status Board, Dataset monitoring for CMS)  First prototype of the Dashboard FTS monitor was developed.  Ricardo’s proposal for integration of multiple ATLAS monitoring systems was well accepted by ATLAS community. Work is ongoing, Ricardo is coordinating this activity  Further development of the Dashboard framework. Use of Google Web Toolkit inside the framework  First application for improvement of the failure diagnostics based on the association rule mining deployed for CMS  Providing service management for elog services, which became very popular (20 logbooks, 480 users, 22K entries)  Providing service management for message brokers which are used by many teams GS Group meeting,

Main directions of work for the next year  Development, deployment, maintenance and support of the Dashboard monitoring applications for the LHC experiments  Main effort will be concentrated on the generic applications (Generic Dashboard job monitoring, FTS monitoring, SSB, Dashboard SAM portal)  Use generic solutions for technical implementation, for example use MSG as a message bus.  Collaboration with the middleware development teams, FTS developers, GridView in order to enable generic monitoring for data transfer, job processing and site availability on the WLCG scope  Integration of the VO-specific monitoring systems aimed to provide high-level view of the LHC computing activities on the WLCG infrastructure GS Group meeting,

Job Monitoring Should follow Grid and application status of user jobs Tasks for the next year: - adapt for all VOs recent improvements done for CMS version of job monitoring - migrate to ActiveMQ messaging (following work started by Kuba’s student ) - provide common way for instrumentation of the VO workload management systems for publishing of the job monitoring information - in collaboration with LB, CEMon, condor_g and GridView development teams enable publishing of the Grid job status information to ActiveMQ messaging system GS Group meeting,

Data transfer monitoring Dashboard for FTS monitoring The first prototype uses GWT inside the framework. Is available at CERN for T0 export and T2 instances. In the beginning of 2010 should be available for general use, should include data from all FTS instances Further plans o Integrate VO specific information  data subscriptions, file grouping (dataset) GS Group meeting,

Site monitoring  Site Status Board In production for CMS, LHCb and Alice. Currently is actively used by CMS (computing shifts, site commissioning and space monitoring). For LHCb is recently configured for monitoring of the space tokens. Further development is driven by the requests of the suer community  VO-specific site availability based on the results of SAM tests Uses information from SAM DB. Includes interface for introducing new availabilities, stored Oracle procedures for availability calculations, some extensions to SAM schema, UI for exposing results of SAM tests and service/site availabilities. Will need to migrate to new SAM schema. Foresee support only for the UI, availability calculations will be provided by GridView. GS Group meeting,

Integration of VO-specific monitoring systems  Coordinate an effort in order to provide high level view of the LHC computing activities on the WLCG infrastructure  VO-specific monitoring systems are used as an information source, GoogleEarth or GridMap for visualization, Dashboard framework as a glue to integrate all components in a single system  Provide support for SiteView, Google Earth applications.  Collaborate with the LHC experiments in order to improve completeness of the reported information  New functionality to be enabled: Historical distributions of the numeric and status metrics in the SiteView. Enable navigation to the tickets submitted against the site GS Group meeting,

Summary  Development, maintenance and support of the Dashboard monitoring applications. Many of them are generic and can be used by any VO.  Development and support of the Dashboard framework which is used outside Dashboard project, not necessary for the development of the monitoring applications  Guidance in instrumentation of the VO-specific systems ( for instance, workload management systems) for publishing of the monitoring data  Integration of the VO-specific monitoring systems  Contribution to the improvement of the overall WLCG monitoring infrastructure (FTS Dashboard, instrumentation of the GRID services for publishing of the job status information) GS Group meeting,