HPDC Grid Monitoring Workshop June 25, 2007 Grid monitoring from the VO/user perspectives Shava Smallen.

Slides:



Advertisements
Similar presentations
Monitoring and performance measurement in Production Grid Environments David Wallom.
Advertisements

Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Grid Information Systems. Two grid information problems Two problems  Monitoring  Discovery We can use similar techniques for both.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
OSG Public Storage and iRODS
CERN IT Department CH-1211 Geneva 23 Switzerland t The Experiment Dashboard ISGC th April 2008 Pablo Saiz, Julia Andreeva, Benjamin.
Enabling Grids for E-sciencE Overview of System Analysis Working Group Julia Andreeva CERN, WLCG Collaboration Workshop, Monitoring BOF session 23 January.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks VO-specific systems for the monitoring of.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Julia Andreeva CERN (IT/GS) CHEP 2009, March 2009, Prague New job monitoring strategy.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
ALICE-USA Grid-Deployment Plans (By the way, ALICE is an LHC Experiment, TOO!) Or (We Sometimes Feel Like and “AliEn” in our own Home…) Larry Pinsky—Computing.
CERN IT Department CH-1211 Genève 23 Switzerland t Monitoring: Tracking your tasks with Task Monitoring PAT eLearning – Module 11 Edward.
HPDC Report Domenico Vicinanza CERN IT-GD-OPS CERN, July 12 th weekly OPS section meeting.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Cracow Grid Workshop ‘06 17 October 2006 Execution Management and SLA Enforcement in Akogrimo Antonios Litke Antonios Litke, Kleopatra Konstanteli, Vassiliki.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
SAN DIEGO SUPERCOMPUTER CENTER Inca TeraGrid Status Kate Ericson November 2, 2006.
NEES Cyberinfrastructure Center at the San Diego Supercomputer Center, UCSD George E. Brown, Jr. Network for Earthquake Engineering Simulation NEES TeraGrid.
Julia Andreeva, CERN IT-ES GDB Every experiment does evaluation of the site status and experiment activities at the site As a rule the state.
GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and.
SAN DIEGO SUPERCOMPUTER CENTER Inca Control Infrastructure Shava Smallen Inca Workshop September 4, 2008.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Site Manageability & Monitoring Issues for LCG Ian Bird IT Department, CERN LCG MB 24 th October 2006.
Korea Workshop May GAE CMS Analysis (Example) Michael Thomas (on behalf of the GAE group)
CMS Usage of the Open Science Grid and the US Tier-2 Centers Ajit Mohapatra, University of Wisconsin, Madison (On Behalf of CMS Offline and Computing Projects)
Julia Andreeva on behalf of the MND section MND review.
WP1 WP2 WP3 WP4 WP5 COORDINATOR WORK PACKAGE LDR RESEARCHER ACEOLE MID TERM REVIEW CERN 3 RD AUGUST 2010 Magnoni Luca Early Stage Researcher WP5 - ATLAS.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
SAN DIEGO SUPERCOMPUTER CENTER Welcome to the 2nd Inca Workshop Sponsored by the NSF September 4 & 5, 2008 Presenters: Shava Smallen
An attempt to summarize…or … some highly subjective observations Matthias Kasemann, CERN & DESY.
Enabling Grids for E-sciencE Grid monitoring from the VO/User perspective. Dashboard for the LHC experiments Julia Andreeva CERN, IT/PSS.
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
MND section. Summary of activities Job monitoring In collaboration with GridView and LB teams enabled full chain from LB harvester via MSG to Dashboard.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Update on CHEP from the Computing Speaker Committee G. Carlino (INFN Napoli) on behalf of the CSC ICB, October
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
Efi.uchicago.edu ci.uchicago.edu Sharing Network Resources Ilija Vukotic Computation and Enrico Fermi Institutes University of Chicago Federated Storage.
A Statistical Analysis of Job Performance on LCG Grid David Colling, Olivier van der Aa, Mona Aggarwal, Gidon Moont (Imperial College, London)
1 The Life-Science Grid Community Tristan Glatard 1 1 Creatis, CNRS, INSERM, Université de Lyon, France The Spanish Network for e-Science 2/12/2010.
Daniele Bonacorsi Andrea Sciabà
Monitoring Evolution and IPv6
Understanding the New PTC System Monitor (PSM/Dynatrace) Application’s Capabilities and Advanced Usage Stephen Vaillancourt PTC Technical Support –Technical.
WLCG Workshop 2017 [Manchester] Operations Session Summary
Chapter 19: Network Management
James Casey, CERN IT-GD WLCG Workshop 1st September, 2007
Monitoring Storage Systems for Oracle Enterprise Manager 12c
WLCG Network Discussion
Discovering Computers 2010: Living in a Digital World Chapter 14
Report from WLCG Workshop 2017: WLCG Network Requirements GDB - CERN 12th of July 2017
Key Activities. MND sections
ALICE Monitoring
POW MND section.
FTS Monitoring Ricardo Rocha
Experiment Dashboard overviw of the applications
GridICE monitoring for the EGEE infrastructure
Deploying ArcGIS at a Telecommunication Organization
Online Steering in gLite with RMOST
THE STEPS TO MANAGE THE GRID
Simulation use cases for T2 in ALICE
CernVM Status Report Predrag Buncic (CERN/PH-SFT).
Monitoring Storage Systems for Oracle Enterprise Manager 12c
LCG Operations Workshop, e-IRG Workshop
OPERATING SYSTEM OVERVIEW
Monitoring of the infrastructure from the VO perspective
Leigh Grundhoefer Indiana University
Danilo Dongiovanni INFN-CNAF
Presentation transcript:

HPDC Grid Monitoring Workshop June 25, 2007 Grid monitoring from the VO/user perspectives Shava Smallen

HPDC Grid Monitoring Workshop June 25, 2007 User Perspective Users want to do science  Can I submit a job?  Access my data?  Reasonable performance?  What is happened (happening) to my jobs ? A single Grid supports many user communities A VO focuses on a single user community and utilizes one or more Grids

HPDC Grid Monitoring Workshop June 25, 2007 VO Management Perspective Determine users’ needs Deliver resources to address the important use cases Educate and trains the users and the sites Understand future needs and patterns to plan and negotiate resources VO manager  How much resources are we using ?  Where ? To do what ?  How efficient is the usage ?  How can efficiency be improved ?  Do we have enough resources ? If not, why ? Grid/VO administrators:  VO data transfer coordinator: Status of the file transfers? Performance, problems, etc…  Production manager: How well is production progressing?  Site administrator: Do my resources work? Are they being well utilized? How well is my site serving a given VO?

HPDC Grid Monitoring Workshop June 25, 2007 Session Papers The Experiment Dashboard - The Monitoring System for the LHC Experiments  Presented by: Ricardo Brito Da Rocha (CERN) Monitoring, accounting and automated decision support for the ALICE experiment based on MonAlisa framework  Presented by: Catalin Cirstoiu (University of Bucharest) User-level Grid monitoring with Inca 2  Presented by: Shava Smallen (San Diego Supercomputer Center) Overview of the user tasks monitoring by the Job Submission Systems  Presented by: Suchandra Thapa (University of Chicago)

HPDC Grid Monitoring Workshop June 25, 2007 Monitored Information Categories:  Software/services  Jobs  Data access  Network (data transfers)  Storage Types:  Functionality/Reliability  Performance  Usage Scope:  Component  End-to-end

HPDC Grid Monitoring Workshop June 25, 2007 Data Collection Techniques Active Instrumented Passive Monitoring system interface

HPDC Grid Monitoring Workshop June 25, 2007 Centralized Data Storage Relational database

HPDC Grid Monitoring Workshop June 25, 2007 Data Display/Notifications Views Publishing formats Tools used Notifications

HPDC Grid Monitoring Workshop June 25, 2007 Discussion 1.Best practices? 2.What are the challenges? 3.Effective data display formats? 4.Can we navigate from one tool to another ?

HPDC Grid Monitoring Workshop June 25, 2007 Discussion hints Monitor of services vs. monitor of activities  Are all services OK, are the various things I can test working  How is the grid used ? Are all my users seeing same performance/problems ?  How to connect overall patterns of success/failures to monitoring of single services ?  How to decouple the failures due to the problems in the user code from the failures due to the problems of the Grid infrastructure?