Xrootd Monitoring for the CMS Experiment Abstract: During spring and summer 2011 CMS deployed Xrootd front- end servers on all US T1 and T2 sites. This.

Slides:



Advertisements
Similar presentations
NAGIOS AND CACTI NETWORK MANAGEMENT AND MONITORING SYSTEMS.
Advertisements

TCP Monitor and Auto Tuner. Need Analysis Enable monitoring of TCP Connections Enable maximum bandwidth utilization No such utility available in MONALISA.
ONE STOP THE TOTAL SERVICE SOLUTION FOR REMOTE DEVICE MANAGMENT.
GENI Experiment Control Using Gush Jeannie Albrecht and Amin Vahdat Williams College and UC San Diego.
Introduction to Network Analysis and Sniffer Pro
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.
Maintaining and Updating Windows Server 2008
 Proxy Servers are software that act as intermediaries between client and servers on the Internet.  They help users on private networks get information.
1 Enabling Secure Internet Access with ISA Server.
Security Guidelines and Management
1 Status of the ALICE CERN Analysis Facility Marco MEONI – CERN/ALICE Jan Fiete GROSSE-OETRINGHAUS - CERN /ALICE CHEP Prague.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
ALICE DATA ACCESS MODEL Outline ALICE data access model - PtP Network Workshop 2  ALICE data model  Some figures.
1.Introduction 2.Monitoring-related features of XRootD 3.Issues with CMS federation monitoring Talk outline: Matevž Tadel
Introduction: Following the great success of AAA and FAX data federations of all US ATLAS & CMS T1 and T2 sites, AAA embarked on exploration of extensions.
Copyright © 2002 OSI Software, Inc. All rights reserved. PI-NetFlow and PacketCapture Eric Tam, OSIsoft.
Chapter 1 Overview Review Overview of demonstration network
ALICE data access WLCG data WG revival 4 October 2013.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
Publication and Protection of Site Sensitive Information in Grids Shreyas Cholia NERSC Division, Lawrence Berkeley Lab Open Source Grid.
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
Open Science Grid The OSG Accounting System: GRATIA by Philippe Canal (FNAL) & Matteo Melani (SLAC) Mumbai, India CHEP2006.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
Module 4: Configuring ISA Server as a Firewall. Overview Using ISA Server as a Firewall Examining Perimeter Networks and Templates Configuring System.
FAX UPDATE 26 TH AUGUST Running issues FAX failover Moving to new AMQ server Informing on endpoint status Monitoring developments Monitoring validation.
Xrootd Monitoring Atlas Software Week CERN November 27 – December 3, 2010 Andrew Hanushevsky, SLAC.
1 Implementing Monitoring and Reporting. 2 Why Should Implement Monitoring? One of the biggest complaints we hear about firewall products from almost.
Network Management Protocols and Applications Cliff Leach Mike Looney Danny Mar Monty Maughon.
Integrating and Troubleshooting Citrix Access Gateway.
STATUS OF DCACHE N2N AND MONITORING REPORT I. CURRENT SITUATION xrootd4j is a part of dCache implemented in such a way that each change requires new dCache.
SLACFederated Storage Workshop Summary For pre-GDB (Data Access) Meeting 5/13/14 Andrew Hanushevsky SLAC National Accelerator Laboratory.
CS 6401 The World Wide Web Outline Background Structure Protocols.
XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,
Jan Hatje, DESY CSS – Control System Studio EPICS collaboration meeting CSS – Control System Studio Update EPICS collaboration meeting 2008 Shanghai.
3D Testing and Monitoring Lee Lueking LCG 3D Meeting Sept. 15, 2005.
CERN IT Department CH-1211 Geneva 23 Switzerland t A proposal for improving Job Reliability Monitoring GDB 2 nd April 2008.
COMP2322 Lab 1 Introduction to Wireshark Weichao Li Jan. 22, 2016.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
Import XRootD monitoring data from MonALISA Sergey Belov, JINR, Dubna DNG section meeting,
Introduction Web analysis includes the study of users’ behavior on the web Traffic analysis – Usage analysis Behavior at particular website or across.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
Session 11: Cookies, Sessions ans Security iNET Academy Open Source Web Development.
ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.
Monitoring of an Xrootd Data Federation Matevž Tadel, UCSD as a member of CMS & Xrootd Collaborations & AAA project Lyon, Sept M. Tadel: Xrootd.
XRootD Monitoring Report A.Beche D.Giordano. Outlines  Talk 1: XRootD Monitoring Dashboard  Context  Dataflow and deployment model  Database: storage.
Maintaining and Updating Windows Server 2008 Lesson 8.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Federating Data in the ALICE Experiment
Daniele Bonacorsi Andrea Sciabà
Essential tools for implementing and testing websites
Transport Protocols Relates to Lab 5. An overview of the transport protocols of the TCP/IP protocol suite. Also, a short discussion of UDP.
Securing the Network Perimeter with ISA 2004
Patricia Méndez Lorenzo ALICE Offline Week CERN, 13th July 2007
A Messaging Infrastructure for WLCG
Networking for Home and Small Businesses – Chapter 6
Implementing TMG Server Publishing
Storage elements discovery
TYPES OF SERVER. TYPES OF SERVER What is a server.
Monitoring Of XRootD Federation
Working at a Small-to-Medium Business or ISP – Chapter 7
Networking for Home and Small Businesses – Chapter 6
Working at a Small-to-Medium Business or ISP – Chapter 7
Working at a Small-to-Medium Business or ISP – Chapter 7
Initial job submission and monitoring efforts with JClarens
Networking for Home and Small Businesses – Chapter 6
Presentation transcript:

Xrootd Monitoring for the CMS Experiment Abstract: During spring and summer 2011 CMS deployed Xrootd front- end servers on all US T1 and T2 sites. This allows for remote access to all experiment data and is used for user-analysis, visualization, running of jobs at T2s and T3s when data is not available at local sites, and as a fail- over mechanism for data-access in CMSSW jobs. Monitoring of Xrootd infrastructure is implemented on three levels: 1.Service and data availability checks 2.Xrootd summary monitoring Custom analyzer MonALISA 3.Xrootd detailed monitoring GLED Web, Gratia, ROOT Trees, … L.A.T. Bauerdick 1, K.Bloom 3, B.P.Bockelman 3, D.C.Bradley 4, S.Dasu 4, I.Sfiligoi 2, A.Tadel 2, M.Tadel 2, F.Wuerthwein 2, A.Yagil 2 1 FNAL, 2 UC San Diego, 3 University of Nebraska-Lincoln, 4 University of Wisconsin-Madison #begin unique_id=xrd file_lfn=/store/data/Run2011B/…/XXXX.root file_size= start_time= end_time= read_bytes= read_operations=196 read_min=300 read_max= read_average= read_sigma= # single-read operation statistics removed read_vector_bytes= read_vector_operations=64 read_vector_min= read_vector_max= read_vector_average= read_vector_sigma= read_vector_count_min=3 read_vector_count_max=512 read_vector_count_average= read_vector_count_sigma= read_bytes_at_close= # write operation statistics removed user_dn=XXXX user_vo= user_role= user_fqan= client_domain=hep.wisc.edu client_host=g22n10 server_username=cmsuser127 app_info= server_domain=t2.ucsd.edu server_host=uaf-7 #end References: AAA & FAX, at this CHEP GLEDhttp://gled.org/ MonALISA ROOThttp://root.cern.ch/ Xrootdhttp://xrootd.org/ 1. Service & Data Availability Nagios probes track the following core operations: Check redirection from sites Check authentication with CERN & OSG certificates Check that files can actually be read (get first 1kB) Mail alarms sent in case of problems Checking of individual Xrootd servers: Some sites also use (historically) The plan is to delegate this to sites (RSV probes exist) Summary monitoring also reveals a lot about server state 2. Xrootd Summary Monitoring All redirectors and servers send their summary monitoring UDP packets to a collector at UCSD where data is pre-processed and stored into MonALISA repository. Examples of collected data: Number of connected clients Rates of new connections, authentications, and various errors Incoming and outgoing network traffic caused by Xrootd Server’s usage of system resources Processing with ML plugins: Calculating per-site quantities, e.g. total traffic for each site Detecting error conditions and sending notification s Presentation options: Standard ML graphs – for individual sites / host, totals Dashboard UDP ➙ TCP multiplexer GLED TTree writer MonALISA xrd-rep-snatcher.pl Development, testing Summary UDP packets Detailed UDP packets 3. Xrootd Detailed Monitoring As with summary data, detailed monitoring UDP packets are also sent to UCSD. The streams are merged and made available via a UDP to TCP converter / multiplexer. Contents of detailed monitoring streams: User authentication records, including their DN and VOMS info File-open records, including LFN by which the file was requested All read and write requests (offset, length, and timestamp) Vector-read requests (# of elements, total length, timestamp) Optionally, servers can send offset & length info for each element Redirection records Default processing with GLED Complete in-memory representation of all servers, sessions and open files is required as packets are highly encoded. Embedded http server shows currently ongoing user sessions When a file is closed a detailed report is generated Sent to OSG Gratia and written into ROOT trees for further analysis