Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.

Slides:

Advertisements

Similar presentations

Welcome to Middleware Joseph Amrithraj

Advertisements

S. Gadomski, "ATLAS computing in Geneva", journee de reflexion, 14 Sept ATLAS computing in Geneva Szymon Gadomski description of the hardware the.

Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.

Microsoft ® Application Virtualization 4.5 Infrastructure Planning and Design Series.

1 Status of the ALICE CERN Analysis Facility Marco MEONI – CERN/ALICE Jan Fiete GROSSE-OETRINGHAUS - CERN /ALICE CHEP Prague.

Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.

LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.

The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.

GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.

Experience of xrootd monitoring for ALICE at RDIG sites G.S. Shabratova JINR A.K. Zarochentsev SPbSU.

ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.

ATLAS DQ2 Deletion Service D.A. Oleynik, A.S. Petrosyan, V. Garonne, S. Campana (on behalf of the ATLAS Collaboration)

A Distributed Computing System Based on BOINC September - CHEP 2004 Pedro Andrade António Amorim Jaime Villate.

Computing and LHCb Raja Nandakumar. The LHCb experiment  Universe is made of matter  Still not clear why  Andrei Sakharov’s theory of cp-violation.

J OINT I NSTITUTE FOR N UCLEAR R ESEARCH OFF-LINE DATA PROCESSING GRID-SYSTEM MODELLING FOR NICA 1 Nechaevskiy A. Dubna, 2012.

Remote Production and Regional Analysis Centers Iain Bertram 24 May 2002 Draft 1 Lancaster University.

BaBar Grid Computing Eleonora Luppi INFN and University of Ferrara - Italy.

Integrating HPC into the ATLAS Distributed Computing environment Doug Benjamin Duke University.

Introduction  Client/Server technology is seen by many as the solution to the difficulty of linking together the various departments of corporation.

INDIANAUNIVERSITYINDIANAUNIVERSITY Grid Monitoring from a GOC perspective John Hicks HPCC Engineer Indiana University October 27, 2002 Internet2 Fall Members.

14 Aug 08DOE Review John Huth ATLAS Computing at Harvard John Huth.

Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.

Support in setting up a non-grid Atlas Tier 3 Doug Benjamin Duke University.

LOGO PROOF system for parallel MPD event processing Gertsenberger K. V. Joint Institute for Nuclear Research, Dubna.

And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR

Architecture and ATLAS Western Tier 2 Wei Yang ATLAS Western Tier 2 User Forum meeting SLAC April

Atlas Tier 3 Virtualization Project Doug Benjamin Duke University.

Development of the distributed monitoring system for the NICA cluster Ivan Slepov (LHEP, JINR) Mathematical Modeling and Computational Physics Dubna, Russia,

What is SAM-Grid? Job Handling Data Handling Monitoring and Information.

…building the next IT revolution From Web to Grid…

LOGO Development of the distributed computing system for the MPD at the NICA collider, analytical estimations Mathematical Modeling and Computational Physics.

T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.

Storage cleaner: deletes files on mass storage systems. It depends on the results of deletion, files can be set in states: deleted or to repeat deletion.

Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.

Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.

Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing

XROOTD AND FEDERATED STORAGE MONITORING CURRENT STATUS AND ISSUES A.Petrosyan, D.Oleynik, J.Andreeva Creating federated data stores for the LHC CC-IN2P3,

University user perspectives of the ideal computing environment and SLAC’s role Bill Lockman Outline: View of the ideal computing environment ATLAS Computing.

PERFORMANCE AND ANALYSIS WORKFLOW ISSUES US ATLAS Distributed Facility Workshop November 2012, Santa Cruz.

Doug Benjamin Duke University. 2 ESD/AOD, D 1 PD, D 2 PD - POOL based D 3 PD - flat ntuple Contents defined by physics group(s) - made in official production.

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.

CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.

US Atlas Tier 3 Overview Doug Benjamin Duke University.

Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.

INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.

1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.

T3g software services Outline of the T3g Components R. Yoshida (ANL)

Gennaro Tortone, Sergio Fantinel – Bologna, LCG-EDT Monitoring Service DataTAG WP4 Monitoring Group DataTAG WP4 meeting Bologna –

Testing Infrastructure Wahid Bhimji Sam Skipsey Intro: what to test Existing testing frameworks A proposal.

Data Analysis w ith PROOF, PQ2, Condor Data Analysis w ith PROOF, PQ2, Condor Neng Xu, Wen Guan, Sau Lan Wu University of Wisconsin-Madison 30-October-09.

Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.

ATLAS Off-Grid sites (Tier-3) monitoring A. Petrosyan on behalf of the ATLAS collaboration GRID’2012, , JINR, Dubna.

SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,

G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.

A Web Based Job Submission System for a Physics Computing Cluster David Jones IOP Particle Physics 2004 Birmingham 1.

SERVERS. General Design Issues  Server Definition  Type of server organizing  Contacting to a server Iterative Concurrent Globally assign end points.

BaBar & Grid Eleonora Luppi for the BaBarGrid Group TB GRID Bologna 15 febbraio 2005.

Database Replication and Monitoring

Doug Benjamin Duke University On Behalf of the Atlas Collaboration

RDMS CMS Computing Activities: current status & participation in ARDA

ALICE Monitoring

Accounting at the T1/T2 Sites of the Italian Grid

Artem Petrosyan (JINR), Danila Oleynik (JINR), Julia Andreeva (CERN)

R.Mashinistov (UTA) July

Database Architectures and the Web

TYPES OF SERVER. TYPES OF SERVER What is a server.

GGUS Partnership between FZK and ASCC

University of Technology

Tiers vs. Layers.

Production Manager Tools (New Architecture)

Presentation transcript:

Tier3 monitoring. Initial issues. Danila Oleynik. Artem Petrosyan. JINR.

What is Tier3 sites. Tier 3 site – is a computing facility, which using some local groups for their analysis work. Working definition –“Non pledged resources” –“Analysis facilities” at Your University/Institute/... Specialty: –Final analysis vs simulation and reconstruction –Local control vs ATLAS central control –Operation load more on local resources (i.e. people) than on the central team (i.e. other people)

Types of Tier 3’s Tier 3 gs (grid services) –Same services as Tier2 Tier 3 w (workstation) –Interactive workstation with Atlas Software –No batch system –Can submit grid jobs –Data retrieved using client tools (dq2-get)

Tier 3G (most common Tier 3) Interactive nodes Can submit grid jobs Batch system Atlas code available Client tools used for fetch data (dq2-ls, dq2-get) Storage can be one of two types: –Located on the worker nodes –Located on dedicated file servers

T3g Software Services Generic Services (services which are used to maintain the cluster) –LDAP is a database, which, in this case, is used to manage the users on the cluster. –Ganglia infrastructure monitoring system. –Web server. –NFS (network file system). Job submission services –Proof (most common) –Arcond (More exotic) Distributed Storage Service –XrootD

Monitoring issues for Tier3 Local monitoring Infrastructure monitoring Monitoring of Job submission system Monitoring of Data management (storage) system –Requirements: easy installation and support Global monitoring (monitoring of all Tier3 activities) We still have no full picture about which data should be represented on this layer 

Local Tier3 Monitoring ‘Ganglia’ and ‘Nagios’ – is a most recommend system for infrastructure monitoring (a lot of Tier3 sites in US already use Ganglia). This system provides wide range of monitoring parameters, significant advantage of this systems that they boiled by plug-in technology and for monitoring specific parameters only sensors needed to be developed. Most of Tier3 sites will use Proof as job management system and xRootd as data management system. This systems has initial interfaces for thus monitoring. Proof and xRootd can be monitored by Monalisa monitoring system, but we collect different views about Monalisa. Monalisa can be very heavy solution for small sites, but we still in investigation.

Global Tier3 monitoring Due to we have no information about data that should be presented in global monitoring from tier3 sites and data flow, we can talk only about initial concepts of this service. This service should based on agent model. Each agent works with local monitoring system, collect aggregate needful data and send this data to some central monitoring service. In depends from dataflow different technology can be implemented (REST - WebServices, ActiveMQ). Central service should provide possibility to collect and store this information from all sites. Provide different interfaces to data, as human oriented so machine oriented.

Interfaces for Monitoring Tier3 system. Local monitoring system. Ganglia and Nagios provide own web interfaces. All new parameters which can be collected by these systems will be presented through their interfaces. Global monitoring system. For presentation of monitored data on global level we offer to use web based technology. In development of this application we are ready to use Ajax technology (Jquery) and Jango as data model layer. Integration with other application can be done by using REST protocol.