Control and monitoring of trigger algorithms using Gaucho

Slides:



Advertisements
Similar presentations
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved Managing Server.
Advertisements

CHEP 2012 – New York City 1.  LHC Delivers bunch crossing at 40MHz  LHCb reduces the rate with a two level trigger system: ◦ First Level (L0) – Hardware.
HV for SM surface testing 2 nd Workshop on the Detector Control System for TRD University of Tsukuba Kengo Watanabe.
Clara Gaspar, May 2010 The LHCb Run Control System An Integrated and Homogeneous Control System.
L. Granado Cardoso, F. Varela, N. Neufeld, C. Gaspar, C. Haen, CERN, Geneva, Switzerland D. Galli, INFN, Bologna, Italy ICALEPCS, October 2011.
Control and monitoring of on-line trigger algorithms using a SCADA system Eric van Herwijnen Wednesday 15 th February 2006.
CERN - IT Department CH-1211 Genève 23 Switzerland t The High Performance Archiver for the LHC Experiments Manuel Gonzalez Berges CERN, Geneva.
Clara Gaspar, October 2011 The LHCb Experiment Control System: On the path to full automation.
1 Alice DAQ Configuration DB
C.Combaret, L.Mirabito Lab & beamtest DAQ with XDAQ tools.
Peter Chochula ALICE DCS Workshop, October 6,2005 PVSSII Alert Handling.
Nick Brook Current status Future Collaboration Plans Future UK plans.
Dynamic Resource Monitoring and Allocation in a virtualized environment.
N ATIONAL E NERGY R ESEARCH S CIENTIFIC C OMPUTING C ENTER Charles Leggett The Athena Control Framework in Production, New Developments and Lessons Learned.
Report on the Commissioning Task Force activity Global and sub-detector views on ECS Histogram handling : – histogram handling and PVSS – the ALEPH and.
ALICE, ATLAS, CMS & LHCb joint workshop on
1 Control Software (CAT) Introduction USB Interface implementation Calorimeter Electronics Upgrade Meeting Frédéric Machefert Wednesday 5 th May, 2010.
7. CBM collaboration meetingXDAQ evaluation - J.Adamczewski1.
Management of the LHCb DAQ Network Guoming Liu * †, Niko Neufeld * * CERN, Switzerland † University of Ferrara, Italy.
Clara Gaspar, March 2005 LHCb Online & the Conditions DB.
Peter Chochula ALICE Offline Week, October 04,2005 External access to the ALICE DCS archives.
L0 DAQ S.Brisbane. ECS DAQ Basics The ECS is the top level under which sits the DCS and DAQ DCS must be in READY state before trying to use the DAQ system.
Status of Farm Monitor and Control CERN, February 24, 2005 Gianluca Peco, INFN Bologna.
CHEP March 2003 Sarah Wheeler 1 Supervision of the ATLAS High Level Triggers Sarah Wheeler on behalf of the ATLAS Trigger/DAQ High Level Trigger.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
Clara Gaspar, July 2005 RTTC Control System Status and Plans.
Root – LHCb Online meeting Eric van Herwijnen Thursday June 14, 2006.
Management of the LHCb Online Network Based on SCADA System Guoming Liu * †, Niko Neufeld † * University of Ferrara, Italy † CERN, Geneva, Switzerland.
1 Calorimeters LED control LHCb CALO meeting Anatoli Konoplyannikov /ITEP/ Status of the calorimeters LV power supply and ECS control Status of.
Alarm Handling Oliver Holme 7 th November Guidelines & Strategies All provide recommendation to use Framework Classes Standard severities/behaviour/colours.
Configuration database status report Eric van Herwijnen September 29 th 2004 work done by: Lana Abadie Felix Schmidt-Eisenlohr.
CERN, Artur Barczyk, CERN/PH-LBC 1 Network Monitoring and Control in LHCb A (very) brief introduction.
TELL1 command line tools Guido Haefeli EPFL, Lausanne Tutorial for TELL1 users : 25.February
Clara Gaspar, April 2006 LHCb Experiment Control System Scope, Status & Worries.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
LHCb Configuration Database Lana Abadie, PhD student (CERN & University of Pierre et Marie Curie (Paris VI), LIP6.
Management of the LHCb DAQ Network Guoming Liu *†, Niko Neufeld * * CERN, Switzerland † University of Ferrara, Italy.
Alignment in real-time in current detector and upgrade 6th LHCb Computing Workshop 18 November 2015 Beat Jost / Cern.
The DCS Databases Peter Chochula. 31/05/2005Peter Chochula 2 Outline PVSS basics (boring topic but useful if one wants to understand the DCS data flow)
Update on Farm Monitor and Control Domenico Galli, Bologna RTTC meeting Genève, 14 april 2004.
Markus Frank (CERN) & Albert Puig (UB).  An opportunity (Motivation)  Adopted approach  Implementation specifics  Status  Conclusions 2.
Calibration & Monitoring M.N Minard Monitoring News Status of monitoring tools Histogramm and monitoring meeting 6/02/08 Calibration farm brainstorming.
Maria del Carmen Barandela Pazos CERN CHEP 2-7 Sep 2007 Victoria LHCb Online Interface to the Conditions Database.
M. Caprini IFIN-HH Bucharest DAQ Control and Monitoring - A Software Component Model.
Clara Gaspar, February 2010 DIM A Portable, Light Weight Package for Information Publishing, Data Transfer and Inter-process Communication.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
Do-more Technical Training
Moving the LHCb Monte Carlo production system to the GRID
Online Control Program: a summary of recent discussions
Tango Administrative Tools
Online Displays for the World
Controlling a large CPU farm using industrial tools
Distributed object monitoring for ROOT analyses with Go4 v.3
MiniDAQ2 Workshop Control System.
Handling online information in the LHCb experiment
Publishing PVSS data on the Web
The LHCb Run Control System
GAUSS Work plan Software Week 6 March 2002 F.Ranjard / CERN.
New FSM v24r1.
Philippe Vannerem CERN / EP ICALEPCS - Oct03
Pierluigi Paolucci & Giovanni Polese
IEEE - Nuclear Science Symposium San Diego, Oct. 31st 2006
The LHCb High Level Trigger Software Framework
Gridifying the LHCb Monte Carlo production system
Eric van Herwijnen March 10th 2005
Tools for the Automation of large distributed control systems
Use Of GAUDI framework in Online Environment
Configuration DB Status report Lana Abadie
Zhangxy Zhangxm Huangxt Dec 17 ,2003
Some thoughts on scripting languages
Presentation transcript:

Control and monitoring of trigger algorithms using Gaucho Eric van Herwijnen Wednesday 12th October 2005

Contents The Problem Gaucho architecture Implementation Experience Conclusions Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho

The problem Control and monitor trigger (Gaudi) processes on event filter farm Send monitoring data (counters, rates, histograms, status, error messages) to ECS Configure jobs on the fly Combine information from individual CPUs Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho

running configure ready start GAUCHO architecture (1 process) running configure ready start PVSS runs DIM clients to send comands and get data from Gaudi jobs PVSS runs DIM server to send accumulated data to ROOT PVSS project with FSM Gaudi Jobs are device units FSM command “start” sends DIM command “start” to Gaudi Job FSM command (“configure”) starts execution of job Gaudi Job starts event loop and sends state “running” to PVSS Gaudi Job sends state “ready” to PVSS Gaudi Job creates a DIM server Counters and histograms Sent to PVSS

Implementation C++ Gaudi MonitorSvc allows same online/offline code PVSS Panel structure: Per job (counters, configuration, dynamic subscription to histograms on the transient store) Per node (two jobs, counters and histograms summed/averaged) Per subfarm (n nodes) ~30 datapoints/job, ~10 dpe’s each ~100 Dim services/job (some internal) Dim services setup in a PVSSCTRL PVSS library to manipulate histograms (executed when panels are open) Packaged as LHCb JCOP Framework compatible tool Root viewer for 2D histograms and further analysis Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho

Experience First experience during RTTC bad: too much CPU usage on PVSS machine Scripts rewritten, latest tests with 20 jobs on 10 lxplus nodes better Tests with dummy Gaudi job Idle configuration for 1 node (2 jobs) 80 Mb, 4% CPU (excluding PVSS itself) Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho

Experience 1 node, 2 jobs: 71 % CPU usage on PVSS machine 38% PVSSCTRL, 20% PVSSEvent, 10% PVSSData, rest other PVSS processes Stopping jobs takes 2 secs, all processes reduce CPU consumption as expected Now try 20 jobs over 10 lxplus nodes Idle configuration: 225 Mb, 8% CPU Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho

Experience with 20 jobs 2205 dim services CPU usage 100% on PVSS machine Viewing counters (10 secs) and histograms (20 secs) OK Proportion between PVSSCTRL, PVSSEvent, PVSSDim the same Stopping jobs takes about 2 minutes CPU usage correctly drops Some unexplained crashes of PVSSDIM, memory usage after stopping stays high Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho

Conclusions Performance is now reasonable Next step: integration of Gaucho into run control system of LHCb event filter farm (November) http://lhcb-comp.web.cern.ch/lhcb-comp/ECS/Gaucho/default.htm Wed 12th October 2005 Control and monitoring of trigger algorithms using Gaucho