Control in ATLAS TDAQ Dietrich Liko on behalf of the ATLAS TDAQ Group.

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

The Control System for the ATLAS Pixel Detector
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
L4-1-S1 UML Overview © M.E. Fayad SJSU -- CmpE Software Architectures Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I.
CS 501: Software Engineering Fall 2000 Lecture 16 System Architecture III Distributed Objects.
CHEP04 - Interlaken - Sep. 27th - Oct. 1st 2004T. M. Steinbeck for the Alice Collaboration1/27 A Control Software for the ALICE High Level Trigger Timm.
Chapter 9: Moving to Design
9 1 Chapter 9 Database Design Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Clara Gaspar, May 2010 The LHCb Run Control System An Integrated and Homogeneous Control System.
March 2003 CHEP Online Monitoring Software Framework in the ATLAS Experiment Serguei Kolos CERN/PNPI On behalf of the ATLAS Trigger/DAQ Online Software.
First year experience with the ATLAS online monitoring framework Alina Corso-Radu University of California Irvine on behalf of ATLAS TDAQ Collaboration.
Java Based Run Control for CMS Small DAQ Systems Michele Gulmini CHEP2000 February 2000 M. Bellato (INFN – Padova) L. Berti (INFN – Legnaro) D. Ceccato.
Introduction To System Analysis and design
Large Scale and Performance Tests of the ATLAS Online Software CERN ATLAS TDAQ Online Software System D.Burckhart-Chromek, I.Alexandrov, A.Amorim, E.Badescu,
Chapter 9 Elements of Systems Design
Framework for Automated Builds Natalia Ratnikova CHEP’03.
1 CMPT 275 Software Engineering Software life cycle.
TDAQ ATLAS Reimplementation of the ATLAS Online Event Monitoring Subsystem Ingo Scholtes Summer Student University of Trier Supervisor: Serguei Kolos.
CIT UPES | Sept 2013 | Unified Modeling Language - UML.
Designing a HEP Experiment Control System, Lessons to be Learned From 10 Years Evolution and Operation of the DELPHI Experiment. André Augustinus 8 February.
ITEC 3220M Using and Designing Database Systems
What is a life cycle model? Framework under which a software product is going to be developed. – Defines the phases that the product under development.
1 Chapter 9 Database Design. 2 2 In this chapter, you will learn: That successful database design must reflect the information system of which the database.
Cluster Reliability Project ISIS Vanderbilt University.
Magnetic Field Measurement System as Part of a Software Family Jerzy M. Nogiec Joe DiMarco Fermilab.
JCOP Workshop September 8th 1999 H.J.Burckhart 1 ATLAS DCS Organization of Detector and Controls Architecture Connection to DAQ Front-end System Practical.
Applications of Advanced Data Analysis and Expert System Technologies in the ATLAS Trigger-DAQ Controls Framework G. Avolio – University of California,
Clara Gaspar, October 2011 The LHCb Experiment Control System: On the path to full automation.
Ramiro Voicu December Design Considerations  Act as a true dynamic service and provide the necessary functionally to be used by any other services.
Chapter 10 Information Systems Analysis and Design
Information System Development Courses Figure: ISD Course Structure.
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
ALICE, ATLAS, CMS & LHCb joint workshop on
Unified Modeling Language* Keng Siau University of Nebraska-Lincoln *Adapted from “Software Architecture and the UML” by Grady Booch.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
L6-S1 UML Overview 2003 SJSU -- CmpE Advanced Object-Oriented Analysis & Design Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I College.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
1 CMPT 275 High Level Design Phase Modularization.
The Process Manager in the ATLAS DAQ System G. Avolio, M. Dobson, G. Lehmann Miotto, M. Wiesmann (CERN)
U.S. ATLAS Executive Committee August 3, 2005 U.S. ATLAS TDAQ FY06 M&O Planning A.J. Lankford UC Irvine.
CHEP March 2003 Sarah Wheeler 1 Supervision of the ATLAS High Level Triggers Sarah Wheeler on behalf of the ATLAS Trigger/DAQ High Level Trigger.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
1 Technical & Business Writing (ENG-715) Muhammad Bilal Bashir UIIT, Rawalpindi.
CMS Luigi Zangrando, Cern, 16/4/ Run Control Prototype Status M. Gulmini, M. Gaetano, N. Toniolo, S. Ventura, L. Zangrando INFN – Laboratori Nazionali.
Computer Simulation of Networks ECE/CSC 777: Telecommunications Network Design Fall, 2013, Rudra Dutta.
Setup/RunControl Andrei and Giovanna. Setup Requirements (Feb 2002, by Sergei) DAQ Active state: DAQ is ready to accept RC commands from Operator DAQ.
25 April Unified Cryptologic Architecture: A Framework for a Service Based Architecture Unified Cryptologic Architecture: A Framework for a Service.
TDAQ Experience in the BNL Liquid Argon Calorimeter Test Facility Denis Oliveira Damazio (BNL), George Redlinger (BNL).
WP1 WP2 WP3 WP4 WP5 COORDINATOR WORK PACKAGE LDR RESEARCHER ACEOLE MID TERM REVIEW CERN 3 RD AUGUST 2010 Magnoni Luca Early Stage Researcher WP5 - ATLAS.
The ATLAS DAQ System Online Configurations Database Service Challenge J. Almeida, M. Dobson, A. Kazarov, G. Lehmann-Miotto, J.E. Sloper, I. Soloviev and.
TM & DVS status 29/10/2004. TM requirements (June 98) UR TMGR-1: TM shall have the ability to execute individual tests. UR TMGR-2: TM shall run on all.
G. Anders, G. Avolio, G. Lehmann Miotto, L. Magnoni CERN, Geneva, Switzerland The Run Control System and the Central Hint and Information Processor of.
14 th IEEE-NPSS Real Time Stockholm - June 9 th 2005 P. F. Zema The GNAM monitoring system and the OHP histogram presenter for ATLAS 14 th IEEE-NPSS Real.
CMS Luigi Zangrando, Cern, 16/4/ Run Control Prototype Status M. Gulmini, M. Gaetano, N. Toniolo, S. Ventura, L. Zangrando INFN – Laboratori Nazionali.
A Validation System for the Complex Event Processing Directives of the ATLAS Shifter Assistant Tool G. Anders (CERN), G. Avolio (CERN), A. Kazarov (PNPI),
Basic Characteristics of Object-Oriented Systems
IST 210 Database Design Process IST 210, Section 1 Todd S. Bacastow January 2004.
M. Caprini IFIN-HH Bucharest DAQ Control and Monitoring - A Software Component Model.
9 Systems Analysis and Design in a Changing World, Fifth Edition.
The Control and Hardware Monitoring System of the CMS Level-1 Trigger Ildefons Magrans, Computing and Software for Experiments I IEEE Nuclear Science Symposium,
Chapter 8 Environments, Alternatives, and Decisions.
Intelligent Systems Development
Unified Modeling Language
Controlling a large CPU farm using industrial tools
Chapter 6 Database Design
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Computer Simulation of Networks
The Performance and Scalability of the back-end DAQ sub-system
Tools for the Automation of large distributed control systems
Presentation transcript:

Control in ATLAS TDAQ Dietrich Liko on behalf of the ATLAS TDAQ Group

CHEP04 - Interlaken Control of the ATLAS TDAQ system2 Overview The ATLAS TDAQ System Dataflow & HLT Control Subsystem of the Online Software Architecture TDAQ Wide Run Control Group Technology Choice CLIPS Design & Implementation Expert System Framework Run Control, Supervision & Verification Testing & Verification Test beam Scalability Tests

CHEP04 - Interlaken Control of the ATLAS TDAQ system3 The ATLAS TDAQ System Dataflow ROD ROS LVL1 HLT LVL2 Event Filter Online System Operation DCS Detector control Test beam: see [331] Event Building Performance: see [217]

CHEP04 - Interlaken Control of the ATLAS TDAQ system4 Control Aspects Dataflow Fixed configuration Synchronization, classical Run Control Error handling High level Triggers Flexible configuration Synchronization Error Handling

CHEP04 - Interlaken Control of the ATLAS TDAQ system5 ATLAS Online Software Component Architecture Object Oriented, C++ and Java Distributed system (CORBA) XML for Configuration Specialized services for a TDAQ system Information sharing, Message Reporting, Configuration Iterative Development Model Prototype already in use Laboratories, Test beam, Scalability tests Evolvement into the systems for initial ATLAS system

CHEP04 - Interlaken Control of the ATLAS TDAQ system6 Online Software Architecture In the context of the iterative development cycle and the Technical Design Review Reevaluation of requirements and architecture Several high level packages & corresponding subsystems Control Supervision, Verification Databases: see [130] Configuration, Conditions Information Sharing: see [166] Information Service, Message Service, Monitoring

CHEP04 - Interlaken Control of the ATLAS TDAQ system7 Control Subsystem In the following only the Supervision subsystem is discussed

CHEP04 - Interlaken Control of the ATLAS TDAQ system8 Supervision The Initialization and Shutdown is responsible for: initialization of TDAQ hardware and software components; re-initialization of a part of the TDAQ partition when necessary; shutting the TDAQ partition down gracefully; TDAQ process supervision. The Run Control is responsible for controlling the Run by accepting commands from the user and sending commands to TDAQ sub-systems; analyzing the status of controlled sub-systems and presenting the status of the whole TDAQ to the Operator The Error Handling is concerned with analyzing run-time error messages coming from TDAQ sub-systems; diagnosing problems, proposing recovery actions to the operator, or performing automatic recovery if requested.

CHEP04 - Interlaken Control of the ATLAS TDAQ system9 TDAQ Wide Run Control group Examines the requirements from the subsystem side Dataflow, HLT Hierarchical concept Follows the overall organization of the TDAQ system Controller central element All control functionality in combined controller State machine concept for synchronization Flexibility in error handling User customization

CHEP04 - Interlaken Control of the ATLAS TDAQ system10 Initial Design & Technology Choice A Run Control implementation is based on a State Machine model and uses the State Machine compiler, CHSM, as underlying technology. P.J. Lucas, An Object-Oriented language system for implementing concurrent hierarchical, finite state machines, MS Thesis, University of Illinois, (1993) A Supervisor is mainly concerned with process management. It has been built using the Open Source expert system CLIPS CLIPS, A tool for building expert systems, A Verification system (DVS) performs tests and provides diagnosis. It is also based on CLIPS.

CHEP04 - Interlaken Control of the ATLAS TDAQ system11 Experiences PLUS Scalability test in 2002 demonstrated that a system of the size of ATLAS TDAQ system can be controlled MINUS Lack of flexibility (CHSM)

CHEP04 - Interlaken Control of the ATLAS TDAQ system12 Technologies CLIPS Production system, standard open source expert system So-called Rete algorithm drives the evaluation rules on a set of facts In house experience General purpose scripting language, OO features C language bindings Alternatives Jess: Java based, very similar to CLIPS Eclipse: Commercial evolution of CLIPS SMI++ State Machine No general purpose scripting language Difficult to integrate in our environment Python Excellent scripting language No expert system

CHEP04 - Interlaken Control of the ATLAS TDAQ system13 Design & Implementation General Framework embedding CLIPS in a CORBA server Periodic evaluation of knowledge base Extension mechanism Online Software Components embedded as plug ins Control functionality fully described by CLIPS rules

CHEP04 - Interlaken Control of the ATLAS TDAQ system14 Proxy Objects Represent external entities Other controllers, processes etc Member attributes exposed to expert system as facts Member functions implement functionality in terms of Online software components Example Proxy objects represents child controllers State of the object corresponds to state of the child (idle, configured, running) Commands are forwarded to child controllers

CHEP04 - Interlaken Control of the ATLAS TDAQ system15 Controller Proxy Objects Other Controllers External processes Rules drive interactions between objects

CHEP04 - Interlaken Control of the ATLAS TDAQ system16 Status Supervisor Uses Framework Run Control Uses Framework Verification system CLIPS based Choice of a common technology drives the path to an unified control system based on Controllers

CHEP04 - Interlaken Control of the ATLAS TDAQ system17 Scalability Test 2004 Test bed Up to 330 PCs of the CERN IT LXSHARE 600 to 800 MHz to 2.4 GHZ Dual Pentium III 256 to 512 MB Linux RedHat 7.3 Only control aspect verified No Dataflow network Various configurations Servers on standard machines Servers on dedicated high end machines

CHEP04 - Interlaken Control of the ATLAS TDAQ system18 Supervisor – Process Management Supervisor P P P One Supervisor PMG Agents Startup limited by initialization of processes Enhanced recovery procedures

CHEP04 - Interlaken Control of the ATLAS TDAQ system19 Startup with 1000 Controllers & 3000 processes in 40 to 100 seconds Several configurations: mon_standard has two additional processes for a controller

CHEP04 - Interlaken Control of the ATLAS TDAQ system20 Run Control Usual RC tree Actually 10 controllers on the lowest level Variation of the number of intermediate nodes Some central infrastructure Name Service (IPC) Information Sharing

CHEP04 - Interlaken Control of the ATLAS TDAQ system21 Transitions 7 internal phases With 1000 Controllers 2 to 6 seconds No “real life” actions Again: More flexible error handling

CHEP04 - Interlaken Control of the ATLAS TDAQ system22 Combined Testbeam 2004 Stable operation from the start – Advantage of the component model

CHEP04 - Interlaken Control of the ATLAS TDAQ system23 Conclusions New assessment of requirements Overall Architecture Controller studied in detail CLIPS confirmed as technology choice Design and implementation of a new framework First test of new systems Test beam Scalability test We can control a system of the size of the ATLAS TDAQ system Much more flexible system Common technology in various control components Unified controllers in the future