Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Redundant EPICS.

Slides:



Advertisements
Similar presentations
1 1999/Ph 514: Channel Access Concepts EPICS Channel Access Concepts Bob Dalesio LANL.
Advertisements

XFEL The European X-Ray Laser Project DESY/ XFEL An Overview EPICS Collaboration Meeting San Francisco, October 5th, 2013 Matthias Clausen, DESY.
Matthias Clausen, DESY CSS GSI Feb. 2009: Introduction XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 1 CSS – Control System.
16 - Nov. 2000EPICS Workshop Oak Ridge1 Epics to TINE translator Matthias Clausen, DESY Hamburg Phil Duval, DESY Hamburg Zoltan Kakucs, DESY Hamburg.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
CS526 Dr.Chow1 HIGH AVAILABILITY LINUX VIRTUAL SERVER By P. Jaya Sunderam and Ankur Deshmukh.
Operating Systems CS208. What is Operating System? It is a program. It is the first piece of software to run after the system boots. It coordinates the.
Brian Martlew 25 th Sept 2006 MICE Control & Monitoring Plan Brian Martlew.
Computer Networks,Internet,& SMTP Priyanka Daultani Department of Computer Science USF, San Francsico, CA.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Introduction to client/server architecture
ABC Co. Network Implementation High reliability is primary concern – near 100% uptime required –Customer SLA has stiff penalty clauses –Everything is designed.
Automatic Software Testing Tool for Computer Networks ARD Presentation Adi Shachar Yaniv Cohen Dudi Patimer
Firewall Typical Networking and Troubleshooting Common Faults.
A modern NM registration system capable of sending data to the NMDB Helen Mavromichalaki - Christos Sarlanis NKUA TEAM National & Kapodistrian University.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY XFEL Refrigerator Controls – April Alarm Training Making.
1 Fault Tolerance in the Nonstop Cyclone System By Scott Chan Robert Jardine Presented by Phuc Nguyen.
Jan Hatje, DESY CSS ITER March 2009: Alarm System, Authorization, Remote Management XFEL The European X-Ray Laser Project X-Ray Free-Electron.
December 2009 Target Group Control System of production furnace for SPES project Results and progress status december 2009.
Deutsches Elektronen-Synchrotron DESY Helmholtz Association of German Research Centres Hamburg, Germany The European X-Ray Laser Project.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY XFEL Refrigerator Controls – April CSS-DCT (SNL) Training.
Matthias Clausen, DESY CSS GSI Feb. 2009: Introduction XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 1 CSS – Control System.
X one. Mobile development platform We mobilize your process IP. ALARMS AUTOMOTION.
Redundancy. 2. Redundancy 2 the need for redundancy EPICS is a great software, but lacks redundancy support which is essential for some highly critical.
XFEL The European X-Ray Laser Project CSS Core Meeting Introduction into CSS DESY, August 7, 2006 Matthias Clausen MKS-2.
1 SNL Current State (in transition state) By Matthias Clausen Ron Chestnut.
ATCA based LLRF system design review DESY Control servers for ATCA based LLRF system Piotr Pucyk - DESY, Warsaw University of Technology Jaroslaw.
Jan Hatje, DESY CSS ITER March 2009: Technology and Interfaces XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 1 CSS – Control.
Redundant IOC with ATCA(HPI) support Utilizing modern hardware for better availability Artem Kazakov, KEK/SOKENDAI.
Advanced Computer Networks Topic 2: Characterization of Distributed Systems.
Final Review of ITER PBS 45 CODAC – PART 1 – 14 th, 15 th and 16 th of January CadarachePage 1 FINAL DESIGN REVIEW OF ITER PBS 45 CODAC – PART 1.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY XFEL Refrigerator Controls – April CSS Core Applications.
Jan Hatje, DESY CSS GSI Feb. 2009: Alarm System, Authorization, Remote Management XFEL The European X-Ray Laser Project X-Ray Free-Electron.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY XFEL Refrigerator Controls – April CSS Introduction 2 nd.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
EPICS EPICS Collaboration Meeting Argonne National Laboratory drvTS improvements for soft timing EPICS Collaboration Meeting Argonne National Laboratory.
A XML 1 An XML Driven Graphical User Interface and Application Management Toolkit Matthias Clausen (DESY & SLAC) Piotr Bartkiewicz (DESY & FPNT/AGH) Alexei.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY ICFA Seminar, 29.October 2008 Global Collaboration on Accelerator.
Reliability/ Secure IOC / Outlook M. Clausen / DESY 1 CA-Put Logging BurtSave Warm Reboot Matthias Clausen DESY/ MKS.
1 1999/Ph 514: Flow of Control EPICS Flow of Control Marty Kraimer APS.
EPICS at DESY M. Clausen EPICS workshop Trieste’99 1 EPICS at DESY Matthias Clausen.
ICALEPCS 2005 Geneva, Oct. 12 The ALMA Telescope Control SystemA. Farris The ALMA Telescope Control System Allen Farris Ralph Marson Jeff Kern National.
EPICS EPICS Collaboration Meeting Argonne June 2006 IOC Redundancy: Redundancy Monitor Task EPICS Meeting - Redundancy Argonne, June 16, 2006 Matthias.
Jan Hatje, DESY CSS GSI Feb. 2009: Technology and Interfaces XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 1 CSS – Control.
Lecture 4 Mechanisms & Kernel for NOSs. Mechanisms for Network Operating Systems  Network operating systems provide three basic mechanisms that support.
Jan Hatje, DESY CSS – Control System Studio EPICS collaboration meeting CSS – Control System Studio Update EPICS collaboration meeting 2008 Shanghai.
Group Communication Theresa Nguyen ICS243f Spring 2001.
Jan Hatje, DESY SNL Editor and Debugger EPICS collaboration meeting SNL Editor and Debugger EPICS collaboration meeting 2008 Shanghai Institute.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser DESY Status and Possible Controls Contributions to the ILC Kay Rehlich.
1 Channel Access Concepts – IHEP EPICS Training – K.F – Aug EPICS Channel Access Concepts Kazuro Furukawa, KEK (Bob Dalesio, LANL)
EPICS and LabVIEW Tony Vento, National Instruments
1 EPICS Flow of Control: EPICS Workshop at IHEP, Beijing, August 2001 EPICS Flow of Control Marty Kraimer APS.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Matthias Clausen, DESY EPICS Training – Client Tools/ CSS EPICS collaboration meeting EPICS Training Client Tools EPICS collaboration meeting 2008.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Wojciech Jalmuzna, Technical University of Lodz, Department of Microelectronics and Computer.
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
Bastian Knerr, MKS2, DESY March 2011 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 1 iPhone Apps for EPICS EPICS.
Redundancy in the Control System of DESY’s Cryogenic Facility. M. Bieler, M. Clausen, J. Penning, B. Schoeneburg, DESY ARW 2013, Melbourne,
Introduction to client/server architecture
Fault Tolerance Distributed Web-based Systems
Computer Networks,Internet,& SMTP
Chapter 2: Operating-System Structures
Channel Access Concepts
An XML-based System Architecture for IXA/IA Intercommunication
Chapter 2: Operating-System Structures
Commissioning the European XFEL cryogenic system and best practice
Channel Access Concepts
Presentation transcript:

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Redundant EPICS IOCs Matthias Clausen Gongfa Liu Bernd Schoeneburg (DESY)

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 2 Agenda High Availability Choose the ‘right’ approach Design Implementation Functionality –Redundancy Monitor Task –Continuous Control Executive –SNL Executive Redundancy management –Diagnostic –SNL Debugger Outlook

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 3 Main Service: NFS File Server (1) –must be cluster service Adding Services: Archiver (2) –as cluster service? –as ‘managed’ service! Adding Services: LDAP-HA –must be cluster service right choice for a file server?  Keep your eye on the main service. Do not allow other services to interfere with the main service. High Availability Example: File-Server Archiver (2) Archiver (2) (1)

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 4 High Availability: How to implement it? Increase Availability by Following Mill Specs ? Redundant Components !

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 5 High Availability: Why? In our case: 24/7 Cryogenic operations for more then one year of operation without any interruption Necessary for: FLASH Cryogenic Plant Will be converted from (redundant) commercial to redundant EPICS next year.(1/3 of the system) XFEL Cryogenic Plant Will be converted from (redundant) commercial to redundant EPICS in 2010.(remaining part) XFEL Cryogenic (and possibly Utility) Controls in the XFEL Tunnel

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser When using redundant IOCs? In applications, where high availability is needed and the failure of an IOC can cause a long plant breakdown. If you have to be able to maintain the system during operation. Like exchanging a power supply, or loading new software versions. If a risk of failure is increased; e.g. in areas, where ionizing radiation can be present. Design goal: The redundant IOC pair must be more reliable than a standalone system!

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 7 Project Schedule: Design Phase (June 2005) Identifying the main components: –Redundancy Monitor Task –Continuous Control Executive –SNL Executive Implementation Phase (March-September 2006) –Redundancy Monitor Task: Industry –Continuous Control Executive: Bob and his brother –SNL Executive: DESY with SLAC support Testing ( ) Porting RMT and CCE to other OS (cooperation with KEK) Testing (2007) Porting to uTCA System (planned) Production: Middle of 2008

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Layout: RMT, CC-Exec, SNL-Exec Control Update Process QUEQUE SNL Update Process QUEQUE Contr ol Exec Scan Task Scan Task Scan Task :::: SNL Prog SNL Prog SNL Prog :::: SNL Exec Redundancy Monitor Process I/O Driver I/O Driver I/O Driver LAN CCE RMT Driver SNL The Redundancy Monitor Task (RMT) is an implementation Independent from EPICS

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 9 Basic Layout Two IOC form a redundant pair One controller is active (Master state) The other IOC keeps synchronized with the Master controller (IOC) A RMT controller (IOC) B RMT Eternet private Eternet redundant pair The RMT is a process which handles all redundancy issues within the IOC

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 10 Ethernet Connectivity controller (IOC) A RMT controller (IOC) B RMT Eternet private Eternet redundant pair Global services: Time Alarming..... CA Clients Master check global Ethernet check public Ethernet check private Ethernet RMT communication synchronization of data CA broadcast Master replies only

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser RMT: the Redundancy Supervisor

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser How the RMT works Control the processes of interest for redundancy Processes register themselves to be controlled Communicate with the RMT in the other IOC Set the IOC in the Master- or Slave-state (manage switch-over) Monitor network connections (slide before) In a redundant IOC PRR-processes register by calling a RMT function and wait for a start command. Some processes need to be synchronized with their partner process in the other IOC. Synchronization over the private Ethernet is controlled by the RMT.

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 13 When to fail over? From the Preamble of the design Document: “Any redundant implementation must make the system more reliable than the non redundant one. Precaution must be taken especially for the detection of errors which shall initiate the failover. This operation should only be activated if there is no doubt that keeping the actual mastership definitely causes more damage to the controlled system than an automatic failover.”

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser EPICS Specific Parts Control Update Process QUEQUE SNL Update Process QUEQUE Contr ol Exec Scan Task Scan Task Scan Task :::: SNL Prog SNL Prog SNL Prog :::: SNL Exec Redundancy Monitor Process I/O Driver I/O Driver I/O Driver LAN CCE RMT Driver SNL RMT is an implementation Independent from EPICS CCE: The Continuous Control Executive permanently collects changes on the master to update the client SNL Executive: Permanently collects states and values from SNL programs on the master. Sending changes to the slave.

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Remote Diagnostic Control Update Process QUEQUE SNL Update Process QUEQUE Contr ol Exec Scan Task Scan Task Scan Task :::: SNL Prog SNL Prog SNL Prog :::: SNL Exec Redundancy Monitor Process I/O Driver I/O Driver I/O Driver Client CCE RMT Driver SNL XML Request files can be passed though the RMT to any registered underlying process. The final destination of the Message will generate an answer. SNL Executive: In this special case the remote diagnostic protocol is used to debug the SLS programs actually Executive running on the remote IOC.

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 16 Status Implementing support for redundant systems is quite a challenging task. The current implementation is improving its maturity by continuous testing. (Gongfa Liu – (Hefei-China) DESY) The RMT and CCE code has bee ported to Linux by Artem Kazakov (KEK) to run redundant (soft)-IOCs on Linux. (see TPPA31) The ported RMT has been used to implement redundant CA-Gateways. (Artem Kazakov) (An option for load balancing is also available)

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser 17 Outlook RMT can be used independent from EPICS to implement redundant applications. The SNL debugging features will be improved. (Joint development of SLAC and DESY) First production of a redundant IOC is foreseen for middle of 2008 An implementation based/ running on uTCA is desired.

Matthias Clausen, Gongfa Liu, Bernd Schoeneburg (DESY), ICALEPCS, 2007 XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Test System The test system consists of two Compact PCI CPUs in separated crates with redundant power supplies each. Thank you!