Best Ever Alarm System Toolkit Xihui Chen, Katia Danilova, Kay Kasemir SNS/ORNL July 2009.

Slides:



Advertisements
Similar presentations
MFA for Business Banking – Security Code Multifactor Authentication: Quick Tip Sheets Note to Financial Institutions: We are providing these QT sheets.
Advertisements

MFA for Business Banking – Security Questions with Reset Multifactor Authentication: Quick Tip Sheets Note to Financial Institutions: We are providing.
Managed by UT-Battelle for the Department of Energy Best Ever Archive Utility, Yet (BEAUtY) Kay Kasemir April 2013.
Control System Studio (CSS)
Introduction to Alarm Handlers Randy Flood Karen Schroeder AOD/OPS.
Jan Hatje, DESY AMS – Alarm Management System PCaPAC AMS – Alarm Management System and CSS – Control System Studio Update PCaPAC 2008 J.Stefan Institute,
Managed by UT-Battelle for the Department of Energy Kay Kasemir, Xihui Chen ORNL/SNS April, 2013 Getting Started with CSS.
Getting Started. Edline Web Site Requirements Provide Students and Parents With: 1.A Brief Course Description 2.Your Address 3.Course Syllabus 4.Major.
Best Ever Alarm System Toolkit Kay Kasemir, Xihui Chen, Katia Danilova SNS/ORNL April, 2013.
Update on CSS at SNS Xihui Chen, Kay Kasemir June 2010.
April, 2005 EPICS Collaboration Controls Group Alarm Management at Jefferson Lab A New Configuration and Extended Capabilities B. Bevins, M. Joyce, J.
The Soft-IOC Based Alarm Handler – an Operations View Pam Gurd October 31, 2007.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS Jan Control System Studio Training - Alarm System Use.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS April 2013 Control System Studio Training - Alarm System Use.
DEMONSTRATION FOR SIGMA DATA ACQUISITION MODULES Tempatron Ltd Data Measurements Division Darwin Close Reading RG2 0TB UK T : +44 (0) F :
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS April 2013 Control System Studio Training - Workspaces, Shared.
ACNET Alarms Brian Hendricks October 18, Alarm Types analog – simple comparison to minimum and maximum values single set of limits digital – nominal.
Managed by UT-Battelle for the Department of Energy Best Ever Archive Utility, Yet Creating a BEAST was easy. BEAUtY seems harder. April.
NWU: Helpdesk Call handling ITC Training: Session 1 -Call Logging and Remedy -Campus Helpdesks and Escalation -Remedy Solution Database -Remedy Mechanisms.
Scan System Kay Kasemir, Xihui Chen Jan Managed by UT-Battelle for the U.S. Department of Energy Automated Experiment Control “Scan” should be.
Scan System: Experiment Automation Kay Kasemir, Xihui Chen RAL EPICS Meeting, May 2013.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY XFEL Refrigerator Controls – April Alarm Training Making.
From the ChannelArchiver to the Best Ever Archive Utility, Yet July 2009.
Jan Hatje, DESY CSS ITER March 2009: Alarm System, Authorization, Remote Management XFEL The European X-Ray Laser Project X-Ray Free-Electron.
Managed by UT-Battelle for the Department of Energy Kay Kasemir, Ph.D. ORNL/SNS July 2011 at Control System Studio - CSS - Overview.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS 2012, April at SLAC Control System Studio - Introduction.
Managed by UT-Battelle for the Department of Energy Kay Kasemir, Ph.D. ORNL/SNS July 2011 at Control System Studio - CSS - Overview.
7.09 Features Detroit, Sept Department
Moodle (Course Management Systems). Assignments 1 Assignments are a refreshingly simple method for collecting student work. They are a simple and flexible.
Service Request Desk How we can help each other, help each other.
Managed by UT-Battelle for the Department of Energy Kay Kasemir, Ph.D. ORNL/SNS June 2011 at KEK Control System Studio - CSS - Alarm.
TEAM Basic TotalElectrostatic ManagementAwareness&
Thomas Jefferson National Accelerator Facility Page 1 Slow Controls Ken Livingston University of Glasgow.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS Oct EPICS Meeting, PAL, Korea Control System Studio Training.
Control System Studio (CSS) Overview Kay Kasemir, July 2009.
The CSS Scan System Kay-Uwe Kasemir SNS/ORNL Dec
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS Oct EPICS Meeting, PAL, Korea Control System Studio Training.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS Jan Control System Studio, CSS Overview.
Diagnostic Pathfinder for Instructors. Diagnostic Pathfinder Local File vs. Database Normal operations Expert operations Admin operations.
XFEL The European X-Ray Laser Project X-Ray Free-Electron Laser Matthias Clausen, DESY XFEL Refrigerator Controls – April CSS Core Applications.
Jan Hatje, DESY CSS GSI Feb. 2009: Alarm System, Authorization, Remote Management XFEL The European X-Ray Laser Project X-Ray Free-Electron.
Operational SNS Karen S. White Controls Group Leader 10/11/10.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS Oct EPICS Meeting, PAL, Korea Control System Studio Training.
SNS Alarm System Status Curtis Dunn Control System Suite/Eclipse Frameworks Workshop EPICS Collaboration Meeting June 12-16, 2006.
Managed by UT-Battelle for the Department of Energy CSS Update Matthias Clausen, Helge Rickens, Jan Hatje and DESY Delphy Armstrong, Xihui Chen,
EPICS EPICS Limitations Bob Dalesio Marty Kraimer.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS 2012, January 9-12 at NSRRC, Taiwan Control System Studio Training.
Managed by UT-Battelle for the Department of Energy Best Ever Alarm System Tool Xihui Chen, Katia Danilova, Kay Kasemir SNS/ORNL April.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS 2011, October at CEA Saclay, France Control System Studio.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS April 2013 Control System Studio, CSS Overview.
Office of Housing Choice Voucher Program Voucher Management System – VMS Version Released October 2011.
Jan Hatje, DESY CSS – Control System Studio EPICS collaboration meeting CSS – Control System Studio Update EPICS collaboration meeting 2008 Shanghai.
ROCS Web Based Reporting Tool Using SNS Relational Database By Katia Danilova, Ernest L. Williams Jr. Control Systems group, ASD, SNS.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS A lot of material from Nadine Utzel, ITER and BOY online help.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS 2012, January 9-12 at NSRRC, Taiwan Control System Studio Training.
Matthias Clausen, Jan Hatje, DESY CSS Overview – Alarm System and Management CSS Overview - GSI, 11 Februrary CSS Overview Alarm System and CSS.
Applications Kay Kasemir ORNL/SNS Using Information and pictures from Matthias Clausen, Jan Hatje, and Helge Rickens (DESY) October 2007.
CSS Synoptic Display Comments Kay Kasemir, July 2009.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS May 2014 Control System Studio Training - Hierarchical Preferences.
Managed by UT-Battelle for the Department of Energy Kay Kasemir ORNL/SNS 2012, April at SLAC Control System Studio Training - Alarm System.
1 Top Level of CSC DCS UI 2nd PRIORITY ERRORS 3rd PRIORITY ERRORS LV Primary - MaratonsHV Primary 1 st PRIORITY ERRORS CSC_COOLING CSC_GAS CSC – Any Single.
Managed by UT-Battelle for the Department of Energy Quest for the Best Ever Alarm System Tool Kay Kasemir Oct
Best Ever Alarm System Toolkit Kay Kasemir, Xihui Chen, Katia Danilova, SNS/ORNL ICALEPCS 2009, Kobe, Japan, Oct 2009.
BOY, A Modern Graphical Operator Interface Editor and Runtime Xihui Chen, Kay Kasemir RAD Control Group.
Monitoring Dynamic IOC Installations Using the alive Record Dohn Arms Beamline Controls & Data Acquisition Group Advanced Photon Source.
Introduction to Control System Studio (CSS) Kay Kasemir, Kunal Shroff EPICS Fall Collaboration Meeting, October 2011 PSI.
Control System Studio - CSS - Alarm Handling
How we can help each other, help each other.
Banafsheh Hajinasab Based on presentation by K. Strnisa, Cosylab
Alarm information in CS-Studio
Presentation transcript:

Best Ever Alarm System Toolkit Xihui Chen, Katia Danilova, Kay Kasemir SNS/ORNL July 2009

2Managed by UT-Battelle for the U.S. Department of Energy Previous Attempts at SNS ALH, soft-IOCs and EDM screens Issues – GUI Static Layouts N clicks to see (some of the) active alarms – Configuration.. was bad  Always too many alarms Changes required contacting one of the 2 experts, restart ALH, … – Information Operator guidance? Related displays? Most frequent alarm? Timeline of alarm?

3Managed by UT-Battelle for the U.S. Department of Energy New End-User View: Alarm Table All current alarms – new, ack’ed Sort by PV, Descr., Time, Severity, … Optional: Annunciate Acknowledge one or multiple alarms – Select by PV or description – BNL/RHIC type un-ack’

4Managed by UT-Battelle for the U.S. Department of Energy Another View: Alarm Tree All alarms – Disabled, inactive, new, ack’ed Hierarchical – Optionally only show active alarms – Ack’/Un-ack’ PVs or sub-tree

5Managed by UT-Battelle for the U.S. Department of Energy Guidance, Related Displays, Commands  Basic Text  Start EDM screen  Open web page  Run ext. command Hierarchical: Including info of parent entries Merges Guidance etc. from all selected alarms

6Managed by UT-Battelle for the U.S. Department of Energy.. Within CSS  Alarms  History of PV  EPICS Config.

7Managed by UT-Battelle for the U.S. Department of Energy E-Log Entries “Logbook” from context menu creates text w/ basic info about selected alarms. Edit, submit. Pluggable implementation, not limited to Oracle-based SNS ELog

8Managed by UT-Battelle for the U.S. Department of Energy.. may require Authentication/Authorization  Log in/out while CSS is running Online Configuration Changes

9Managed by UT-Battelle for the U.S. Department of Energy Configure PV Again online Especially useful for operators to update guidance and related screens.

10Managed by UT-Battelle for the U.S. Department of Energy Logging..into generic CSS log also used for error/warn/info/debug messages Alarm Server: State transitions, Annunciations Alarm GUI: Ack/Un-Ack requests, Config changes Generic Message History Viewer – Example w/ Filter on TEXT=CONFIG

11Managed by UT-Battelle for the U.S. Department of Energy Logging: Get timeline Example: Filter on TYPE, PV 1. PV triggers, clears, triggers again 2. Alarm Server latches alarm 4. Problem fixed 3. Alarm Server annunciates 5. Ack’ed by operator 6. All OK

12Managed by UT-Battelle for the U.S. Department of Energy All Sorts of Web Reports

13Managed by UT-Battelle for the U.S. Department of Energy Technical View Alarm Cfg & State RDB Alarm Cfg & State RDB IOCs Alarm Server Current Alarms: Acknowledged? Transient? Annunciated? Alarm Server Current Alarms: Acknowledged? Transient? Annunciated? LOG Message RDB JMS 2 Speech JMS 2 RDB Tomcat -Reports Tomcat -Reports CSS Applications Alarm Client GUI JMS Alarm Updates Ack’; Config Updates Annunciations Log Messages TALK ALARM_CLIENT ALARM_SERVER PV Updates (Channel Access, …)

14Managed by UT-Battelle for the U.S. Department of Energy General Alarm Server Behavior Latch highest severity, or non-latching – like ALH “ack. transient” Annunciate Chatter filter ala ALH  Alarm only if severity persists some minimum time .. or alarm happens >=N times within period Optional formula-based alarm enablement: – Enable if “(pv_x > 5 && pv_y < 7) || pv_z==1” – … but we prefer to move that logic into IOC When acknowledging MAJOR alarm, subsequent MINOR alarms not annunciated – ALH would again blink/require ack’

15Managed by UT-Battelle for the U.S. Department of Energy Best Ever Alarm System Tools, Indeed.. but Tools are only half the issue Good configuration requires plan & follow-up. B. Hollifield, E. Habibi, "Alarm Management: Seven (??) Effective Methods for Optimum Performance", ISA, 2007

16Managed by UT-Battelle for the U.S. Department of Energy Alarm Philosophy Goal: Help operators take correct actions – Alarms with guidance, related displays – Manageable alarm rate (<150/day) – Operators will respond to every alarm (corollary to manageable rate)

17Managed by UT-Battelle for the U.S. Department of Energy  DOES IT REQUIRE IMMEDIATE OPERATOR ACTION? – What action? Alarm guidance!  Not “make elog entry”, “tell next shift”, …  Consider consequence of no action  Is it the best alarm? – Would other subsystems, with better PVs, alarm at the same time? What’s a valid alarm?

18Managed by UT-Battelle for the U.S. Department of Energy How are alarms added? Alarm triggers: PVs on IOCs – But more than just setting HIGH, HIHI, HSV, HHSV – HYST is good idea – Dynamic limits, enable based on machine state,... Requires thought, communication, documentation Added to alarm server with – Guidance: How to respond – Related screen: Reason for alarm (limits, …), link to screens mentioned in guidance – Link to rationalization info (wiki)

19Managed by UT-Battelle for the U.S. Department of Energy Impact/Consequence Grid CategorySo WhatMinor ConsequenceMajor Consequence Personnel SafetyPPS independent from EPICS? Environment, Public Can EPICS cause contained spill of mercury? Uncontained spill?? Cost: Beam Production, Downtime, Beam Quality No effect Beam off < 1 sec? Beam off <10 min <$10000 Beam off >10min >$10000 Mostly: How long will beam be off?

20Managed by UT-Battelle for the U.S. Department of Energy.. combined with Response Time Time to RespondMinor ConsequenceMajor Consequence >30 MinutesNO_ALARMMINOR minutesMINORMAJOR <10 minutesMAJORMAJOR + Annunciate – This part is still evolving…

21Managed by UT-Battelle for the U.S. Department of Energy Example: Elevated Temp/Press/Res.Err./… Immediate action required? – Do something to prevent interlock trip Impact, Consequence? – Beam off: Reset & OK, 5 minutes? – Cryo cold box trip: Off for a day? Time to respond? – 10 minutes to prevent interlock?  MINOR? MAJOR? Guidance: “Open Valve 47 a bit, …” Related Displays: Screen that shows Temp, Valve, …

22Managed by UT-Battelle for the U.S. Department of Energy “Safety System” Alarms  Protection Systems not per se high priority – Action is required, but we’re safe for now, it won’t get worse if we wait  Pick One  “Mommy, I need to gooo!”  “Mommy, I went” (Does it require operator action? How much time is there?)

23Managed by UT-Battelle for the U.S. Department of Energy Avoid Multiple Alarm Levels

24Managed by UT-Battelle for the U.S. Department of Energy Bad Example: Old SNS ‘MEBT’ Alarms Each amplifier trip: ≥ 3 ~identical alarms, no guidance Rethought w/ subsystem engineer, IOC programmer and operators: 1 better alarm

25Managed by UT-Battelle for the U.S. Department of Energy Alarms for Redundant Pumps

26Managed by UT-Battelle for the U.S. Department of Energy Alarm Generation: Redundant Pumps the wrong way Control System – Pump1 on/off status – Pump2 on/off status Simple Config setting: Pump Off => Alarm: – It’s normal for the ‘backup’ to be off – Both running is usually bad as well Except during tests or switchover – During maintenance, both can be off

27Managed by UT-Battelle for the U.S. Department of Energy Redundant Pumps Control System – Pump1 on/off status – Pump2 on/off status – Number of running pumps – Configurable number of desired pumps Alarm System: Running == Desired? – … with delay to handle tests, switchover Same applies to devices that are only needed on-demand 1 1 Required Pumps:

28Managed by UT-Battelle for the U.S. Department of Energy Weekly Review: How Many? Top 10?

29Managed by UT-Battelle for the U.S. Department of Energy A lot of information available  How often did PV trigger?  For how long?  When?  Temporary issue? Or need HYST, alarm delay, fix to hardware?

30Managed by UT-Battelle for the U.S. Department of Energy Weekly Check: Stale, Forgotten?

31Managed by UT-Battelle for the U.S. Department of Energy GUI: Similar to SNS GUI shown here JMS CSS Other RDB LOG ALARM JMS2RDB IOC LDAP Interconnection Server What about the DESY Alarm System? Filters Filt.Alrm No Channel Access Monitor of selected alarm PVs! IOCs push all alarms via new protocol into Interconn. Server. No Channel Access Monitor of selected alarm PVs! IOCs push all alarms via new protocol into Interconn. Server.

32Managed by UT-Battelle for the U.S. Department of Energy Design Choices Similar alarm table and tree GUIs JMS for communication – slightly different messages, though DESY IOCs send all alarms, then filtered in AMS – DESY: All IOC alarms should show up in AMS, zero additional configuration – At SNS, how many of the PVs would send alarms? We want to make the addition of alarms simple, but not automatic, and encourage guidance, related displays. DESY/SNS: LDAP vs. RDB for configuration/state – Choice was based on available infrastructure. JMS Listeners – SNS: Logger, Annunciator – DESY: Logger, Send SMS, , Voice Mail

33Managed by UT-Battelle for the U.S. Department of Energy Summary BEAST operational at SNS since Feb’09 – DESY AMS is similar and has been operational for longer Pick either, but good configuration requires work in any case – Started with previous “annunciated” alarms ~300, no guidance, no related displays Now ~400, all with guidance, rel. displays, links to operational procedures – “Philosophy” helps decide what gets added and how Immediate Operator Action? Consequence? Response Time? – Weekly review spots troubles and tries to improve configuration