1 Availability and Controls Tom Himel SLAC Controls GG meeting January 20, 2006.

Slides:



Advertisements
Similar presentations
Using Telemedicine Equipment. Overview The ‘Big’ Picture The ‘Bigger’ Picture Equipment Setting Up A Video Conference Conducting A Video Conference Common.
Advertisements

Q11: Describe how the effects of power supply failures on integrated luminosity will be mitigated. TESLA Response : –Mainly consider two types of magnet.
2-May-15 GUI Design. 2 HMI design There are entire college courses taught on HMI (Human-Machine Interface) design This is just a very brief presentation.
Operations and Availability GG3. Key decisions Summary of Key Decisions for the Baseline Design The linac will have two parallel tunnels so that the support.
Help Desk Troubleshooting Computer Problems. 2 Certificate III Software Applications Troubleshooting Computer Problems Solving computer problems is one.
Applied Software Project Management INTRODUCTION Applied Software Project Management 1 5/20/2015.
Basic guidelines for the creation of a DW Create corporate sponsors and plan thoroughly Determine a scalable architectural framework for the DW Identify.
A U.S. Department of Energy Office of Science Laboratory Operated by The University of Chicago Argonne National Laboratory Office of Science U.S. Department.
Tom Himel Linac Controls Upgrade Nov SLAC National Accelerator Laboratory 1 Linac Controls Upgrade Status Tom Himel.
F.Brinker, DESY, July 17 st 2008 Injection to Doris and Petra Fitting the detector in the IP-region Radiation issues Beam optic, Target cell Polarisation.
1 Tom Himel 1 Status of the Linac Controls Upgrade LCLS FAC review Status of the Linac Controls Upgrade Tom Himel June 8, 2009.
4 Dec 2001First ideas for readout/DAQ1 Paul Dauncey Imperial College Contributions from all of UK: result of brainstorming meeting in Birmingham on 13.
RF Systems and Stability Linac Coherent Light Source Stanford Synchrotron Radiation Laboratory Stanford Linear Accelerator Center.
Tom Kubicki.  Booster HLRF System to be upgraded using 1kW Solid State Driver (SSD) Amplifiers.  Used to drive the 200kW Power Amplifier  Eliminates.
ATF2 Power Supply Availability Comparison February 5, ATF2 Power Supply Availability C0mparison Paul Bellomo and Briant lam.
LSU 07/24/2004Defining Project Tasks1 Defining the Project Tasks Project Management Unit, Lecture 4.
© SAIC. All rights reserved. NATIONAL SECURITY ENERGY & ENVIRONMENT HEALTH CYBERSECURITY The Potential High Cost of Simple Systems Engineering Errors Jim.
CLIC Kickers: Status and activities for 2013 in view of the budget discussions for 2013 CTC #65 M.J. Barnes CLIC Technical Committee, November M.J.
The Project AH Computing. Functional Requirements  What the product must do!  Examples attractive welcome screen all options available as clickable.
Jon Curwin and Roger Slater, QUANTITATIVE METHODS: A SHORT COURSE ISBN © Cengage Chapter 3: Using Graphs.
Miscellaneous Notes: This is a bare-bones template – make it fancier if you wish, but be sure to address at least the items listed here. Basically this.
INFO 637Lecture #81 Software Engineering Process II Integration and System Testing INFO 637 Glenn Booker.
ICS – Software Engineering Group 1 The SNS General Time Timestamp Driver Sheng Peng & David Thompson.
TSS First steps with Fwin - Maintenance TSS - Software.
Global Design Effort U.S. ILC Cost Translation R. Stanek, et al. Vic Kuchler 1/26/07.
Dec 8-10, 2004EPICS Collaboration Meeting – Tokai, Japan MicroIOC: A Simple Robust Platform for Integrating Devices Mark Pleško
Real world measurements. Measuring things Making accurate measurements is an essential part of all branches science and engineering. Much (all?)of our.
1 Status of EMMA Shinji Machida CCLRC/RAL/ASTeC 23 April, ffag/machida_ ppt & pdf.
1 Availsim DRFS and klyClus setup, assumptions, questions Tom Himel August 11, 2009.
ASTA roadmap to EPICS and other upgrades Remote monitoring and control DAQ Machine protection Dark current energy spectrometer.
Operations, Test facilities, CF&S Tom Himel SLAC.
Date Event Global Design Effort 1 ILC UPDATE Vancouver to Valencia Ewan Paterson Personal Report to SiD Collaboration Oct 27, 2006.
Project Management Mark Palmer Cornell Laboratory for Accelerator-Based Sciences and Education.
Global Design Effort 1 Possible Minimum Machine Studies of Central Region for 2009 Reference, ILC Minimum Machine Study Proposal V1, January 2009 ILC-EDMS.
MKI Erratics: Hardware and Electronics Related Aspects M.J. Barnes Acknowledgements: A.Antoine, R.A. Barlow, P. Burkel, E. Carlier, N. Magnin, V. Mertens.
Dmitri Denisov D0 Weekly Summary: August 18 to August 24  Delivered Luminosity and operating efficiency u Delivered 6.9pb -1 u Recorded 6.0pb -1 (87%)
Reliability and availability considerations for CLIC modulators Daniel Siemaszko OUTLINE : Give a specification on the availability of the powering.
18 March 2002 All Experimenters’ Meeting Alan L. Stone Louisiana Tech University 1 DØ Status: 03/11 – 03/18 Week integrated luminosity –1.1 pb -1 delivered.
Click to add text Systems Analysis, Prototyping and Iteration.
Status of KEK 150MeV FFAG M. Aiba (KEK) For KEK FFAG Gr. FFAG’05, 5 to 9 Dec., KURRI.
The recent history and current state of the linac control system Tom Himel Dec 1,
20 July 2006 Vancouver GDE Meeting Global Design Effort 1 Ring to Main Linac Peter Tenenbaum SLAC.
GG3 Operations & Reliability (Availability) Eckhard Elsen Tom Himel
All Experimenters MeetingDmitri Denisov Week of September 2 to September 8 D0 Summary  Delivered luminosity and operating efficiency u Delivered: 3.9pb.
1 The ILC Control Work Packages. ILC Control System Work Packages GDE Oct Who We Are Collaboration loosely formed at Snowmass which included SLAC,
GAN: remote operation of accelerator diagnosis systems Matthias Werner, DESY MDI.
1 Object-Oriented Analysis and Design with the Unified Process Figure 13-1 Implementation discipline activities.
John Carwardine, Ewan Paterson, Marc Ross 14/15 July 2009 Availability Task Force: Subgroup #2.
SNuMI: WBS 1.1 Booster Upgrades Eric Prebys $642K FY06$ (no contingency, no G&A) xx% contingency Main Injector & Recycler BNB NuMI Tunnel Booster Ring.
Upgrade PO M. Tyndel, MIWG Review plans p1 Nov 1 st, CERN Module integration Review – Decision process  Information will be gathered for each concept.
Calorimeter global commissioning: progress and plans Patrick Robbe, LAL Orsay & CERN, 25 jun 2008.
LC availability Simulation done for the LC comparison task force Tom Himel SLAC.
DAQ ELECTRONICS 18 March 2015MEG Collaboration Meeting, Tokyo Stefan Ritt.
SuperB Integration SuperB Experimental hall. Related topics of Slac D&D activities.
Design process of the Interlock Systems Patrice Nouvel - CERN / Institut National Polytechnique de Toulouse CLIC Workshop Accelerator / Parameters.
Mobile Testing - Bug Report
Debugging Intermittent Issues
Engineering in High Availability
SNS Status Report Karen S. White 10/15/08.
Outline What does the OS protect? Authentication for operating systems
The ILC Control Work Packages
Availsim runs and questions
Outline What does the OS protect? Authentication for operating systems
Availability and Reliability Issues for the XFEL Tom Himel SLAC/DESY
Commissioning, Operations, and Reliability Status
4-6 GeV CEBAF Reliability Overview
The Troubleshooting theory
NAND/NOR Logic Gate Replacement Training tool
Fwin - Maintenance TSS - Software
Reliability, MPS, Operations and Tuning Work Packages
Presentation transcript:

1 Availability and Controls Tom Himel SLAC Controls GG meeting January 20, 2006

Controls GG meeting Tom Himel 1/20/06 2 Contents The unavailability budget Major tools: Hot swappability Redundancy Helping other systems diagnose their problems

Controls GG meeting Tom Himel 1/20/06 3 Availability Design Philosophy Design it in up front. Budget 15% downtime total. Keep an extra 10% as contingency. Try to get the high availability for the minimum cost. First stab given here. Will need to iterate as design progresses. Quantities are not final Engineering studies may show that the cost minimum would be attained by moving some of the unavailability budget from one item to another. This means some MTBFs may be allowed to go down, but others will have to go up.

Controls GG meeting Tom Himel 1/20/06 4 Will Need Improvement Program Must design to meet the budget on the first pass. Assume we are only partly successful and unavailability will be too high when we turn on. Will need operations budget and engineering to make the necessary improvements.

Controls GG meeting Tom Himel 1/20/06 5 Unavailability budget by region Global Controls = 2%*15% = 0.3%

Controls GG meeting Tom Himel 1/20/06 6 Availability budget by system Controls = 17%*15% = 2.5%

MTBF/MTTR requirements

Controls GG meeting Tom Himel 1/20/06 8 Hot-swappability Definition: An item is hot-swappable if it can be replaced without making anything else temporarily not work. If 1 channel of a 16 channel ADC is dead, and it requires a full module replacement to fix the bad channel, then it is not hot- swappable. Tried to make reasonable assumptions about what was hot-swappable. Both hardware and software need to support hot-swapping.

Controls GG meeting Tom Himel 1/20/06 9 Parts spreadsheet (double click to view)

Redundancy Complex redundancies (energy overhead, extra DR kickers, spare klystron/modulator) are directly accounted for in availsim. Simpler redundancies like having 5 power supply regulators where only 4 are needed are not modeled in detail by availsim. Their effect is put in by having a longer MTBF for the overall power supply. That is you must calculate the system MTBF given the redundancy and the time to repair a bad part that didn’t bring the full system down. If one part of your system is redundant, it is likely that single points of failure in the non-redundant parts will dominate the MTBF. In the calculation of the overall MTBF, it is necessary to know the time to repair the redundant part that broke. Some rules of thumb to use: If it is in the accelerator tunnel: 2 months If it is accessible with beam on but not hot-swappable (repairable without bringing the beam down): 1 week If it accessible with beam on and hot-swappable: 2 hours

Controls GG meeting Tom Himel 1/20/06 11 Software Haven’t explicitly budgeted for software downtime. Probably need to. How much? Good to make ILC able to run through changes in DB structure, software downloads, and maybe even boots of computers. Have uniform user interface with all (even “expert”) diagnostics available. Strive to not need experts.

Controls GG meeting Tom Himel 1/20/06 12 Two aspects to control and availability Controls itself should not go down often. That is what we have been addressing above Controls needs to give tools to help discover what is wrong in other systems.

Controls GG meeting Tom Himel 1/20/06 13 Tools to help other systems Network access for laptops and diagnostic equipment near all hardware Readout and recording of diagnostic information built into other systems (e.g. a power supply may record its voltage and current at a megahertz) Either record everything very often or allow flexibly triggered readout of everything or both. Provide analysis tools of the data that is recorded.

Controls GG meeting Tom Himel 1/20/06 14 Example of need for sync readout Based on SLC “flyer pulses” Infrequently a single bunch causes very high backgrounds. Need to figure out why. Only know few seconds after the fact that a bunch was bad. Could be caused by bad kicker pulse. Need to know kicker strength on each bunch. Could be caused by DR phase instability (saw tooth). Need to know orbit and phase of that bunch on many turns prior to extraction

Controls GG meeting Tom Himel 1/20/06 15 Help Solve Subtle Problems Phase drifts – compare redundant readouts Lying BPMs – chisquared? Redundancy? Drifting BPMs (both mechanical and electrical) Difficult to localize problems (normal module swaps don’t fix it). E.g. noise coupling in on long cable or flakey connector. Vacuum bursts in DRs (present PEP problem) – read 1/sec, provide good analysis package

Controls GG meeting Tom Himel 1/20/06 16 Summary We have a starting unavailability budget. It will be refined as engineering continues Upfront planning is essential to achieve the challenging budget. Must also help other systems diagnose their problems