Reliability and availability considerations for CLIC modulators Daniel Siemaszko 11.05.2010 OUTLINE : Give a specification on the availability of the powering.

Slides:



Advertisements
Similar presentations
1 Impact of Decisions Made to Systems Engineering: Cost vs. Reliability System David A. Ekker Stella B. Bondi and Resit Unal November 4-5, 2008 HRA INCOSE.
Advertisements

WP3 High Availability Drives Electrical Machines and Drives Research Group University of Sheffield Dr. Georges El Murr
Reliability Engineering (Rekayasa Keandalan)
MODULE 2: WARRANTY COST ANALYSIS Professor D.N.P. Murthy The University of Queensland Brisbane, Australia.
Q11: Describe how the effects of power supply failures on integrated luminosity will be mitigated. TESLA Response : –Mainly consider two types of magnet.
1 MM3 - Reliability and Fault tolerance in Networks Service Level Agreements Jens Myrup Pedersen.
High availability is one of the most important issues in computing today. Understanding how to achieve the highest possible availability of systems has.
INDR 343 Problem Session
5/18/2015CPE 731, 4-Principles 1 Define and quantify dependability (1/3) How decide when a system is operating properly? Infrastructure providers now offer.
SMJ 4812 Project Mgmt and Maintenance Eng.
LHC UPS Systems and Configurations: Changes during the LS1 V. Chareyre / EN-EL LHC Beam Operation Committee 11 February 2014 EDMS No /02/2014.
Reliable System Design 2011 by: Amir M. Rahmani
Reliability of Systems
ILC Marx Modulator Development Program G.E. Leyh, Stanford Linear Accelerator Center.
MAE 552 – Heuristic Optimization Lecture 6 February 6, 2002.
EEE499 Real Time Systems Software Reliability (Part II)
Reliability Chapter 4S.
Copyright © 2014 by McGraw-Hill Education (Asia). All rights reserved. 4S Reliability.
Development of Solid State Long Pulse Klystron Modulators
The primary objective in the implementation of a UPS system is to improve power reliability to the limits of technical capability, the ultimate aim being.
1 Product Reliability Chris Nabavi BSc SMIEEE © 2006 PCE Systems Ltd.
Mercury Laser Driver Reliability Considerations HAPL Integration Group Earl Ault June 20, 2005 UCRL-POST
Carsten Nesgaard Michael A. E. Andersen
Powering the main linac implications Daniel Siemaszko, Serge Pittet OUTLINE : Cost impact of power converters, power consumption and powering.
Lecture 03: Fundamentals of Computer Design - Trends and Performance Kai Bu
Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002,
Gek 16/6/041 ITRP Comments on Question 19 GEK 9/06/04 19) For the X-band (warm) technology, detail the status of the tests of the full rf delivery system.
Drive beam magnets powering strategy Serge Pittet, Daniel Siemaszko CERN, Electronic Power Converter Group (TE-EPC) OUTLINE : Suggestion of.
1 Availsim DRFS and klyClus setup, assumptions, questions Tom Himel August 11, 2009.
Klystron Modulator for Proton Driver
by Sebastian Blume ETH Zürich
1 Component reliability Jørn Vatn. 2 The state of a component is either “up” or “down” T 1, T 2 and T 3 are ”Uptimes” D 1 and D 2 are “Downtimes”
L Berkley Davis Copyright 2009 MER035: Engineering Reliability Lecture 6 1 MER301: Engineering Reliability LECTURE 6: Chapter 3: 3.9, 3.11 and Reliability.
1 EXAKT SKF Phase 1, Session 2 Principles. 2 The CBM Decision supported by EXAKT Given the condition today, the asset mgr. takes one of three decisions:
Reliability Failure rates Reliability
Reliability McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
1 Optimized Load Sharing Control by means of Thermal Reliability Management Carsten Nesgaard * Michael A. E. Andersen Technical University of Denmark in.
Linac RF Source Recommendations for Items 22,23,24,46,47 Chris Adolphsen.
Simulation results for powering serial connected magnets Daniel Siemaszko, Serge Pittet OUTLINE : Serial configuration of full rated converters.
TE EPC Safe by Design CLIC Powering Daniel Siemaszko, Serge Pittet, David Nisbet CERN, Technology Department (TE) Electrical Power Converter Group (EPC)
Modulators for DB klystrons: requirements and plans for developments Serge Pittet, David Nisbet TE EPC.
Maintainance and Reliability Pertemuan 26 Mata kuliah: J Manajemen Operasional Tahun: 2010.
Mean Time To Repair
Reliability Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill.
ATC / ABOC 23 January 2008SESSION 6 / MTTR and Spare Parts AB / RF GROUP MTTR, SPARE PARTS AND STAND-BY POLICY FOR RF EQUIPMENTS C. Rossi on behalf of.
Aug 23, 2006 Low Power HLRF System and Impact on Distributed RF System Shigeki Fukuda and Task Force Team of DRFS in KEK KEK.
Powering the damping rings wigglers Daniel Siemaszko, Serge Pittet OUTLINE : Powering superconductive magnets, Powering wigglers strategy, Existing.
CS203 – Advanced Computer Architecture Dependability & Reliability.
Physics Department Lancaster University Physics Department Lancaster University Reliability Rebecca Seviour Cockcroft Institute Dept Physics Lancaster.
Tailoring the ESS Reliability and Availability needs to satisfy the users Enric Bargalló WAO October 27, 2014.
Presentation CERN SPAIN | ITALY | FRANCE | GERMANY | MEXICO | USA | BRAZIL | UAE | QATAR | OMAN | SAUDI ARABIA jema.es.
1 Introduction to Engineering Spring 2007 Lecture 16: Reliability & Probability.
LOG 211 Supportability Analysis “Reliability 101”
The Survey of the Power Supply Reliability at SSRF
Discrete-time Markov chain (DTMC) State space distribution
CHAPTER 4s Reliability Operations Management, Eighth Edition, by William J. Stevenson Copyright © 2005 by The McGraw-Hill Companies, Inc. All rights reserved.
Most people will have some concept of what reliability is from everyday life, for example, people may discuss how reliable their washing machine has been.
CLIC Civil Engineering & Infrastructure Working Group Meeting
RELIABILITY OF 600 A ENERGY EXTRACTION SYSTEMS
LV Safe Powering from UPS to Clients
MAGNET POWER SUPPLIES RELIABILITY IMPROVEMENTS AT SOLEIL
Powering CLIC Strategies and Technical Issues
Klystron Modulator Systems for CLIC 500
Robicon Perfect Harmony.
Reliability Failure rates Reliability
Reliability.
Production and Operations Management
RELIABILITY Reliability is -
Solutions Markov Chains 6
Presentation transcript:

Reliability and availability considerations for CLIC modulators Daniel Siemaszko OUTLINE : Give a specification on the availability of the powering system of the drive beam LINAC klystrons. Evaluate the reliability of a given topology/solution. Evaluate the reliability of modular/redundant systems. TE EPC

Hypothesis (Decelerator example) N+1 redundancy allows one failure of the modules in a power converter. The whole converter fails when two module failures occur. A factor  describes the converter failures that are saved by redundancy. The trimmers allow a tolerance up to twenty failures [ref: Adli]. Estimated repair time includes machine cool down and walking time in the tunnel (4h). 11/05/2010Daniel Siemaszko2 TE EPC

Composite MTBF model Failure rates  =MTBF -1 combined with the same association rules as impedances. Reliability calculated as a function of failure rate and mean time between preventive maintenance (or technical stops or horizon h). Serial reliabilities are multiplied. 11/05/2010Daniel Siemaszko3 TE EPC

Markov chains Each converter defined as a set of states with probability transitions after each time step R. Failure probabilities (F=1-R) defined as a function of failure rate and time step. Matrix P m contains all transition probabilities. The failure probability of the whole system is a combination of all components probabilities. 11/05/2010Daniel Siemaszko4 TE EPC

Poisson modelling Failure rates are assumed to be Independent Identically distributed exponential variables. The expected number of failures is given as a function of horizon time (namely days between preventive maintenance) with an envelope corresponding to 95% probability. Down time is a function of number of failure and MTTR (Mean time to repair) and considered the down time of the maintenance days. 11/05/2010Daniel Siemaszko5 TE EPC

Modularity / Redundancy (1) Modularity and redundancy is a way for increasing the reliability of a power converter. However, modularity decreases the overall MTBF of a system by increasing the number of components. Redundancy increases the reliability if and only if failures can be saved and that the added redundant system does not add additional failures. Short circuit is ensured with a dedicated crowbar that must not be fired under normal operation. Open circuit is ensured with a dedicated breaker that must not open under operation. When a converter fails, depending on the redundant structure, the short-circuit or open circuit must be ensured. Factor κ stands for the probability of saving a failure by redundancy. Its value is crucial when predicting the global reliability of a system. 11/05/2010Daniel Siemaszko6 TE EPC

Modularity / Redundancy (2) If parameter κ= 60% (left-hand side graphic), then modularity adds more failures to the system than an individual converter. Only one case (one converter and one redundant module) can increase the system reliability. If parameter κ =80% (right-hand side graphic), then modularity can help increasing reliability but the price to pay stays high for small increase. For higher values of κ, then modularity and redundancy increase the reliability of system with a decisive value. 11/05/2010Daniel Siemaszko7 TE EPC

Drive Beam Linac If the klystron modulators are designed with a MTBF of 50,000 hours, the powering of the 1,638 units will reach some 93,5% availability, counting on individual powering. With a solution, including one hot spare for 20 modulators (and some few minutes for remotely swapping a failed converter), then 97% is reached for a horizon time of 100 days. The design of the modulator is still under research. It will include reliability optimisation with either redundancy or hot swap. Modulator MTBF target is: 100,000 hours. 11/05/2010Daniel Siemaszko8 TE EPC

CLIC machine availability (1) The minimum expected machine availability with individual powering of all magnets does not reach 80% when considering failure tolerance in the drive beam decelerator (which comes for free.) The availability due to maintenance is defined as one maintenance day out of h days which is the horizon in reliability calculations. Availability is the proportion between MTBF and MTTR 11/05/2010Daniel Siemaszko9 TE EPC

CLIC machine availability (2) When considering failure tolerance in the main beam quadrupoles, hot swap in the klystron modulators, and hot swap redundancy for each magnet, then a peak in availability of 93% is reached for a horizon of some 60 days. 11/05/2010Daniel Siemaszko10 TE EPC

Towards specifications The modulators have same requirements on voltage cycles than in traction but for a shorter length of time. Given the specification of 100,000 hours on one modulator’s MTBF, what are the possibilities? Single Individual modulator: high reliability required on every single component. Modular-Redundant approach on single modulators: Need for high κ factor. Hot spare approach: Need for ready hot spares for quick replacement of bulky modulators. 11/05/2010Daniel Siemaszko11 TE EPC CLIC life time10 years Availability97% Thermal cycles1500? Modulator stops every two days? Voltage cycles1.3·10 10 (50Hz operation)

Modulator reliability The reliability of one single modulator or modulator’s module is a function of the reliability of: Charger – Pulse transformer – Solid state switch – Bouncer – Capacitors – Control – Measurement. IGBT reliability depends on a thermal factor, environmental factor, quality factor plus voltage stress factor. All together define a number of cycles to failure. When evaluating the reliability of a topology, reliability or failure rates in FIT of each component should be known. So a global reliability can be drawn on the modulator. However, they will be based on assumptions, namely when talking about thermal effects and environment. When going for a modular – redundant approach, the reliability of the bypass system must be high enough to ensure high availability. 11/05/2010Daniel Siemaszko12 TE EPC