Presentation is loading. Please wait.

Presentation is loading. Please wait.

Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002,

Similar presentations


Presentation on theme: "Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002,"— Presentation transcript:

1 Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002, 2003 E E 681 - Module 2 ( Version for book website )

2 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 2 Overview of the lecture Concept of Reliability –Reliability function, Failure density function, hazard rate Concept of Availability: –Availability function, unavailability, availability of elements in series/parallel Methodology for Availability Analysis –Quick Unavailability Lower bound estimation –Cut sets method –Tie paths method Automatic Protection Switching (APS) Systems –Principle –Availability Analysis of an APS system

3 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 3 Reliability is a mission-oriented question Technical meaning of Reliability In everyday English: –“My car is very reliable”  It works well, it starts every time (even at -30°). Technical meaning: –Reliability is the probability of a device performing its purpose adequately for the period of time intended under the operating conditions intended. –Example: Reliability of a fuel-pump during a rocket launch

4 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 4 –Q(t) = probability { at least one failure in interval [0,t] } –Q(t) = 1 - R(t) R(t) t (R(t) is always a non- increasing function) Reliability The reliability function R(t): –R(t) = probability { no failure in interval [0,t] } R(0) = 1 R(  ) = 0 1

5 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 5 f(t) can be seen as the pdf of time to next failure Reliability R(t) = prob { no failure in [0,t] } Related function: failure density function, f(t)

6 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 6 Rate of failures Given that the element has survived this long Reliability Hazard rate (t) (age specific failure rate) : Rate of failure of an element given that this element has survived this long integration

7 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 7 TTF 1 Failure0t Reliability Expected Time To Failure or Mean Time To Failure (MTTF): –It is the expected value of the random variable with pdf f(t):

8 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 8 Reliability Special case: constant hazard rate (memoryless) –In this case we can apply the Poisson distribution:

9 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 9 Reliability Numerical example: –Poisson distribution with = 1 / (5 years) Probability of 1 failure in the first year: P = 16.4% Probability of at least one failure in the first year: P = 18.1% Probability of 1 failure in the first 5 years: P = 36.8% Probability of at least one failure in the first 5 years: P = 63.2%

10 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 10 “What is the probability that the engine of a formula 1 car will work during the whole race?” –This is a reliability question “How often do I hear the dial tone when I pick up the phone?” –This is an availability question Availability is the probability of finding the system in the operating state at any arbitrary time in the future Unlike in the context of reliability we now consider systems that can be repaired Availability (Repairable systems)

11 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 11 Region 1: R(t) and A(t) are the same Region 2: Repair actions begin to hold up A Region 3: A reaches a steady state Availability Comparison of Availability and Reliability Functions:

12 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 12 Time to Repair Time to Failure Time to Repair Time Between Failures t MTTF: Mean-Time To Failure MTBF: Mean-Time Between Failures MTTR: Mean-Time To Repair FailureRepair Failure Availability

13 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 13 In availability analysis we usually work with unavailability quantities because of some simplifications that can be done on the unavailability of elements in series and in parallel FIT: Unit corresponding to 1 failure in 10 9 hours 1 FIT = 1 failure in 114,155 years 1 failure / year = 114,155 FITS (high!) Typical value for telecom equipment: 1500 FITS ( MTTF = 76 years ) Availability Unavailability:

14 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 14 1 2 3 n... 2 1 n Approximation based on the fact that U i << 1 Numerical examp. U i = 10 -3, n = 3  U s = 3. 10 -3  U s = 10 -9 Availability Series elements unavailability reduction: Parallel elements unavailability reduction:

15 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 15 Availability Analysis The reliability engineer can use different techniques to evaluate the availability of a system: – 1) Quick estimate of a lower bound for the unavailability –2) Series and parallel unavailability reductions –3) Cut set method –4) Tie paths method –5) Conditional decomposition The general methodology is explained next…

16 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 16 Availability Analysis General Methodology: 1) Get unavailability values of all components and sub-systems. 2) Draw parallel and series availability relationships 3) Reduce the system availability model by repeated applications of the parallel/series availability simplifications. 4) If not completely reduced, do quick unavailability lower bound estimation, use the tie paths method, the cut sets method or the conditional decomposition

17 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 17 A B C D E F G H Lower bound of U s : U A +U H Availability Analysis Lower bound on unavailability –The contributions of parallel elements to the unavailability is not taken into account –In some cases this quick evaluation of a lower bound on U can be enough to conclude that the system does not meet the availability requirements

18 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 18 1 A 2 3 5 B 4 I O Availability Analysis Tie paths method: –We enumerate all the paths from I to O 8 || 96+7 1 2 3 5 4 A B IO –The availability of each paths is calculated: –The availability of the system is:

19 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 19 Availability Analysis Cut sets method: –Which combinations of element failures can bring the system down? –The probability of each cut is calculated: – The availability of the system is : 1 A 2 3 5 B 4

20 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 20 A syst 1 A syst 2 A d low Availability Analysis Conditional decomposition (High Unavailability Elements): –When some elements have high U, it becomes less acceptable to sum unavailabilities. –Solution: Conditional decomposition: –The availability of the system is :

21 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 21 Automatic Protection Switching (APS) Systems Basic idea: –to provide a standby transmission channel that is kept in fully operating condition and used to replace any of the other traffic bearing channels in the event of their failure Characteristics of an APS system: –spare to working ratio: ‘1-to-1’ or ‘1-to-N’ –co-routed / diversely routed: ‘1-to-1’ or ‘1-to-1 /DP’ ‘1-to-N’ or ‘1-to-N /DP’ –1+1 or 1:1: ‘1+1’: Signal always sent on the spare channel ‘1:1’: Signal sent on spare channel upon failure of the working channel

22 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 22 For Head End Bridge(HEB) and Tail End Transfer(TET): Mode 1 failure: working signal is not relayed Mode 2 failure: no bridging or transfer to/from spare channel Automatic Protection Switching (APS) Systems 1:N APS system:

23 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 23 Automatic Protection Switching (APS) Systems Cut sets approach to 1:N APS availability analysis: –Combinations creating outage for a specific channel (cut sets): Cut set 1: Failure that channel with prior failure of at least one other working channel Cut set 2: Failure of that working channel plus the spare channel or head end bridge or tail end transfer in mode 2 Cut set 3: Failure of head end bridge or tail end transfer in mode 1 –The probability of each cut set is: Cut set 1: U w  (N-1)U w  0.5 Cut set 2: U w  (U s + U b2 + U t2 ) Cut set 3: U b1 + U t1

24 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 24 O(U) A B O(Uc)O(Uc) U A = U B = 10 -3 U C = 10 -5 U S = U A U B + 2 U C  2 U C  c c Automatic Protection Switching (APS) Systems 1:N APS Unavailability –The unavailability of a channel is: –The term in O(U) reflects the irreducible series-availability elements: the HEB and the TET in their mode 1 failure. It is impossible to make a perfectly redundant system. There is always some parallelism-accessing device c that brings series unavailability contribution

25 E E 681 Lecture #2 © Wayne D. Grover 2002, 2003 25 Summary Reliability is a mission oriented question for non-repairable systems In telecom engineering we are interested in the availability of the system designed There are several techniques that can be used for availability analysis. The one we will use in the rest of the course is the algebraic approach (equivalent to cut sets) APS is a protection scheme that enhances availability by providing a spare channel for restoration of failed working channels


Download ppt "Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002,"

Similar presentations


Ads by Google