Markov Method of Availability Analysis, APS systems, and Availability Simulation Methods E E 681 Module 21 W. D. Grover TRLabs & University of Alberta.

Slides:



Advertisements
Similar presentations
Lecture 5 This lecture is about: Introduction to Queuing Theory Queuing Theory Notation Bertsekas/Gallager: Section 3.3 Kleinrock (Book I) Basics of Markov.
Advertisements

ST3236: Stochastic Process Tutorial 3 TA: Mar Choong Hock Exercises: 4.
Lect.3 Modeling in The Time Domain Basil Hamed
Generating Random Numbers
SE503 Advanced Project Management Dr. Ahmed Sameh, Ph.D. Professor, CS & IS Project Uncertainty Management.
Random Variable A random variable X is a function that assign a real number, X(ζ), to each outcome ζ in the sample space of a random experiment. Domain.
1 Chapter 5 Continuous time Markov Chains Learning objectives : Introduce continuous time Markov Chain Model manufacturing systems using Markov Chain Able.
Intensity Transformations (Chapter 3)
5.2 Continuous Random Variable
STAT 497 APPLIED TIME SERIES ANALYSIS
Markov Analysis Jørn Vatn NTNU.
Topics Review of DTMC Classification of states Economic analysis
Continuous Random Variables. L. Wang, Department of Statistics University of South Carolina; Slide 2 Continuous Random Variable A continuous random variable.
Our Group Members Ben Rahn Janel Krenz Lori Naiberg Chad Seichter Kyle Colden Ivan Lau.
Reliable System Design 2011 by: Amir M. Rahmani
Planning under Uncertainty
Resampling techniques Why resampling? Jacknife Cross-validation Bootstrap Examples of application of bootstrap.
Test 2 Stock Option Pricing
Probability Theory Part 2: Random Variables. Random Variables  The Notion of a Random Variable The outcome is not always a number Assign a numerical.
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
A. BobbioReggio Emilia, June 17-18, Dependability & Maintainability Theory and Methods Part 2: Repairable systems: Availability Andrea Bobbio Dipartimento.
E E Module 18 M.H. Clouqueur and W. D. Grover TRLabs & University of Alberta © Wayne D. Grover 2002, 2003 Analysis of Path Availability in Span-Restorable.
3-1 Introduction Experiment Random Random experiment.
Computer vision: models, learning and inference Chapter 10 Graphical Models.
Properties of Random Numbers
Lecture 11 – Stochastic Processes
1 Logistics Systems Engineering Availability NTU SY-521-N SMU SYS 7340 Dr. Jerrell T. Stracener, SAE Fellow.
1 Performance Evaluation of Computer Systems By Behzad Akbari Tarbiat Modares University Spring 2009 Introduction to Probabilities: Discrete Random Variables.
Discrete Distributions
1 Reading Report 9 Yin Chen 29 Mar 2004 Reference: Multivariate Resource Performance Forecasting in the Network Weather Service, Martin Swany and Rich.
 1  Outline  stages and topics in simulation  generation of random variates.
Background on Reliability and Availability Slides prepared by Wayne D. Grover and Matthieu Clouqueur TRLabs & University of Alberta © Wayne D. Grover 2002,
Software Reliability SEG3202 N. El Kadri.
Tch-prob1 Chap 3. Random Variables The outcome of a random experiment need not be a number. However, we are usually interested in some measurement or numeric.
Monte Carlo Simulation CWR 6536 Stochastic Subsurface Hydrology.
1 7. Two Random Variables In many experiments, the observations are expressible not as a single quantity, but as a family of quantities. For example to.
Chapter 7 Random-Number Generation
Ch5. Probability Densities II Dr. Deshi Ye
1 Let X represent a Binomial r.v as in (3-42). Then from (2-30) Since the binomial coefficient grows quite rapidly with n, it is difficult to compute (4-1)
CDA6530: Performance Models of Computers and Networks Chapter 8: Discrete Event Simulation Example --- Three callers problem in homwork 2 TexPoint fonts.
1 Topic 3 - Discrete distributions Basics of discrete distributions Mean and variance of a discrete distribution Binomial distribution Poisson distribution.
Probability Theory for Networking Application Rudra Dutta ECE/CSC Fall 2010, Section 001, 601.
Lecture 4: State-Based Methods CS 7040 Trustworthy System Design, Implementation, and Analysis Spring 2015, Dr. Rozier Adapted from slides by WHS at UIUC.
Fault-Tolerant Computing Systems #4 Reliability and Availability
1 Markov chains and processes: motivations Random walk One-dimensional walk You can only move one step right or left every time unit Two-dimensional walk.
Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.
1 CS 552/652 Speech Recognition with Hidden Markov Models Winter 2011 Oregon Health & Science University Center for Spoken Language Understanding John-Paul.
The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.
Robust Estimation With Sampling and Approximate Pre-Aggregation Author: Christopher Jermaine Presented by: Bill Eberle.
Topic 5: Continuous Random Variables and Probability Distributions CEE 11 Spring 2002 Dr. Amelia Regan These notes draw liberally from the class text,
Advanced Engineering Mathematics, 7 th Edition Peter V. O’Neil © 2012 Cengage Learning Engineering. All Rights Reserved. CHAPTER 4 Series Solutions.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
STOCHASTIC HYDROLOGY Stochastic Simulation of Bivariate Distributions Professor Ke-Sheng Cheng Department of Bioenvironmental Systems Engineering National.
STA347 - week 91 Random Vectors and Matrices A random vector is a vector whose elements are random variables. The collective behavior of a p x 1 random.
CS433 Modeling and Simulation Lecture 11 Continuous Markov Chains Dr. Anis Koubâa 01 May 2009 Al-Imam Mohammad Ibn Saud University.
Stochastic Processes and Transition Probabilities D Nagesh Kumar, IISc Water Resources Planning and Management: M6L5 Stochastic Optimization.
11. Markov Chains (MCs) 2 Courtesy of J. Bard, L. Page, and J. Heyl.
Engineering Probability and Statistics - SE-205 -Chap 3 By S. O. Duffuaa.
1 Chapter 5 Continuous time Markov Chains Learning objectives : Introduce continuous time Markov Chain Model manufacturing systems using Markov Chain Able.
Generating Random Variates
Chapter 6 Random Processes
Reliability Engineering
Discrete-time Markov chain (DTMC) State space distribution
Availability Availability - A(t)
Continuous Random Variables
Fault-Tolerant Computing Systems #5 Reliability and Availability2
T305: Digital Communications
Dept. of Electrical & Computer engineering
Autonomous Cyber-Physical Systems: Probabilistic Models
Lecture 5 This lecture is about: Introduction to Queuing Theory
Presentation transcript:

Markov Method of Availability Analysis, APS systems, and Availability Simulation Methods E E 681 Module 21 W. D. Grover TRLabs & University of Alberta © Wayne D. Grover 2002, 2003

E E Module 21 © Wayne D. Grover 2002, Markov chain: A system with discrete states in statistical equilibrium and with constant memoryless probabilities of making transition between states... Example: two-state working-failed system Markov Techniques for Availability Analysis 1 2 p 12 = p 21 =  p 22 = 1-  p 11 = 1- State 1: working State 2: failed = 1/MTTF = failure rate  = 1/MTTR = repair rate

E E Module 21 © Wayne D. Grover 2002, Consider that: This means that:  1 = prob. of being in state 1 n = time-step p ij = state transition probability Thus.... “spinning” the transition matrix Once a vector is known, availability is known or computable over non-operating states Etc. Markov Techniques for Availability Analysis

E E Module 21 © Wayne D. Grover 2002, That is, that the state probability vector doesn’t change under repeated multiplication by matrix P. Which can be solved numerically for . A separate computational method is based on observing that the system must reach steady-state probabilities wherein, as every time epoch passes, the state occupancy probabilities remain unchanged, implying that:...or more generally that : is reached in the limit. Markov Techniques for Availability Analysis

E E Module 21 © Wayne D. Grover 2002, Functional view of 1+1 APS Markov chain model of APS Markov Techniques: Model of a 1+1 APS System State transition probability matrix Failure state

E E Module 21 © Wayne D. Grover 2002, Functional view of 1:N APS K1-K2 byte signalling: K1 - Head end bridge request (channel number) K2- Head end bridge confirm (channel number) Markov Techniques: Model of 1:N APS Systems

E E Module 21 © Wayne D. Grover 2002, Availability model view of 1:N APS U w = working system unavailability U s = spare system unavailability U b1 = head end bridge unavailability, failure mode 1 U b2 = head end bridge unavailability, failure mode 2 U t1 = tail end transfer unavailability, failure mode 1 U t2 = tail end transfer unavailability, failure mode 2 Markov Techniques: Models of APS Systems

E E Module 21 © Wayne D. Grover 2002, Method: List outage-causing failure combinations from functional inspection / system knowledge: Outage if: (1) two or more working systems fail. (2) If HEB or TET fail at any time in mode 2, or the spare span fails, and there is one or more working system failure. (3) If HEB or TET fail at any time in mode 1 there is outage. 2 or more working systems 1 working system and spare or HEB / TET active-function HEB / TET through-function by itself: “series element” “Cutset” (algebraic) model of APS availability

E E Module 21 © Wayne D. Grover 2002, Markov chain model of 1:N APSOutage-causing states Markov Techniques: Models of APS Systems

E E Module 21 © Wayne D. Grover 2002, It is usual to provide redundancy in the form of a “standby” system to enhance availability. It is also easy to overlook the impact of “silent failure” of the protection system. 1) Both working and spare sys- tems up 2) “working sys- tem” down, spare takes over l l m 3) Both working and spare down 1- m l l - m 2m Ideal view: (spare assumed always to be ready) Actual system (spare can have “silent failure”) This can be approximated by the above iff the spare unit is routinely “exercised” An engineering issue: “exercising the spare”

E E Module 21 © Wayne D. Grover 2002, Basic idea: - Represent system in Markov chain form - Simulate random transitions and observe state probabilities (instead of attempting symbolic analysis of the model) Some Advantages: - can obtain experimental distributions of frequency / duration data - can model transition rates that are state dependent - can drive system through transient episodes on non-stationary statistical behaviour Limitation: - strictly only valid model of real systems for negative exponential (i.e., memoryless) failure and repair models - reason is that transitions are generated into / out-of states, they are not generated based on individual equipment items Simulation techniques inspired by Markov modelling

E E Module 21 © Wayne D. Grover 2002, Basic simulation engine: t= 0; assign initial state S(t=0); repeat generate random variable x i for each transition type exiting current state; t := t+ min{x 1, x 2,....X n }, k <- i : x i is min. (i.e. k indicates transition selected) ; next_state := case of {current_state, transition k} record / update any relevant statistical data -examples: time since last entering a failure state,.... Until {forever} General method for generating r.v.s with any desired probability density function: For “memoryless” processes: u* is uniform r.v. on (0,1) t* is then distributed as: Simulation inspired by Markov modelling(2)

E E Module 21 © Wayne D. Grover 2002, Method: - Identify each independent subsystem or component that is subject to failure and repair. - Specify (in CDF form) the distribution of times between failure and repair times for that element. - For each element individually simulate a time-line history of its failure and repair. - Merged the event-sequences for all elements into one time line - Use the merged time-line to drive the walk through the system state model. - Accumulate all desired statistics such as: total outage time, times between system outage, outage times duration, etc. Even more general simulation method....

E E Module 21 © Wayne D. Grover 2002, Issue is: virtually all analytically obtainable results provide only the mean (not distribution) of outage times duration or times between failure and... simulation based on memorylessness, always implies times between failure and repair times duration are both negative exponentially distributed p r o b Time between failure or time to repair Believable for times between failures but certainly not accurate pdf shape for distribution of repair times. Q. (project): How different is the accurately simulated distribution of outage-time durations from the analytical model ? Q. How much might this affect SLA policies ? (Service Level Agreements) Repair times more like Issue (and related project topic)...