Oct. 2007State-Space ModelingSlide 1 Fault-Tolerant Computing Motivation, Background, and Tools.

Slides:



Advertisements
Similar presentations
Lecture 5 This lecture is about: Introduction to Queuing Theory Queuing Theory Notation Bertsekas/Gallager: Section 3.3 Kleinrock (Book I) Basics of Markov.
Advertisements

LAPLACE TRANSFORMS.
Apr. 2007SatisfiabilitySlide 1 Satisfiability A Lecture in CE Freshman Seminar Series: Ten Puzzling Problems in Computer Engineering.
Lecture 6  Calculating P n – how do we raise a matrix to the n th power?  Ergodicity in Markov Chains.  When does a chain have equilibrium probabilities?
The Laplace Transform Let f(x) be defined for 0≤x
Fault Tree Analysis Part 12 – Redundant Structure and Standby Units.
Chapter 8 Continuous Time Markov Chains. Markov Availability Model.
1 Chapter 5 Continuous time Markov Chains Learning objectives : Introduce continuous time Markov Chain Model manufacturing systems using Markov Chain Able.
INDR 343 Problem Session
Probability and Statistics with Reliability, Queuing and Computer Science Applications: Chapter 6 on Stochastic Processes Kishor S. Trivedi Visiting Professor.
ECE 8443 – Pattern Recognition ECE 3163 – Signals and Systems Objectives: Review Resources: Wiki: State Variables YMZ: State Variable Technique Wiki: Controllability.
TCOM 501: Networking Theory & Fundamentals
Dependability Evaluation through Markovian model.
NETE4631:Capacity Planning (3)- Private Cloud Lecture 11 Suronapee Phoomvuthisarn, Ph.D. /
Reliable System Design 2011 by: Amir M. Rahmani
Lecture 13 – Continuous-Time Markov Chains
Oct State-Space Modeling Slide 1 Fault-Tolerant Computing Motivation, Background, and Tools.
June 2007Malfunction DiagnosisSlide 1 Malfunction Diagnosis A Lecture in CE Freshman Seminar Series: Ten Puzzling Problems in Computer Engineering.
Oct Combinational Modeling Slide 1 Fault-Tolerant Computing Motivation, Background, and Tools.
Ch 7.9: Nonhomogeneous Linear Systems
A. BobbioBertinoro, March 10-14, Dependability Theory and Methods 5. Markov Models Andrea Bobbio Dipartimento di Informatica Università del Piemonte.
Oct Terminology, Models, and Measures Slide 1 Fault-Tolerant Computing Motivation, Background, and Tools.
Laplace Transforms Important analytical method for solving linear ordinary differential equations. - Application to nonlinear ODEs? Must linearize first.
TCOM 501: Networking Theory & Fundamentals
Transfer Functions Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: The following terminology.
Queueing Theory: Part I
Question 11 – 3.
2003 Fall Queuing Theory Midterm Exam(Time limit:2 hours)
Modeling and Analysis of Manufacturing Systems Session 2 QUEUEING MODELS January 2001.
Transfer Functions Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: The following terminology.
Oct Fault Masking Slide 1 Fault-Tolerant Computing Dealing with Low-Level Impairments.
Oct. 2007Combinational ModelingSlide 1 Fault-Tolerant Computing Motivation, Background, and Tools.
Chapter 3 1 Laplace Transforms 1. Standard notation in dynamics and control (shorthand notation) 2. Converts mathematics to algebraic operations 3. Advantageous.
1 Chapter 2 Matrices Matrices provide an orderly way of arranging values or functions to enhance the analysis of systems in a systematic manner. Their.
5.7 Impulse Functions In some applications, it is necessary to deal with phenomena of an impulsive nature—for example, voltages or forces of large magnitude.
ORDINARY DIFFERENTIAL EQUATION (ODE) LAPLACE TRANSFORM.
ECE 2300 Circuit Analysis Dr. Dave Shattuck Associate Professor, ECE Dept. Lecture Set #12 Natural Response W326-D3.
ECE 2300 Circuit Analysis Dr. Dave Shattuck Associate Professor, ECE Dept. Lecture Set #13 Step Response W326-D3.
NETE4631:Capacity Planning (2)- Lecture 10 Suronapee Phoomvuthisarn, Ph.D. /
ECE 8443 – Pattern Recognition ECE 3163 – Signals and Systems Objectives: First-Order Second-Order N th -Order Computation of the Output Signal Transfer.
Chapter 3 mathematical Modeling of Dynamic Systems
Prepared by Mrs. Azduwin Binti Khasri
1 Elements of Queuing Theory The queuing model –Core components; –Notation; –Parameters and performance measures –Characteristics; Markov Process –Discrete-time.
Chapter 6: Sampled Data Systems and the z-Transform 1.
Why Wait?!? Bryan Gorney Joe Walker Dave Mertz Josh Staidl Matt Boche.
Motivation Thus far we have dealt primarily with the input/output characteristics of linear systems. State variable, or state space, representations describe.
CS433 Modeling and Simulation Lecture 07 – Part 01 Continuous Markov Chains Dr. Anis Koubâa 14 Dec 2008 Al-Imam.
NETE4631: Network Information System Capacity Planning (2) Suronapee Phoomvuthisarn, Ph.D. /
Linear Constant-Coefficient Difference Equations
Discretization of Continuous-Time State Space Models
REPAIRABLE SYSTEMS 1 AVAILABILITY ENGINEERING. A Single Repairable Component With Failure Rate λ and Repair Rate μ The component may exist in one of Two.
ERT 210/4 Process Control & Dynamics DYNAMIC BEHAVIOR OF PROCESSES :
1 Chapter 5 Continuous time Markov Chains Learning objectives : Introduce continuous time Markov Chain Model manufacturing systems using Markov Chain Able.
Math for CS Fourier Transforms
Dynamic Presentation of Key Concepts Module 6 – Part 2 Natural Response of First Order Circuits Filename: DPKC_Mod06_Part02.ppt.
DYNAMIC BEHAVIOR OF PROCESSES :
Class 3 Linear System Solution Using the Laplace Transform
Copyright 2007 Koren & Krishna, Morgan-Kaufman Part.4.1 FAULT TOLERANT SYSTEMS Part 4 – Analysis Methods Chapter 2 – HW Fault Tolerance.
Transfer Functions Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: The following terminology.
Discrete-time Markov chain (DTMC) State space distribution
Teknik kendali.
Availability Availability - A(t)
Much More About Markov Chains
SIGMA INSTITUTE OF ENGINEERING
Mathematical Modeling of Control Systems
Fundamentals of Electric Circuits Chapter 15
Equivalent State Equations
Reliability Engineering
Mathematical Models of Control Systems
Chapter 3 Modeling in the Time Domain
Presentation transcript:

Oct. 2007State-Space ModelingSlide 1 Fault-Tolerant Computing Motivation, Background, and Tools

Oct. 2007State-Space ModelingSlide 2 About This Presentation EditionReleasedRevised FirstOct. 2006Oct This presentation has been prepared for the graduate course ECE 257A (Fault-Tolerant Computing) by Behrooz Parhami, Professor of Electrical and Computer Engineering at University of California, Santa Barbara. The material contained herein can be used freely in classroom teaching or any other educational setting. Unauthorized uses are prohibited. © Behrooz Parhami

Oct. 2007State-Space ModelingSlide 3 State-Space Modeling

Oct. 2007State-Space ModelingSlide 4

Oct. 2007State-Space ModelingSlide 5 What Is State-Space Modeling? With respect to availability of resources and computational capabilities, a system can be viewed as being in one of several possible states State transitions: System moves from one state to another as resource availability and computational power change due to various events State-space modeling entails quantifying transition probabilities so as to determine the probability of the system being in each state; from this, we derive reliability, availability, safety, and other desired parameters The number of states can be large, if we want to make fine distinctions, or it can be relatively small if we lump similar states together Great Lousy Good So-so

Oct. 2007State-Space ModelingSlide 6 Markov Chains Represented by a state diagram with transition probabilities Sum of all transition probabilities out of each state is 1 s(t + 1) = s(t) M s(t + h) = s(t) M h The state of the system is characterized by the vector (s 0, s 1, s 2, s 3 ) (1, 0, 0, 0) means that the system is in state 0 (0.5, 0.5, 0, 0) means that the system is in state 0 or 1 with equal prob’s (0.25, 0.25, 0.25, 0.25) represents complete uncertainty Must sum to 1 Transition matrix: M = Example: (s 0, s 1, s 2, s 3 ) = (0.5, 0.5, 0, 0) M = (0.4, 0.4, 0.15, 0.05) (s 0, s 1, s 2, s 3 ) = (0.4, 0.4, 0.15, 0.05) M = (0.34, 0.365, 0.225, 0.07) Markov matrix (rows sum to 1) Self loops not shown

Oct. 2007State-Space ModelingSlide 7 Stochastic Sequential Machines Transition taken from state s under input j is not uniquely determined Rather, a number of states may be entered with different probabilities There will be a separate transition (Markov) matrix for each input value Transitions, j = 0: M = A Markov chain can be viewed as a stochastic sequential machine with no input Self loops and transitions for j = 1 not shown Transitions, j = 1: M =

Oct. 2007State-Space ModelingSlide 8 Sample Applications of Markov Modeling “Hidden Markov Model” for recognition problems Markov model for programmer workflow

Oct. 2007State-Space ModelingSlide 9 Merging States in a Markov Model  All solid lines Dashed lines    0 Failed state if TMR Simpler equivalent model for 3-unit fail-soft system Whether or not states are merged depends on the model’s semantics There are three identical units 1 = Unit is up 0 = Unit is down

Oct. 2007State-Space ModelingSlide 10 Two-State Nonrepairable Systems Reliability as a function of time: R(t) = p 1 (t) = e – t Rate of change for the probability of being in state 1 is – p 1 = – p 1 p 1 + p 0 = 1 10 The label on this transition means that over time dt, the transition will occur with probability dt (we are dealing with a continuous-time Markov model) p 0 = 1 – e – t p 1 = e – t 1 Time Initial condition: p 1 (0) = 1 Start state Failure UpDown

Oct. 2007State-Space ModelingSlide 11 k-out-of-n Nonrepairable Systems nn – 1 n n – 2 (n–1) kk – 1 k … 0 … p n = –n p n p n–1 = n p n – (n – 1) p n–1. p k = (k + 1) p k+1 – k p k p n + p n– p k + p F = 1 F p n = e –n t p n–1 = ne –(n–1) t (1 – e – t ). p k = ( ) e –(n–k) t (1 – e – t ) k p F = 1 –  j=k to n p j nknk Initial condition: p n (0) = 1 In this case, we do not need to resort to more general method of solving linear differential equations (LaPlace transform, to be introduced later) The first equation is solvable directly, and each additional equation introduces only one new variable

Oct. 2007State-Space ModelingSlide 12 Two-State Repairable Systems Availability as a function of time: A(t) = p 1 (t) =  /( +  ) + /( +  ) e –( +  )t Derived in the next slide In steady state (equilibrium), transitions into/out-of each state must “balance out” – p 1 +  p 0 = 0 p 1 + p 0 = 1 10  The label  on this transition means that over time dt, repair will occur with probability  dt (constant repair rate as well as constant failure rate) p 1 =  /( +  ) p 0 = /( +  ) 1 Steady-state availability Time Repair Start state Failure UpDown

Oct. 2007State-Space ModelingSlide 13 Solving the State Differential Equations 10  To solve linear differential equations with constant coefficients: 1. Convert to algebraic equations using LaPlace transform 2. Solve the algebraic equations 3. Use inverse LaPlace transform to find original solutions p 1 (t) = – p 1 (t) +  p 0 (t) p 0 (t) = –  p 0 (t) + p 1 (t) sP 1 (s) –  p 1 (0) = – P 1 (s) +  P 0 (s) sP 0 (s) –  p 0 (0) = –  P 0 (s) + P 1 (s) P 1 (s) = (s +  ) / [s 2 + ( +  )s] P 0 (s) = / [s 2 + ( +  )s] 1 0 p 1 (t) =  /( +  ) + /( +  ) e –( +  )t p 0 (t) = /( +  ) – /( +  ) e –( +  )t LaPlace Transform Table h(t) H(s) kk/s e –at 1/(s + a) t n–1 e –at /(n – 1)!1/(s + a) n k h(t) k H(s) h(t) + g(t) H(s) + G(s) h(t) s H(s) – h(0) Start state

Oct. 2007State-Space ModelingSlide 14 Inverse LaPlace Transform 10  To find the solutions via inverse LaPlace transform: 1. Manipulate expressions into sum of terms, each of which takes one of the forms shown under H(s) 2. Find the inverse transform for each term (s +  ) / [s 2 + ( +  )s] = 1/[s + ( +  )] +  /[s 2 + ( +  )s] P 1 (s) = (s +  ) / [s 2 + ( +  )s] P 0 (s) = / [s 2 + ( +  )s] LaPlace Transform Table h(t) H(s) kk/s e –at 1/(s + a) t n–1 e –at /(n – 1)!1/(s + a) n k h(t) k H(s) h(t) + g(t) H(s) + G(s) h(t) s H(s) – h(0) Start state 1/[s 2 + ( +  )s] = a/s + b/[s + ( +  )] 1 = a[s + ( +  )] + bs  a + b = 0 a = 1/( +  ) b = –1/( +  )

Oct. 2007State-Space ModelingSlide 15 Systems with Multiple Failure States Safety evaluation: Total risk of system is  failure states c j p j In steady state (equilibrium), transitions into/out-of each state must “balance out” – p 2 +  p 1 +  p 0 = 0 –  p p 2 = 0 p 2 + p 1 + p 0 = 1 p 2 =  /( +  ) p 1 = 1 /( +  ) p 0 = 0 /( +  ) Failure state j has a cost (penalty) c j associated with it 1   = 1 Time p2(t)p2(t) p1(t)p1(t) p0(t)p0(t) Failed, type 1 Failed, type 2 Failure Start state Good Repair

Oct. 2007State-Space ModelingSlide 16 Systems with Multiple Operational States Performability evaluation: Performability =  operational states b j p j – 2 p 2 +  2 p 1 = 0 1 p 1 –  1 p 0 = 0 p 2 + p 1 + p 0 = 1 p 2 =  p 1 =  2 /  2 p 0 =  1 2 /(  1  2 ) Operational state j has a benefit b j associated with it 2 1  2 Let  = 1/[1 + 2 /  /(  1  2 )] Example: 2 = 2, 1 =,  1 =  2 =  (single repairperson or facility), b 2 = 2, b 1 = 1, b 0 = 0 P = 2p 2 + p 1 = 2  + 2  /  = 2(1 + /  )/(1 +2 /  /  2 ) 20 1 Start state Repair Failure Partial repair Partial failure

Oct. 2007State-Space ModelingSlide 17 TMR System with Repair Mean time to failure evaluation: See [Siew92], pp , for derivation MTTF = 5/(6 ) +  /(6 2 ) = [5/(6 )](  / ) –3 p 3 +  p 2 = 0 –(  + 2 )p p 3 = 0 p 3 + p 2 + p F = 1 Assume the voter is perfect Upon first module malfunction, we switch to duplex operation with comparison 3 3F2 2  Steady-state analysis of no use p 3 = p 2 = 0, p F = 1 MTTF Comparisons ( = 10 –6 /hr,  = 0.1/hr) Nonredundant1/ 1 M hr TMR5/(6 ) M hr TMR with repair[5/(6 )](  / )16,668 M hr MTTF for TMR Improvement due to repair Improvement factor

Oct. 2007State-Space ModelingSlide 18 Fail-Soft System with Imperfect Coverage – 2 p 2 +  2 p 1 = 0 2 (1 – c)p p 1 –  1 p 0 = 0 p 2 + p 1 + p 0 = 1 p 2 =  p 1 = 2  p 0 = 2  (1 – c +  )  If malfunction of one unit goes undetected, the system fails 2 c 1  2 We solve this in the special case of 2 = 2, 1 =,  2 =  1 =  Let  = /  and  = 1 / (1 + 4  – 2c  + 2  2 ) We can also consider coverage for the repair direction 2 (1 – c) 20 1 Start state Repair Failure Partial repair Partial failure

Oct. 2007State-Space ModelingSlide 19 Birth-and-Death Processes Special case of Markov model with states appearing in a chain and transitions allowed only between adjacent states This model is used in queuing theory, where the customers’ arrival rate and provider’s service rate determine the queue size and waiting time Closed-form expression for state probabilities can be found, assuming n + 1 states and s service providers (repair persons): M/M/s/n/n queue p j = (n – j + 1) ( /  ) p j–1 / j for j = 1, 2,..., r p j = (n – j + 1) ( /  ) p j–1 / r for j = r + 1, r + 2,..., n 4  01 3  2 3  24 22 22 33  22 33 44 Equation for p 0 [Siew92], p. 347

Oct. 2007State-Space ModelingSlide 20 The Dependability Modeling Process Choose modeling approach Combinational State-space Solve model Derive model parameters Interpret results Validate model and results Construct model Iterate until results are satisfactory

Oct. 2007State-Space ModelingSlide 21 Software Aids for Reliability Modeling Relex (company specializing in reliability engineering) Reliability block diagram: Markov: Iowa State University HIMAP: University of Virginia Galileo: See Appendix D, pp , of [Shoo02] for more programs Dept. of Mechanical Engineering, Univ. of Maryland: List of reliability engineering tools: