G12: Management Science Markov Chains.

Slides:

Advertisements

Similar presentations

Discrete time Markov Chain

Advertisements

The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Chapter 4. Discrete Probability Distributions Section 4.11: Markov Chains Jiaping Wang Department of Mathematical.

Lecture 6  Calculating P n – how do we raise a matrix to the n th power?  Ergodicity in Markov Chains.  When does a chain have equilibrium probabilities?

Queueing Models and Ergodicity. 2 Purpose Simulation is often used in the analysis of queueing models. A simple but typical queueing model: Queueing models.

CS433 Modeling and Simulation Lecture 06 – Part 03 Discrete Markov Chains Dr. Anis Koubâa 12 Apr 2009 Al-Imam Mohammad Ibn Saud University.

Operations Research: Applications and Algorithms

Chapter 4 Mathematical Expectation.

1 Chapter 5 Continuous time Markov Chains Learning objectives : Introduce continuous time Markov Chain Model manufacturing systems using Markov Chain Able.

IERG5300 Tutorial 1 Discrete-time Markov Chain

Discrete Time Markov Chains

Markov Chains 1.

. Computational Genomics Lecture 7c Hidden Markov Models (HMMs) © Ydo Wexler & Dan Geiger (Technion) and by Nir Friedman (HU) Modified by Benny Chor (TAU)

Topics Review of DTMC Classification of states Economic analysis

11 - Markov Chains Jim Vallandingham.

Lecture 12 – Discrete-Time Markov Chains

TCOM 501: Networking Theory & Fundamentals

Chapter 17 Markov Chains.

Flows and Networks (158052) Richard Boucherie Stochastische Operations Research -- TW wwwhome.math.utwente.nl/~boucherierj/onderwijs/158052/ html.

1 Part III Markov Chains & Queueing Systems 10.Discrete-Time Markov Chains 11.Stationary Distributions & Limiting Probabilities 12.State Classification.

Андрей Андреевич Марков. Markov Chains Graduate Seminar in Applied Statistics Presented by Matthias Theubert Never look behind you…

Lecture 3: Markov processes, master equation

Markov Processes MBAP 6100 & EMEN 5600 Survey of Operations Research Professor Stephen Lawrence Leeds School of Business University of Colorado Boulder,

Markov Chain Part 2 多媒體系統研究群指導老師：林朝興博士學生：鄭義繼. Outline Review Classification of States of a Markov Chain First passage times Absorbing States.

Continuous Time Markov Chains and Basic Queueing Theory

Lecture 13 – Continuous-Time Markov Chains

048866: Packet Switch Architectures Dr. Isaac Keslassy Electrical Engineering, Technion Review.

Computational statistics 2009 Random walk. Computational statistics 2009 Random walk with absorbing barrier.

Chapter 4: Stochastic Processes Poisson Processes and Markov Chains

20. Extinction Probability for Queues and Martingales

Homework 2 Question 2: For a formal proof, use Chapman-Kolmogorov Question 4: Need to argue why a chain is persistent, periodic, etc. To calculate mean.

If time is continuous we cannot write down the simultaneous distribution of X(t) for all t. Rather, we pick n, t 1,...,t n and write down probabilities.

1 Markov Chains Algorithms in Computational Biology Spring 2006 Slides were edited by Itai Sharon from Dan Geiger and Ydo Wexler.

Markov Chains Chapter 16.

Stochastic Process1 Indexed collection of random variables {X t } t   for each t  T  X t is a random variable T = Index Set State Space = range.

CS6800 Advanced Theory of Computation Fall 2012 Vinay B Gavirangaswamy

Lecture 11 – Stochastic Processes

6. Markov Chain. State Space The state space is the set of values a random variable X can take. E.g.: integer 1 to 6 in a dice experiment, or the locations.

1 Performance Evaluation of Computer Networks: Part II Objectives r Simulation Modeling r Classification of Simulation Modeling r Discrete-Event Simulation.

Probability and Statistics with Reliability, Queuing and Computer Science Applications: Chapter 7 on Discrete Time Markov Chains Kishor S. Trivedi Visiting.

Intro. to Stochastic Processes

Decision Making in Robots and Autonomous Agents Decision Making in Robots and Autonomous Agents The Markov Decision Process (MDP) model Subramanian Ramamoorthy.

0 K. Salah 2. Review of Probability and Statistics Refs: Law & Kelton, Chapter 4.

Queuing Theory Basic properties, Markovian models, Networks of queues, General service time distributions, Finite source models, Multiserver queues Chapter.

Markov Chains X(t) is a Markov Process if, for arbitrary times t1 < t2 < < tk < tk+1 If X(t) is discrete-valued If X(t) is continuous-valued i.e.

Chapter 61 Continuous Time Markov Chains Birth and Death Processes,Transition Probability Function, Kolmogorov Equations, Limiting Probabilities, Uniformization.

 { X n : n =0, 1, 2,...} is a discrete time stochastic process Markov Chains.

Chapter 3 : Problems 7, 11, 14 Chapter 4 : Problems 5, 6, 14 Due date : Monday, March 15, 2004 Assignment 3.

Chapter 01 Probability and Stochastic Processes References: Wolff, Stochastic Modeling and the Theory of Queues, Chapter 1 Altiok, Performance Analysis.

1 Markov chains and processes: motivations Random walk One-dimensional walk You can only move one step right or left every time unit Two-dimensional walk.

Chapter 01 Probability and Stochastic Processes References: Wolff, Stochastic Modeling and the Theory of Queues, Chapter 1 Altiok, Performance Analysis.

8/14/04J. Bard and J. W. Barnes Operations Research Models and Methods Copyright All rights reserved Lecture 12 – Discrete-Time Markov Chains Topics.

CS433 Modeling and Simulation Lecture 07 – Part 01 Continuous Markov Chains Dr. Anis Koubâa 14 Dec 2008 Al-Imam.

Random Variable The outcome of an experiment need not be a number, for example, the outcome when a coin is tossed can be 'heads' or 'tails'. However, we.

The generalization of Bayes for continuous densities is that we have some density f(y|  ) where y and  are vectors of data and parameters with  being.

CDA6530: Performance Models of Computers and Networks Chapter 3: Review of Practical Stochastic Processes.

Discrete Time Markov Chains

Flows and Networks (158052) Richard Boucherie Stochastische Operations Research -- TW wwwhome.math.utwente.nl/~boucherierj/onderwijs/158052/ html.

COMS Network Theory Week 5: October 6, 2010 Dragomir R. Radev Wednesdays, 6:10-8 PM 325 Pupin Terrace Fall 2010.

11. Markov Chains (MCs) 2 Courtesy of J. Bard, L. Page, and J. Heyl.

Markov Processes What is a Markov Process?

8/14/04J. Bard and J. W. Barnes Operations Research Models and Methods Copyright All rights reserved Lecture 11 – Stochastic Processes Topics Definitions.

Random Variables r Random variables define a real valued function over a sample space. r The value of a random variable is determined by the outcome of.

1 Chapter 5 Continuous time Markov Chains Learning objectives : Introduce continuous time Markov Chain Model manufacturing systems using Markov Chain Able.

Theory of Computational Complexity Probability and Computing Lee Minseon Iwama and Ito lab M1 1.

Reliability Engineering

Let E denote some event. Define a random variable X by Computing probabilities by conditioning.

Discrete-time markov chain (continuation)

Discrete time Markov Chain

Discrete time Markov Chain

Presentation transcript:

G12: Management Science Markov Chains

Outline Classification of stochastic processes Markov processes and Markov chains Transition probabilities Transition networks and classes of states First passage time probabilities and expected first passage time Long-term behaviour and steady state distribution

Analysing Uncertainty Computer Models of Uncertainty: Building blocks: Random number generators Simulation Models Static (product launch example) Dynamic (inventory example and queuing models) Mathematical Models of Uncertainty: Building blocks: Random Variables Mathematical Models Static: Functions of Random Variables Dynamic: Stochastic (Random) Processes

Stochastic Processes Collection of random variables Xt, t in T Xt’s are typically statistically dependent State space: set of possible values of Xt’s State space is the same for all Xt’s Discrete space: Xt’s are discrete RVs Continuous space: Xt’s are continuous RVs Time domain: Discrete time: T={0,1,2,3,…} Continuous time: T is an interval (possibly unbounded)

Examples from Queuing Theory Discrete time, discrete space Ln: queue length upon arrival of nth customer Discrete time, continuous space Wn: waiting time of nth customer Continuous time, discrete space Lt: queue length at time t Continuous time, continuous space Wt: waiting time for a customer arriving at time t

A gambling example Game: Flip a coin. You win £ 10 if coin shows head and loose £ 10 otherwise You start with £ 10 and you keep playing until you are broke Typical questions What is the expected amount of money after t flips? What is the expected length of the game?

A Branching Process 0.5 …. £30 0.5 0.5 …. £20 0.5 0.5 £10 …. 0.5 10 £ 0

Discrete Time - Discrete State Stochastic Processes Xt: Amount of money you own after t flips Stochastic Process: X1,X2,X3,… Each Xt has its own probability distribution The RVs are dependent: the probability of having £ k after t flips depends on what you had after t’ (<t) flips Knowing Xt’ changes the probability distribution of Xt (conditional probability)

Outline Markov processes and Markov chains Classification of stochastic processes Markov processes and Markov chains Transition probabilities Transition networks and classes of states First passage time probabilities and expected first passage time Long-term behaviour and steady state distribution

Markovian Property Waiting time at time t depends on waiting time at times t’<t Knowing waiting time at some time t’<t changes the probability distribution of waiting time at time t (Conditional probability) Knowledge of history generally improves probability distribution (smaller variance) Generally: The distribution of states at time t depends on the whole history of the process Knowing states of the system at times t1,…tn<t changes the distribution of states at time t Markov property: The distribution of states at time t, given the states at times t1<…<tn<t is the same as the distribution of states at time t, given only knowledge of the state at time tn. The distribution depends only on the last observed state Knowledge about earlier states does not improve probability distribution

Discrete time, discrete space P(Xt+1= j | X0=i0,…,Xt=it) = P(Xt+1= j | Xt=it) In words: The probabilities that govern a transition from state i at time t to state j at time t+1 only depend on the state i at time t and not on the states the process was in before time t

Transition Probabilities The transition probabilites are P(Xt+1= j | Xt=i) Transition probabilities are called stationary if P(Xt+1= j | Xt=i) = P(X1= j | X0=i) If there are only finitely many possible states of the RVs Xt then the stationary transition probabilities are conveniently stored in a transition matrix Pij= P(X1= j | X0=i) Find the transition matrix for our first example if the game ends if the gambler is either broke or has earned £ 30

Markov Chains Stochastic process with a finite number, say n, possible states that has the Markov property Transitions between states in discrete time steps MC is completely characterised by transition probabilities Pij from state i to state j are stored in an n x n transition matrix P Rows of transition matrix sum up to 1. Such a matrix is called a stochastic matrix Initial distribution of states is given by an initial probability vector p(0)=(p1(0),…,pn(0)) We are interested in the change of the probability distribution of the states over time

Markov Chains as Modelling Templates Lawn mower example: Weekly demand D for lawn mowers has distribution P(D=0)=1/3, P(D=1)=1/2, P(D=2)=1/6 Mowers can be ordered at the end of each week and are delivered right at the beginning of the next week Inventory policy: Order two new mowers if stock is empty at the end of the week Currently (beginning of week 0) there are two lawn mowers in stock Determine the transition matrix

Market Shares Two software packages, B and C, enter a market that has so far been dominated by software A C is more powerful than B which is more powerful than A C is a big departure from A, while B has some elements in common with both A and C Market research shows that about 65% of A-users are satisfied with the product and won’t change over the next three months 30% of A-users are willing to move to B, 5% are willing to move to C….

Transition Matrix All transition probabilities over the next three months can be found in the following transition matrix What are the approximate market shares going to be?

Machine Replacement Many identical machines are used in a manufacturing environment They deteriorate over time with the following monthly transition probabilities:

Outline Transition probabilities Classification of stochastic processes Markov processes and Markov chains Transition probabilities Transition networks and classes of states First passage time probabilities and expected first passage time Long-term behaviour and steady state distribution

2-step transition probability (graphically) P0j Pi0 i 1 Pi1 P1j j Pi2 2 P2j

2-step transition probabilities (formally)

Chapman-Kolmogorov Equations Similarly, one shows that n-step transition probabilities Pij(n)=P(Xn=j | X0=i) obey the following law (for arbitrary m<n:) The n-step transition probability matrix P(n) is the n-th power of the 1-step TPM P: P(n) =Pn=P…P (n times)

Example see spreadsheet Markov.xls

P(Xn=i)=P(Xn=i¦X0=1)p1(0)+…+P(Xn=i¦X0=m)pm(0) Distribution of Xn Given Markov chain with m states (1,…,m) and transition matrix P Probability vector for initial state (t=0): p(0)=(p1(0),…, pm(0)) What is the probability that the process is in state i after n transitions? Bayes’ formula: P(Xn=i)=P(Xn=i¦X0=1)p1(0)+…+P(Xn=i¦X0=m)pm(0) Probability vector for Xn: p(n)= p(0)Pn Iteratively: p(n+1)= p(n)P Open spreadsheet Markov.xls for lawn mower, market share, and machine replacement examples

Outline Transition networks and classes of states Classification of stochastic processes Markov processes and Markov chains Transition probabilities Transition networks and classes of states First passage time probabilities and expected first passage time Long-term behaviour and steady state distribution

An Alternative Representation of the Machine Replacement Example 0.6 OK 0.3 0.9 0.1 New Worn 0.1 1 0.6 0.4 Fail

The transition network The nodes of the network correspond to the states There is an arc from node i to node j if Pij > 0 and this arc has an associated value Pij State i is accessible from state j if there is a path in the network from node i to node j A stochastic matrix is said to be irreducible if each state is accessible from each other state

Classes of States State i and j communicate if i is accessible from j and j is accessible from i Communicating states form classes A class is called absorbing if it is not possible to escape from it A class A is said to be accessible from a class B if each state in A is accessible from each state in B Equivalently: …if some state in A is accessible from some state in B

Find all classes in this example and indicate their accessibility from other classes 2 3 1/3 4 1 1/6 1 1/3 1/2 1/2 1 1 5 2/3 2/3 1/2 1/2 6 7

Return to Gambling Example Draw the transition network Find all classes Is the Markov chain irreducible? Indicate the accessibility of the classes Is there an absorbing class?

Outline Classification of stochastic processes Markov processes and Markov chains Transition probabilities Transition networks and classes of states First passage time probabilities and expected first passage time Long-term behaviour and steady state distribution

First passage times The first passage time from state i to state j is the number of transitions until the process hits state j if it starts at state i First passage time is a random variable Define fij(k) = probability that the first passage from state i to state j occurs after k transitions

P(A)=P(A|B1)P(B1)+…+ P(A|Bn)P(Bn) Calculating fij(k) Use Bayes’ formula P(A)=P(A|B1)P(B1)+…+ P(A|Bn)P(Bn) Event A: starting from sate i the process is in state j after n transitions (P(A)=Pij(n)) Event Bk: first passage from i to j happens after k transitions

Calculating fij(k) (cont.) Bayes’ formula gives: This results in the recursion formula:

Alternative: Simulation Do a number of simulations, starting from state i and stopping when you have reached state j Estimate fij(k) = Percentage of runs of length k BUT: This may take a long time if you want to do this for all state combinations (i,j) and many k’s

Expected first passage time If Xij = time of first passage from i to j then E(Xij)=fij(1)+2fij(2)+3fij(3)+…. Use conditional expectation formula E(Xij)=E(Xij|B1)P(B1)+…+ E(Xij|Bn)P(Bn) Event Bk: first transition goes from i to k Notice E(Xij |Bj)=1 and E(Xij|Bk)=1+E(Xkj)

Hence

Example

Outline Long-term behaviour and steady state distribution Classification of stochastic processes Markov processes and Markov chains Transition probabilities Transition networks and classes of states First passage time probabilities and expected first passage time Long-term behaviour and steady state distribution

Long term behaviour We are interested in distribution of Xn as n tends to infinity: lim p(n)=lim p(0)P(n)= p(0) lim P(n) If lim P(n) exists then P is called The limit may not exist, though: See Markov.xls Problem: Process has periodic behaviour Process can only recur to state i after t,2t,3t,… steps There exists t: if n Not in {t,2t,3t} then Pii(n) = 0 Period of a state i: maximal such t

Find the periods of the states 2 3 1/3 4 1 1/6 1 1/3 1/2 1/2 1 1 5 2/3 2/3 1/2 1/2 6 7

Aperiodicity A state with period 1 is called aperiodic State i is aperiodic if and only if there exists N such that Pii(N) > 0 and Pii(N+1) > 0 The Chapman-Kolmogorov Equations therefore imply that Pii(n)>0 for every n>=N Aperiodicity is a class property, i.e. if one state in a class is aperiodic, then so are all others

Regular matrices A stochastic matrix P is called regular if there exists a number n such that all entries of Pn are positive A Markov chain with a regular transition matrix is aperiodic (i.e. all states are aperiodic) and irreducible (i.e. all states communicate)

Back to long-term behaviour Mathematical Fact: If a Markov chain is irreducible and aperiodic then it is ergodic, i.e., all limits exist

Finding the long term probabilities Mathematical Result: If a Markov chain is irreducible and aperiodic then all rows of its long term transition probability matrix are identical to the unique solution p=(p1,…, pm) of the equations

However,... …the latter system is of the form pP=p, p1+…+pm=1 and has m+1 equations and m unknowns It has a solution because P is a stochastic matrix and therefore has 1 as an eigenvalue (with eigenvector x=(1,…,1)). Hence p is just a left eigenvector of P to the eigenvalue 1 and the additional equation normalizes the eigenvector Calculation: solve the system without the first equation - then check first equation

Example Find the steady state probabilities for Solution: (p1,p2)=(0.6,0.4)

Steady state probabilities The probability vector p with pP=p and p1+..+pm=1 is called the steady state (or stationary) probability distribution of the Markov chain A Markov chain does not necessarily have a steady state distribution Mathematical result: an irreducible Markov chain has a steady state distribution

Tending towards steady state If we start with the steady state distribution then the probability distribution of the states does not change over time More importantly: If the Markov chain is irreducible and aperiodic then, independently of the initial distribution, the distribution of states gets closer and closer to the steady state distribution Illustration: see spreadsheet Markov.xls

More on steady state distributions pj can be interpreted as the long-run proportion of time the process is in state j Alternatively: pj=1/E(Xjj) where Xjj is the time of the first recurrence to j E.g. if the expected recurrence time to state j is 2 transitions then, on the long run, the process will be in state j after every 1 out of two transitions,i.e. 1/2 of the time

Average Payoff Per Unit Time Setting: If process hits state i, a payoff of g(i) is realized (costs = negative payoffs) Average payoff per period after n transitions Yn=(g(X1)+…+g(Xn))/n Long-run expected average payoff per time period: lim E(Yn) as n tends to infinity

Calculating long-run average pay-offs Mathematical Fact: If a Markov chain is irreducible and aperiodic then

Example A transition takes place every week. A weekly cost of £ 1 has to be payed if the process is in state 1, while a weekly profit of £ 1 is obtained if the process is in state 1. Find the average payoff per week. (Solution: £ -0.2 per week)

Key Learning Points Markov chains are a template for the analysis of systems with finitely many states where random transitions between states happen at discrete points in time We have seen how to calculate n-step transition probabilities, first passage time probabilities and expected first passage times We have discussed steady state behaviour of a Markov chain and how to calculate steady state distributions