Download presentation
Presentation is loading. Please wait.
1
Probabilistic Dynamic Programming
2
Gambling Problem Start with 3 chips Have 3 plays of the game
P{win} = 2/3 Object: have 5 or more chips at game end
3
Gambling Problem Let n = play of the game (1, 2, 3) Sn = # chips on hand to begin play n Xn = # chips to bet at play n If s3 = 0, 1, 2, we can not bet enough chips to meet objective
4
Gambling Problem 3 4 5 2 1 P{win}=2/3 No bet P{Lose}=1/3
5
Gambling Problem Consider Bet 3 5 4 3 2 1
Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5
6
Gambling Problem Consider Bet 3 5 4 3 2 1
Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5 If S3 = 4, we could bet 1, 2, 3, or 4
7
Gambling Problem Consider Bet 3 5 4 3 2 1
Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5 If S3 = 4, we could bet 1, 2, 3, or 4 If S3 = 3, we could bet 2 or 3
8
Gambling Problem Consider Bet 3 5 4 3 2 1
Consider Bet 3 If S3 > 5, we have met objective, x3 = 0, or x3 = s3 - 5 If S3 = 4, we could bet 1, 2, 3, or 4 If S3 = 3, we could bet 2 or 3 If S3 < 2, we can’t win, no bet
9
Gambling Problem 3 4 5 2 1
10
Gambling Problem Consider Bet 2 5 4 3 2 1
Consider Bet 2 If S2 > 5, we have met objective, x2 = 0, or x2 = s2 - 5
11
Gambling Problem Consider Bet 2 5 4 3 2 1
Consider Bet 2 If S2 > 5, we have met objective, x2 = 0, or x2 = s2 - 5 If S2 = 4, we could bet 0, 1, 2, 3, or 4
12
Gambling Problem Consider Bet 2 5 4 3 2 1
Consider Bet 2 If S2 > 5, we have met objective, x2 = 0, or x2 = s2 - 5 If S2 = 4, we could bet 0, 1, 2, 3, or 4 If S2 = 3, we could bet 0, 1, 2 or 3 If S2 = 2, we can bet 1 or 2
13
Gambling Problem 5 4 3 2 1 Bet 2, If Sn = 0, 1 P{objective} = 0
Bet 2, If Sn = 0, 1 P{objective} = 0 If Sn = 2, x2 = 1 or 2
14
Gambling Problem 5 4 3 2 1 Bet 2, If Sn = 2, x2 = 1
Bet 2, If Sn = 2, x2 = 1 P{objective} = P{win stage 2 and win stage 3}
15
Gambling Problem 5 4 3 2 1 Bet 2, If xn = 1
Bet 2, If xn = 1 P{objective} = P{lose}P{win 3} + P{win}P{win 3} = 1/3(0) + 2/3(2/3) = 4/9
16
Gambling Problem 5 4 3 2 1 S2=4, x2=1 P{objective} = P{win2}P{win3}
S2=4, x2=1 P{objective} = P{win2}P{win3} + P{lose2}P{win3} = (2/3)(1)+(1/3)(2/3) = 8/9
17
Gambling Problem 5 4 3 2 1 S1=3, X1=1 P(Objective)
= 2/3P{meeting objective from S2=4} + 1/3P{meeting objective from S2=2} = (2/3)(8/9) + (1/3)(4/9) = 20/27 3 4 5 2 1
18
Recapture Optimal 5 4 3 2 1 x1* = 1 if win x2* = 1
x1* = 1 if win x2* = 1 if lose x2* = 1 or 2
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.