Presentation is loading. Please wait.

Presentation is loading. Please wait.

DP for Optimum Strategies in Games

Similar presentations


Presentation on theme: "DP for Optimum Strategies in Games"— Presentation transcript:

1 DP for Optimum Strategies in Games
J.-S. Roger Jang (張智星) MIR Lab, CSIE Dept. National Taiwan University

2 Outline Game of dice sum Game of colored jenga

3 Game of Dice Sum Description Your goal
Toss a dice 8 times and place the value into 4 double-digit number right after each toss. Find the total of these 4 numbers. If the total is bigger than 150, your score is 0. Otherwise your score is the total. Your goal Find the optimum strategy to play the game such that the expected total is optimized. Credit: Peter Norvig at Google CS283: AI Programming Techniques (1989 at UC Berkeley)

4 Three-step Formula of DP: Step 1
Optimum-value function D(p, q, s)=expected max score when p: No. of ten’s position left q: No. of one’s position left s: current sum of the game Credit: 電機系賀正翔 Game state of (1, 2, 67)

5 Three-step Formula of DP: Steps 2 and 3
Recurrent formula for the optimum-value function Answer: D(4, 4, 0)

6 Strategy during the Game
Recurrent formula for the optimum-value function

7 Game of Colored Jenga Description: Techniques
Techniques Dynamic programming Hash table


Download ppt "DP for Optimum Strategies in Games"

Similar presentations


Ads by Google