Games: Efficiency Andrea Danyluk September 23, 2013.

Games: Efficiency Andrea Danyluk September 23, 2013

Announcements Programming Assignment 1: Search – Autograding done – Please sign up for a code review Programming Assignment 2: Games – Remove your print statements before turning it in – Need help with buggy code? If I’m not in the office, email me your code and a description of the problem.

Today’s Lecture Heuristic evaluation functions for minimax Making the search more efficient: – α-β pruning

Minimax Reality Can rarely explore entire search space to terminal nodes. Choose a depth cutoff – i.e., a maximum ply Need an evaluation function – Returns an estimate of the expected utility of the game from a given position – Must order the terminal states in the same way as the true utility function – Must be efficient to compute Trading off plies for heuristic computation More plies makes a difference Consider iterative deepening

Evaluation Functions Ideal: returns the utility of the position In practice: typically weighted linear sum of features: Eval(s) = w 1 f 1 (s) + w 2 f 2 (s) + … + w n f n (s)

Exercise Evaluation function for Connect Four?

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 7

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 Wow! 0 is a great score!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 I wonder if I can do even better!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 Too bad the maximizer won’t ever let me get here!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 Better not to waste my time!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 0

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 0 6

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 0 6 9 Wow! 9!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 0 6 9 Oh, wait!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 0 6 9 Min won’t let me go here!

An earlier example revisited 7 7 2 2 6 6 3 3 0 0 6 6 -2 5 5 2 2 9 9 6 6 2 2 73 3 0 0 6 9 Min won’t let me go here! 6

α-β Pruning If something looks too good to be true, it probably is. One example of the class of branch and bound algorithms with two bounds – α : the value of the best choice for Max – β : the value of the best choice for min

α-β Pruning Given these two bounds – α : the value of the best choice for Max – β : the value of the best choice for min Basic idea of the algorithm – On a minimizing level, if you find a value < α, cut the search – On a maximizing level, if you find a value > β, cut the search

function αβSEARCH(state) returns an action a v = MAX-VALUE(state, -infinity, +infinity) return action a in ACTIONS(state) with value v function MAX-VALUE(state, α, β) returns a utility value v if TERMINAL-TEST(state) then return UTILITY(state) v = -infinity for each a in ACTIONS(state) do v = MAX(v, MIN-VALUE(RESULT(state, a), α, β)) if v ≥ β then return v α = MAX(α, v) return v function MIN-VALUE(state, α, β) returns a utility value v if TERMINAL-TEST(state) then return UTILITY(state) v = infinity for each a in ACTIONS(state) do v = MIN(v, MAX-VALUE(RESULT(state, a), α, β)) if v ≤ α then return v β = MIN(β, v) return v

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = +inf α = -inf Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = +inf α = 8 Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = +inf 8

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = 8 8

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = 8 8 α = -inf Β = 8

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf α = -inf Β = 8 8 9

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = -inf Β = +inf 8 8 9

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9 α = 8 Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9 2

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9 2 α = 8 Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9 2 α = 8 Β = +inf α = 8 Β = +inf

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9 2 α = 8 Β = +inf 8

8 8 4 4 7 7 3 3 8 8 19 11 2 2 6 6 1 1 9 9 5 5 α = 8 Β = +inf 8 8 9 2 α = 8 Β = +inf 8 Is there a problem here???

function αβSEARCH(state) returns an action a v = MAX-VALUE(state, -infinity, +infinity) return action a in ACTIONS(state) with value v function MAX-VALUE(state, α, β) returns a utility value v if TERMINAL-TEST(state) then return UTILITY(state) v = -infinity for each a in ACTIONS(state) do v = MAX(v, MIN-VALUE(RESULT(state, a), α, β)) if v ≥ β then return v α = MAX(α, v) return v function MIN-VALUE(state, α, β) returns a utility value v if TERMINAL-TEST(state) then return UTILITY(state) v = infinity for each a in ACTIONS(state) do v = MIN(v, MAX-VALUE(RESULT(state, a), α, β)) if v ≤ α then return v β = MIN(β, v) return v

Properties of α-β Pruning does not affect the final result (but be careful) Good move ordering improves effectiveness of pruning If search depth is d, what is the time complexity of minimax? – O(b d ) With perfect pruning, get down to O(b d/2 ) – Doubles solvable depth [Adapted from Russell]

Games: Efficiency Andrea Danyluk September 23, 2013.

Similar presentations

Presentation on theme: "Games: Efficiency Andrea Danyluk September 23, 2013."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Games: Efficiency Andrea Danyluk September 23, 2013.

Similar presentations

Presentation on theme: "Games: Efficiency Andrea Danyluk September 23, 2013."— Presentation transcript:

Similar presentations

About project

Feedback