ON THE BOREL AND VON NEUMANN POKER MODELS. Comparison with Real Poker  Real Poker:  Around 2.6 million possible hands for 5 card stud  Hands somewhat.

Slides:



Advertisements
Similar presentations
Advanced Strategies for Craps and Poker Billy J. Duke Joel A. Johnson.
Advertisements

How to win at poker using game theory A review of the key papers in this field.
Math and Card Games By: Candace DiBiano.
On Von Neumann Poker with Community Cards Reto Spöhel Joint work with Nicla Bernasconi and Julian Lorenz TexPoint fonts used in EMF. Read the TexPoint.
After the flop – an opponent raised before the flop Strategy: No-Limit.
Lecture 13. Poker Two basic concepts: Poker is a game of skill, not luck. If you want to win at poker, – make sure you are very skilled at the game, and.
Oct 9, 2014 Lirong Xia Hypothesis testing and statistical decision theory.
Introduction to Game Theory
9.1 Strictly Determined Games Game theory is a relatively new branch of mathematics designed to help people who are in conflict situations determine the.
Alpha-Beta Search. 2 Two-player games The object of a search is to find a path from the starting position to a goal position In a puzzle-type problem,
Auction Theory Class 5 – single-parameter implementation and risk aversion 1.
GAME THEORY.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
CS4432: Database Systems II
Non myopic strategy Truth or Lie?. Scoring Rules One important feature of market scoring rules is that they are myopic strategy proof. That means that.
Totally Unimodular Matrices
ECO290E: Game Theory Lecture 5 Mixed Strategy Equilibrium.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Zero-Sum Games (follow-up)
Part 3: The Minimax Theorem
Lecture 10. Simplified roulette European roulette has numbers 0,1,…36. xD0s
A Introduction to Game Theory Xiuting Tao. Outline  1 st a brief introduction of Game theory  2 nd Strategic games  3 rd Extensive games.
Cs6501: Imperfect Information Games Principles Of Knowledge Engineering & Reconstruction Spring 2010 University of Virginia David Evans.
Rational Agents for the Card Game “BS” Jeff Reilly, Brian Munce, Stephen Tanguis, Alex Keeler.
Mathematics and the Game of Poker
Social Learning. A Guessing Game Why are Wolfgang Puck restaurants so crowded? Why do employers turn down promising job candidates on the basis of rejections.
Lectures in Microeconomics-Charles W. Upton Minimax Strategies.
Intro to Probability & Games
Using Probabilistic Knowledge And Simulation To Play Poker (Darse Billings …) Presented by Brett Borghetti 7 Jan 2007.
Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
Alpha-Beta Search. 2 Two-player games The object of a search is to find a path from the starting position to a goal position In a puzzle-type problem,
Minimax Strategies. Everyone who has studied a game like poker knows the importance of mixing strategies. –With a bad hand, you often fold –But you must.
Games of Incomplete Information. These games drop the assumption that players know each other’s preferences. Complete info: players know each other’s preferences.
Game Theory Statistics 802. Lecture Agenda Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for.
Models of Strategic Deficiency and Poker Workflow Inference: What to do with One Example and no Semantics.
Texas Holdem Poker With Q-Learning. First Round (pre-flop) PlayerOpponent.
Online Poker James Gilman. Topics ●Hand Probabilities ●Betting Odds ●Odds of winning ●Expected Value ●Decision Making ●Poker Statistics ●Variance.
Brian Duddy.  Two players, X and Y, are playing a card game- goal is to find optimal strategy for X  X has red ace (A), black ace (A), and red two (2)
Game Playing.
Auction Seminar Optimal Mechanism Presentation by: Alon Resler Supervised by: Amos Fiat.
Introduction to Artificial Intelligence CS 438 Spring 2008 Today –AIMA, Ch. 6 –Adversarial Search Thursday –AIMA, Ch. 6 –More Adversarial Search The “Luke.
Games. Adversaries Consider the process of reasoning when an adversary is trying to defeat our efforts In game playing situations one searches down the.
1.5 Independent and Dependent Events. Flipping a Coin.
The challenge of poker NDHU CSIE AI Lab 羅仲耘. 2004/11/04the challenge of poker2 Outline Introduction Texas Hold’em rules Poki’s architecture Betting Strategy.
Poker as a Testbed for Machine Intelligence Research By Darse Billings, Dennis Papp, Jonathan Schaeffer, Duane Szafron Presented By:- Debraj Manna Gada.
Outline for the day: 1.Discuss handout / get new handout. 2.Teams 3.Example projects 4.Expected value 5.Pot odds calculations 6.Hansen / Negreanu 7.P(4.
Part 3 Linear Programming
Stat 35b: Introduction to Probability with Applications to Poker Poker Code competition: all-in or fold.   u 
Lecture 12. Game theory So far we discussed: roulette and blackjack Roulette: – Outcomes completely independent and random – Very little strategy (even.
An Introduction to Game Theory Math 480: Mathematics Seminar Dr. Sylvester.
Expected value (µ) = ∑ y P(y) Sample mean (X) = ∑X i / n Sample standard deviation = √[∑(X i - X) 2 / (n-1)] iid: independent and identically distributed.
Auctions serve the dual purpose of eliciting preferences and allocating resources between competing uses. A less fundamental but more practical reason.
Statistics Overview of games 2 player games representations 2 player zero-sum games Render/Stair/Hanna text CD QM for Windows software Modeling.
9.2 Mixed Strategy Games In this section, we look at non-strictly determined games. For these type of games the payoff matrix has no saddle points.
Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.
Recap From Previous Classes (I) Games as Schemes of Uncertainty –Macro-level: We don’t know outcome of game –Micro-level: Probability is assigned to outcome.
Extensive-Form Game Abstraction with Bounds
Lecture 13.
The Duality Theorem Primal P: Maximize
Game Theory Just last week:
Lecture 11.
Lecture 10.
Strategies for Poker AI player
Lecture 12.
Instructor: Vincent Conitzer
Lial/Hungerford/Holcomb/Mullins: Mathematics with Applications 11e Finite Mathematics with Applications 11e Copyright ©2015 Pearson Education, Inc. All.
Stat 35b: Introduction to Probability with Applications to Poker
Stat 35b: Introduction to Probability with Applications to Poker
HOW TO PLAY POKER.
Presentation transcript:

ON THE BOREL AND VON NEUMANN POKER MODELS

Comparison with Real Poker  Real Poker:  Around 2.6 million possible hands for 5 card stud  Hands somewhat independent for Texas Hold ‘em  Let’s assume probability of hands comes from a uniform distribution in [0,1]  Assume probabilities are independent

The Poker Models  La Relance Rules:  Each player puts in 1 ante before seeing his number  Each player then sees his/her number  Player 1 chooses to bet B/fold  Player 2 chooses to call/fold  Whoever has the largest number wins.  von Neumann Rules:  Player 1 chooses to bet B/check immediately  Everything else same as La Relance

The Poker Models  rng.html rng.html

La Relance  Who has the edge, P1 or P2? Why?  Betting tree:

La Relance  The optimal strategy and value of the game:  Consider the optimal strategy for player 2 first. It’s no reason for player 2 to bluff/slow roll.  Assume the optimal strategy for player 2 is: Bet when Y>c Fold when Y<c  Nash’s Equilibrium

La Relance

 When to bluff if P1 gets a number X<c?  Intuitively, P1 bluffs with c 2 c and folds with X<c 2.  Why? If P2 is playing with the optimal strategy, how to choose when to bluff is not relevant. This penalizes when P2 is not following the optimal strategy.

La Relance  What if player / opponent is suboptimal?  Assumed Strategy  player 1 should always bet if X > m, fold otherwise  player 2 should always call if Y > n, fold otherwise, Also call if n > m is known (why?)  Assume decisions are not random beyond cards dealt  Alternate Derivations Follow

La Relance

La Relance (Player 2 strategy)

 What can you infer from the properties of this function?  What if m ≈ 0? What if m ≈ 1?

La Relance (Player 1 response)  Player 1 does not have a good response strategy (why?)

La Relance (Player 1 Strategy)  Let’s assume player 2 doesn’t always bet when n > m  This function is always increasing, is zero at n = β / ( β + 2)  What should player 1 do?

La Relance (Player 1 Strategy)  If n is large enough, P1 should always bet (why?)  If n is small however, bet when m >  What if n = β / ( β + 2) exactly?

Von Neumann  Betting tree:

Von Neumann

 Since P1 can check,  now he gets positive value out of the game  P1 now bluff with the worst hand. Why? On the bluff part, it’s irrelevant to choose which section of (0,a) to use if P2 calls (P2 calls only when Y>c) On the check part, it’s relevant because results are compared right away.

Von Neumann

 What if player / opponent is suboptimal?  Assumed Strategy  Player 1 Bet if X b, Check otherwise  Player 2 Call if Y > c, fold otherwise  If c is known, Player 1 wants to keep a c

Von Neumann

Von Neumann (Player 1 Strategy)  Find the maximum of the payoff function  a =  b =  What can we conclude here?

Von Neumann (Player 2 Response)  Player 2 does not have a good response strategy

Von Neumann (Player 2 Strategy)  This analysis is very similar to Borel Poker’s player 1 strategy, won’t go in depth here…  c =

Bellman & Blackwell

FoldLow B High B Low B mLmL mHmH b1b1 b3b3 b2b2

Bellman & Blackwell  Where Or if

La Relance: Non-identical Distribution  Still follows the similar pattern  Where F and G are distributions of P1 and P2, c is still the threshold point for P2. π is still the probability that P1 bets when he has X<c.  What if ?

La Relance: (negative) Dependent hands

 Player 1 bets when X > l  P(Y < c | X = l) = B / (B + 2)  Player 2 bets when Y > c  (2*B + 2)*P(X > c|Y = c) = (B + 2)*P(X > l|Y = c)  Game Value:  P(X > Y) – P(Y < X)  + B * [ P(c c) ]  + 2 * [ P(X l) – P(Y < X < l) ]

Von Neumann: Non-identical Distribution  Also similar to before (just substitute the distribution functions)  a | (B + 2) * G(c) = 2 * G(a) + B  b | 2 * G(b) = G(c) + 1  c | (B + 2) * F(a) = B * (1 – F(b))

Von Neumann: (negative) Dependent hands  Player 2 Optimal Strategy:  Player 1 Optimal Strategy:

Discussion / Thoughts / Questions  Is this a good model for poker?