Strategies for Poker AI player

Slides:



Advertisements
Similar presentations
On Von Neumann Poker with Community Cards Reto Spöhel Joint work with Nicla Bernasconi and Julian Lorenz TexPoint fonts used in EMF. Read the TexPoint.
Advertisements

ON THE BOREL AND VON NEUMANN POKER MODELS. Comparison with Real Poker  Real Poker:  Around 2.6 million possible hands for 5 card stud  Hands somewhat.
After the flop – nobody raised before the flop Strategy: No-Limit.
Lecture 13. Poker Two basic concepts: Poker is a game of skill, not luck. If you want to win at poker, – make sure you are very skilled at the game, and.
Introduction to Game Theory
Short Stack Strategy – How to play after the flop Strategy: No Limit.
Randomized Strategies and Temporal Difference Learning in Poker Michael Oder April 4, 2002 Advisor: Dr. David Mutchler.
No-Regret Algorithms for Online Convex Programs Geoffrey J. Gordon Carnegie Mellon University Presented by Nicolas Chapados 21 February 2007.
Class Project Due at end of finals week Essentially anything you want, so long as it’s AI related and I approve Any programming language you want In pairs.
Warm up 1)We are drawing a single card from a standard deck of 52 find the probability of P(seven/nonface card) 2)Assume that we roll two dice and a total.
Neural Networks for Opponent Modeling in Poker John Pym.
Final Specification KwangMonarchIkhanJamesGraham.
Mathematics and the Game of Poker
Intelligence for Games and Puzzles1 Poker: Opponent Modelling Early AI work on poker used simplified.
Texas Hold’em Poker Simulation Project Monarch ^ James Ikhan Kwang Graham Devin Organized by M^JIK Game Developers Members Include:
Lectures in Microeconomics-Charles W. Upton Minimax Strategies.
Intro to Probability & Games
Using Probabilistic Knowledge And Simulation To Play Poker (Darse Billings …) Presented by Brett Borghetti 7 Jan 2007.
Introduction to Cognition and Gaming 9/22/02: Bluffing.
Games of Chance Introduction to Artificial Intelligence COS302 Michael L. Littman Fall 2001.
Minimax Strategies. Everyone who has studied a game like poker knows the importance of mixing strategies. –With a bad hand, you often fold –But you must.
Poki: The Poker Agent Greg Priebe Zak Knudson. Overview Texas Hold’em poker Architecture and Opponent Modeling of Poki Improvements from past Poki Betting.
Models of Strategic Deficiency and Poker Workflow Inference: What to do with One Example and no Semantics.
Introduction to the Big Stack Strategy (BSS) Strategy: No Limit.
Poker and AI How the most “stable” creature on earth got used to that good old game from the west!
Brian Duddy.  Two players, X and Y, are playing a card game- goal is to find optimal strategy for X  X has red ace (A), black ace (A), and red two (2)
Learning to Play Blackjack Thomas Boyett Presentation for CAP 4630 Teacher: Dr. Eggen.
Games People Play. 4. Mixed strategies In this section we shall learn How to not lose a game when it appears your opponent has a counter to all your moves.
NearOptimalGamblingAdive Matt Morgis Peter Chapman Mitch McCann Temple University.
Introduction Many decision making problems in real life
Poker. Basic card terminology Suits: Hearts, diamonds, clubs, spades Deuce, face cards.
Games with Imperfect Information Bayesian Games. Complete versus Incomplete Information So far we have assumed that players hold the correct belief about.
SARTRE: System Overview A Case-Based Agent for Two-Player Texas Hold'em Jonathan Rubin & Ian Watson University of Auckland Game AI Group
Oklahoma’s Personal Financial Literacy Passport © Oklahoma State Department of Education. All rights reserved. 1 Teacher Presentation Series 12 Standard.
Oklahoma’s Personal Financial Literacy Passport © Oklahoma State Department of Education. All rights reserved. 1 Teacher Presentation Series 12 Standard.
The challenge of poker NDHU CSIE AI Lab 羅仲耘. 2004/11/04the challenge of poker2 Outline Introduction Texas Hold’em rules Poki’s architecture Betting Strategy.
Poker as a Testbed for Machine Intelligence Research By Darse Billings, Dennis Papp, Jonathan Schaeffer, Duane Szafron Presented By:- Debraj Manna Gada.
Neural Network Implementation of Poker AI
Texas Hold’em Playing Styles Team 4 Matt Darryl Alex.
Short stack strategy: Draws in a free play situation Strategy: No Limit.
All In To put all the rest of your money into the pot.
Introduction to Poker Originally created by Albert Wu,
Example 31: Solve the following game by equal gains method: Y I II I II I X II II
Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.
© Oklahoma State Department of Education. All rights reserved.1 The Costs and Benefits of Gambling Gambling.
DECISION MODELS. Decision models The types of decision models: – Decision making under certainty The future state of nature is assumed known. – Decision.
Copyright © Cengage Learning. All rights reserved. 8 PROBABILITY DISTRIBUTIONS AND STATISTICS.
Mixed Strategies Keep ‘em guessing.
Unit 1 Day 6: Simulations.
You Bet Your Life - So to Speak
Lecture 13.
Joint work with Sam Ganzfried
The Duality Theorem Primal P: Maximize
Game Theory Just last week:
Lecture 10.
Extensive-form games and how to solve them
The Parameterized Poker Squares EAAI NSG Challenge
Probability of casino games
Lecture 12.
Econ 805 Advanced Micro Theory 1
The Costs and Benefits of Gambling
Chapter 6 Game Theory (Module 4) 1.
Game Theory.
Probability 14.1 Experimental Probability 14.2 Principles of Counting
National 5 Physical Education
Get to know the rating system in the model
Warm Up A tennis tournament with 4 players will be held. In the first round Seed #1 plays Seed #4 and Seed #2 plays Seed #3. The winners meet in the championship.
Games with Imperfect Information Bayesian Games
Lecture 20 Linear Program Duality
Elements of Chance Exemplar – Card Game.
Presentation transcript:

Strategies for Poker AI player Research Paper: Computer Poker by Darse Billings Dept. of CS, University of Alberta 5/2/17

Why study Poker? Bluff and Poker are similar in that, at any stage the players only have partial knowledge about the current state of the game. Incomplete or unreliable knowledge and risk management are factors that make Poker an interesting study for our project. The Computer Poker Research Group (CPRG) of the University of Alberta has contributed greatly towards the study of poker academically. The implemented a neural network for predicting the opponent's next move

Optimal move Koller in his paper "Generating and solving imperfect information games" states that just as there is an optimal move for every perfect information game, existence of optimal strategies can be proved if randomized strategies are allowed. Optimal strategy ensures that the opponent gains no advantage over the player even if his strategy is revealed The theoretical maximum guaranteed profit from a given poker situation can be attained by bluffing, or calling a possible bluff, with a predetermined probability

Poker example Consider an example with 2 players in a game of pot-limit Draw poker where player 2 has called a flush with a draw of just one card against player 1 who has the best hand. Player 2 wins showdown (revealing of cards played) if she makes the flush, but loses otherwise. This situation is similar to when an opponent calls bluff and the player has to display the cards. If the cards are played right, the player is safe, but otherwise he loses and has to take the pile.

Poker example contd. The example further assumes that the probability of completing the flush is exactly 0.2 or one in five, since 5 cards make a flush. Game theoretic analysis can be used to calculate the number of bluffs and the numbers of calls that helps to build an optimal strategy. For each pair of frequencies, the overall expectation can be calculated as shown in the following Table.

Table: expected values for a 4-flush draw in Poker BR = Ratio of Bluffs to Legitimate bets ABF = Absolute Bluff Frequency CFr = Absolute Calling Frequency

Optimal Strategies in Poker From table, to obtain the guaranteed maximum, Player B should bet 30% of time. Player A can ensure his optimal expectation of 0.7 by calling exactly 50% of the time Player B bets. Optimal ratios will change depending on the size of the bet in relation to the pot, but is independent of other factors.

Strategies for Bluff Currently we pursue the optimal strategy which can be used in Bluff as well, where we determine the probabilities of each card being in a particular hand. We know with certainty which cards are in our hand and which cards have been played. We determine the probability of all other cards based on cards that have been played. To start with, we could just use a naive and equal probability that a card is in some player's hand. If there are four players, each player have a probability of 1/39 or 0.025 to have a card, except for those cards which are held by the player himself with a probability of 1.

Maximal Strategies in Poker Maximal strategy is directed towards exploiting weaknesses in opponent Optimal strategy implicitly assumes perfect play on the part of the opponent. Example for Maximal strategy: If faced with a bet from a player who never bluffs, strong player will fold a marginal hand.