S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Computing Bayes-Nash Equilibria through Support Enumeration Methods in Bayesian Two-Player.

Slides:

Advertisements

Similar presentations

Simple Search Methods for Finding a Nash Equilibrium

Advertisements

N. Basilico, N. Gatti, F. Amigoni DEI, Politecnico di Milano Leader-Follower Strategies for Robotic Patrolling in Environments with Arbitrary Topology.

Preprocessing Techniques for Computing Nash Equilibria Vincent Conitzer Duke University Based on: Conitzer and Sandholm. A Generalized Strategy Eliminability.

F. Amigoni, N. Basilico, N. Gatti DEI, Politecnico di Milano Finding the Optimal Strategies in Robotic Patrolling with Adversaries in Topologically-Represented.

Multiagent Technology Solutions for Planning in Ambient Intelligence Nicola Gatti, Francesco Amigoni, Marco Rolando {ngatti,

Concise representations of games Vincent Conitzer

CPS Extensive-form games Vincent Conitzer

Complexity Results about Nash Equilibria Vincent Conitzer, Tuomas Sandholm International Joint Conferences on Artificial Intelligence 2003 (IJCAI’03) Presented.

M9302 Mathematical Models in Economics Instructor: Georgi Burlakov 3.1.Dynamic Games of Complete but Imperfect Information Lecture

Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.

Continuation Methods for Structured Games Ben Blum Christian Shelton Daphne Koller Stanford University.

1 University of Southern California Keep the Adversary Guessing: Agent Security by Policy Randomization Praveen Paruchuri University of Southern California.

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.

Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.

Computation of Nash Equilibrium Jugal Garg Georgios Piliouras.

Game Theoretical Insights in Strategic Patrolling: Model and Analysis Nicola Gatti – DEI, Politecnico di Milano, Piazza Leonardo.

Bargaining in Markets with One-Sided Competition: Model and Analysis Nicola Gatti DEI, Politecnico di Milano, Piazza Leonardo da.

Playing Games for Security: An Efficient Exact Algorithm for Solving Bayesian Stackelberg Games Praveen Paruchuri, Jonathan P. Pearce, Sarit Kraus Catherine.

Algorithms for solving two- player normal form games Tuomas Sandholm Carnegie Mellon University Computer Science Department.

Game-theoretic analysis tools Necessary for building nonmanipulable automated negotiation systems.

Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.

NECTAR NECTAR Nash Equilibriam CompuTation Algorithms and Resources  Game Theory provides a rich mathematical framework for analyzing strategic interactions.

Basics on Game Theory For Industrial Economics (According to Shy’s Plan)

Temporal Action-Graph Games: A New Representation for Dynamic Games Albert Xin Jiang University of British Columbia Kevin Leyton-Brown University of British.

Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm Ruben Stranders, Alex Rogers, Nick Jennings School of Electronics and Computer.

Improving Market-Based Task Allocation with Optimal Seed Schedules IAS-11, Ottawa. September 1, 2010 G. Ayorkor Korsah 1 Balajee Kannan 1, Imran Fanaswala.

Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm School of Electronics and Computer Science University of Southampton {rs06r2,

Algorithmic Game Theory Nicola Gatti and Marcello Restelli {ngatti, DEI, Politecnico di Milano, Piazza Leonardo da Vinci 32, Milano,

Harsanyi transformation Players have private information Each possibility is called a type. Nature chooses a type for each player. Probability distribution.

1 Computing Nash Equilibrium Presenter: Yishay Mansour.

Computational Methods for Management and Economics Carla Gomes

On Spectrum Selection Games in Cognitive Radio Networks

Developing a Deterministic Patrolling Strategy for Security Agents Nicola Basilico, Nicola Gatti, Francesco Amigoni.

Better automated abstraction techniques for imperfect information games, with application to Texas Hold’em poker * Andrew Gilpin and Tuomas Sandholm, CMU,

Graphical Games Kjartan A. Jónsson. Nash equilibrium Nash equilibrium Nash equilibrium N players playing a dominant strategy is a Nash equilibrium N players.

Towards Automated Bargaining in Electronic Markets: a Partially Two-Sided Competition Model N. Gatti, A. Lazaric, M. Restelli {ngatti, lazaric,

A Game-Theoretic Approach to Determining Efficient Patrolling Strategies for Mobile Robots Francesco Amigoni, Nicola Gatti, Antonio Ippedico.

1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory.

Alternating-Offers Bargaining under One-Sided Uncertainty on Deadlines Francesco Di Giunta and Nicola Gatti Dipartimento di Elettronica e Informazione.

Simple search methods for finding a Nash equilibrium Ryan Porter, Eugene Nudelman, and Yoav Shoham Games and Economic Behavior, Vol. 63, Issue 2. pp ,

Computing Equilibria in Electricity Markets Tony Downward Andy Philpott Golbon Zakeri University of Auckland.

Decentralised Coordination of Mobile Sensors School of Electronics and Computer Science University of Southampton Ruben Stranders,

Operations Research Models

Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Dynamic Games of complete information: Backward Induction and Subgame perfection - Repeated Games -

CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.

Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.

Temperature Discovery Search Temperature Discovery Search (TDS) is a new minimaxbased game tree search method designed to compute or approximate the temperature.

Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.

Regret Minimizing Equilibria of Games with Strict Type Uncertainty Stony Brook Conference on Game Theory Nathanaël Hyafil and Craig Boutilier Department.

A Study of Central Auction Based Wholesale Electricity Markets S. Ceppi and N. Gatti.

Vasilis Syrgkanis Cornell University

Better automated abstraction techniques for imperfect information games Andrew Gilpin and Tuomas Sandholm Carnegie Mellon University Computer Science Department.

Strategic Game Theory for Managers. Explain What is the Game Theory Explain the Basic Elements of a Game Explain the Importance of Game Theory Explain.

Incomplete information: Perfect Bayesian equilibrium

Tommy Messelis * Stefaan Haspeslagh Burak Bilgin Patrick De Causmaecker Greet Vanden Berghe *

Lecture 20 Review of ISM 206 Optimization Theory and Applications.

Keep the Adversary Guessing: Agent Security by Policy Randomization

Nash Equilibrium: P or NP?

Extensive-Form Game Abstraction with Bounds

Communication Complexity as a Lower Bound for Learning in Games

Extensive-form games and how to solve them

Two-player Games (2) ZUI 2013/2014

Risk-informed Decision Making under Incomplete Information

Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 2 Bayesian Games Zhu Han, Dusit Niyato, Walid Saad, Tamer.

CPS Extensive-form games

Vincent Conitzer Extensive-form games Vincent Conitzer

Normal Form (Matrix) Games

A Technique for Reducing Normal Form Games to Compute a Nash Equilibrium Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science.

Presentation transcript:

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Computing Bayes-Nash Equilibria through Support Enumeration Methods in Bayesian Two-Player Strategic-Form Games Sofia Ceppi, Nicola Gatti, and Nicola Basilico Dipartimento di Elettronica e Informazione, Politecnico di Milano {ceppi, ngatti,

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Outline State of the Art –What is a Bayesian game –Why to study Bayesian games Original Contributions –Extensions of existing algorithms for Bayesian games –B-PNS algorithm Experimental Evaluation Conclusions and Future Contributions

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Bayesian Games What is a Bayesian Game? –Non-cooperative game –A game wherein information is uncertain Type 2.1 2, 7 9, 4 3, 5 2, 3 a b cd ω 2.1 = 0.3 Type 2.2 2, 7 9, 8 3, 5 1, 3 a b cd ω 2.2 = 0.7 ? ? Player 2 Player 1

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Bayesian Games Why to study Bayesian Games? –Most real world strategic situations present uncertainty and therefore can be modeled as Bayesian games, e.g., Negotiation settings: bilateral bargaining and auctions Security settings: strategic mobile robot patrolling –The literature does not study algorithms for computing Bayes- Nash equilibria in depth [Shoham and Leyton-Brown, 2008]

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano State of the Art Solution concept for Bayesian games is Bayes-Nash equilibrium A Bayesian game is solved by reducing it to a complete-information game and then computing a Nash equilibrium in this game The literature provides a detailed comparison of the algorithms for the computation of Nash equilibria in complete-information games The exact algorithms for two-player complete-information strategic- form games are: –LH: based on linear complementary programming [Lemke- Howson, 1964] –PNS: based on support enumeration [Porter et al., 2004] –SGC: based on mixed integer linear programming [Sandholm et al., 2005]

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Bayesian Game Peculiarities The experimental results provided from the literature for computing Nash equilibria cannot be generalized to Bayesian case. The main reasons are: –Bayesian games can present characteristics (e.g., existence of equilibria with small supports) different from those of complete- information games –The reduction to complete-information games raises several problems in the application of algorithms for computing Nash equilibria [Koller and Megiddo, 1996]

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Original Contributions Extension of the algorithms existing in the literature for the computation of Bayes-Nash equilibrium –PNS B-PNS (the main result) –LH B-LC –SGC B-SGC

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano PNS Algorithm The support S i of an agent i is the set of actions played by i with non- null probability The joint support S is the set of single agents support To derive the B-PNS algorithm we modified all the three parts of PNS algorithm STEP 1: Choosing S (Enumeration Criteria) STEP 1: Choosing S (Enumeration Criteria) STEP 2: Pruning (Conditional Dominance) STEP 2: Pruning (Conditional Dominance) STEP 3: Equilibrium Checking (Feasibility Problem) STEP 3: Equilibrium Checking (Feasibility Problem) not dominated feasible dominated not feasible

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Supports Supports for player 1: S 1 =(a), S 1 =(b),S 1 =(a,b) Supports for type 1 of player 2: S 2.1 =(c), S 2.1 =(d),S 2.1 =(c,d) Supports for type 2 of player 2: S 2.2 =(c), S 2.2 =(d),S 2.2 =(c,d) Joint support: S={S 1,S 2.1,S 2.2 } S={ (a), (d), (c,d) } Goal: enumerate the joint supports and check if they are of equilibrium How to enumerate the joint supports? Type 2.1 2, 7 9, 4 3, 5 2, 3 a b cd ω 2.1 = 0.3 Type 2.2 2, 7 9, 8 3, 5 1, 3 a b cd ω 2.2 = 0.7 Player 2 Player 1

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Step 1: Heuristics Balance –Non Bayesian games: |S 1 |-|S 2 | If S 1 =(a), S 2 =(c) the balance is 0 –We call –In Bayesian games the balance is If S 1 =(a,b), S 2.1 =(c), S 2.2 =(c,d) the balance is 0 –Increasing order of balance Size –The size of a player is the sum of all the actions played with non-null probability by all the types of the player –Increasing order of size

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Step 1: Peak Criterion (1) Open Issue: –Given the values of balance and size, ranking a players supports –Example: Balance = 0 Size = 7 Player 1s types = 3 Actions = {a,b,c,d,e} S 1 = { (a), (a,b,c,d,e), (c) } S 1 = { (a,c), (a,b,c), (c,e) } Peak Criterion –Based on the size of types supports –The peak is the size of the maximum possible support –Decreasing criterion and increasing criterion

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano We use an enumeration tree to order the supports where each node defines the size of all the types. (e.g. |S 1 |, |S 2.1 |,|S 2.2 |) Size = 7Types = 3Available Actions = 5 Step 1: Peak Criterion (2)

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Step 2: Pruning Techniques The problem of checking whether or not an action is strictly conditionally Bayesian dominated by another action can be formulated as a linear feasibility problem In our case, it can be formulated as a fractional knapsack problem and then solved in linear time in the number of variables Type 2.1 2, 7 9, 4 3, 5 2, 3 a b cd ω 2.1 = 0.3 Type 2.2 2, 7 9, 8 3, 5 1, 3 a b cd ω 2.2 = 0.7 Action a is strictly conditionally Bayesian dominated by action a if for every σ -i | S -i Player 1 Player 2 Given S -i = {S 2.1 = (c), S 2.2 = (d)} EU 1 (a) = ω 2.1 · 2 + ω 2.2 · 9 EU 1 (b) = ω 2.1 · 3 + ω 2.2 · 1 EU 1 (a) > EU 1 (b)

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Step 3: B-PNS Feasibility Problem (1) Linear feasibility problem used for checking if a joint support is of equilibrium

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Step 3: B-PNS Feasibility Problem (2) The problem with support S = { (a), (c,d), (d) } is infeasible The problem with support S = { (a,b), (c), (c,d) } is feasible: –the probabilities of the actions are: player 1: p(a) = p(b)= type 1 player 2: p(c) = 1 p(d) = 0 type 2 player 2: p(c) = p(d) = Type 2.1 2, 7 9, 4 3, 5 2, 3 a b cd ω 2.1 = 0.3 Type 2.2 2, 7 9, 8 3, 5 1, 3 a b cd ω 2.2 = 0.7 Player 2 Player 1

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Experimental Evaluation We developed a tool based on GAMUT to generate Bayesian games We compared computational time in: –Different configurations of B-PNS –PNS and B-PNS –B-PNS, B-SGC, and B-LC

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Experimental Results

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Conclusions We focus on the computation of equilibria in Bayesian games This class of game is important since most strategic real-world situations can be modeled as a Bayesian game Computing Nash equilibria in complete-information games is inefficient when the game is Bayesian We extend the algorithms used for the computation of Nash equilibria for the Bayesian games We focus on B-PNS We experimentally evaluate the Bayesian algorithms

S. Ceppi, N. Gatti, and N. Basilico DEI, Politecnico di Milano Future Contributions Improvement of support enumeration methods using algorithms based on local search techniques –Non-Stochastic –Stochastic Application to open problems