Commitment to Correlated Strategies Vince Conitzer and Dima Korzhyk.

Slides:

Advertisements

Similar presentations

Preprocessing Techniques for Computing Nash Equilibria Vincent Conitzer Duke University Based on: Conitzer and Sandholm. A Generalized Strategy Eliminability.

Advertisements

CPS Extensive-form games Vincent Conitzer

CPS Bayesian games and their use in auctions Vincent Conitzer

Game Theory Assignment For all of these games, P1 chooses between the columns, and P2 chooses between the rows.

1 University of Southern California Keep the Adversary Guessing: Agent Security by Policy Randomization Praveen Paruchuri University of Southern California.

COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.

Game Theoretical Insights in Strategic Patrolling: Model and Analysis Nicola Gatti – DEI, Politecnico di Milano, Piazza Leonardo.

For any player i, a strategy weakly dominates another strategy if (With at least one S -i that gives a strict inequality) strictly dominates if where.

Nash Equilibria In Graphical Games On Trees Edith Elkind Leslie Ann Goldberg Paul Goldberg.

Playing Games for Security: An Efficient Exact Algorithm for Solving Bayesian Stackelberg Games Praveen Paruchuri, Jonathan P. Pearce, Sarit Kraus Catherine.

Course: Applications of Information Theory to Computer Science CSG195, Fall 2008 CCIS Department, Northeastern University Dimitrios Kanoulas.

Algorithms for solving two- player normal form games Tuomas Sandholm Carnegie Mellon University Computer Science Department.

Computing equilibria of security games using linear integer programming Dima Korzhyk

General-sum games You could still play a minimax strategy in general- sum games –I.e., pretend that the opponent is only trying to hurt you But this is.

Extensive-form games. Extensive-form games with perfect information Player 1 Player 2 Player 1 2, 45, 33, 2 1, 00, 5 Players do not move simultaneously.

Vincent Conitzer CPS Normal-form games Vincent Conitzer

by Vincent Conitzer of Duke

An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.

This Wednesday Dan Bryce will be hosting Mike Goodrich from BYU. Mike is going to give a talk during :30 to 12:20 in Main 117 on his work on UAVs.

1 Computing Nash Equilibrium Presenter: Yishay Mansour.

AWESOME: A General Multiagent Learning Algorithm that Converges in Self- Play and Learns a Best Response Against Stationary Opponents Vincent Conitzer.

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer

CPS 173 Security games Vincent Conitzer

Computing Equilibria Christos H. Papadimitriou UC Berkeley “christos”

Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

A quantum protocol for sampling correlated equilibria unconditionally and without a mediator Iordanis Kerenidis, LIAFA, Univ Paris 7, and CNRS Shengyu.

The Computational Complexity of Finding Nash Equilibria Edith Elkind Intelligence, Agents, Multimedia group (IAM) School of Electronics and CS U. of Southampton.

Nash Equilibria In Graphical Games On Trees Revisited Edith Elkind Leslie Ann Goldberg Paul Goldberg (University of Warwick) (To appear in ACM EC’06)

CPS LP and IP in Game theory (Normal-form Games, Nash Equilibria and Stackelberg Games) Joshua Letchford.

1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.

Algorithms for solving two-player normal form games

CPS 570: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.

CPS Utility theory, normal-form games

1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.

Game representations, game-theoretic solution concepts, and complexity Tuomas Sandholm Computer Science Department Carnegie Mellon University.

A useful reduction (SAT -> game)

Game theory basics A Game describes situations of strategic interaction, where the payoff for one agent depends on its own actions as well as on the actions.

Yuan Deng Vincent Conitzer Duke University

Keep the Adversary Guessing: Agent Security by Policy Randomization

Bayesian games and mechanism design

Hello everyone, welcome to this talk about~~~

Bayesian games and their use in auctions

C. Kiekintveld et al., AAMAS 2009 (USC) Presented by Neal Gupta

A Game Theoretic Study of Attack and Defense in Cyber-Physical Systems

Non-additive Security Games

A useful reduction (SAT -> game)

Lecture 6: Other Game Models and Solution Concepts

CPS 570: Artificial Intelligence Game Theory

Communication Complexity as a Lower Bound for Learning in Games

Vincent Conitzer CPS Repeated games Vincent Conitzer

Commitment to Correlated Strategies

Vincent Conitzer Normal-form games Vincent Conitzer

CPS Extensive-form games

Vincent Conitzer Repeated games Vincent Conitzer

Lecture 20 Linear Program Duality

Vincent Conitzer Extensive-form games Vincent Conitzer

CPS 173 Extensive-form games

Instructor: Vincent Conitzer

Introduction to Game Theory

Game theory review for agent design tutorial

Normal Form (Matrix) Games

Information, Incentives, and Mechanism Design

Introduction to Game Theory

A Technique for Reducing Normal Form Games to Compute a Nash Equilibrium Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University, Computer Science.

Vincent Conitzer CPS Repeated games Vincent Conitzer

CPS Bayesian games and their use in auctions

Presentation transcript:

Commitment to Correlated Strategies Vince Conitzer and Dima Korzhyk

Games in Normal Form 1, 13, 0 0, 02, 1 Player 1 Player 2 U D LR A strategy profile is a pair of strategies (pure or randomized) An outcome of the game is an entry in the matrix A player’s strategy is a distribution over the player’s actions

Nash equilibrium 1, 13, 0 0, 02, 1 Player 1 Player 2 U D LR An NE is a strategy profile in which no player has an incentive to deviate.

Computing a Nash Equilibrium Iterated dominance works in this case 1, 13, 0 0, 02, 1 Player 1 Player 2 U D LR Generally, there is no known polytime algorithm [PPAD-completeness: Daskalakis, Goldberg & Papadimitriou ‘06; Chen & Deng ’06; NP-hardness of NE with certain properties: Gilboa & Zemel ’89; Conitzer & Sandholm ‘08] Dominated

Stackelberg model Suppose the row player (the leader) can commit to a strategy 1, 13, 0 0, 02, 1 Leader Follower U D LR The leader benefits from commitment!

Commitment to a mixed strategy Suppose the leader commits to (2/3 Down, 1/3 Up) 1, 13, 0 0, 02, 1 Leader Follower U D LR Commitment to a mixed strategy benefits the leader even more The optimal strategy to commit to is (50%-eps, 50%+eps) Can be computed in polytime [von Stengel & Zamir ‘10, Conitzer & Sandholm ‘06] 1/3 2/3

Applications of the Stackelberg model Resource allocation for airport security [Pita et al., AI Magazine ‘09] Scheduling of federal air marshals [Tsai et al., AAMAS ‘09] GUARDS system for TSA resource allocation [Pita et al., AAMAS ’11] Photo AP Photo STL airport

LP1: Computing a Stackelberg strategy Objective: leader’s utility Subject to the follower’s rationality [von Stengel and Zamir ‘10, Conitzer and Sandholm ‘06]

New idea: Commitment to a correlated strategy The leader draws from a distribution over the outcomes 1, 13, 0 0, 02, 1 U D L R 20%40%10% 30% The follower only gets to know the column The follower should have no incentive to deviate We will look for a correlated strategy that maximizes the leader’s utility Leader Follower

Equivalence to Stackelberg Proposition 1. There exists an optimal correlated strategy to commit to in which the follower always gets the same recommendation.

Proof of Proposition 1 U C D L M R Break the correlated strategy into two components:

LP2 for computing an optimal correlated strategy to commit to We can use this LP to compute an optimal Stackelberg strategy! Objective – leader’s utility Follower’s rationality

Experimental evaluation The single LP actually runs faster than LP1, MIP for many game classes (on 50x50 games, using CPLEX, GAMUT games [Nudelman et al. ‘04]): Downside: the single LP uses more memory. BidirectionalLEG CovariantGameDispersionGame GrabTheDollar MinimumEffortMajorityVoting LocationGame GuessTwoThirds PolymatrixGame RandomGame TravelersDilemma UniformLEGWarOfAttrition RandomLEG RandomGraphical RandomZeroSum

Correlated equilibrium A 3 rd party proposes a distribution over the outcomes Now, both players should have no incentive to deviate Player 2’s rationality Player 1’s rationality

Stackelberg vs Correlated Equilibrium Corollary 1. The leader’s Stackelberg utility is at least as high as the leader’s utility in any correlated equilibrium of the game. [von Stengel and Zamir ‘10]

Commitment to correlated strategies for n>2 players A generalization of LP2 With n>2 players, the optimal correlated strategy to commit to may not be a product distribution P= product distribution D= degenerate distribution

Example: Commitment to a correlated strategy with 3 players Utilities 21 50% Optimal correlated strategy to commit to: Leader Different from Stackelberg and CE!

Advantages of commitment to a correlated equilibrium Same as Stackelberg for n=2 players Well defined for n>2 players (unlike the Stackelberg model) Easy to compute for any number of players

Overview of contributions A single LP for Stackelberg (for 2 players) Easy proof of relationship between Stackelberg and correlated equilibrium (for 2 players) Model for commitment to a correlated strategy (for n≥2 players) LP for commitment to a correlated strategy (for n≥2 players)

Thank you!