COLOR TEST COLOR TEST. Dueling Algorithms N ICOLE I MMORLICA, N ORTHWESTERN U NIVERSITY WITH A. T AUMAN K ALAI, B. L UCIER, A. M OITRA, A. P OSTLEWAITE,

Slides:



Advertisements
Similar presentations
Submodular Set Function Maximization via the Multilinear Relaxation & Dependent Rounding Chandra Chekuri Univ. of Illinois, Urbana-Champaign.
Advertisements

Shortest Vector In A Lattice is NP-Hard to approximate
Totally Unimodular Matrices
Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.
Lecturer: Moni Naor Algorithmic Game Theory Uri Feige Robi Krauthgamer Moni Naor Lecture 8: Regret Minimization.
COMP 553: Algorithmic Game Theory Fall 2014 Yang Cai Lecture 21.
Game Theoretical Insights in Strategic Patrolling: Model and Analysis Nicola Gatti – DEI, Politecnico di Milano, Piazza Leonardo.
MIT and James Orlin © Game Theory 2-person 0-sum (or constant sum) game theory 2-person game theory (e.g., prisoner’s dilemma)
Study Group Randomized Algorithms 21 st June 03. Topics Covered Game Tree Evaluation –its expected run time is better than the worst- case complexity.
Shengyu Zhang The Chinese University of Hong Kong.
10/19/2004TCSS435A Isabelle Bichindaritz1 Game and Tree Searching.
Part 3: The Minimax Theorem
Interchanging distance and capacity in probabilistic mappings Uriel Feige Weizmann Institute.
10/11/2001Random walks and spectral segmentation1 CSE 291 Fall 2001 Marina Meila and Jianbo Shi: Learning Segmentation by Random Walks/A Random Walks View.
An Introduction to Game Theory Part II: Mixed and Correlated Strategies Bernhard Nebel.
1 Optimization problems such as MAXSAT, MIN NODE COVER, MAX INDEPENDENT SET, MAX CLIQUE, MIN SET COVER, TSP, KNAPSACK, BINPACKING do not have a polynomial.
Dynamic Programming1. 2 Outline and Reading Matrix Chain-Product (§5.3.1) The General Technique (§5.3.2) 0-1 Knapsack Problem (§5.3.3)
1 Computing Nash Equilibrium Presenter: Yishay Mansour.
Arithmetic Hardness vs. Randomness Valentine Kabanets SFU.
An Introduction to Game Theory Part III: Strictly Competitive Games Bernhard Nebel.
No Free Lunch (NFL) Theorem Many slides are based on a presentation of Y.C. Ho Presentation by Kristian Nolde.
CSC401 – Analysis of Algorithms Lecture Notes 12 Dynamic Programming
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
Games & Adversarial Search Chapter 6 Section 1 – 4.
Game Theory.
Experts Learning and The Minimax Theorem for Zero-Sum Games Maria Florina Balcan December 8th 2011.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Game Playing State-of-the-Art  Checkers: Chinook ended 40-year-reign of human world champion Marion Tinsley in Used an endgame database defining.
Approximation Algorithms for Stochastic Combinatorial Optimization Part I: Multistage problems Anupam Gupta Carnegie Mellon University.
The Multiplicative Weights Update Method Based on Arora, Hazan & Kale (2005) Mashor Housh Oded Cats Advanced simulation methods Prof. Rubinstein.
Game Playing.
A quantum protocol for sampling correlated equilibria unconditionally and without a mediator Iordanis Kerenidis, LIAFA, Univ Paris 7, and CNRS Shengyu.
Approximation Algorithms for NP-hard Combinatorial Problems Magnús M. Halldórsson Reykjavik University
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
Agents that can play multi-player games. Recall: Single-player, fully-observable, deterministic game agents An agent that plays Peg Solitaire involves.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Q 5-2 a. E = Efficiency score wi = Weight applied to i ’s input and output resources by the composite hospital.
Yang Cai Oct 08, An overview of today’s class Basic LP Formulation for Multiple Bidders Succinct LP: Reduced Form of an Auction The Structure of.
Game theory & Linear Programming Steve Gu Mar 28, 2008.
CSC401: Analysis of Algorithms CSC401 – Analysis of Algorithms Chapter Dynamic Programming Objectives: Present the Dynamic Programming paradigm.
Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Speaker: Dr. Michael Schapira Topic: Dynamics in Games (Part III) (Some slides.
Ásbjörn H Kristbjörnsson1 The complexity of Finding Nash Equilibria Ásbjörn H Kristbjörnsson Algorithms, Logic and Complexity.
Part 3 Linear Programming
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Linear Programming Maximize Subject to Worst case polynomial time algorithms for linear programming 1.The ellipsoid algorithm (Khachian, 1979) 2.Interior.
Implicit Hitting Set Problems Richard M. Karp Erick Moreno Centeno DIMACS 20 th Anniversary.
Lecture.6. Table of Contents Lp –rounding Dual Fitting LP-Duality.
C&O 355 Lecture 24 N. Harvey TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AA A A A A A A A A.
Dynamic Programming1. 2 Outline and Reading Matrix Chain-Product (§5.3.1) The General Technique (§5.3.2) 0-1 Knapsack Problem (§5.3.3)
Vasilis Syrgkanis Cornell University
Deterministic Algorithms for Submodular Maximization Problems Moran Feldman The Open University of Israel Joint work with Niv Buchbinder.
1 Algorithms for Computing Approximate Nash Equilibria Vangelis Markakis Athens University of Economics and Business.
5.1.Static Games of Incomplete Information
MiniMax Principle in Game Theory Slides Made by Senjuti Basu Roy.
Econ 805 Advanced Micro Theory 1 Dan Quint Fall 2009 Lecture 1 A Quick Review of Game Theory and, in particular, Bayesian Games.
Approximation algorithms
Information Complexity Lower Bounds
Game Theory Just last week:
Non-additive Security Games
David Kauchak CS52 – Spring 2016
Games & Adversarial Search
Artificial Intelligence
Structured Models for Multi-Agent Interactions
Artificial Intelligence
Lecture 20 Linear Program Duality
Lecture 15: Least Square Regression Metric Embeddings
MinMax Principle in Game Theory – continued….
Normal Form (Matrix) Games
Presentation transcript:

COLOR TEST COLOR TEST

Dueling Algorithms N ICOLE I MMORLICA, N ORTHWESTERN U NIVERSITY WITH A. T AUMAN K ALAI, B. L UCIER, A. M OITRA, A. P OSTLEWAITE, AND M. T ENNENHOLTZ

Social Contexts Normal-form games: Players choose strategies to maximize expected von Neumann-Morgenstern utility. Social context games [AKT’08]: Players choose strategies to achieve particular social status among peers.

Social Contexts Ranking games [BFHS’08]: Players choose strategies to achieve particular payoff rank among peers.

Two-Player Ranking Games G Alice Bob RG payoff of Alice: 1Alice beats Bob in G Alice and Bob play game: ½Alice ties Bob in G 0Alice loses to Bob in G

Implicit Representations Succinct games [FIKU’08]: Payoff matrix represented by boolean circuit. NE hard to solve or approximate. Blotto games [B’21, GW’50, R’06, H’08]: Distribute armies to battlefields.

Implicit Representations Optimization duels [this work]: Underlying game is optimization problem. Goal is to optimize better than opponent.

Ranking Duel A search engine is an algorithm that inputs set Ω = {1, 2, …, n} of items probabilities p 1 + … + p n = 1 of each and outputs a permutation π of Ω. Monopolist objective: minimize E i ~ p [π(i)].

Ranking Duel Competitive objective: Let the expected score of a ranking π versus a ranking π’ be Pr i ~ p [ π(i) < π’(i) ] + (½) Pr i ~ p [ π(i) = π’(i) ]. Then objective is to output a π that maximizes expected score given algorithm of opponent.

Optimizing a Search Engine User searches for object drawn according to known probability dist.

Search: pretty shape 1. (27%) 2. (22%) 3. (19%) 4. (16%) 5. (09%) 6. (07%)

Choosing a Search Engine 1.Search for “pretty shape”. 2.See which search engine ranks my favorite shape higher. 3.Thereafter, use that one.

Search: pretty shape 1. (27%) 2. (22%) 3. (19%) 4. (16%) 5. (09%) 6. (07%) Search: pretty shape 6. (27%) 1. (22%) 2. (19%) 3. (16%) 4. (09%) 5. (07%)

Questions Can we efficiently compute an equilibrium of a ranking duel? How poorly does greedy perform in a competitive setting? What consequences does the duel have for the searcher?

Optimization Problems as Duels RankingBinary SearchRouting Parking CompressionHiring Start Finish ? ? ? ? ? ? ?

Duel Framework Finite feasible set X of strategies. Prob. distribution p over states of nature Ω. Objective cost c: Ω × X R. Monopolist: choose x to minimize E ω ~ p [c ω (x)].

Duel Framework 1.Players select strategies x, x’ from X. 2.Nature selects state ω from Ω according to p. 3.Payoffs v(x,x’), (1-v(x,x’)) are realized. 1 if c ω (x) < c ω (x’) 0 if c ω (x) > c ω (x’) ½ if c ω (x) = c ω (x’) v(x,x’) = E ω ~ p

Results: Computation An LP-based technique to compute exact equilibria, A low-regret learning technique to compute approximate equilibria, … and a demonstration of these techniques in our sample settings

Computing Exact Equilibria Formulate game as bilinear duel: 1.Efficiently map strategies to points X in R n. 2.Define constraints describing K=convex-hull(X). 3.Define payoff matrix M that computes values. 4.Maps points in K back to strategies in original setting.

Bilinear Duels If feasible strategies X are points in R n, and payoff v(x, x’) is x t Mx’ for some M in R nxn, then max v,x v s.t. x t Mx’ ≥ v for all x’ in X x is in K (=convex-hull(X)) Exponential, but equivalent poly-sized LP.

Ranking Duel Formulate game as bilinear duel: 1.Efficiently map strategies to points X in R n. X = set of permutation matrices (entry x ij indicates item i placed in position j) 2.Define constraints describing K=convex-hull(X). K = set of doubly stochastic matrices (entry y ij = prob. item i placed in position j)

Ranking Duel Formulate game as bilinear duel: 4.Design “rounding alg.” that maps points in K back to strategies in original setting. Birkhoff–von Neumann Theorem: Can efficiently construct permutation basis for doubly stochastic matrix (e.g., via matching).

Ranking Duel Formulate game as bilinear duel: 3.Define payoff matrix M that computes values. E p,y,y’ [v(x,x’)] = ∑ i p(i) ( ½ Pr y,y’ [x i = x’ i ] + Pr y,y’ [x i > x’ i ]) = ∑ i p(i) (∑ i y ij ( ½ y’ ij + ∑ k>j y’ ik )) which is bilinear in y,y’ and so can be written y t My’.

Ranking Duel Result: Can reduce computation time to poly(n) versus poly(n!) with standard LP approach. Technique also applies to hiring duel and binary search duel.

Compression Duel data Goal: smaller compression (i.e., lower depth in tree). (each with prob. p(.))

Classical Algorithm

Compression Duel Formulate game as bilinear duel: 1.Efficiently map strategies to points X in R n. X = subset of zero-one matrices* (entry x ij indicates item i placed at depth j) 2.Define constraints describing K=convex-hull(X). K = subset of row-stochastic matrices* (entry y ij = prob. item i placed at depth j) * Must correspond to depth profile of some binary tree!

Compression Duel Formulate game as bilinear duel: 3.Define payoff matrix M that computes values. E p,y,y’ [v(x,x’)] = ∑ i p(i) (∑ i y ij ( ½ y’ ij + ∑ k>j y’ ik )) which is bilinear in y,y’ and so can be written y t My’.

Compression Duel Bilinear Form: max v,x v s.t. x t Mx’ ≥ v for all x’ in X x is in K (=convex-hull(X)) Problems: 1. How to round points in K back to a random binary tree with right depth profile? 2. How to succinctly express constraints describing K?

Approximate Minimax Defn. For any ε > 0, an approximate minimax strategy guarantees payoff not worse than best possible value minus ε. Defn. For any ε > 0, an approximate best response has payoff not worse than payoff of best response minus ε.

Best-Response Oracle Idea. Use approximate best-response oracle to get approximate minimax strategies. 1. Low-regret learning: if x 1,…,x T and x’ 1,…,x’ T have low regret, then ave. is approx minimax. 2. Follow expected leader: on round t+1, play best-response to x 1,…,x t to get low-regret.

Compression Best-Response Given lists of items with values and weights, pick one from each list with max total value and total weight at most one. Multiple-choice Knapsack:

Compression Best-Response Depth: 1234

Compression Best-Response (each with prob. p(.)) x’ in K For j from 1..n, list of depth j: v( ) = Pr[win at depth j | x’ ] w( ) = 2 -j … Kraft inequality

Other Duels 1.Hiring duel: constraints defining Euclidean subspace correspond to hiring probabilities. 2.Binary search duel: similar to hiring duel, but constraints defining Euclidean subspace more complex (must correspond to search trees). 3.Racing duel: seems computationally hard, even though single-player problem easy.

Conclusion Every optimization problem has a duel. Classic solutions (and all deterministic algorithms) can usually be badly beaten. Duel can be easier or harder to solve, and can lead to inefficiencies. OPEN QUESTION: effect of duel on the solution to the optimization problem?