Adaptively Learning Tolls to Induce Target Flows Aaron Roth Joint work with Jon Ullman and Steven Wu.

Slides:



Advertisements
Similar presentations
Testing Linear Pricing Algorithms for use in Ascending Combinatorial Auctions (A5) Giro Cavallo David Johnson Emrah Kostem.
Advertisements

NON - zero sum games.
Price Of Anarchy: Routing
Bilinear Games: Polynomial Time Algorithms for Rank Based Subclasses Ruta Mehta Indian Institute of Technology, Bombay Joint work with Jugal Garg and Albert.
Fast Convergence of Selfish Re-Routing Eyal Even-Dar, Tel-Aviv University Yishay Mansour, Tel-Aviv University.
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
Joint Strategy Fictitious Play Sherwin Doroudi. “Adapted” from J. R. Marden, G. Arslan, J. S. Shamma, “Joint strategy fictitious play with inertia for.
Game Theory and Computer Networks: a useful combination? Christos Samaras, COMNET Group, DUTH.
Calibrated Learning and Correlated Equilibrium By: Dean Foster and Rakesh Vohra Presented by: Jason Sorensen.
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 13.
Course: Price of Anarchy Professor: Michal Feldman Student: Iddan Golomb 26/02/2014 Non-Atomic Selfish Routing.
How Bad is Selfish Routing? By Tim Roughgarden Eva Tardos Presented by Alex Kogan.
Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.
Learning in games Vincent Conitzer
Noam Nisan, Michael Schapira, Gregory Valiant, and Aviv Zohar.
Sogang University ICC Lab Using Game Theory to Analyze Wireless Ad Hoc networks.
Mechanism Design without Money Lecture 4 1. Price of Anarchy simplest example G is given Route 1 unit from A to B, through AB,AXB OPT– route ½ on AB and.
Non-cooperative Game Theory Notes by Alberto Bressan.
Algoritmi per Sistemi Distribuiti Strategici
Convergent Learning in Unknown Graphical Games Dr Archie Chapman, Dr David Leslie, Dr Alex Rogers and Prof Nick Jennings School of Mathematics, University.
Topics in Stochastic Networks Performance Scaling and Algorithmic Challenges.
1 Best-Reply Mechanisms Noam Nisan, Michael Schapira and Aviv Zohar.
The Cat and The Mouse -- The Case of Mobile Sensors and Targets David K. Y. Yau Lab for Advanced Network Systems Dept of Computer Science Purdue University.
Competitive Routing in Multi-User Communication Networks Presentation By: Yuval Lifshitz In Seminar: Computational Issues in Game Theory (2002/3) By: Prof.
Autonomous Target Assignment: A Game Theoretical Formulation Gurdal Arslan & Jeff Shamma Mechanical and Aerospace Engineering UCLA AFOSR / MURI.
The Price Of Stability for Network Design with Fair Cost Allocation Elliot Anshelevich, Anirban Dasgupta, Jon Kleinberg, Eva Tardos, Tom Wexler, Tim Roughgarden.
XYZ 6/18/2015 MIT Brain and Cognitive Sciences Convergence Analysis of Reinforcement Learning Agents Srinivas Turaga th March, 2004.
How Bad is Selfish Routing? Tim Roughgarden & Eva Tardos Presented by Wonhong Nam
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
Near-Optimal Network Design with Selfish Agents By Elliot Anshelevich, Anirban Dasgupta, Eva Tardos, Tom Wexler STOC’03 Presented by Mustafa Suleyman CIFTCI.
Convergence Time to Nash Equilibria in Load Balancing Eyal Even-Dar, Tel-Aviv University Alex Kesselman, Tel-Aviv University Yishay Mansour, Tel-Aviv University.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley Asynchronous Distributed Algorithm Proof.
Algorithmic Issues in Non- cooperative (i.e., strategic) Distributed Systems.
Algorithms and Economics of Networks Abraham Flaxman and Vahab Mirrokni, Microsoft Research.
Graphical Games Kjartan A. Jónsson. Nash equilibrium Nash equilibrium Nash equilibrium N players playing a dominant strategy is a Nash equilibrium N players.
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
1 On the Agenda(s) of Research on Multi-Agent Learning by Yoav Shoham and Rob Powers and Trond Grenager Learning against opponents with bounded memory.
On Self Adaptive Routing in Dynamic Environments -- A probabilistic routing scheme Haiyong Xie, Lili Qiu, Yang Richard Yang and Yin Yale, MR and.
Inefficiency of equilibria, and potential games Computational game theory Spring 2008 Michal Feldman.
1 Issues on the border of economics and computation נושאים בגבול כלכלה וחישוב Congestion Games, Potential Games and Price of Anarchy Liad Blumrosen ©
CPS Learning in games Vincent Conitzer
Online Oblivious Routing Nikhil Bansal, Avrim Blum, Shuchi Chawla & Adam Meyerson Carnegie Mellon University 6/7/2003.
online convex optimization (with partial information)
A Framework for Distributed Model Predictive Control
A quantum protocol for sampling correlated equilibria unconditionally and without a mediator Iordanis Kerenidis, LIAFA, Univ Paris 7, and CNRS Shengyu.
Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.
Small clique detection and approximate Nash equilibria Danny Vilenchik UCLA Joint work with Lorenz Minder.
The Effectiveness of Stackelberg strategies and Tolls for Network Congestion Games Chaitanya Swamy University of Waterloo.
Introduction to Algorithms By Mr. Venkatadri. M. Two Phases of Programming A typical programming task can be divided into two phases: Problem solving.
Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia.
Connections between Learning Theory, Game Theory, and Optimization Maria Florina (Nina) Balcan Lecture 14, October 7 th 2010.
Equilibria in Network Games: At the Edge of Analytics and Complexity Rachel Kranton Duke University Research Issues at the Interface of Computer Science.
EE 685 presentation Optimization Flow Control, I: Basic Algorithm and Convergence By Steven Low and David Lapsley.
1 Intrinsic Robustness of the Price of Anarchy Tim Roughgarden Stanford University.
Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project Competitive Scheduling in Wireless Networks with Correlated Channel State Ozan.
Designing Games for Distributed Optimization Na Li and Jason R. Marden IEEE Journal of Selected Topics in Signal Processing, Vol. 7, No. 2, pp ,
Vasilis Syrgkanis Cornell University
Alpcan, T., and T. Basar (2004) “A game theoretic analysis of intrusion detection in access control systems” Proceedings of 43 rd IEEE Conference on Decision.
Presented by: Dardan Xhymshiti Spring 2016:. Authors: Publication:  ICDM 2015 Type:  Research Paper 2 Michael ShekelyamGregor JosseMatthias Schubert.
Finding Dense and Connected Subgraphs in Dual Networks
Introduction of BP & TRW-S
Structured Models for Multi-Agent Interactions
Presented By Aaron Roth
Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 10 Stochastic Game Zhu Han, Dusit Niyato, Walid Saad, and.
≠ Particle-based Variational Inference for Continuous Systems
Multiagent Systems Repeated Games © Manfred Huber 2018.
Javad Ghaderi, Tianxiong Ji and R. Srikant
Vincent Conitzer Learning in games Vincent Conitzer
Vincent Conitzer CPS Learning in games Vincent Conitzer
Normal Form (Matrix) Games
Presentation transcript:

Adaptively Learning Tolls to Induce Target Flows Aaron Roth Joint work with Jon Ullman and Steven Wu

Non-Atomic Congestion Games A graph representing a road network

Non-Atomic Congestion Games

Equilibrium Flows

Routing games are potential games

Manipulating equilibrium flow (classic problem)

A Natural Problem [Bhaskar, Ligett, Schulman, Swamy FOCS 14]

The [BLSS] solution in a nutshell

Very neat! (Read the paper!) A couple of limitations: Latency functions must have a simple, known form: only unknowns are a small number of coefficients. Latency functions must be convex Heavy machinery Have to run Ellipsoid. Computationally intensive. Centralized.

Some desiderata Remove assumptions on latency functions No known parametric form Not necessarily convex Not necessarily Lipschitz Make update elementary Ideally decentralized.

Proposed algorithm: Tatonnemont.

Reverse engineering why it should work View the interaction as repeated play of a game between a toll player and a flow player. What game are they playing? Flow player’s strategies are feasible flows, toll player’s strategies are tolls in some bounded range What are their cost functions? How are they playing it?

Behavior of the flow player is clear.

Behavior of the toll player?

So algorithm is consistent with:

Questions Does the equilibrium of this game correspond to the target flow? Does this repeated play converge (quickly) to equilibrium?

Questions

Does this repeated play converge (quickly) to equilibrium? It does in zero sum games! [FreundSchapire96?] Actual strategy of GD player, empirical average of BR player.

So it converges in a zero sum game.

Strategic Equivalence Adding a strategy-independent term to a player’s cost function does not change that player’s best response function And so doesn’t change the equilibria of the game…

So it converges in a zero sum game.

So the dynamics converge!

Do approximate min-max tolls guarantee the approximate target flow?

Upshot

Questions Exact convergence without assumptions on latency functions? Extensions to other games? Seem to crucially use the fact that equilibrium is the solution to a convex optimization problem…

Questions Exact convergence without assumptions on latency functions? Extensions to other games? Seem to crucially use the fact that equilibrium is the solution to a convex optimization problem…