Optimizing Scrip Systems: Efficiency, Crashes, Hoarders, and Altruists By Ian A. Kash, Eric J. Friedman, Joseph Y. Halpern Presentation by Avner May 12/10/08.

Slides:



Advertisements
Similar presentations
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Advertisements

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 16.
6.896: Topics in Algorithmic Game Theory Lecture 20 Yang Cai.
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
1 University of Southern California Keep the Adversary Guessing: Agent Security by Policy Randomization Praveen Paruchuri University of Southern California.
Analysis of Algorithms
An Approximate Truthful Mechanism for Combinatorial Auctions An Internet Mathematics paper by Aaron Archer, Christos Papadimitriou, Kunal Talwar and Éva.
Evolution and Repeated Games D. Fudenberg (Harvard) E. Maskin (IAS, Princeton)
Macroeconomics Chapter 111 Inflation, Money Growth, and Interest Rates C h a p t e r 1 1.
Congestion Games with Player- Specific Payoff Functions Igal Milchtaich, Department of Mathematics, The Hebrew University of Jerusalem, 1993 Presentation.
EC941 - Game Theory Lecture 7 Prof. Francesco Squintani
Negotiation A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor SIUC.
Regret Minimization and the Price of Total Anarchy Paper by A. Blum, M. Hajiaghayi, K. Ligett, A.Roth Presented by Michael Wunder.
1 12. Principles of Parameter Estimation The purpose of this lecture is to illustrate the usefulness of the various concepts introduced and studied in.
Imperfect commitment Santiago Truffa. Agenda 1.“Authority and communication in Organizations” Wouter Dessein, RES “Contracting for information.
The Evolution of Conventions H. Peyton Young Presented by Na Li and Cory Pender.
Dynamic Games of Complete Information.. Repeated games Best understood class of dynamic games Past play cannot influence feasible actions or payoff functions.
General Equilibrium and Efficiency. General Equilibrium Analysis is the study of the simultaneous determination of prices and quantities in all relevant.
6.896: Topics in Algorithmic Game Theory Lecture 15 Constantinos Daskalakis.
Certainty Equivalent and Stochastic Preferences June 2006 FUR 2006, Rome Pavlo Blavatskyy Wolfgang Köhler IEW, University of Zürich.
An Introduction to Markov Decision Processes Sarah Hickmott
Planning under Uncertainty
Game-Theoretic Approaches to Multi-Agent Systems Bernhard Nebel.
Reducing Costly Information Acquisition in Auctions Kate Larson, University of Waterloo Presented by David Thompson, University of British Columbia July.
Good Research Questions. A paradigm consists of – a set of fundamental theoretical assumptions that the members of the scientific community accept as.
6/2/2001 Cooperative Agent Systems: Artificial Agents Play the Ultimatum Game Steven O. Kimbrough Presented at FMEC 2001, Oslo Joint work with Fang Zhong.
Beyond selfish routing: Network Formation Games. Network Formation Games NFGs model the various ways in which selfish agents might create/use networks.
1 Computing Nash Equilibrium Presenter: Yishay Mansour.
Nov 2003Group Meeting #2 Distributed Optimization of Power Allocation in Interference Channel Raul Etkin, Abhay Parekh, and David Tse Spectrum Sharing.
Job Market Signaling (Spence model)
AWESOME: A General Multiagent Learning Algorithm that Converges in Self- Play and Learns a Best Response Against Stationary Opponents Vincent Conitzer.
Correlated-Q Learning and Cyclic Equilibria in Markov games Haoqi Zhang.
No Free Lunch (NFL) Theorem Many slides are based on a presentation of Y.C. Ho Presentation by Kristian Nolde.
Distributed Rational Decision Making Sections By Tibor Moldovan.
On Bounded Rationality and Computational Complexity Christos Papadimitriou and Mihallis Yannakakis.
Competitive Analysis of Incentive Compatible On-Line Auctions Ron Lavi and Noam Nisan SISL/IST, Cal-Tech Hebrew University.
Yang Cai Sep 15, An overview of today’s class Myerson’s Lemma (cont’d) Application of Myerson’s Lemma Revelation Principle Intro to Revenue Maximization.
Experts and Boosting Algorithms. Experts: Motivation Given a set of experts –No prior information –No consistent behavior –Goal: Predict as the best expert.
Network Formation Games. Netwok Formation Games NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models:
Exchange Rate “Fundamentals” FIN 40500: International Finance.
Constant Price of Anarchy in Network Creation Games via Public Service Advertising Presented by Sepehr Assadi Based on a paper by Erik D. Demaine and Morteza.
Minimax strategies, Nash equilibria, correlated equilibria Vincent Conitzer
Learning dynamics,genetic algorithms,and corporate takeovers Thomas H. Noe,Lynn Pi.
Allerton 2011 September 28 Mathias Humbert, Mohammad Hossein Manshaei, and Jean-Pierre Hubaux EPFL - Laboratory for Communications and Applications (LCA1)
Efficiency Loss in a Network Resource Allocation Game Paper by: Ramesh Johari, John N. Tsitsiklis [ Informs] Presented by: Gayatree Ganu.
Mechanisms for Making Crowds Truthful Andrew Mao, Sergiy Nesterko.
The Multiplicative Weights Update Method Based on Arora, Hazan & Kale (2005) Mashor Housh Oded Cats Advanced simulation methods Prof. Rubinstein.
1 Efficiency and Nash Equilibria in a Scrip System for P2P Networks Eric J. Friedman Joseph Y. Halpern Ian Kash.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Moshe Tennenholtz, Aviv Zohar Learning Equilibria in Repeated Congestion Games.
CS584 - Software Multiagent Systems Lecture 12 Distributed constraint optimization II: Incomplete algorithms and recent theoretical results.
May 8, Manipulating Scrip Systems: Sybils and Collusion Ian Kash Cornell University Joint work with Eric Friedman and Joe Halpern.
Cs286r Victor Chan Scrip Systems Victor Chan. CS286 Victor Chan Agenda  Scrip Systems  Peer to Peer Systems  Scrip Systems for P2P Networks  Adobe.
Approximating Market Equilibria Kamal Jain, Microsoft Research Mohammad Mahdian, MIT Amin Saberi, Georgia Tech.
A Study of Central Auction Based Wholesale Electricity Markets S. Ceppi and N. Gatti.
Incentives for Sharing in Peer-to-Peer Networks By Philippe Golle, Kevin Leyton-Brown, Ilya Mironov, Mark Lillibridge.
Information Theory for Mobile Ad-Hoc Networks (ITMANET): The FLoWS Project Competitive Scheduling in Wireless Networks with Correlated Channel State Ozan.
Beyond selfish routing: Network Games. Network Games NGs model the various ways in which selfish agents strategically interact in using a network They.
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Ch 10.6: Other Heat Conduction Problems
6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.
The Evolution of “An Experiment on Exit in Duopoly”
Negotiating Socially Optimal Allocations of Resources U. Endriss, N. Maudet, F. Sadri, and F. Toni Presented by: Marcus Shea.
Pricing of Competing Products BI Solutions December
Network Formation Games. NFGs model distinct ways in which selfish agents might create and evaluate networks We’ll see two models: Global Connection Game.
A useful reduction (SAT -> game)
A useful reduction (SAT -> game)
Presented By Aaron Roth
Authors: Oren Rigbi Damian Goren
Optimizing Scrip Systems: Efficiency, Crashes, Hoarders, and Altruists
Presentation transcript:

Optimizing Scrip Systems: Efficiency, Crashes, Hoarders, and Altruists By Ian A. Kash, Eric J. Friedman, Joseph Y. Halpern Presentation by Avner May 12/10/08

Overview 1) Introduction 2) Review of Earlier Model 3) Basic Results about Equilibrium in System 4) Examining Distribution of Money in System 5) Optimizing System Performance Through Money Supply 6) Effects of Altruists and Hoarders on System 7) Conclusion

Intro: Brief History NGO’s issue their own currency  These are known as “scrip” Examples  In company towns where government currency was scarce  To prevent robbery in DC buses  Online Systems: Everquest, Second Life Creates market for exchange of goods/services that would be impractical or undesirable with real currency

Introduction Motivating Example: Capital Hill Baby Sitting Co- op  Effects of too much or too little money on effectiveness of system  Optimal Money Supply?  Note: fixed price Homogeneous vs. Heterogeneous Population Altruists and Hoarders  Gnutella: 1% users satisfy 50% of requests  Effect on System?

Model Very similar to previous model Agent Types:  Tuple: t = (α t, β t, γ t, δ t, ρ t ) in T (finite set)  α t : Cost of satisfying request  β t : Probability that agent can satisfy request  γ t : Utility an agent gains for having a request satisfied  δ t : Rate at which agent discounts utility  ρ t : Represents an agent’s relative request rate Agent population entirely determined by (T,f,n)  T: finite set of types  f: vector of f t, fraction of agents of type t  n: # of agents

Model (cont.) Infinite number of rounds Each round:  An agent is picked to request service (prob. proportional to ρ t )  If chosen agent has money, every other agent is able to satisfy request with certain probability (β t : independent of round). Else round ends  If at least one agent is willing and able to satisfy request, transaction occurs Requester (type t) gets γ t utils, pays $1 (normalized scrip cost)  Volunteer (type t’) pays cost of α t’ utils, receives $ n rounds per unit time (time b/w rounds is 1/n)

Model (cont.) Payoff Heterogeneity  Population of types T that differ only in α, γ, δ (utility variables)  Don’t differ in probability of being able to satisfy request (β), or in frequency of requests (ρ) The results in this paper only explicitly apply to “payoff heterogeneous” populations  Authors don’t believe it would fundamentally change results to extend it to an arbitrary heterogeneous population Recap: This paper extends the results from previous paper to payoff heterogeneous populations

Discussion: Model Any thoughts? Are the assumptions too strong? Payoff-Heterogeneity Assumption? Agent types remain constant over time? Choice of volunteer is independent over time? Threshold Strategies?

Theoretical Results Many of the results in this paper are direct analogs of the theorems we saw in the previous paper Theorem 3.1  General idea: Given a payoff-heterogeneous system, it will approach the max-entropy monetary distribution  More specifically, if there are enough agents in the system, after sufficient time, it will be arbitrarily close to the max-entropy distributions with arbitrarily high probability  Note: This is assuming a given distribution of strategies, and the final monetary distribution maximizes entropy subject to the constraints of the strategy distribution

Theoretical Results (cont) Theorem 3.2  Given a payoff-heterogeneous system, if each agent of type t plays a threshold strategy, then every agent has a best-response which is a threshold strategy Lemma 3.2  The best-response function for agents of type t is non- decreasing Note: input to best-response function is vector k of threshold values for agents of each type Intuition: If all agents increase their threshold value, an agent will earn money less often while wanting to spend money at least as often. Thus, they will increase their threshold value.

Theoretical Results (cont) Theorem 3.3  Given a payoff-heterogeneous system, their exists a non-trivial Nash equilibrium where all agents of type t play the same threshold strategy S k. Sketch of proof  By Lemma 3.2, for each type, BR t (k) is non-decreasing (k is vector)  Thus, BR(k) is non-decreasing function  So, Tarski’s fixed point theorem guarantees the existence of a greatest and least fixed point Least fixed point: trivial equilibrium (all types play S 0 ) Greatest fixed point: Non-trivial equilibrium Tarski’s Fixed Point Theorem  Given any monotone increasing function on a complete lattice L (f: L  L), their exist a greatest and a least fixed point.

Theoretical Results (cont) What is a fixed point?  Vector k s.t. BR(k) = k  This means that given k, the best-response strategy profile is precisely k  Thus, this is an equilibrium Computing the fixed point efficiently  Best-reply dynamics  Begin with strategy profile (∞, ∞, …, ∞)  Iteratively compute best-reply strategy profile BR(BR(….(BR((∞, ∞, …, ∞))  This process converges to greatest fixed point Note: The greatest fixed point might not be the one which maximizes social welfare

Theoretical Results (cont) Using Best-Reply DynamicsEXAMPLE

Identifying User Strategies In section 3 of paper, used max-entropy to get explicit formula for monetary distribution given strategies of agents (fraction of agents playing each strategy) π  M Now, we would like to try to determine the strategies of the agents given a certain monetary distribution π  M

Identifying User Strategies (cont) Given a monetary distribution M, an “explanation” of M to be a distribution π over strategies such that the maximum entropy distribution which results from π is M. (i.e., π “explains” M if π  M) Lemma 4.1  If M is a fully supported distribution of money with finite support, then there exist an infinite number of explanations of M The paper explains an algorithm for finding the “minimal explanation,” that in which fewest # of strategies are played. From minimal explanation, can further make inferences about fraction of population of each type, and strategy they are playing

Discussion At this point, the authors motivate why the minimal explanation is useful They think that there will be clusters of agents of similar types, and within a cluster agents will play similar strategies. Does this make sense? What if small differences in type make large differences in strategy? Can people be expected to compute their best response functions accurately? What if there are not clusters, but rather a uniform distribution of agents of different types?

Optimizing the Money Supply In sections 3 & 4, we assumed money supply was fixed, and considered how money is distributed among agents of different types We will now examine what happens to the distribution of money when the money supply is altered We want to determine the money supply which will optimize the performance of the system (maximize social welfare)

Optimizing the Money Supply Result: Increasing the amount of money improves performance up to the point where the system crashes. Thus, optimizing the system is simply a matter of finding the point at which the system would experience monetary crash

Optimizing Money Supply But how do we find this optimal point? Observation: We would like to find money supply which minimizes fraction M 0 of people without any money  Why?  Expected utility gain in each round (1-M 0 )(γ t -α t ) * PROB(at least one agent volunteers)  Expected total utility (assuming homogeneous populations) (1-M 0 )(γ t -α t )/(1-δ t )  Analysis is very similar for heterogeneous population

Optimizing Money Supply Theorem 5.1  Increasing average money decreases fraction of people with no money (assuming there is no monetary crash)  Thus, as long as non-trivial equilibria exist, the more money the better Corollary 5.1  There exists a finite threshold m*, such that: If average money/person < m*, then non-trivial equilibria exist If average money/person > m*, then system crashes (no non- trivial equilibrium exists)

Optimizing Money Supply But how do we find m*??? Given that you know T and f, you can use techniques from section 3 (best-reply dynamics) to see if non-trivial equilibria exists Binary search: Given T,f. Vary m  m: average money/person But how do we know T,f?  You can use results from section 4 to solve for T,f. Reasonable?

Altruists and Hoarders Altruists: Provide services at no cost  Presumably gain some utility by satisfying a request Hoarders: Always volunteer to satisfy requests, but never make requests  Can be modeled as playing S ∞ Intuition  Having altruists is like increasing money supply  Having hoarders is like decreasing money supply

Altruists and Hoarders Theorem 6.1  Increasing fraction of agents which are altruists increases social welfare up to the point where the system crashes Theorem 6.2  Increasing fraction of agents which are hoarders decreases social welfare System designer can see actions of these agents and modify money supply accordingly to prevent sub-optimal outcomes Additional property: Hoarders stabilize system by helping to prevent monetary crash  Theoretically, they are always willing to work for $1, regardless of how depreciated the dollar is

Discussion Floating prices? Auctions with scrips? Other types of irrational behavior? Would these theoretical results hold-up well in real-world scrip systems? What next? Any other thoughts