Mechanism design for computationally limited agents Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Slides:

Advertisements

Similar presentations

Combinatorial Auction

Advertisements

6.896: Topics in Algorithmic Game Theory Lecture 20 Yang Cai.

Approximating optimal combinatorial auctions for complements using restricted welfare maximization Pingzhong Tang and Tuomas Sandholm Computer Science.

CPS Bayesian games and their use in auctions Vincent Conitzer

This Segment: Computational game theory Lecture 1: Game representations, solution concepts and complexity Tuomas Sandholm Computer Science Department Carnegie.

Combinatorial auctions Vincent Conitzer v( ) = $500 v( ) = $700.

Multi-item auctions with identical items limited supply: M items (M smaller than number of bidders, n). Three possible bidder types: –Unit-demand bidders.

Game Theory in Wireless and Communication Networks: Theory, Models, and Applications Lecture 6 Auction Theory Zhu Han, Dusit Niyato, Walid Saad, Tamer.

Auction Theory Class 3 – optimal auctions 1. Optimal auctions Usually the term optimal auctions stands for revenue maximization. What is maximal revenue?

Complexity of manipulating elections with few candidates Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University Computer Science Department.

1 Regret-based Incremental Partial Revelation Mechanism Design Nathanaël Hyafil, Craig Boutilier AAAI 2006 Department of Computer Science University of.

Seminar In Game Theory Algorithms, TAU, Agenda  Introduction  Computational Complexity  Incentive Compatible Mechanism  LP Relaxation & Walrasian.

Algorithmic Applications of Game Theory Lecture 8 1.

Reducing Costly Information Acquisition in Auctions Kate Larson, University of Waterloo Presented by David Thompson, University of British Columbia July.

Lecture 1 - Introduction 1.  Introduction to Game Theory  Basic Game Theory Examples  Strategic Games  More Game Theory Examples  Equilibrium  Mixed.

Distributed Multiagent Resource Allocation In Diminishing Marginal Return Domains Yoram Bachrach(Hebew University) Jeffrey S. Rosenschein (Hebrew University)

Limitations of VCG-Based Mechanisms Shahar Dobzinski Joint work with Noam Nisan.

Agent Technology for e-Commerce Chapter 10: Mechanism Design Maria Fasli

An Algorithm for Automatically Designing Deterministic Mechanisms without Payments Vincent Conitzer and Tuomas Sandholm Computer Science Department Carnegie.

Computational Criticisms of the Revelation Principle Vincent Conitzer, Tuomas Sandholm AMEC V.

Workshop on Auction Theory and Practice Carnegie Mellon University 1 Strategic Information Acquisition in Auctions Kate Larson Carnegie Mellon University.

Sequences of Take-It-or-Leave-it Offers: Near-Optimal Auctions Without Full Valuation Revelation Tuomas Sandholm and Andrew Gilpin Carnegie Mellon University.

Auctioning one item PART 3 Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Distributed Rational Decision Making Sections By Tibor Moldovan.

Costly valuation computation/information acquisition in auctions: Strategy, counterspeculation, and deliberation equilibrium Tuomas Sandholm Computer Science.

Complexity of Mechanism Design Vincent Conitzer and Tuomas Sandholm Carnegie Mellon University Computer Science Department.

Automated Mechanism Design: Complexity Results Stemming From the Single-Agent Setting Vincent Conitzer and Tuomas Sandholm Computer Science Department.

Reshef Meir School of Computer Science and Engineering Hebrew University, Jerusalem, Israel Joint work with Maria Polukarov, Jeffery S. Rosenschein and.

Competitive Analysis of Incentive Compatible On-Line Auctions Ron Lavi and Noam Nisan SISL/IST, Cal-Tech Hebrew University.

Multi-item auctions & exchanges (multiple distinguishable items for sale) Tuomas Sandholm Carnegie Mellon University.

This Week’s Topics  Review Class Concepts -Sequential Games -Simultaneous Games -Bertrand Trap -Auctions  Review Homework  Practice Problems.

Arbitrage in Combinatorial Exchanges Andrew Gilpin and Tuomas Sandholm Carnegie Mellon University Computer Science Department.

Collusion and the use of false names Vincent Conitzer

Introduction complexity has been suggested as a means of precluding strategic behavior. Previous studies have shown that some voting protocols are hard.

Auction Theory Class 2 – Revenue equivalence 1. This class: revenue Revenue in auctions – Connection to order statistics The revelation principle The.

Yang Cai Sep 8, An overview of the class Broad View: Mechanism Design and Auctions First Price Auction Second Price/Vickrey Auction Case Study:

CPS 173 Mechanism design Vincent Conitzer

Multi-Unit Auctions with Budget Limits Shahar Dobzinski, Ron Lavi, and Noam Nisan.

Sequences of Take-It-or-Leave-it Offers: Near-Optimal Auctions Without Full Valuation Revelation Tuomas Sandholm and Andrew Gilpin Carnegie Mellon University.

Combinatorial Auctions By: Shai Roitman

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 21.

Mechanism Design CS 886 Electronic Market Design University of Waterloo.

Preference elicitation Communicational Burden by Nisan, Segal, Lahaie and Parkes October 27th, 2004 Jella Pfeiffer.

Auction Theory תכנון מכרזים ומכירות פומביות Topic 7 – VCG mechanisms 1.

Automated Design of Multistage Mechanisms Tuomas Sandholm (Carnegie Mellon) Vincent Conitzer (Carnegie Mellon) Craig Boutilier (Toronto)

Mechanism design for computationally limited agents (previous slide deck discussed the case where valuation determination was complex) Tuomas Sandholm.

Game-theoretic analysis tools Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.

Slide 1 of 16 Noam Nisan The Power and Limitations of Item Price Combinatorial Auctions Noam Nisan Hebrew University, Jerusalem.

Mechanism design. Goal of mechanism design Implementing a social choice function f(u 1, …, u |A| ) using a game Center = “auctioneer” does not know the.

Automated Mechanism Design Tuomas Sandholm Presented by Dimitri Mostinski November 17, 2004.

1 Mechanism Design for Computationally Bounded Agents Kate Larson Carnegie Mellon University Thesis Proposal April 8, 2002.

Mechanism Design II CS 886:Electronic Market Design Sept 27, 2004.

Mechanism design (strategic “voting”) Tuomas Sandholm Professor Computer Science Department Carnegie Mellon University.

6.853: Topics in Algorithmic Game Theory Fall 2011 Constantinos Daskalakis Lecture 22.

Definition and Complexity of Some Basic Metareasoning Problems Vincent Conitzer and Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Some overarching themes on electronic marketplaces Tuomas Sandholm Computer Science Department Carnegie Mellon University.

Mechanism design for computationally limited agents (previous slide deck discussed the case where valuation determination was complex) Tuomas Sandholm.

False-name Bids “The effect of false-name bids in combinatorial

Tuomas Sandholm Computer Science Department Carnegie Mellon University

Mechanism design for computationally limited agents (last lecture discussed the case where valuation determination was complex) Tuomas Sandholm Computer.

CPS Mechanism design Michael Albert and Vincent Conitzer

Failures of the VCG Mechanism in Combinatorial Auctions and Exchanges

Tuomas Sandholm Computer Science Department Carnegie Mellon University

Tuomas Sandholm Computer Science Department Carnegie Mellon University

Tuomas Sandholm Computer Science Department Carnegie Mellon University

Vincent Conitzer Mechanism design Vincent Conitzer

Vincent Conitzer CPS 173 Mechanism design Vincent Conitzer

Preference elicitation/ iterative mechanisms

CPS Preference elicitation/ iterative mechanisms

Information, Incentives, and Mechanism Design

Presentation transcript:

Mechanism design for computationally limited agents Tuomas Sandholm Computer Science Department Carnegie Mellon University

Outline Part I: Limited deliberation to determine valuations: A study of common auction mechanisms Part II: Limited deliberation to determine valuations: Designing new mechanisms Part III: Other ideas for mechanism design for computationally limited agents

Part I: Limited deliberation to determine valuations: A study of common auction mechanisms

Bidders may need to compute their valuations for (bundles of) goods In many applications, e.g. –Vehicle routing problem in transportation exchanges –Manufacturing scheduling problem in procurement Value of a bundle of items (tasks, resources, etc) = value of solution with those items - value of solution without them Information gathering fits the model as well

Software agents for auctions Software agents exist that bid on behalf of user We want to enable agents to not only bid in auctions, but also determine the valuations of the items Agents use computational resources to compute valuations Valuation determination can involve computing on NP- complete problems (scheduling, vehicle routing, etc.) Optimal solutions may not be possible to determine due to limitations in agents’ computational abilities (i.e. agents have bounded rationality)

Recall A bidder in an auction can pay cost c to find out his own valuation => Vickrey auction ceases to have a dominant strategy [Sandholm ICMAS-96, International J. of Electronic Commerce 2000] –Same model studied in “information acquisition in auctions” [Compte and Jehiel 01, Rezende 02, Rasmussen 03]

Bounded rationality Work in economics has largely focused on descriptive models Some models based on limited memory in repeated games [Papadimitriou, Rubinstein, …] Some AI work has focused on models that prescribe how computationally limited agents should behave [Horvitz; Russell & Wefald; Zilberstein & Russell; Sandholm & Lesser; Hansen & Zilberstein, …] –Simplifying assumptions Myopic deliberation control Asymptotic notions of bounded optimality Conditioning on performance but not path of an algorithm Simplifications can work well in single agent settings, but any deviation from full normativity can be catastrophic in multiagent settings Incorporate deliberation (computing) actions into agents’ strategies => deliberation equilibrium

Normative control of deliberation In our setting agents have –Limited computing, or –Costly computing Agents must decide how to use their limited resources in an efficient manner Agents have anytime algorithms and use performance profiles to control their deliberation

Anytime algorithms can be used to approximate valuations Solution improves over time Can usually “solve” much larger problem instances than complete algorithms can Allow trading off computing time against quality –Decision is not just which bundles to evaluate, but how carefully Examples –Iterative refinement algorithms: Local search, simulated annealing –Search algorithms: Depth first search, branch and bound

Performance profiles of anytime algorithms Statistical performance profiles characterize the quality of an algorithm’s output as a function of computing time There are different ways of representing performance profiles –Earlier methods were not normative: they do not capture all the possible ways an agent can control its deliberation Can be satisfactory in single agent settings, but catastrophic in multiagent systems

Performance profiles Computing time Solution quality Deterministic performance profile Solution quality Variance introduced by different problem instances Computing time [Horvitz 87, 89, Dean & Boddy 89] Optimum

Ignores conditioning on the path Table-based representation of uncertainty in performance profiles Computing time Solution quality [Zilberstein & Russell IJCAI-91, AIJ-96] Conditioning on solution quality so far [Hansen & Zilberstein AAAI-96]

Performance profile tree [Larson & Sandholm TARK-01] Normative –Allows conditioning on path of solution quality –Allows conditioning on path of other solution features –Allows conditioning on problem instance features (different trees to be used for different classes) Constructed from statistics on earlier runs A P(B|A) B 5 C P(C|A) Solution quality

Performance profile tree… Can be augmented to model –Randomized algorithms –Agent not knowing which algorithms others are using –Agent having uncertainty about others’ problem instances Agent can emulate different scenarios of others p(0) p(1) Random node Value node Our results hold in this augmented setting

Roles of computing Computing by an agent –Improves the solution to the agent’s own problem(s) –Reduces uncertainty as to what future computing steps will yield –Improves the agent’s knowledge about others’ valuations –Improves the agent’s knowledge about what problems others may have computed on and what solutions others may have obtained Our results apply to different settings –Computing increases the valuation (reduces cost) –Computing refines the valuation estimate

Strategic computing [Larson & Sandholm] Good estimates of the other bidders’ valuations can allow an agent to tailor its bids to achieve higher utility Definition. Strong strategic computing: Agent uses some of its deliberation resources to compute on others’ problems Definition. Weak strategic computing: Agent uses information from others’ performance profiles How an agent should allocate its computation (based on results it has obtained so far) can depend on how others allocate their computation –Deliberation equilibrium

Theorems on strategic computing yes no Generalized Vickrey On which pair to allocate next computation step ? Multiple items for sale noEnglish (1 st price ascending) yes no Vickrey (2 nd price sealed bid) yes Dutch (1 st price descending) yes First price sealed-bidSingle item for sale Costly computing Limited computing Strategic computing ?Counter- speculation by rational agents ? Auction mechanism If performance profiles are deterministic, only weak strategic computing can occur  New normative deliberation control method uncovered a new phenomenon

Costly computing in English auctions Dominant strategy mechanism for rational bidders Thrm: If at most one performance profile is stochastic, no strong strategic computing occurs in equilibrium Thrm: If at least two performance profiles are stochastic, strong strategic computing can occur in equilibrium –Despite the fact that agents learn about others’ valuations by waiting and observing others’ bids –Passing & restarting computation during the auction is allowed –Proof. Consider an auction with two bidders: Agent 1 can compute for free Agent 2 incurs cost 1 for each computing step

Performance profiles of the proof Agent 1’s problemAgent 2’s problem p(high 1 ) 1-p(high 1 ) p(high 2 ) 1-p(high 2 ) high 1 low 1 high 2 low 2 low 2 < low 1 < high 2 < high Since computing one step on 2’s problem does not yield any information, we can treat computing for two steps on 2’s problem atomically

Proof continued… Agent 1 has a dominant strategy: –Compute only on own problem & increment bid whenever Agent 1 does not have the highest bid and Highest bid is lower than agent 1’s valuation Agent 2’s strategy: –CASE 1: bid 1 > low 1 Agent 2 knows that agent 1 has valuation high 1 Agent 2 cannot win, and thus has no incentive to compute or bid –CASE 2: bid 1 < low 2 Agent 2 continues to increment its own bid No need to compute since it knows that its valuation is at least low 2 –CASE 3: low 1  bid 1  low 2 Agent 2’s strategy depends on the performance profiles

Decision problem of agent 2 in CASE 3 Withdraw Bid Compute on 2’s problem Compute on 1’s problem high 1 low 1 high 1 low 1 high 2 Compute on 2’s low 2 Decision node for agent 2 Chance node for agent 1’s performance profile Chance node for agent 2’s performance profile Bid 0 0 high 1 low 1 high 2 -low 1 -2 low 2 -low 1 -2 high 2 -low Withdraw high 2 low 2 Withdraw -2 high 2 -low 1 -2 high 2 low 2 Compute on 2’s high 2 low 2 Bid high 2 -high 1 -3 low 2 -high 1 -3 high 2 -low Withdraw Bid high 2 low 2 Withdraw Compute on 1’s high 1 low 1 -2 high 2 -low 1 -3 low 2 -low 1 -3 Bid Compute on 1’s high 1 low high 2 -low 1 -3 Agent 2’s utility low 2 < low 1 < high 2 < high 1

Under what conditions does strong strategic computing occur? Probability that agent 1 will have its high valuation Probability that agent 2 will have its high valuation

Other variants we have solved Agents cannot pass on computing during the auction & continue computing later during the auction –Can make a difference in English auctions with costly computing, but strong strategic computing is still possible in equilibrium Agents can/cannot compute after the auction 2-agent bargaining (again with performance profile trees) –Larson, K. and Sandholm, T Bargaining with Limited Computation: Deliberation Equilibrium. Artificial Intelligence, 132(2), Bargaining with Limited Computation: Deliberation Equilibrium. –Larson, K. and Sandholm, T An Alternating Offers Bargaining Model for Computationally Limited Agents. In Proceedings of the First International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), Bologna, Italy, July.An Alternating Offers Bargaining Model for Computationally Limited Agents.

Conclusions Software agents participating in auctions may need to compute valuations under computational limitations –This adds other possibilities to the agents’ strategies Modeled computing normatively as part of each agent’s strategy –Deliberation equilibrium –Showed under which auction mechanisms and which models of bounded rationality strategic computing can/cannot occur Deliberation resources may be used strategically –Strong strategic vs. weak strategic computing –Deep interaction between incentives and computing Dominant strategy mechanisms can become strategy-prone Even English auction with costly computing

The future ? In many B2B settings, automated bidders can compute valuations dynamically faster than humans Some future research directions –Using our deliberation control method in systems Manufacturing planning, networks, … –New (market) mechanisms Game-theoretically engineered to work well under (different) models of bounded rationality Our results show that even the most common mechanism design principles (e.g., revelation principle) cease to hold Our normative deliberation control method = basis for new design principles ?

Part II: Limited deliberation to determine valuations: Designing new mechanisms [Larson & Sandholm AAMAS-05]

Mechanism desiderata Preference formation-independent –Mechanism should not be involved in agents’ preference formation process (otherwise revelation principle applies trivially) Deliberation-proof –In equilibrium, no agent should have incentive to strategically deliberate Non-misleading –In equilibrium, no agent should follow a strategy that causes others to believe that its true preferences are impossible Proof sketch. Given any outcome function it is always possible to construct an example where agents are best off knowing the valuation of another agent

Indirect/multi-step mechanisms provide information to agents –Example: Ascending auction Bidders Information At price p there are k bidders remaining in the auction

Is it possible to satisfy the three desiderata via a multi-stage mechanism? Thm: There does not exist any strategy-dependent, preference-formation independent mechanism that is both –deliberation proof, and –non-misleading Proof sketch. Look at information sets in the game induced by the indirect mechanism –Case 1: Game does not provide enough information to stop strategic-deliberation (ascending auction) –Case 2: Game does provide enough information BUT agents’ play a signaling game Pooling equilibria (misleading)

Future work Overcoming the impossibility result by relaxing the properties –Encourage strategic deliberation Incentives for agents to share information? –Relax preference-formation independent property Mechanism guides deliberation

Part III: Other ideas for mechanism design for computationally limited agents

Recall from last lecture With computationally limited agents, a non- truthful mechanism can be better than a truth-promoting one –[Conitzer & Sandholm: “Computational Criticisms of the Revelation Principle”, 2003]

2 nd -chance mechanism [in paper “Computationally Feasible VCG Mechanisms” by Nisan & Ronen, EC-00]Computationally Feasible VCG Mechanisms (Interesting unrelated fact: Any VCG mechanism that is maximal in range is incentive compatible) Observation: only way an agent can improve its utility in a VCG mechanism where an approximation algorithm is used is by helping the algorithm find a higher-welfare allocation Second-chance mechanism: let each agent i submit a valuation fn v i and an appeal fn l i : V->V. Mechanism (using alg k) computes k(v), k(l i (v)), k(l 2 (v)), … and picks the among those the allocation that maximizes welfare. Pricing based on unappealed v.

Work based on the assumption that agents can only solve problems that are worst-case polynomial time Bartholdi, Tovey, and Trick The computational difficulty of manipulating an election, Social Choice and Welfare, Bartholdi and Orlin. Single Transferable Vote Resists Strategic Voting, Social Choice and Welfare, Nisan and Ronen Computationally Feasible VCG Mechanisms, EC-00. O’Connell and Stearns Polynomial Time Mechanisms for Collective Decision Making, SUNYA-CS-00-1 Conitzer, V. and Sandholm, T Complexity of Manipulating Elections with Few Candidates. National Conference on Artificial Intelligence (AAAI).Complexity of Manipulating Elections with Few Candidates. Conitzer, V. and Sandholm, T Universal Voting Protocol Tweaks to Make Manipulation Hard. International Joint Conference on Artificial Intelligence (IJCAI).Universal Voting Protocol Tweaks to Make Manipulation Hard. Conitzer, V. and Sandholm, T How Many Candidates Are Needed to Make Elections Hard to Manipulate? Conference on Theoretical Aspects of Rationality and Knowledge (TARK).How Many Candidates Are Needed to Make Elections Hard to Manipulate? …