1 Agents – Background Vicki H. Allan. 2 An Agent in its Environment AGENT ENVIRONMENT Sensor Input action output.

Slides:



Advertisements
Similar presentations
Concepts of Game Theory I
Advertisements

(Single-item) auctions Vincent Conitzer v() = $5 v() = $3.
Chapter 17: Making Complex Decisions April 1, 2004.
CPS Bayesian games and their use in auctions Vincent Conitzer
Nash’s Theorem Theorem (Nash, 1951): Every finite game (finite number of players, finite number of pure strategies) has at least one mixed-strategy Nash.
Economics 100B u Instructor: Ted Bergstrom u T.A. Oddgeir Ottesen u Syllabus online at (Class pages) Or at
Computer-aided mechanism design Ye Fang, Swarat Chaudhuri, Moshe Vardi 1.
1 Game Theory. 2 Definitions Game theory -- formal way to analyze interactions among a group of rational agents behaving strategically Agents – players.
Auctions. Strategic Situation You are bidding for an object in an auction. The object has a value to you of $20. How much should you bid? Depends on auction.
Chapter 25: Auctions and Auction Markets 1 Auctions and Auction Markets.
An Approximate Truthful Mechanism for Combinatorial Auctions An Internet Mathematics paper by Aaron Archer, Christos Papadimitriou, Kunal Talwar and Éva.
Michael R. Baye, Managerial Economics and Business Strategy, 3e. ©The McGraw-Hill Companies, Inc., 1999 Managerial Economics & Business Strategy Chapter.
Multi-item auctions with identical items limited supply: M items (M smaller than number of bidders, n). Three possible bidder types: –Unit-demand bidders.
SIF8072 Distributed Artificial Intelligence and Intelligent Agents
Auction. Types of Auction  Open outcry English (ascending) auction Dutch (descending) auction  Sealed bid First-price Second-price (Vickrey)  Equivalence.
Private-value auctions: theory and experimental evidence (Part I) Nikos Nikiforakis The University of Melbourne.
Game theory (Sections )
6-1 LECTURE 6: MULTIAGENT INTERACTIONS An Introduction to MultiAgent Systems
Chapter 6 Game Theory © 2006 Thomson Learning/South-Western.
Game Theory 1. Game Theory and Mechanism Design Game theory to analyze strategic behavior: Given a strategic environment (a “game”), and an assumption.
Game theory (Sections ). Game theory Game theory deals with systems of interacting agents where the outcome for an agent depends on the actions.
What is game theory… Game theory studies settings where multiple parties (agents) each have –different preferences (utility functions), –different actions.
Game Theory The study of rational behavior among interdependent agents Agents have a common interest to make the pie as large as possible, but Agents have.
An Introduction to Game Theory Part I: Strategic Games
GAME THEORY.
SARANI SAHABHATTACHARYA, HSS ARNAB BHATTACHARYA, CSE 07 JAN, 2009 Game Theory and its Applications.
Chapter 6 © 2006 Thomson Learning/South-Western Game Theory.
1 Business System Analysis & Decision Making - Lecture 5 Zhangxi Lin ISQS 5340 July 2006.
Ecs289m Spring, 2008 Non-cooperative Games S. Felix Wu Computer Science Department University of California, Davis
Oligopoly The challenge of analyzing interdependent strategic decisions.
Lecture 4 on Individual Optimization Risk Aversion
Game-Theoretic Approaches to Multi-Agent Systems Bernhard Nebel.
Lecture 1 - Introduction 1.  Introduction to Game Theory  Basic Game Theory Examples  Strategic Games  More Game Theory Examples  Equilibrium  Mixed.
Review: Game theory Dominant strategy Nash equilibrium
Agent Technology for e-Commerce Chapter 10: Mechanism Design Maria Fasli
THE PROBLEM OF MULTIPLE EQUILIBRIA NE is not enough by itself and must be supplemented by some other consideration that selects the one equilibrium with.
Introduction to Game Theory and Behavior Networked Life CIS 112 Spring 2009 Prof. Michael Kearns.
AN INTRODUCTION TO PORTFOLIO MANAGEMENT
Strategic Game Theory for Managers. Explain What is the Game Theory Explain the Basic Elements of a Game Explain the Importance of Game Theory Explain.
Risk attitudes, normal-form games, dominance, iterated dominance Vincent Conitzer
Game Theory, Strategic Decision Making, and Behavioral Economics 11 Game Theory, Strategic Decision Making, and Behavioral Economics All men can see the.
CPS 173 Mechanism design Vincent Conitzer
Introduction 1 TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: AAA A.
Mechanism Design CS 886 Electronic Market Design University of Waterloo.
Standard and Extended Form Games A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor, SIUC.
Auction Theory תכנון מכרזים ומכירות פומביות Topic 7 – VCG mechanisms 1.
Games People Play. 12: Auctions Games People Play. Auctions In this section we shall learn How different types of auctions allocate goods How to buy.
McGraw-Hill/Irwin Copyright  2008 by The McGraw-Hill Companies, Inc. All rights reserved. GAME THEORY, STRATEGIC DECISION MAKING, AND BEHAVIORAL ECONOMICS.
THE “CLASSIC” 2 x 2 SIMULTANEOUS CHOICE GAMES Topic #4.
Auctions Shyam Sunder, Yale University Kozminski Academy Warsaw, June 22, 2013.
3.1.4 Types of Games. Strategic Behavior in Business and Econ Outline 3.1. What is a Game ? The elements of a Game The Rules of the Game:
Lecture 5 Introduction to Game theory. What is game theory? Game theory studies situations where players have strategic interactions; the payoff that.
1 What is Game Theory About? r Analysis of situations where conflict of interests is present r Goal is to prescribe how conflicts can be resolved 2 2 r.
Intermediate Microeconomics Game Theory and Oligopoly.
Mohsen Afsharchi Multiagent Interaction. What are Multiagent Systems?
Auctions serve the dual purpose of eliciting preferences and allocating resources between competing uses. A less fundamental but more practical reason.
Game theory (Sections )
CMSC 100 Multi-Agent Game Day Professor Marie desJardins Tuesday, November 20, 2012 Tue 11/20/12 1 Multi-Agent Game Day.
Auctions serve the dual purpose of eliciting preferences and allocating resources between competing uses. A less fundamental but more practical reason.
Strategic Game Theory for Managers. Explain What is the Game Theory Explain the Basic Elements of a Game Explain the Importance of Game Theory Explain.
Auctions Great Lakes Institute of Management, Chennai March 4, 2012.
Incomplete Information and Bayes-Nash Equilibrium.
Oligopoly and Game Theory Topic Students should be able to: Use simple game theory to illustrate the interdependence that exists in oligopolistic.
By: Donté Howell Game Theory in Sports. What is Game Theory? It is a tool used to analyze strategic behavior and trying to maximize his/her payoff of.
Games, Strategies, and Decision Making By Joseph Harrington, Jr. First Edition Chapter 4: Stable Play: Nash Equilibria in Discrete Games with Two or Three.
Advanced Subjects in GT Prepared by Rina Talisman Introduction Revenue Equivalence The Optimal Auction (Myerson 1981) Auctions.
Advanced Subjects in GT Outline of the tutorials Static Games of Complete Information Introduction to games Normal-form (strategic-form) representation.
Management support systems II
Information, Incentives, and Mechanism Design
Presentation transcript:

1 Agents – Background Vicki H. Allan

2 An Agent in its Environment AGENT ENVIRONMENT Sensor Input action output

3 “Agent enjoys the following properties: autonomy - agents operate without the direct intervention of humans or others, and have some kind of control over their actions and internal state; social ability - agents interact with other agents (and possibly humans) via some kind of agent-communication language; reactivity: agents perceive their environment and respond in a timely fashion to changes that occur in it; pro-activeness: agents do not simply act in response to their environment, they are able to exhibit goal-directed behaviour by taking initiative.” (Wooldridge and Jennings, 1995)

4 Agents Need for computer systems to act in our best interests “The issues addressed in Multiagent systems have profound implications for our understanding of ourselves.” Wooldridge Example – how do you make a decision about buying a car

5 Agent Environments not have complete control (influence only) (Ex: elevators in Old Main) deterministic vs. non-deterministic effect accessible (get complete state info) vs inaccessible environment (Ex. stock market) episodic (single episode, independent of others) vs. non-episodic (history sensitive) (Ex. grades in class)

6 Exercise There are three blue hats and two brown hats. The men are lined up such that one man can see the backs of the other two, the middle man can see the back of the front man, and the front man can’t see anybody. One of the five hats is placed on each man's head. The remaining two hats are hidden away. The men are asked what color of hat they are wearing. Time passes. Front man correctly guesses the color of his hat. What color was it, and how did he guess correctly?

7 Concept Everyone else is as smart as you

8 Game of Chicken Consider another type of encounter — the game of chicken: (Think of James Dean in Rebel without a Cause: swerving = coop, driving straight = defect.) Difference to prisoner’s dilemma: Mutual defection is most feared outcome.

9 Question: How do we communicate our desires to an agent? May be muddy: You want to graduate with a 4.0, have a job making $100K a year, have opportunities for growth, and have quality of life. If you can’t have it all, what is most valued?

10 Answer: Utilities Assume we have just two agents: Ag = {i, j} Agents are assumed to be self-interested: they have preferences over how the environment is Assume  {  1  2  …}is the set of “outcomes” that agents have preferences over We capture preferences by utility functions which map an outcome to a rational number. Utility functions lead to preference orderings over outcomes.

11 What is Utility? Utility is not money (but it is a useful analogy) Typical relationship between utility & money:

12 Dominant Strategies Recall that –Agents’ utilities depend on what strategies other agents are playing –Agents’ are expected utility maximizers A dominant strategy is a best-response for player i –They do not always exist –Inferior strategies are called dominated

13 Dominant Strategy Equilibrium A dominant strategy equilibrium is a strategy profile where the strategy for each player is dominant (so neither wants to change) Known as “DUH” strategy. Nice: Agents do not need to counter speculate (reciprocally reason about what others will do)!

14 Prisoners’ dilemma -10, -100, , 0-1, -1 Confess Don’t Confess Ned Kelly Two people are arrested for a crime. If neither suspect confesses, both get light sentence. If both confess, then they get sent to jail. If one confesses and the other does not, then the confessor gets no jail time and the other gets a heavy sentence.

15 Prisoners’ dilemma -10, -100, -30 Confess Don’t Confess Ned Kelly Kelly will confess. Same holds for Ned.

16 Prisoners’ dilemma -10, -10 Confess Don’t Confess Ned Kelly So the only outcome that involves each player choosing their dominant strategies is where they both confess. Solve by iterative elimination of dominant strategies

17 Example: Prisoner’s Dilemma Two people are arrested for a crime. If neither suspect confesses, both get light sentence. If both confess, then they get sent to jail. If one confesses and the other does not, then the confessor gets no jail time and the other gets a heavy sentence. (Actual numbers vary in different versions of the problem, but relative values are the same) -10,-10 0, ,0-1,-1 Confess Don’t Confess Dom. Str. Eq not pareto optimal Optimal Outcome Don’t Confess Pareto optimal

18 Example: Bach or Stravinsky A couple likes going to concerts together. One loves Bach but not Stravinsky. The other loves Stravinsky but not Bach. However, they prefer being together than being apart. 2,10,0 1,2 B BS S No dominant strategy equilibrium

19 Example: Paying for Bus fare Getting back to the Gatwick airport. Steve had planned to pay for all of us, but left to find son. Came for funds. Do I pay, or say my husband will? 0,0-25, ,-1000,0 Pay for 2 Pay for 4 Not Pay No dominant strategy equilibrium

20 Research Questions Can we apply game theory to solve seemingly unrelated problems? Ex: traffic control Ex: sharing Operating System resources

21 Exercise You participate in a game show in which prizes of varying values occur at equal frequency. Two of you win a prize. There are 10 types of prizes of varying values. Assume, a prize of type 10 is the best and a prize of type 1 is the worst. Without knowing the other’s prize, both asked if they want to exchange the prizes they were given. If both want to exchange, the two exchange prizes. What is your strategy?

22 Employee Monitoring Employees can work hard or shirk Salary: $100K unless caught shirking Cost of effort: $50K Managers can monitor or not Value of employee output: $200K Profit if employee doesn’t work: $0 Cost of monitoring: $10K

23 What is your strategy? Work hard? Shirk?

24 No equilibrium in pure strategies What do the players do? Employee Monitoring Manager MonitorNo Monitor Employee Work 50, 9050, 100 Shirk 0, , -100

25 Mixed Strategies Randomize – surprise the rival Mixed Strategy: Specifies that an actual move be chosen randomly from the set of pure strategies with some specific probabilities.

26 Research question What features does a good solution have?

27 Pareto Efficient Solutions: f represents possible solutions for two players U1U1 U2U2 f 1 f 2 f 4 f 3

28 Pareto Efficient Solutions U1U1 U2U2 f 1 f 2 f 4 f 3 f 2 Pareto dominates f 3

29 Auctions Dutch English First Price Sealed Bid Second Price Sealed Bid

30 Auction Parameters Goods can have –private value (Aunt Bessie’s Broach) –public/common value (oil field to oil companies) –correlated value (partially private, partially values of others): consider the resale value Winner pays –first price (highest bidder wins, pays highest price) –second price (to person who bids highest, but pay value of second price) Bids may be –open cry –sealed bid Bidding may be –one shot –ascending –descending

31 Dutch (Aalsmeer) flower auction

32

33 Research Questions How can we design an agent to function in the electronic marketplace? Give the new possibilities, made possible via an electronic auction, what “mechanisms” can be designed to elicit desirable properties?

34 How do you counter speculate? Consider a Dutch auction While you don’t know what the other’s valuation is, you know a range and guess at a distribution (uniform, normal, etc.) For example, suppose there is a single other bidder whose valuation lies in the range [a,b] with a uniform distribution. If your valuation of the item is v, what price should you bid? Thinking about this logically, if you bid above your valuation, you lose. If you bid lower than your valuation, you increase profit. If you bid very low, you lower the probability that you will ever get it.

35 What is expected profit (Dutch auction)? Try to maximize your expected profit. Expected profit (as a function of a specific bid) is the probability that you will win the bid times the amount of your profit at that price. Let p be the price you bid for an item. v be your valuation. [a,b] be the uniform range of others bid. The probability that you win the bid at this price is the fraction of the time that the other person bids lower than p. (p-a)/(b-a) The profit you make at p is v-p Expected profit as a function of p is the function = (v-p)*(p-a)/(b-a) + 0*(1- (p-a)/(b-a))

36 Finding maximum profit is a simple calculus problem Expected profit as a function of p is the function (v-p)*(p-a)/(b-a) Take the derivative with respect to p and set that value to zero. Where the slope is zero, is the maximum value. (as second derivative is negative) f(p) = 1/(b-a) * (vp -va -p 2 +pa) f’(p) = 1/(b-a) (v-2p+a) = 0 p=(a+v)/2 (half the distance between your bid and the min range value)

37 Ultimatum Bargaining with Incomplete Information

38 Ultimatum Bargaining with Incomplete Information Player 1 begins the game by drawing a chip from the bag. Inside the bag are 30 chips ranging in value from $1.00 to $ Both must agree to split the amount. Player 2 does not see the chip. Player 1 then makes an offer to Player 2. The offer can be any amount in the range from $0.00 up to the value of the chip. Player 2 can either accept or reject the offer. If accepted,Player 1 pays Player 2 the amount of the offer and keeps the rest. If rejected, both players get nothing.

39 Experimental Results Questions: 1)How much should Player 1 offer Player 2? 2)Does the amount of the offer depend on the size of the chip? 2) What should Player 2 do? Should Player 2 accept all offers or only offers above a specified amount? Explain.

40 Coalition Formation Tasks need the skills of several workers Tasks have various worth Agents have various costs How do you decide who works together? What do you pay each one?

41 Research Questions Computing the optimal coalition is NP- hard. How do you form good coalitions in an efficient manner? How do you form coalitions when the information is incomplete? How do you form coalitions in a dynamic environment – with agents entering/leaving?

42 Voting Mechanisms How do we make decisions that respond to various individuals preference funtions? Ex: selecting new faculty based on various different evaluations Want to decide what to serve for refreshments the last day of class. How do we decide?

43 Borda Paradox – remove loser, winner changes (notice, c is always ahead of removed item) a > b > c >d b > c > d >a c > d > a > b a > b > c > d b > c > d> a c >d > a >b a <b <c < d a=18, b=19, c=20, d=13 n a > b > c n b > c >a n c > a > b n a > b > c n b > c > a n c > a >b n a <b <c a=15,b=14, c=13 When loser is removed, next loser becomes winner!

44 Research Question Do individuals always act the way the theory says they should? If not, why not? Is the theory wrong?

45 Allais Paradox In 1953, Maurice Allais published a paper regarding a survey he had conducted in 1952, with a hypothetical game. Subjects "with good training in and knowledge of the theory of probability, so that they could be considered to behave rationally", routinely violated the expected utility axioms. The game itself and its results have now become famous as the "Allais Paradox".

46 The most famous structure is the following: Subjects are asked to choose between the following 2 gambles, i.e. which one they would like to participate in if they could: Gamble A: A 100% chance of receiving $1 million. Gamble B: A 10% chance of receiving $5 million, an 89% chance of receiving $1 million, and a 1% chance of receiving nothing. After they have made their choice, they are presented with another 2 gambles and asked to choose between them: Gamble C: An 11% chance of receiving $1 million, and an 89% chance of receiving nothing. Gamble D: A 10% chance of receiving $5 million, and a 90% chance of receiving nothing.

47 This experiment has been conducted many, many times, and most people invariably prefer A to B, and D to C. So why is this a paradox?.

48 The expected value of A is $1 million, while the expected value of B is $1.39 million. By preferring A to B, people are presumably maximizing expected utility, not expected value. By preferring A to B, we have the following expected utility relationship: u(1) > 0.1 * u(5) * u(1) * u(0), i.e * u(1) > 0.1 * u(5) * u(0) Adding 0.89 * u(0) to each side, we get: 0.11 * u(1) * u(0) > 0.1 * u(5) * u(0), implying that an expected utility maximizer consistent with the first choice must prefer C to D. The expected value of C is $110,000, while the expected value of D is $500,000, so if people were maximizing expected value, they should in fact prefer D to C. However, their choice in the first stage is inconsistent with their choice in the second stage, and herein lies the paradox.