CPS 570: Artificial Intelligence Decision theory

Slides:



Advertisements
Similar presentations
Economics of Information (ECON3016)
Advertisements

CPS Utility theory Vincent Conitzer
Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.
Vincent Conitzer CPS Repeated games Vincent Conitzer
Utility Theory.
Strongly based on Slides by Vincent Conitzer of Duke
1 Decision Making and Utility Introduction –The expected value criterion may not be appropriate if the decision is a one-time opportunity with substantial.
Chapter 13 Risk Attitudes ..
Economics 202: Intermediate Microeconomic Theory 1.HW #5 on website. Due Tuesday.
1 Utility Theory. 2 Option 1: bet that pays $5,000,000 if a coin flipped comes up tails you get $0 if the coin comes up heads. Option 2: get $2,000,000.
Risk a situation in which there is a probability that an event will occur. People tend to prefer greater certainty and less risk.
Intro to Game Theory Revisiting the territory we have covered.
1 Modeling risk attitudes Objective: Develop tools to compare alternative courses of action with uncertain outcomes (lotteries or deals) A B $30 -$15 $100.

Frank Cowell : Risk RISK MICROECONOMICS Principles and Analysis Frank Cowell Almost essential Consumption and Uncertainty Almost essential Consumption.
Risk attitudes, normal-form games, dominance, iterated dominance Vincent Conitzer
Instructor: Vincent Conitzer
CPS 270: Artificial Intelligence Decision theory Instructor: Vincent Conitzer.
Reminder Midterm Mar 7 Project 2 deadline Mar 18 midnight in-class
CPS 270: Artificial Intelligence Game Theory Instructor: Vincent Conitzer.
Decision theory under uncertainty
© 2005 Pearson Education Canada Inc Chapter 17 Choice Making Under Uncertainty.
CPS Utility theory, normal-form games
CPS 570: Artificial Intelligence Markov decision processes, POMDPs
Chapter 11 – Introduction to Risk Analysis u Why do individuals, companies, and stockholders take risks?
On Investor Behavior Objective Define and discuss the concept of rational behavior.
DECISION MODELS. Decision models The types of decision models: – Decision making under certainty The future state of nature is assumed known. – Decision.
L12 Uncertainty. Model with real endowments 1. Labor Supply (Labor-Leisure Choice) 2. Intertemporal Choice (Consumption-Savings Choice) 3. Uncertainty.
Lecture 8 Social Regulation. Correction of market failures, not dealing with the natural monopoly problem Regulation of health, safety, environment, public.
Chapter 5 Understanding Risk
Managerial Economics Uncertainty
Vincent Conitzer CPS Utility theory Vincent Conitzer
Utilities and Decision Theory
The Economics of Information and Choice Under Uncertainty
12 Uncertainty.
CPS 570: Artificial Intelligence Markov decision processes, POMDPs
Vincent Conitzer CPS Repeated games Vincent Conitzer
Chapter 5 Theory of Consumer Behavior
Theory of Consumer Behavior
Chapter Five Understanding Risk.
L11 Uncertainty.
Chapter 5.
Rational Decisions and
Chapter Twelve Uncertainty.
Utility Theory Decision Theory.
Managerial Economics in a Global Economy
Fi8000 Valuation of Financial Assets
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Vincent Conitzer Mechanism design Vincent Conitzer
Vincent Conitzer CPS 173 Utility theory Vincent Conitzer
Instructor: Vincent Conitzer
Vincent Conitzer CPS 173 Mechanism design Vincent Conitzer
Instructor: Vincent Conitzer
Chapter 5: Theory of Consumer Behavior
Vincent Conitzer Utility theory Vincent Conitzer
Vincent Conitzer Repeated games Vincent Conitzer
Chapter 12 Uncertainty.
Consumer Behavior and Utility Maximization
Behavioral Finance Economics 437.
Walter Nicholson Christopher Snyder
Vincent Conitzer Utility theory Vincent Conitzer
Behavioral Finance Economics 437.
Chapter 5: Theory of Consumer Behavior
Artificial Intelligence Decision theory
Utilities and Decision Theory
Utilities and Decision Theory
Buying and Selling: Uncertainty
Instructor: Vincent Conitzer
Making Simple Decisions
Vincent Conitzer CPS Repeated games Vincent Conitzer
Presentation transcript:

CPS 570: Artificial Intelligence Decision theory Instructor: Vincent Conitzer

Risk attitudes Which would you prefer? How about: A lottery ticket that pays out $10 with probability .5 and $0 otherwise, or A lottery ticket that pays out $3 with probability 1 How about: A lottery ticket that pays out $100,000,000 with probability .5 and $0 otherwise, or A lottery ticket that pays out $30,000,000 with probability 1 Usually, people do not simply go by expected value An agent is risk-neutral if she only cares about the expected value of the lottery ticket An agent is risk-averse if she always prefers the expected value of the lottery ticket to the lottery ticket Most people are like this An agent is risk-seeking if she always prefers the lottery ticket to the expected value of the lottery ticket

Decreasing marginal utility Typically, at some point, having an extra dollar does not make people much happier (decreasing marginal utility) utility buy a nicer car (utility = 3) buy a car (utility = 2) buy a bike (utility = 1) money $200 $1500 $5000

Maximizing expected utility buy a nicer car (utility = 3) buy a car (utility = 2) buy a bike (utility = 1) money $200 $1500 $5000 Lottery 1: get $1500 with probability 1 gives expected utility 2 Lottery 2: get $5000 with probability .4, $200 otherwise gives expected utility .4*3 + .6*1 = 1.8 (expected amount of money = .4*$5000 + .6*$200 = $2120 > $1500) So: maximizing expected utility is consistent with risk aversion

Different possible risk attitudes under expected utility maximization money Green has decreasing marginal utility → risk-averse Blue has constant marginal utility → risk-neutral Red has increasing marginal utility → risk-seeking Grey’s marginal utility is sometimes increasing, sometimes decreasing → neither risk-averse (everywhere) nor risk-seeking (everywhere)

Acting optimally over time Finite number of periods: Overall utility = sum of rewards in individual periods Infinite number of periods: … are we just going to add up the rewards over infinitely many periods? Always get infinity! (Limit of) average payoff: limn→∞Σ1≤t≤nr(t)/n Limit may not exist… Discounted payoff: Σtδt r(t) for some δ < 1 Interpretations of discounting: Interest rate r: δ= 1/(1+r) World ends with some probability 1 -δ Discounting is mathematically convenient