PGM 2003/04 Tirgul7 Foundations of Decision Theory (mostly from Pearl)

Slides:



Advertisements
Similar presentations
Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.
Advertisements

Choice under Uncertainty. Introduction Many choices made by consumers take place under conditions of uncertainty Therefore involves an element of risk.
Utility Theory.
3. Basic Topics in Game Theory. Strategic Behavior in Business and Econ Outline 3.1 What is a Game ? The elements of a Game The Rules of the.
Decisions under Uncertainty
Chapter 8: Decision Analysis
1 Decision Making and Utility Introduction –The expected value criterion may not be appropriate if the decision is a one-time opportunity with substantial.
Chapter 18 Statistical Decision Theory Copyright © 2010 Pearson Education, Inc. Publishing as Prentice Hall Statistics for Business and Economics 7 th.
Chapter 21 Statistical Decision Theory
1 Decision Analysis - Part 2 Aslı Sencer Graduate Program in Business Information Systems.
Managerial Decision Modeling with Spreadsheets
3 Decision Analysis To accompany Quantitative Analysis for Management, Twelfth Edition, by Render, Stair, Hanna and Hale Power Point slides created by.
Risk Attitude Dr. Yan Liu
ISMT 161: Introduction to Operations Management
Decision Analysis Chapter 3
Agent Technology for e-Commerce Appendix A: Introduction to Decision Theory Maria Fasli
Utility Axioms Axiom: something obvious, cannot be proven Utility axioms (rules for clear thinking)
CHAPTER 14 Utility Axioms Paradoxes & Implications.
1 Utility Theory. 2 Option 1: bet that pays $5,000,000 if a coin flipped comes up tails you get $0 if the coin comes up heads. Option 2: get $2,000,000.
Judgment and Decision Making in Information Systems Utility Functions, Utility Elicitation, and Risk Attitudes Yuval Shahar, M.D., Ph.D.
1 Chapter 6 It is a truth very certain that when it is not in our power to determine what is true we ought to follow what is probable.— Descartes Decision.
Lecture 4 on Individual Optimization Risk Aversion

Cooperating Intelligent Systems Utility theory Chapter 16, AIMA.
Making Simple Decisions Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 16.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 16-1 Chapter 16 Decision Making Statistics for Managers Using Microsoft.
Decision Making Under Uncertainty Russell and Norvig: ch 16 CMSC421 – Fall 2006.
Decision Trees and Utility Theory
Decision Making as Constrained Optimization Specification of Objective Function  Decision Rule Identification of Constraints.
Decision Analysis Chapter 3
EXPECTED UTILITY AND RISK AVERSION
CMSC 671 Fall 2003 Class #26 – Wednesday, November 26 Russell & Norvig 16.1 – 16.5 Some material borrowed from Jean-Claude Latombe and Daphne Koller by.
Decision Analysis Chapter 3
The Marriage Problem Finding an Optimal Stopping Procedure.
Decision-making under uncertainty. Introduction Definition of risk Attitudes toward risk Avoiding risk: Diversification Insurance.
Principles of Engineering System Design Dr T Asokan Decision Making in System Design.
Decision Analysis (cont)
Chapter 3 Decision Analysis.
Decision making Making decisions Optimal decisions Violations of rationality.
Reminder Midterm Mar 7 Project 2 deadline Mar 18 midnight in-class
Making Simple Decisions
1 Risk and Information Chapter Chapter Fifteen Overview 1.Introduction: Amazon.com 2.Describing Risky Outcome – Basic Tools Lotteries and Probabilities.
Quantitative Decision Techniques 13/04/2009 Decision Trees and Utility Theory.
Lecture 3 on Individual Optimization Uncertainty Up until now we have been treating bidders as expected wealth maximizers, and in that way treating their.
Decision theory under uncertainty
Making Simple Decisions Utility Theory MultiAttribute Utility Functions Decision Networks The Value of Information Summary.
Decision Theory McGraw-Hill/Irwin Copyright © 2012 by The McGraw-Hill Companies, Inc. All rights reserved.
Lecture 6 Decision Making.
Models for Strategic Marketing Decision Making. Market Entry Decisions To enter first or to wait Sources of First-Mover Advantages –Technological leadership.
Chapter 16: Making Simple Decision March 23, 2004.
Expected Value, Expected Utility & the Allais and Ellsberg Paradoxes
On Investor Behavior Objective Define and discuss the concept of rational behavior.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
1 Systems Analysis Methods Dr. Jerrell T. Stracener, SAE Fellow SMU EMIS 5300/7300 NTU SY-521-N NTU SY-521-N SMU EMIS 5300/7300 Utility Theory Applications.
Decision Making ECE457 Applied Artificial Intelligence Spring 2007 Lecture #10.
DADSS Lecture 11: Decision Analysis with Utility Elicitation and Use.
Chapter 5 Understanding Risk
Chapter 19 Decision Making
CHAPTER 1 FOUNDATIONS OF FINANCE I: EXPECTED UTILITY THEORY
19/11/1439 Decision Analysis.
Lecture 2 Hastie & Dawes: Changing Our Minds: Bayes’ Theorem. In Rational Choice in an UncertainWorld, 2nd ed., 2010, pp
CS 4/527: Artificial Intelligence
Chapter Five Understanding Risk.
Risk Chapter 11.
Rational Decisions and
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Choices, Values and Frames
Utilities and Decision Theory
Making Simple Decisions
Presentation transcript:

PGM 2003/04 Tirgul7 Foundations of Decision Theory (mostly from Pearl)

Introduction I’m planning a party and having a hard time to decide whether to stage it indoors or outdoors. In despair, I summarize what I can say in a table: indoorsdry (0.7)=regret indoorswet (0.3)=relief outdoorsdry (0.7)=perfect outdoorswet (0.3)=disaster It is clear I prefer perfect to regret, but is it enough to make a decision? Probabilities quantify the likely- hood of events. We are looking for a measure which will quantify desirability and will be the basis of all decision making.

Utilities, consequences and MEU Given some evidence, we are given a set of actions that we can take. Each action has several consequences. For each consequence c we assign a utility (or desirability) measure U(c). The expected utility of the action is then: Following the Bayesian approach, we want to maximize this expected utilitiy (MEU)

Lotteries At each stage of the decision process, we are faced with a dilemma of which action to take. Each action has a payoff or a loss. Thus, much like gambling we are faced with a lottery situation. Lotteries are defined as pairs of consequences and probabilities L(C,P). C 1 = 1 month C 2 = 2 month C 3 = 3 month Project completion time lottery L p 1 =0.60 p 3 =0.10 p 2 =0.30

Lotteries (cont.) When trying to assess the desirability of a lottery, we encounter several problems:  What is a good calibration commodity? Is money stable enough? Does the commodity depend on time and circumstances?  Once a commodity is found, we need to be able to deduce complex lotteries from simpler ones (or deal with infinitely many possibilities).

Calibrating a Lottery Solution: calibrate a lottery against a standard best/worst lottery which is equivalent in desirability to ours (calibrate a lottery using it’s inherent uncertainty). L C 1 = 1 month C 2 = 2 month C 3 = 3 month LsLs C 1 = 1 week C 3 = 1 year p 1 =0.60 p 3 =0.10 p 2 =0.30 psps 1-p s

Axioms of Utility Theory We are still left with the problem of how to define our preference over different consequences. We want a preference pattern that will allow us to prefer one lottery over another in a consistent way. The following axioms define some constraints on what kind of preference we are allowed to use as a basis for making decisions. Axiom 1 - Orderability: A linear and transitive preference relation must exist between the prizes. In simpler words, an agent must know what he wants:

Axioms of Utility Theory (cont.) Axiom 2 - Continuity: If C 2 is in between C 1 and C 3 then we could compare C 2 to some lottery with only C 1 and C 3 : Axiom 3 - Substitutability: If two things look equivalent, they will also be equivalent in a larger context:

Axioms of Utility Theory (cont.) Axiom 4 - Monotonicity: We prefer a better prize with higher probability: Axiom 5 - Decomposability: there is no fun in gambling (just the outcome matters):

Constructing Utilities Theorem: If a preference pattern obeys axioms 1-5 then by specifying the utility measure of each consequence we can faithfully prefer one lottery to another using: This allows us to do just what we set out for: define desirability for elementary consequences and then use the MEU principle to evaluate complex lotteries.

Proof “Surprisingly”, the essence of the proof rests on exactly the essence of the axioms: the ability to reduce complex lotteries to simple ones and that compare these simple two prize lotteries to each other. The outline of the proof follows: Let’s us examine two lotteries L 1 and L 2 that we want to compare with a joint set of prizes C 1 …C n (ordered from best prize to worst using axiom 1). Using substitution (axiom 3) we can write both lotteries in terms of all prizes (with probability 0 where needed).  Since any prize is between C 1 and C n, using continuity (axiom 2): and we can now substitute (axiom 3) and get:

Proof (cont.) Now, applying decomposability, we can present our lotteries in the following equivalent way: Using monotonicity (axiom 4), since C 1 >C n, the preference between L 1 and L 2 is determined by the probability assigned to C 1. Thus: L1L1 C1C1 CnCn L2L2 C1C1 CnCn L1L1 C1C1 CnCn p1p1 pnpn ~ L1L1 p1p1 pnpn C1C1 CnCn u1u1 1-u 1 C1C1 CnCn unun 1-u n

Real-life Utilities Given all the axioms of decision theory, we are still left with the actual problem of assigning utilities to consequences. In fact, any utility we come up with is only as unique as a linear transformation of it. Can we use monetary values as utilities? Do you prefer $3,000,000 for sure or $4,000,000 with 80% chance? This just means that our utilities is not money directly. If, we assign a utlity of 12 to the case of adding $3,000,000 to our bank account and 10 to that of adding $4,000,000 than our choice makes sense.

Utility of Money St. Petersburg paradox (by Bernoulli): in the game you toss a coin until it comes up as heads. If the head appeared on the nth toss you get $2n. How much are you willing to pay to participate? The expected winnings are: if we the utility of money is logarithmic then:

Attitudes towards risk The shape of the utility curve describes a person’s attitude towards risk. A risk-averse person will prefer a sure thing to a gamble with the same payoff. That is: and the sum of p i x i is the expected monetary value (EMV) offered by the lottery. A risk averse curve will be convex: and $100 is the insurance premium people will be willing to pay $reward U U(lottery) $500 $400

Attitudes towards risk (cont.) A risk seeking person will prefer the lottery to certainty. In fact, state lotteries and casinos make a lot of money just by utilizing this attitude in all of us. Thinking above the above examples, we are all both risk averse and risk seeking. How can that be? The answer lies in that our utility function is not the same for all circumstances. A person with a $10,000,000 debt will take ridiculous risks to erase the debt because the situation is hopeless any ways. We feel strongly about “being lucky” in a Poker game but have a hard time seeing insurance claims as prizes in a game...

Rationality Finally, we hope that at least we are rational compared to some utility function (decision theory gives us freedom in choosing the function). Unfortunately we are all irrational: Test 1: Do you prefer a lottery A where you get $4,000,000 with 20% chance or B where you get $3,000,000 with 25% chance. Test 2: Do you prefer C where you get $4,000,000 with 80% chance or D where you get $3,000,000 for sure?

Decision Trees When we want to make optimal decisions, we need to plan. We will ultimately merge Bayesian networks and decisions into influence diagrams. Let’s start with a model which represent all possible scenarios: A decision tree has two types of nodes: decision and chance. The leaves of the tree carry the utility associated with the scenario at that leaf. We will find an optimal strategy by starting with the leaves and propagating towards the root:  a chance node receives the expected utility  a decision node is assigned the best utility (MEU) and the corresponding branch is marked.

Decision Trees: An example The buyer of a used car can decide to carry out two tests. t 1 at the cost of $50 and t 2 at the cost of $20. There are two candidate cars c 1 costs $1500 and it’s market value is $2000 but will cost $700 to repair if the car is bad. c 2 costs $1150 with a market value of $1400 but a repair cost of only $150. The buyer needs to buy a car and has time for one test. From experience, c 1 has 70% chance of being good while c 2 has 80% chance of being good (as an ex, verify that using no test, it is better to buy c 1 with EMV of $290). Test t 1 checks car c 1. If the car is good there is a 90% chance that the test will confirm. If it is bad, the test will discover it in 65% of the cases. Test t 2 checks car c 2. If will confirm good quality with 25% probability and discover flaws with 70% probability.

Decision Trees: An example (cont.) Plugging in the probabilities and propagating for optimal plan, we find that we should do t 1. If it passes we should bye c 1 and if it fails we should by c 2. Over all, we expect a utility of

Direction of Probabilities Often, we are more comfortable in specifying probabilities in other directions than the ones needed. This is because causality does not necessarily follow the order in which we have to make decisions. For example, it is more natural to assess what is the probability that a test will pass/fail based on the condition of the car. We then use Bayes rule to calculate the needed probabilities for the decision tree.