Rational Decisions and

Slides:



Advertisements
Similar presentations
Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.
Advertisements

Mixed Strategies.
Decisions under Uncertainty
Confidence-Based Decision Making Climate Science and Response Strategies Discussion Grantham, February
Intelligent Agents Russell and Norvig: 2
1 Decision Making and Utility Introduction –The expected value criterion may not be appropriate if the decision is a one-time opportunity with substantial.
ECO290E: Game Theory Lecture 5 Mixed Strategy Equilibrium.
Agent Technology for e-Commerce Appendix A: Introduction to Decision Theory Maria Fasli
Utility Axioms Axiom: something obvious, cannot be proven Utility axioms (rules for clear thinking)
CHAPTER 14 Utility Axioms Paradoxes & Implications.

Making Simple Decisions Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 16.
PSY 5018H: Math Models Hum Behavior, Prof. Paul Schrater, Spring 2005 Normative Decision Theory A prescriptive theory for how decisions should be made.
Decision Making Under Uncertainty Russell and Norvig: ch 16 CMSC421 – Fall 2006.
Lecture 3: Arrow-Debreu Economy
CMSC 671 Fall 2003 Class #26 – Wednesday, November 26 Russell & Norvig 16.1 – 16.5 Some material borrowed from Jean-Claude Latombe and Daphne Koller by.
MAKING COMPLEX DEClSlONS
PGM 2003/04 Tirgul7 Foundations of Decision Theory (mostly from Pearl)
CPS 270: Artificial Intelligence Decision theory Instructor: Vincent Conitzer.
Reminder Midterm Mar 7 Project 2 deadline Mar 18 midnight in-class
Economic evaluation of health programmes Department of Epidemiology, Biostatistics and Occupational Health Class no. 9: Cost-utility analysis – Part 2.
Making Simple Decisions
Axioms Let W be statements known to be true in a domain An axiom is a rule presumed to be true An axiomatic set is a collection of axioms Given an axiomatic.
Risk Management with Coherent Measures of Risk IPAM Conference on Financial Mathematics: Risk Management, Modeling and Numerical Methods January 2001.
CS 188: Artificial Intelligence Spring 2007 Lecture 21:Reinforcement Learning: I Utilities and Simple decisions 4/10/2007 Srini Narayanan – ICSI and UC.
© 2005 Pearson Education Canada Inc Chapter 17 Choice Making Under Uncertainty.
Making Simple Decisions Utility Theory MultiAttribute Utility Functions Decision Networks The Value of Information Summary.
Chapter 16: Making Simple Decision March 23, 2004.
Axiomatic Theory of Probabilistic Decision Making under Risk Pavlo R. Blavatskyy University of Zurich April 21st, 2007.
Decision Making Under Uncertainty CMSC 471 – Spring 2014 Class #12– Thursday, March 6 R&N, Chapters , material from Lise Getoor, Jean-Claude.
Decision Analytic Approaches for Evidence-Based Practice M8120 Fall 2001 Suzanne Bakken, RN, DNSc, FAAN School of Nursing & Department of Medical Informatics.
INLS 200 thursday, january 17.
Web-Mining Agents Agents and Rational Behavior Decision-Making under Uncertainty Simple Decisions Ralf Möller Universität zu Lübeck Institut für Informationssysteme.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Decision Making ECE457 Applied Artificial Intelligence Spring 2007 Lecture #10.
Von Neumann-Morgenstern Lecture II. Utility and different views of risk Knightian – Frank Knight Risk – known probabilities of events Uncertainty – unknown.
Decision Trees Jennifer Tsay CS 157B February 4, 2010.
Vincent Conitzer CPS Utility theory Vincent Conitzer
Utilities and Decision Theory
Ralf Möller Universität zu Lübeck Institut für Informationssysteme
The Economics of Information and Choice Under Uncertainty
CHAPTER 1 FOUNDATIONS OF FINANCE I: EXPECTED UTILITY THEORY
ECE457 Applied Artificial Intelligence Fall 2007 Lecture #10
ECE457 Applied Artificial Intelligence Spring 2008 Lecture #10
L03 Utility.
Decision Making under Uncertainty and Risk
Games of pure conflict two person constant sum
CPS 570: Artificial Intelligence Decision theory
CS 4/527: Artificial Intelligence
L12 Uncertainty.
Lecture 3 Axioms & elicitation.
Chapter Three Preferences.
L11 Uncertainty.
Risk Chapter 11.
Multiagent Systems Game Theory © Manfred Huber 2018.
L03 Utility.
L03 Utility.
Utility Theory Decision Theory.
Chapter 3 Preferences.
Course Overview and Introduction
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Instructor: Vincent Conitzer
EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF INDUSTRIAL ENGINEERING IENG314 OPERATIONS RESEARCH II SAMIR SAMEER ABUYOUSSEF
Behavioral Finance Economics 437.
Vincent Conitzer Utility theory Vincent Conitzer
Artificial Intelligence Decision theory
Utilities and Decision Theory
Utilities and Decision Theory
Normal Form (Matrix) Games
Making Simple Decisions
Presentation transcript:

Rational Decisions and Multiagent Systems Rational Decisions and Utility Theory © Manfred Huber 2018

Rationality Rationality describes a decision making criterion A rational agent always takes the action that leads to the “best” outcome for it In multiagent scenarios this implies Actions are not taken to harm other agents Actions are taken in order to maximize the agent’s performance criterion But: actions do not necessarily only consider aspects important to the individual agent © Manfred Huber 2018

Rationality Rationality requires to define “best” In multiagent systems what is “best” can be different for every agent What is “best” depends on the state of the agent What is “best” can be influenced by random effects and uncertainties © Manfred Huber 2018

Decision and Utility Theory Decision Theory deals with optimal decision making in single agent as well as multiagent systems A rational decision agent takes the action that leads to the highest expected payoff Utility Theory deals with the definition of “best”, “better”, and “payoff” Utility defines the expected “payoff” © Manfred Huber 2018

Utility Theory Utility: quantifies the degree of preference across different alternatives models the impact of uncertainty on preferences represents a mapping from an agent’s state (or situation) to the degree of happiness (expected future payoff) for being in this state © Manfred Huber 2018

Utility Theory The application of utility theory requires: All preferences of an agent have to be expressed with a single utility function independent of the complexity of the task, environment, and action set An agent’s decisions in the context of uncertainty are purely determined by the expected value of the utility © Manfred Huber 2018

Rational Preferences To make rational decisions we have to define rational preferences: Outcomes oi o1 is at least as desirable as o2 (the agent weakly prefers o1 to o2) the agent is indifferent between o1 and o2 ( and ) the agent strictly prefers o1 to o2 ( and ) © Manfred Huber 2018

Rational Preferences Preferences have to be expressible in the context of uncertainty: Lottery: specifies the probability of each possible outcome [p1 : o1, p2 : o2, …, pn : on] Lotteries represent uncertain outcomes © Manfred Huber 2018

Rational Preferences For preferences to be rational (more precisely for them to lead to rational decisions) they have to fulfill a number of axioms: Completeness Transitivity Substitutability Decomposability Monotonicity Continuity © Manfred Huber 2018

Preferences - Completeness All outcomes have to be comparable A preference relationship has to be defined for every pair of outcomes © Manfred Huber 2018

Preferences - Transitivity Preferences have to be transitive This implies that there can not be any circular preference relationships © Manfred Huber 2018

Preferences - Substitutability If an agent is indifferent between two outcomes then they can be substituted for each other without change in the preference Outcomes that are indifferent have to be indifferent in all contexts © Manfred Huber 2018

Preferences - Decomposability An agent has to be indifferent between two choices that lead to the same outcome The agent is indifferent between two lotteries that lead to the same outcome probabilities This implies that the agent has no preference for the mechanism that leads to an outcome but only for the outcome itself For example, a probabilistic outcome vs a probabilistic outcome where one choice includes a lottery The agent is indifferent between one thing and gambling as long as it leads to the same outcome © Manfred Huber 2018

Preferences - Monotonicity An agent prefers a larger chance at a better outcome to a smaller chance This also implies that the agent does not have any preference between deterministic and probabilistic outcomes © Manfred Huber 2018

Preferences - Continuity The preference relation of a lottery has to change continuously with the change of the outcome probabilities © Manfred Huber 2018

From Rational Preferences to Utility von Neumann and Morgenstern showed in 1944 that if preferences are rational (i.e. they obey the axioms), then there exists a scalar utility function that quantifies the preferences Proof idea: select best and worst outcomes and assign 0 and 1, find probabilities at which lotteries of these two are indifferent to the other outcomes and set the utilities correspondingly © Manfred Huber 2018

Utility Functions A Utility function allows to quantify preferences for decision making Rational decisions are simply the ones that lead to the largest value of the utility function © Manfred Huber 2018

Utility Functions There are an infinite number of utility functions for each set of rational preferences E.g.: Linear offsets and scaling of the utility function preserves preferences Utility can only be used to compare alternatives The absolute value of the utility is arbitrary The bounds on the values of the utility function are not necessary for rational decision making Utilities can be arbitrary values (as long as they are finite) © Manfred Huber 2018

From Russell and Norvig Utility vs. Money Money does not behave like a utility for humans Humans preferences react to “risk” as well as money For example, a probabilistic outcome vs a probabilistic outcome where one choice includes a lottery The agent is indifferent between one thing and gambling as long as it leads to the same outcome From Russell and Norvig © Manfred Huber 2018

Utility vs. Money That humans do not choose actions that maximize the expected amount of money implies either: Human preferences are not rational (i.e. People are irrational) Human preferences are not based solely on money (i.e. People are rational but money is not the utility function used) Artificial Intelligence usually prefers the second option For example, a probabilistic outcome vs a probabilistic outcome where one choice includes a lottery The agent is indifferent between one thing and gambling as long as it leads to the same outcome © Manfred Huber 2018