Ralf Möller Universität zu Lübeck Institut für Informationssysteme

Slides:



Advertisements
Similar presentations
Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC 671 – Fall 2005 material from Lise Getoor, Jean-Claude Latombe, and Daphne Koller.
Advertisements

Utility theory U: O-> R (utility maps from outcomes to a real number) represents preferences over outcomes ~ means indifference We need a way to talk about.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Making Simple Decisions
Utility Axioms Axiom: something obvious, cannot be proven Utility axioms (rules for clear thinking)
Judgment and Decision Making in Information Systems Utility Functions, Utility Elicitation, and Risk Attitudes Yuval Shahar, M.D., Ph.D.

Cooperating Intelligent Systems Utility theory Chapter 16, AIMA.
Making Simple Decisions Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 16.
Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC421 – Fall 2005.
Decision Making Under Uncertainty Russell and Norvig: ch 16 CMSC421 – Fall 2006.
1 Mutli-Attribute Decision Making Eliciting Weights Scott Matthews Courses: /
Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC421 – Fall 2003 material from Jean-Claude Latombe, and Daphne Koller.
CMSC 671 Fall 2003 Class #26 – Wednesday, November 26 Russell & Norvig 16.1 – 16.5 Some material borrowed from Jean-Claude Latombe and Daphne Koller by.
GAMES AGAINST NATURE Topic #3. Games Against Nature In game theory, for reasons that will be explained later, the alternatives (e.g., LEFT and RIGHT)
PGM 2003/04 Tirgul7 Foundations of Decision Theory (mostly from Pearl)
Decision Analysis (cont)
Reminder Midterm Mar 7 Project 2 deadline Mar 18 midnight in-class
Making Simple Decisions
Decision & Risk Analysis Influence Diagrams, Decision Trees NOTE: Some materials for this presentation courtesy of Dr. Dan Maxwell Reference: Clemen &
Axioms Let W be statements known to be true in a domain An axiom is a rule presumed to be true An axiomatic set is a collection of axioms Given an axiomatic.
Making Simple Decisions Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 16.
Chapter 13 February 19, Acting Under Uncertainty Rational Decision – Depends on the relative importance of the goals and the likelihood of.
Decision Making Under Uncertainty CMSC 671 – Fall 2010 R&N, Chapters , , material from Lise Getoor, Jean-Claude Latombe, and.
Making Simple Decisions Utility Theory MultiAttribute Utility Functions Decision Networks The Value of Information Summary.
Models for Strategic Marketing Decision Making. Market Entry Decisions To enter first or to wait Sources of First-Mover Advantages –Technological leadership.
Chapter 16: Making Simple Decision March 23, 2004.
Decision Theory Copyright © 2015 McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill.
Decision Making Under Uncertainty CMSC 471 – Spring 2014 Class #12– Thursday, March 6 R&N, Chapters , material from Lise Getoor, Jean-Claude.
Web-Mining Agents Agents and Rational Behavior Decision-Making under Uncertainty Simple Decisions Ralf Möller Universität zu Lübeck Institut für Informationssysteme.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Chapter 16 March 25, Probability Theory: What an agent should believe based on the evidence Utility Theory: What the agent wants Decision Theory:
1 Automated Planning and Decision Making 2007 Automated Planning and Decision Making Prof. Ronen Brafman Various Subjects.
Decision Making ECE457 Applied Artificial Intelligence Spring 2007 Lecture #10.
Decision Making Under Uncertainty CMSC 471 – Fall 2011 Class #23-24 – Thursday, November 17 / Tuesday, November 22 R&N, Chapters , ,
Behavioral Finance Preferences Part I Feb 16 Behavioral Finance Economics 437.
The Theory of Rational Choice
Ralf Möller Universität zu Lübeck Institut für Informationssysteme
Chapter 5 Supplement Decision Theory.
CS 188: Artificial Intelligence Spring 2006
Utilities and Decision Theory
Nevin L. Zhang Room 3504, phone: ,
Maximum Expected Utility
Business Modeling Lecturer: Ing. Martina Hanová, PhD.
Making Simple Decisions
Prof. Dr. Holger Schlingloff 1,2 Dr. Esteban Pavese 1
ECE457 Applied Artificial Intelligence Fall 2007 Lecture #10
ECE457 Applied Artificial Intelligence Spring 2008 Lecture #10
Where are we in CS 440? Now leaving: sequential, deterministic reasoning Entering: probabilistic reasoning and machine learning.
Uncertainty Chapter 13.
Expectimax Lirong Xia. Expectimax Lirong Xia Project 2 MAX player: Pacman Question 1-3: Multiple MIN players: ghosts Extend classical minimax search.
CS 4/527: Artificial Intelligence
Decision Theory: Single Stage Decisions
Risk Chapter 11.
Representing Uncertainty
Rational Decisions and
Utility Theory Decision Theory.
Decision Theory: Single Stage Decisions
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Class #21 – Monday, November 10
Evaluating and Choosing Preferred Projects
Making Simple Decisions
Web-Mining Agents Deep Relational Learning
Behavioral Finance Economics 437.
Utilities and Decision Theory
Utilities and Decision Theory
Decision Making.
Deciding Under Probabilistic Uncertainty
Zachary Blomme University of Nebraska, Lincoln Pattern Recognition
Making Simple Decisions
Presentation transcript:

Ralf Möller Universität zu Lübeck Institut für Informationssysteme Web-Mining Agents Agents and Rational Behavior Decision-Making under Uncertainty Simple Decisions Ralf Möller Universität zu Lübeck Institut für Informationssysteme

Literature Chapter 16 Material from Lise Getoor, Jean-Claude Latombe, Daphne Koller, and Stuart Russell

Decision Making Under Uncertainty Many environments have multiple possible outcomes Some of these outcomes may be good; others may be bad Some may be very likely; others unlikely What’s a poor agent going to do??

Non-Deterministic vs. Probabilistic Uncertainty ? b a c ? b a c {a(pa),b(pb),c(pc)} decision that maximizes expected utility value {a,b,c} decision that is best for worst case Non-deterministic model Probabilistic model ~ Adversarial search

Expected Utility Random variable X with n values x1,…,xn and distribution (p1,…,pn) E.g.: X is the state reached after doing an action A under uncertainty Function U of X E.g., U is the utility of a state The expected utility of A is EU[A=a] = Si=1,…,n p(xi|A=a)U(xi)

One State/One Action Example U(S0) = 100 x 0.2 + 50 x 0.7 + 70 x 0.1 = 20 + 35 + 7 = 62 s3 s2 s1 0.2 0.7 0.1 100 50 70

One State/Two Actions Example U1(S0) = 62 U2(S0) = 74 U(S0) = max{U1(S0),U2(S0)} = 74 s0 A1 A2 s3 s2 s1 s4 0.2 0.7 0.1 0.2 0.8 100 50 70 80

Introducing Action Costs U1(S0) = 62 – 5 = 57 U2(S0) = 74 – 25 = 49 U(S0) = max{U1(S0),U2(S0)} = 57 s0 A1 A2 -5 -25 s3 s2 s1 s4 0.2 0.7 0.1 0.2 0.8 100 50 70 80

MEU Principle AI is Solved!!! A rational agent should choose the action that maximizes agent’s expected utility This is the basis of the field of decision theory The MEU principle provides a normative criterion for rational choice of action AI is Solved!!!

Not quite… Must have complete model of: Actions Utilities States Even if you have a complete model, it might be computationally intractable In fact, a truly rational agent takes into account the utility of reasoning as well---bounded rationality Nevertheless, great progress has been made in this area recently, and we are able to solve much more complex decision-theoretic problems than ever before

We’ll look at Decision-Theoretic Planning Simple decision making (ch. 16) Sequential decision making (ch. 17)

Preferences A and B can be lotteries again: Prizes are special lotteries: [1, X; 0, not X]

Rational preferences

Rational preferences contd.

Axioms of Utility Theory Orderability ({<, >, ~} is JEPD) (A>B)  (A<B)  (A~B) Transitivity (A>B)  (B>C)  (A>C) Continuity A>B>C  p [p,A; 1-p,C] ~ B Substitutability A~B  [p,A; 1-p,C]~[p,B; 1-p,C] Monotonicity A>B  (p≥q  [p,A; 1-p,B] >~ [q,A; 1-q,B]) Decomposability [p,A; 1-p, [q,B; 1-q, C]] ~ [p,A; (1-p)q, B; (1-p)(1-q), C] Decomposability: There is no fun in gambling

And then there was utility

Utilities -and-continue -as-before

Utility scales U(pay $30...) = 0.999999

Money

Money Versus Utility Money <> Utility Expected Monetary Value More money is better, but not always in a linear relationship to the amount of money Expected Monetary Value Risk-averse – U(L) < U(SEMV(L)) Risk-seeking – U(L) > U(SEMV(L)) Risk-neutral – U(L) = U(SEMV(L))

Value Functions Provides a ranking of alternatives, but not a meaningful metric scale Also known as an “ordinal utility function”

Multiattribute Utility Theory A given state may have multiple utilities ...because of multiple evaluation criteria ...because of multiple agents (interested parties) with different utility functions

Strict dominance

Stochastic dominance Continuous random variable S1 stochastically dominates S2 iff t . P(S1 ≥ t) ≥ P(S2 ≥ t)

Stochastic Dominance

Stochastic Dominance

Stochastic Dominance

Preference structure: Deterministic month > vs. month > V(x1, ..., vn) = 𝚺i Vi(xi)

Preference structure: Stochastic

Decision Networks Extend BNs to handle actions and utilities Also called influence diagrams Use BN inference methods to solve Perform Value of Information calculations

Decision Networks cont. Chance nodes: random variables, as in BNs Decision nodes: actions that decision maker can take Utility/value nodes: the utility of the outcome state.

R&N example

Umbrella Network umbrella weather have umbrella forecast happiness take/don’t take P(rain) = 0.4 umbrella weather have umbrella forecast P(have|take) = 1.0 P(~have|~take)=1.0 happiness f w p(f|w) sunny rain 0.3 rainy rain 0.7 sunny no rain 0.8 rainy no rain 0.2 U(have,rain) = -25 U(have,~rain) = 0 U(~have, rain) = -100 U(~have, ~rain) = 100

Evaluating Decision Networks Set the evidence variables for current state For each possible value of the decision node: Set decision node to that value Calculate the posterior probability of the parent nodes of the utility node, using BN inference Calculate the resulting utility for action Return the action with the highest utility

Decision Making: Umbrella Network Should I take my umbrella?? take/don’t take P(rain) = 0.4 umbrella weather have umbrella forecast P(have|take) = 1.0 P(~have|~take)=1.0 happiness f w p(f|w) sunny rain 0.3 rainy rain 0.7 sunny no rain 0.8 rainy no rain 0.2 U(have,rain) = -25 U(have,~rain) = 0 U(~have, rain) = -100 U(~have, ~rain) = 100

Value of information

General formula

Properties of VPI

Qualitative behaviors

Information Gathering Agent Ask questions Request(Ej) in a reasonable order Avoid irrelevant questions Take into account imporance of piece of information j in relation to Cost(Ej)