Decision Making Under Uncertainty CMSC 471 – Spring 2014 Class #12– Thursday, March 6 R&N, Chapters 15.1-15.2.1, 16.1-16.3 material from Lise Getoor, Jean-Claude.

Slides:



Advertisements
Similar presentations
UNIT IV: UNCERTAIN KNOWLEDGE AND REASONING
Advertisements

Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC 671 – Fall 2005 material from Lise Getoor, Jean-Claude Latombe, and Daphne Koller.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Making Simple Decisions
PROBABILITY. Uncertainty  Let action A t = leave for airport t minutes before flight from Logan Airport  Will A t get me there on time ? Problems :
Uncertainty Everyday reasoning and decision making is based on uncertain evidence and inferences. Classical logic only allows conclusions to be strictly.
Dynamic Bayesian Networks (DBNs)
Modeling Uncertainty over time Time series of snapshot of the world “state” we are interested represented as a set of random variables (RVs) – Observable.
Lirong Xia Hidden Markov Models Tue, March 28, 2014.
Hidden Markov Models Reading: Russell and Norvig, Chapter 15, Sections
Introduction of Probabilistic Reasoning and Bayesian Networks
1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, Web site for the book & more.
1 Reasoning Under Uncertainty Over Time CS 486/686: Introduction to Artificial Intelligence Fall 2013.
Planning under Uncertainty
Probabilistic reasoning over time So far, we’ve mostly dealt with episodic environments –One exception: games with multiple moves In particular, the Bayesian.
… Hidden Markov Models Markov assumption: Transition model:
CS 547: Sensing and Planning in Robotics Gaurav S. Sukhatme Computer Science Robotic Embedded Systems Laboratory University of Southern California
Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC421 – Fall 2005.
Part 2 of 3: Bayesian Network and Dynamic Bayesian Network.
1 Bayesian Reasoning Chapter 13 CMSC 471 Adapted from slides by Tim Finin and Marie desJardins.
Decision Making Under Uncertainty Russell and Norvig: ch 16 CMSC421 – Fall 2006.
Probabilistic Robotics Introduction Probabilities Bayes rule Bayes filters.
Representing Uncertainty CSE 473. © Daniel S. Weld 2 Many Techniques Developed Fuzzy Logic Certainty Factors Non-monotonic logic Probability Only one.
CPSC 422, Lecture 14Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 14 Feb, 4, 2015 Slide credit: some slides adapted from Stuart.
CS 188: Artificial Intelligence Fall 2009 Lecture 19: Hidden Markov Models 11/3/2009 Dan Klein – UC Berkeley.
Decision Making Under Uncertainty Russell and Norvig: ch 16, 17 CMSC421 – Fall 2003 material from Jean-Claude Latombe, and Daphne Koller.
1 Bayesian Networks Chapter ; 14.4 CS 63 Adapted from slides by Tim Finin and Marie desJardins. Some material borrowed from Lise Getoor.
CMSC 671 Fall 2003 Class #26 – Wednesday, November 26 Russell & Norvig 16.1 – 16.5 Some material borrowed from Jean-Claude Latombe and Daphne Koller by.
Visibility Graph. Voronoi Diagram Control is easy: stay equidistant away from closest obstacles.
1 Robot Environment Interaction Environment perception provides information about the environment’s state, and it tends to increase the robot’s knowledge.
Making Simple Decisions
CHAPTER 15 SECTION 1 – 2 Markov Models. Outline Probabilistic Inference Bayes Rule Markov Chains.
UIUC CS 498: Section EA Lecture #21 Reasoning in Artificial Intelligence Professor: Eyal Amir Fall Semester 2011 (Some slides from Kevin Murphy (UBC))
Chapter 13 February 19, Acting Under Uncertainty Rational Decision – Depends on the relative importance of the goals and the likelihood of.
CS 416 Artificial Intelligence Lecture 17 Reasoning over Time Chapter 15 Lecture 17 Reasoning over Time Chapter 15.
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
Decision Making Under Uncertainty CMSC 671 – Fall 2010 R&N, Chapters , , material from Lise Getoor, Jean-Claude Latombe, and.
4 Proposed Research Projects SmartHome – Encouraging patients with mild cognitive disabilities to use digital memory notebook for activities of daily living.
CPSC 422, Lecture 15Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 15 Oct, 14, 2015.
Chapter 16: Making Simple Decision March 23, 2004.
Decision Theoretic Planning. Decisions Under Uncertainty  Some areas of AI (e.g., planning) focus on decision making in domains where the environment.
Decision Making Under Uncertainty CMSC 471 – Spring 2041 Class #25– Tuesday, April 29 R&N, material from Lise Getoor, Jean-Claude Latombe, and.
CPSC 422, Lecture 17Slide 1 Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 17 Oct, 19, 2015 Slide Sources D. Koller, Stanford CS - Probabilistic.
1 CMSC 671 Fall 2001 Class #20 – Thursday, November 8.
Probability and Time. Overview  Modelling Evolving Worlds with Dynamic Baysian Networks  Simplifying Assumptions Stationary Processes, Markov Assumption.
1 Chapter 17 2 nd Part Making Complex Decisions --- Decision-theoretic Agent Design Xin Lu 11/04/2002.
Stochastic Processes and Transition Probabilities D Nagesh Kumar, IISc Water Resources Planning and Management: M6L5 Stochastic Optimization.
Probabilistic Robotics Introduction Probabilities Bayes rule Bayes filters.
Probability and Time. Overview  Modelling Evolving Worlds with Dynamic Baysian Networks  Simplifying Assumptions Stationary Processes, Markov Assumption.
Web-Mining Agents Agents and Rational Behavior Decision-Making under Uncertainty Simple Decisions Ralf Möller Universität zu Lübeck Institut für Informationssysteme.
CSE 473 Uncertainty. © UW CSE AI Faculty 2 Many Techniques Developed Fuzzy Logic Certainty Factors Non-monotonic logic Probability Only one has stood.
Making Simple Decisions Chapter 16 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Marie desJadines,
Probabilistic Robotics Probability Theory Basics Error Propagation Slides from Autonomous Robots (Siegwart and Nourbaksh), Chapter 5 Probabilistic Robotics.
Instructor: Eyal Amir Grad TAs: Wen Pu, Yonatan Bisk Undergrad TAs: Sam Johnson, Nikhil Johri CS 440 / ECE 448 Introduction to Artificial Intelligence.
Decision Making Under Uncertainty CMSC 471 – Fall 2011 Class #23-24 – Thursday, November 17 / Tuesday, November 22 R&N, Chapters , ,
Matching ® ® ® Global Map Local Map … … … obstacle Where am I on the global map?                                   
CS 541: Artificial Intelligence Lecture VIII: Temporal Probability Models.
CS 541: Artificial Intelligence Lecture VIII: Temporal Probability Models.
CS498-EA Reasoning in AI Lecture #23 Instructor: Eyal Amir Fall Semester 2011.
Ralf Möller Universität zu Lübeck Institut für Informationssysteme
Making Simple Decisions
Where are we in CS 440? Now leaving: sequential, deterministic reasoning Entering: probabilistic reasoning and machine learning.
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 14
Probabilistic Reasoning over Time
Probabilistic Reasoning over Time
Rational Decisions and
13. Acting under Uncertainty Wolfram Burgard and Bernhard Nebel
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 14
Making Simple Decisions
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 14
Presentation transcript:

Decision Making Under Uncertainty CMSC 471 – Spring 2014 Class #12– Thursday, March 6 R&N, Chapters , material from Lise Getoor, Jean-Claude Latombe, and Daphne Koller 1

MODELING UNCERTAINTY OVER TIME 2

Temporal Probabilistic Agent environment agent ? sensors actuators t 1, t 2, t 3, … 3

Time and Uncertainty The world changes; we need to track and predict it Examples: diabetes management, traffic monitoring Basic idea: Copy state and evidence variables for each time step Model uncertainty in change over time, incorporating new observations as they arrive Reminiscent of situation calculus, but for stochastic domains X t – set of unobservable state variables at time t e.g., BloodSugar t, StomachContents t E t – set of evidence variables at time t e.g., MeasuredBloodSugar t, PulseRate t, FoodEaten t Assumes discrete time steps 4

States and Observations Process of change is viewed as series of snapshots, each describing the state of the world at a particular time Each time slice is represented by a set of random variables indexed by t: 1. the set of unobservable state variables X t 2. the set of observable evidence variables E t The observation at time t is E t = e t for some set of values e t The notation X a:b denotes the set of variables from X a to X b 5

Stationary Process/Markov Assumption Markov Assumption: X t depends on some previous X i s First-order Markov process: P(X t |X 0:t-1 ) = P(X t |X t-1 ) kth order: depends on previous k time steps Sensor Markov assumption: P(E t |X 0:t, E 0:t-1 ) = P(E t |X t ) That is: The agent’s observations depend only on the actual current state of the world Assume stationary process: Transition model P(X t |X t-1 ) and sensor model P(E t |X t ) are time-invariant (i.e., they are the same for all t) That is, the changes in the world state are governed by laws that do not themselves change over time 6

Complete Joint Distribution Given: Transition model: P(X t |X t-1 ) Sensor model: P(E t |X t ) Prior probability: P(X 0 ) Then we can specify the complete joint distribution of a sequence of states: 7

Example Rain t-1 Umbrella t-1 Rain t Umbrella t Rain t+1 Umbrella t+1 R t-1 P(R t |R t-1 ) TFTF RtRt P(U t |R t ) TFTF This should look a lot like a finite state automaton (since it is one...) 8

Inference Tasks Filtering or monitoring: P(X t |e 1,…,e t ) Compute the current belief state, given all evidence to date Prediction: P(X t+k |e 1,…,e t ) Compute the probability of a future state Smoothing: P(X k |e 1,…,e t ) Compute the probability of a past state (hindsight) Most likely explanation: arg max x1,..xt P(x 1,…,x t |e 1,…,e t ) Given a sequence of observations, find the sequence of states that is most likely to have generated those observations 9

Examples Filtering: What is the probability that it is raining today, given all of the umbrella observations up through today? Prediction: What is the probability that it will rain the day after tomorrow, given all of the umbrella observations up through today? Smoothing: What is the probability that it rained yesterday, given all of the umbrella observations through today? Most likely explanation: If the umbrella appeared the first three days but not on the fourth, what is the most likely weather sequence to produce these umbrella sightings? 10

Filtering We use recursive estimation to compute P(X t+1 | e 1:t+1 ) as a function of e t+1 and P(X t | e 1:t ) We can write this as follows: This leads to a recursive definition: f 1:t+1 =  FORWARD (f 1:t, e t+1 ) QUIZLET: What is α? 11

Filtering Example Rain t-1 Umbrella t-1 Rain t Umbrella t Rain t+1 Umbrella t+1 R t-1 P(R t |R t-1 ) TFTF RtRt P(U t |R t ) TFTF What is the probability of rain on Day 2, given a uniform prior of rain on Day 0, U 1 = true, and U 2 = true? 12

DECISION MAKING UNDER UNCERTAINTY

Decision Making Under Uncertainty Many environments have multiple possible outcomes Some of these outcomes may be good; others may be bad Some may be very likely; others unlikely What’s a poor agent to do?? 14

Non-Deterministic vs. Probabilistic Uncertainty ? bac {a,b,c}  decision that is best for worst case ? bac {a(p a ), b(p b ), c(p c )}  decision that maximizes expected utility value Non-deterministic modelProbabilistic model ~ Adversarial search 15

Expected Utility Random variable X with n values x 1,…,x n and distribution (p 1,…,p n ) E.g.: X is the state reached after doing an action A under uncertainty Function U of X E.g., U is the utility of a state The expected utility of A is EU[A] =  i=1,…,n p(x i |A)U(x i ) 16

s0s0 s3s3 s2s2 s1s1 A U(A1, S0) = 100 x x x 0.1 = = 62 One State/One Action Example 17

s0s0 s3s3 s2s2 s1s1 A A2 s4s U (A1, S0) = 62 U (A2, S0) = 74 U (S0) = max a {U(a,S0)} = 74 One State/Two Actions Example 18

s0s0 s3s3 s2s2 s1s1 A A2 s4s U (A1, S0) = 62 – 5 = 57 U (A2, S0) = 74 – 25 = 49 U (S0) = max a {U(a, S0)} = Introducing Action Costs 19

MEU Principle A rational agent should choose the action that maximizes agent’s expected utility This is the basis of the field of decision theory The MEU principle provides a normative criterion for rational choice of action 20

Not quite… Must have a complete model of: Actions Utilities States Even if you have a complete model, decision making is computationally intractable In fact, a truly rational agent takes into account the utility of reasoning as well (bounded rationality) Nevertheless, great progress has been made in this area recently, and we are able to solve much more complex decision-theoretic problems than ever before 21

Axioms of Utility Theory Orderability (A>B)  (A<B)  (A~B) Transitivity (A>B)  (B>C)  (A>C) Continuity A>B>C   p [p,A; 1-p,C] ~ B Substitutability A~B  [p,A; 1-p,C]~[p,B; 1-p,C] Monotonicity A>B  (p≥q  [p,A; 1-p,B] >~ [q,A; 1-q,B]) Decomposability [p,A; 1-p, [q,B; 1-q, C]] ~ [p,A; (1-p)q, B; (1-p)(1-q), C] 22

Money Versus Utility Money <> Utility More money is better, but not always in a linear relationship to the amount of money Expected Monetary Value Risk-averse: U(L) < U(S EMV(L) ) Risk-seeking: U(L) > U(S EMV(L) ) Risk-neutral: U(L) = U(S EMV(L) ) 23

Value Function Provides a ranking of alternatives, but not a meaningful metric scale Also known as an “ordinal utility function” Sometimes, only relative judgments (value functions) are necessary At other times, absolute judgments (utility functions) are required 24