A Framework for Planning in Continuous-time Stochastic Domains Håkan L. S. Younes Carnegie Mellon University David J. MuslinerReid G. Simmons Honeywell.

Slides:



Advertisements
Similar presentations
MaC Monitoring and Checking at Runtime (Continue) Presented By Usa Sammapun CIS 700 Oct 12, 2005.
Advertisements

CLASSICAL PLANNING What is planning ?  Planning is an AI approach to control  It is deliberation about actions  Key ideas  We have a model of the.
7 Chapter Hypothesis Testing with One Sample
Probabilistic Verification of Discrete Event Systems using Acceptance Sampling Håkan L. S. YounesReid G. Simmons Carnegie Mellon University.
PROBABILITY. Uncertainty  Let action A t = leave for airport t minutes before flight from Logan Airport  Will A t get me there on time ? Problems :
1 1 Slide IS 310 – Business Statistics IS 310 Business Statistics CSU Long Beach.
Statistical Probabilistic Model Checking Håkan L. S. Younes Carnegie Mellon University.
Ymer: A Statistical Model Checker Håkan L. S. Younes Carnegie Mellon University.
Probabilistic Verification of Discrete Event Systems Håkan L. S. Younes Reid G. Simmons (initial work performed at HTC, Summer 2001)
1 Statistical Inference H Plan: –Discuss statistical methods in simulations –Define concepts and terminology –Traditional approaches: u Hypothesis testing.
TADA Transition Aligned Domain Analysis T J. Benton and Kartik Talamadupula and Subbarao Kambhampati.
Probabilistic Verification of Discrete Event Systems Håkan L. S. Younes.
The Rare Glitch Project: Verification Tools for Embedded Systems Carnegie Mellon University Pittsburgh, PA Ed Clarke, David Garlan, Bruce Krogh, Reid Simmons,
Probabilistic Temporal Planning with Uncertain Durations Mausam Joint work with Daniel S. Weld University of Washington Seattle.
1 1 Slide © 2011 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole.
Policy Generation for Continuous-time Stochastic Domains with Concurrency Håkan L. S. YounesReid G. Simmons Carnegie Mellon University.
Overview of Statistical Hypothesis Testing: The z-Test
CHAPTER 10: Hypothesis Testing, One Population Mean or Proportion
Hypothesis Testing – Introduction
1 © Lecture note 3 Hypothesis Testing MAKE HYPOTHESIS ©
Chapter 8 Introduction to Hypothesis Testing
1 1 Slide © 2005 Thomson/South-Western Chapter 9, Part B Hypothesis Tests Population Proportion Population Proportion Hypothesis Testing and Decision Making.
Planning with Concurrency in Continuous-time Stochastic Domains (thesis proposal) Håkan L. S. Younes Thesis Committee: Reid G. Simmons (chair) Geoffrey.
Chapter 9 Testing a Claim
Planning and Verification for Stochastic Processes with Asynchronous Events Håkan L. S. Younes Carnegie Mellon University.
Software Reliability SEG3202 N. El Kadri.
Copyright © 2012 by Nelson Education Limited. Chapter 7 Hypothesis Testing I: The One-Sample Case 7-1.
1 1 Slide © 2008 Thomson South-Western. All Rights Reserved Slides by JOHN LOUCKS St. Edward’s University.
Planning and Execution with Phase Transitions Håkan L. S. Younes Carnegie Mellon University Follow-up paper to Younes & Simmons’ “Solving Generalized Semi-Markov.
Copyright © Cengage Learning. All rights reserved. 8 Introduction to Statistical Inferences.
Chapter 8 Introduction to Hypothesis Testing ©. Chapter 8 - Chapter Outcomes After studying the material in this chapter, you should be able to: 4 Formulate.
1 1 Slide © 2005 Thomson/South-Western Slides Prepared by JOHN S. LOUCKS St. Edward’s University Slides Prepared by JOHN S. LOUCKS St. Edward’s University.
5.1 Chapter 5 Inference in the Simple Regression Model In this chapter we study how to construct confidence intervals and how to conduct hypothesis tests.
Carnegie Mellon University
Extending PDDL to Model Stochastic Decision Processes Håkan L. S. Younes Carnegie Mellon University.
Decision-Theoretic Planning with Asynchronous Events Håkan L. S. Younes Carnegie Mellon University.
Probabilistic Verification of Discrete Event Systems using Acceptance Sampling Håkan L. S. Younes Carnegie Mellon University.
Learning Simio Chapter 10 Analyzing Input Data
Simulation Chapter 16 of Quantitative Methods for Business, by Anderson, Sweeney and Williams Read sections 16.1, 16.2, 16.3, 16.4, and Appendix 16.1.
Probabilistic Verification of Discrete Event Systems Håkan L. S. Younes.
A Formalism for Stochastic Decision Processes with Asynchronous Events Håkan L. S. YounesReid G. Simmons Carnegie Mellon University.
 Simulation enables the study of complex system.  Simulation is a good approach when analytic study of a system is not possible or very complex.  Informational,
1 Outline 1. Hypothesis Tests – Introduction 2. Technical vocabulary Null Hypothesis Alternative Hypothesis α (alpha) β (beta) 3. Hypothesis Tests – Format.
If we fail to reject the null when the null is false what type of error was made? Type II.
Håkan L. S. YounesDavid J. Musliner Carnegie Mellon UniversityHoneywell Laboratories Probabilistic Plan Verification through Acceptance Sampling.
Building Valid, Credible & Appropriately Detailed Simulation Models
+ Chapter 9 Testing a Claim 9.1Significance Tests: The Basics 9.2Tests about a Population Proportion 9.3Tests about a Population Mean.
Simulation Modeling.
Chapter 9: Testing a Claim
Statistical probabilistic model checking
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Hypothesis Testing – Introduction
Chapter 9: Hypothesis Testing
St. Edward’s University
YOU HAVE REACHED THE FINAL OBJECTIVE OF THE COURSE
Statistical Model-Checking of “Black-Box” Probabilistic Systems VESTA
Slides by JOHN LOUCKS St. Edward’s University.
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Statistical Probabilistic Model Checking
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Chapter 9: Testing a Claim
Unit 5: Hypothesis Testing
Chapter 9: Testing a Claim
Presentation transcript:

A Framework for Planning in Continuous-time Stochastic Domains Håkan L. S. Younes Carnegie Mellon University David J. MuslinerReid G. Simmons Honeywell LaboratoriesCarnegie Mellon University

Introduction Policy generation for complex domains Uncertainty in outcome and timing of actions and events Time as a continuous quantity Concurrency Rich goal formalism Achievement, maintenance, prevention Deadlines

Motivating Example Deliver package from CMU to Honeywell CMU PIT Pittsburgh MSP Honeywell Minneapolis

Elements of Uncertainty Uncertain duration of flight and taxi ride Plane can get full without reservation Taxi might not be at airport when arriving in Minneapolis Package can get lost at airports

Modeling Uncertainty Associate a delay distribution F(t) with each action/event a F(t) is the cumulative distribution function for the delay from when a is enabled until it triggers drivingat airport arrive U(20,40) Pittsburgh Taxi

Concurrency Concurrent semi-Markov processes drivingat airport U(20,40) Pittsburgh Taxi at airportmoving arrive move return Exp(1/40) U(10,20) Minneapolis Taxi PT driving MT at airport PT driving MT moving t=0 t=24 MT move Generalized semi-Markov process

Rich Goal Formalism Goals specified as CSL formulae  ::= true | a |    |  | Pr ≥  (  )  ::=  U ≤t  | ◊ ≤t  | □ ≤t 

Goal for Motivating Example Probability at least 0.9 that the package reaches Honeywell within 300 minutes without getting lost on the way Pr ≥0.9 (  pkg lost U ≤300

Problem Specification Given: Complex domain model Stochastic discrete event system Initial state Probabilistic temporally extended goal CSL formula Wanted: Policy satisfying goal formula in initial state

Generate, Test and Debug [Simmons 88] Generate initial policy Test if policy is good Debug and repair policy good badrepeat

Generate Ways of generating initial policy Generate policy for relaxed problem Use existing policy for similar problem Start with null policy Start with random policy Not focus of this talk! Generate Test Debug

Test Use discrete event simulation to generate sample execution paths Use acceptance sampling to verify probabilistic CSL goal conditions Generate Test Debug

Analyze sample paths generated in test step to find reasons for failure Change policy to reflect outcome of failure analysis Generate Test Debug

More on Test Step Generate initial policy Test if policy is good Debug and repair policy Test

Error Due to Sampling Probability of false negative: ≤  Rejecting a good policy Probability of false positive: ≤  Accepting a bad policy (1−  )-soundness

Acceptance Sampling Hypothesis: Pr ≥  (  )

Performance of Test Actual probability of  holding Probability of accepting Pr ≥  (  ) as true  1 –  

Ideal Performance Actual probability of  holding Probability of accepting Pr ≥  (  ) as true  1 –   False negatives False positives

Realistic Performance  –  +  Indifference region Actual probability of  holding Probability of accepting Pr ≥  (  ) as true  1 –   False negatives False positives

Sequential Acceptance Sampling [Wald 45] Hypothesis: Pr ≥  (  ) True, false, or another sample?

Graphical Representation of Sequential Test Number of samples Number of positive samples

Graphical Representation of Sequential Test We can find an acceptance line and a rejection line given , , , and  Reject Accept Continue sampling Number of samples Number of positive samples A , , ,  (n) R , , ,  (n)

Graphical Representation of Sequential Test Reject hypothesis Reject Accept Continue sampling Number of samples Number of positive samples

Graphical Representation of Sequential Test Accept hypothesis Reject Accept Continue sampling Number of samples Number of positive samples

Anytime Policy Verification Find best acceptance and rejection lines after each sample in terms of  and  Number of samples Number of positive samples

Verification Example Initial policy for example problem CPU time (seconds) Error probability  =0.01

More on Debug Step Generate initial policy Test if policy is good Debug and repair policy Debug

Role of Negative Sample Paths Negative sample paths provide evidence on how policy can fail “Counter examples”

Generic Repair Procedure 1. Select some state along some negative sample path 2. Change the action planned for the selected state Need heuristics to make informed state/action choices

Scoring States Assign –1 to last state along negative sample path and propagate backwards Add over all negative sample paths s9s9 failures1s1 s5s5 s2s2 s1s1 s5s5 s3s3 −1 −− −2−2 −3−3 −4−4 −− −2−2 −3−3 s5s5 s5s5

Example Package gets lost at Minneapolis airport while waiting for the taxi Repair: store package until taxi arrives

Verification of Repaired Policy CPU time (seconds) Error probability  =0.01

Comparing Policies Use acceptance sampling: Pair samples from the verification of two policies Count pairs where policies differ Prefer first policy if probability is at least 0.5 of pairs where first policy is better

Summary Framework for dealing with complex stochastic domains Efficient sampling-based anytime verification of policies Initial work on debug and repair heuristics