Effective Approaches for Partial Satisfaction (Over-subscription) Planning Romeo Sanchez * Menkes van den Briel ** Subbarao Kambhampati * * Department.

Slides:



Advertisements
Similar presentations
BEST FIRST SEARCH - BeFS
Advertisements

Planning Module THREE: Planning, Production Systems,Expert Systems, Uncertainty Dr M M Awais.
Greedy best-first search Use the heuristic function to rank the nodes Search strategy –Expand node with lowest h-value Greedily trying to find the least-cost.
Planning Module THREE: Planning, Production Systems,Expert Systems, Uncertainty Dr M M Awais.
CLASSICAL PLANNING What is planning ?  Planning is an AI approach to control  It is deliberation about actions  Key ideas  We have a model of the.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
An LP-Based Heuristic for Optimal Planning Menkes van den Briel Department of Industrial Engineering Arizona State University
MBD and CSP Meir Kalech Partially based on slides of Jia You and Brian Williams.
Top 5 Worst Times For A Conference Talk 1.Last Day 2.Last Session of Last Day 3.Last Talk of Last Session of Last Day 4.Last Talk of Last Session of Last.
1 Graphplan José Luis Ambite * [* based in part on slides by Jim Blythe and Dan Weld]
~1~ Infocom’04 Mar. 10th On Finding Disjoint Paths in Single and Dual Link Cost Networks Chunming Qiao* LANDER, CSE Department SUNY at Buffalo *Collaborators:
Constraint Optimization Presentation by Nathan Stender Chapter 13 of Constraint Processing by Rina Dechter 3/25/20131Constraint Optimization.
Planning with Constraints Graphplan & SATplan Yongmei Shi.
5-1 Chapter 5 Tree Searching Strategies. 5-2 Satisfiability problem Tree representation of 8 assignments. If there are n variables x 1, x 2, …,x n, then.
Fluent Merging: A General Technique to Improve Reachability Heuristics and Factored Planning Menkes van den Briel Department of Industrial Engineering.
Planning CSE 473 Chapters 10.3 and 11. © D. Weld, D. Fox 2 Planning Given a logical description of the initial situation, a logical description of the.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Branch and Bound Searching Strategies
1 Integer Programming Approaches for Automated Planning Menkes van den Briel Department of Industrial Engineering Arizona State University
Over-subscription Planning with Numeric Goals J. Benton Computer Sci. & Eng. Dept. Arizona State University Tempe, AZ Minh Do Palo Alto Research Center.
Extending Graphplan to handle Resources Presenter: Pham Van Cuong Department of Computer Science New Mexico State University.
Finding Admissible Bounds for Over- subscribed Planning Problems J. Benton Menkes van den BrielSubbarao Kambhampati Arizona State University.
1 Planning. R. Dearden 2007/8 Exam Format  4 questions You must do all questions There is choice within some of the questions  Learning Outcomes: 1.Explain.
Planning Graph-based Heuristics for Cost-sensitive Temporal Planning Minh B. Do & Subbarao Kambhampati CSE Department, Arizona State University
A Hybrid Linear Programming and Relaxed Plan Heuristic for Partial Satisfaction Planning Problems J. Benton Menkes van den BrielSubbarao Kambhampati Arizona.
25 th Feb EBL example GP-CSP Other compilation schemes Setting up encodings.
Problem Solving and Search in AI Heuristic Search
Expressive and Efficient Frameworks for Partial Satisfaction Planning Subbarao Kambhampati Arizona State University (Proposal submitted for consideration.
Minh Do - PARC Planning with Goal Utility Dependencies J. Benton Department of Computer Science Arizona State University Tempe, AZ Subbarao.
1 Tree Searching Strategies. 2 The procedure of solving many problems may be represented by trees. Therefore the solving of these problems becomes a tree.
Reviving Integer Programming Approaches for AI Planning: A Branch-and-Cut Framework Thomas Vossen Leeds School of Business University of Colorado at Boulder.
Heuristics for Planning  Initial project advisors assigned—please contact them  Project 1 ready; due 2/14  Due to underwhelming interest, note-taking.
Branch and Bound Algorithm for Solving Integer Linear Programming
CS121 Heuristic Search Planning CSPs Adversarial Search Probabilistic Reasoning Probabilistic Belief Learning.
Local Search Techniques for Temporal Planning in LPG Paper by Gerevini, Serina, Saetti, Spinoni Presented by Alex.
RePOP: Reviving Partial Order Planning
Brute Force Search Depth-first or Breadth-first search
Classical Planning Chapter 10.
Boolean Satisfiability and SAT Solvers
Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.
Informed State Space Search Department of Computer Science & Engineering Indian Institute of Technology Kharagpur.
State-Space Searches. 2 State spaces A state space consists of A (possibly infinite) set of states The start state represents the initial problem Each.
CP Summer School Modelling for Constraint Programming Barbara Smith 2. Implied Constraints, Optimization, Dominance Rules.
Heuristics in Search-Space CSE 574 April 11, 2003 Dan Weld.
AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)
Introduction to Artificial Intelligence Class 1 Planning & Search Henry Kautz Winter 2007.
RePOP: Reviving Partial Order Planning XuanLong Nguyen & Subbarao Kambhampati Yochan Group:
Problem Reduction So far we have considered search strategies for OR graph. In OR graph, several arcs indicate a variety of ways in which the original.
Robust Planning using Constraint Satisfaction Techniques Daniel Buettner and Berthe Y. Choueiry Constraint Systems Laboratory Department of Computer Science.
Planning I: Total Order Planners Sections
ICS 353: Design and Analysis of Algorithms Backtracking King Fahd University of Petroleum & Minerals Information & Computer Science Department.
Searching a Linear Subspace Lecture VI. Deriving Subspaces There are several ways to derive the nullspace matrix (or kernel matrix). ◦ The methodology.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.
1 Chapter 6 Planning-Graph Techniques. 2 Motivation A big source of inefficiency in search algorithms is the branching factor  the number of children.
RePOP: Reviving Partial Order Planning
Artificial Intelligence Problem solving by searching CSC 361
Minh Do, Terry Zimmerman, and Subbarao Kambhampati
Class #17 – Thursday, October 27
Graphplan/ SATPlan Chapter
Searching for Solutions
Class #19 – Monday, November 3
Chapter 6 Planning-Graph Techniques
Introducing Underestimates
Graphplan/ SATPlan Chapter
Graphplan/ SATPlan Chapter
The Rich/Knight Implementation
Tree Searching Strategies
[* based in part on slides by Jim Blythe and Dan Weld]
The Rich/Knight Implementation
Presentation transcript:

Effective Approaches for Partial Satisfaction (Over-subscription) Planning Romeo Sanchez * Menkes van den Briel ** Subbarao Kambhampati * * Department of Computer Science and Engineering ** Department of Industrial Engineering Arizona State University Tempe, Arizona

Outline  Background  Example  Approaches  Optiplan  Altaltps  Sapaps  Planning graph heuristics  Results

For all your demands, you could’ve bought me a better flash memory stick at least! In one day achieve the following 100 goals: RockData at WP 1, high-res pics at WP 2 & 3, …., SoilData at WP 100 No way I can achieve that many goals in one day It’s hard but here is the best I can do: Goal1, Goal5, Goal99 Given: Actions with costs, and goals with utilities, find a plan that has a highest {utility – cost} Previous Approaches:  Highest utility goal first  Estimating the set of most beneficial goals Background

 Complete satisfaction (traditional) planning  Goal state G is a list of conjunctions: G = g 1  g 2  …  g n  A plan that achieves n – 1 goal fluents is as good as a plan that achieves 0 goal fluents  Partial satisfaction planning (PSP)  Goal state G is a list of fluents: G = {g 1, g 2, …, g n }  Goal fluents might have utilities, actions might have costs, therefore achieving a partial plan might be more beneficial than the “null” plan.  Achieving all goal fluents might be impossible…  The goal state G may contain logically conflicting fluents  There might not be enough resources to achieve all fluents in G (:goal (and (pointing satellite1 moon) (pointing satellite1 mars) )) (:goal (and (have_rock rover1 waypoint1) (have_rock rover1 waypoint2) ))

PSP problems  PSP Net benefit:  Given a planning problem P = (F, A, I, G), and for each action a “cost” c a  0, and for each goal fluent f  G a “utility” u f  0, and a positive number k. Is there a finite sequence of actions  = (a 1, a 2, …, a n ) that starting from I leads to a state S that has net benefit  f  (S  G) u f –  a  c a  k. PLAN EXISTENCE PLAN LENGTH PSP GOAL LENGTH PSP GOAL PLAN COSTPSP UTILITY PSP UTILITY COST PSP NET BENEFIT

Example  Getting from Las Vegas (LV) to San Jose (SJ) C: action cost U(G): utility of goal G G1,G2,G3,G4: goals P = {travel(LV,DL), travel(DL,SJ), travel(SJ,SF)} achieves G1, G2, G3

Approaches  Optiplan  Integer programming based STRIPS planner  Solves the PSP problem by encoding it as an integer program  Altaltps  Heuristic regression planner  Solves the PSP problem through a goal selection heuristic  Sapaps  Heuristic forward state space planner  Solves the PSP problem using an anytime A* algorithm

Optiplan  Optiplan planning system:  Combines Graphplan (Blum & Furst, 1995) with State Change Encoding (Vossen et al., 1999)  As in the Blackbox planning system, Graphplan reduces the encoding size generated by Optiplan  Computes optimal plans for a given parallel length  Objective:   f  G U f (x_add f,n + x_preadd f,n + x_maintain f,n ) –  l  L  a  A C a y a,l  Sum of goal utilities – Sum of action cost

Optiplan and partial satisfaction Objective  0 / Minimize #actions Constraints  Fluent changes  Satisfy initial state  Satisfy goal  Fluent implications  Action implications  Total satisfaction planning: goal satisfaction is treated as a hard constraint Objective  Maximize net benefit  Goal utility – action cost Constraints  Fluent changes  Satisfy initial state  Fluent implications  Actions implications  Partial satisfaction planning: goal satisfaction is treated as a soft constraint

Graphplan based cost propagation

AltAlt ps  AltAlt planning system  Heuristic state-space search planner (Nguyen, Kambhampati & Sanchez, 2002)  Combines Graphplan (Blum & Furst, 1995) with heuristic state- space search techniques (Bonet, Loerincs & Geffner, 1997; Bonet Geffner, 1999; McDermott 1999)  AltAlt ps planning system  Total enumeration on 2 n goal subsets is too costly  Selects a promising subset of the top-level goals upfront  Searches for a plan using a regression state space search combined with cost-sensitive planning graph heuristics.

AltAlt ps cost propagation  Using a planning graph structure  Propositions in the initial state come for free (they have zero cost)  Other propositions have costs computed as follows:  Propagation procedures  Max-propagation  Sum-propagation l=0l=1l=2 h l (p) = Cost of proposition p at level l 0 if p  I h l (p) =min{h l-1 (p), cost(a) + C l (a)} if l > 0  otherwise C l (a) = max{h l-1 (q) : q  prec(a)} C l (a) =  q  prec(a) h l-1 (q) 44

AltAlt ps goal set selection  Main idea  Start with the original goal set G and an empty goal set G’  Iteratively add goals to G’ as long as the estimated NET BENEFIT increases  The cost of adding another goal g to G’ depends on the goals that are already in G’ G’ G’  g Cost for achieving G’ Residual cost for g Relaxed plan for G’ (R’p) Rp for G’  g biased to re-use actions in R’p

AltAlt ps cost-sensitive relaxed plan heuristic  General procedure  States are ranked during search using the relaxed plan heuristic and the propagated costs  The idea is to compute the cost of a relaxed plan Rp in terms of the costs of the actions composing it.  Heuristic value for S equal h(S) =  a  Rp cost(a) 1. Given a state S, remove the (sub)goal g from S that has highest h l (g) 2. Select the action that supports g with lowest cost ( cost(a) + C l (a) ) 3. Regress S over a to get S’ = S  prec(a) \ eff(a) 4. Stop when each proposition q  S is present in the initial state

Sapaps

 Nodes evaluation:  g(S) = U(S) – C(S)  h(S) = U(RP(S)) – C(RP(S))  Beneficial Node:  g(S) > 0 or U(S) > C(S)  Termination Node:  V S’: g(S) > f(S’) A*: f(S) = g(S) + h(S) A1: Navigate(X,Y)A2: SampleSoil(Y) A3: TakePicture A4: Navigate(Y,Z) A5: SampleRock g(S) = Util(HasSoilData) – Cost(A1,A2) h(S) = Util(Apply(A3,S)) – Cost(A3) Anytime A* Algorithm: Search through best beneficial nodes SAPA PS : a forward A* approach for PSP

Heuristic: Variation of SAPA’s Approach Heuristically extracting the least cost relaxed plan using cost-function Remove “unbeneficial” goals and related actions G1 G2 G3 A1 A2 A3 A4 → G1 G2 A1 A3 C(A1) + C(A2) > U(G3) SAPA PS : heuristic

Empirical results

Future work