Winning by being lazy: Hierarchy, Abstraction and Least- commitment in the new-age planning Subbarao Kambhampati Arizona State University rakaposhi.eas.asu.edu/yochan.html.

Slides:



Advertisements
Similar presentations
Learning to Improve the Quality of Plans Produced by Partial-order Planners M. Afzal Upal Intelligent Agents & Multiagent Systems Lab.
Advertisements

Practical Planning: Scheduling and Hierarchical Task Networks Chapter CS 63 Adapted from slides by Tim Finin and Marie desJardins.
CSE391 – 2005 NLP 1 Planning The Planning problem Planning with State-space search.
Planning
Planning Module THREE: Planning, Production Systems,Expert Systems, Uncertainty Dr M M Awais.
Planning Module THREE: Planning, Production Systems,Expert Systems, Uncertainty Dr M M Awais.
CLASSICAL PLANNING What is planning ?  Planning is an AI approach to control  It is deliberation about actions  Key ideas  We have a model of the.
Opmaker2: Efficient Action Schema Acquisition T.L.McCluskey, S.N.Cresswell, N. E. Richardson and M.M.West The University of Huddersfield,UK
PLANNING IN AI. Determine the set of steps that are necessary to achieve a goal Some steps might be conditional, i.e., they are only taken when a set.
Best-First Search: Agendas
Planning CSE 473 Chapters 10.3 and 11. © D. Weld, D. Fox 2 Planning Given a logical description of the initial situation, a logical description of the.
1 Chapter 16 Planning Methods. 2 Chapter 16 Contents (1) l STRIPS l STRIPS Implementation l Partial Order Planning l The Principle of Least Commitment.
4 th Nov, Oct 23 rd Happy Deepavali!. 10/23 SAT & CSP.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Planning II GraphPlan and Russell and Norvig: ch.12 CMSC421 – Fall 2006 based on material from Jim Blythe, JC Latombe, Marie desJardins and Daphne Koller.
Artificial Intelligence Chapter 11: Planning
MA2: Recent Advances in AI Planning: A Unified View Subbarao Kambhampati 3/11.
A: A Unified Brand-name-Free Introduction to Planning Subbarao Kambhampati.
Knowledge and Systems Research Group, University of Huddersfield B vs OCL: Comparing Specification Languages for Planning Domains Diane Kitchin, Lee McCluskey,
1 Planning. R. Dearden 2007/8 Exam Format  4 questions You must do all questions There is choice within some of the questions  Learning Outcomes: 1.Explain.
MA2: Recent Advances in AI Planning: A Unified View Subbarao Kambhampati 10/4: Knowledge-based Planning  Semester Project Proposals due.
A: A Unified Brand-name-Free Introduction to Planning Subbarao Kambhampati Jan 28 th My lab was hacked and the systems are being rebuilt.. Homepage is.
Handling non-determinism and incompleteness. Problems, Solutions, Success Measures: 3 orthogonal dimensions  Incompleteness in the initial state  Un.
Planning Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 11.
Hierarchical GUI Test Case Generation Using Automated Planning Atif M. Memon, Student Member, IEEE, Martha E. Pollack, and Mary Lou Soffa, Member, IEEE.
1 BLACKBOX: A New Approach to the Application of Theorem Proving to Problem Solving Bart Selman Cornell University Joint work with Henry Kautz AT&T Labs.
MA2: Recent Advances in AI Planning: A Unified View Subbarao Kambhampati 2/7: Knowledge-based Planning.
Automated Planning and HTNs Planning – A brief intro Planning – A brief intro Classical Planning – The STRIPS Language Classical Planning – The STRIPS.
1 Planning Chapters 11 and 12 Thanks: Professor Dan Weld, University of Washington.
Planning II CSE 573. © Daniel S. Weld 2 Logistics Reading for Wed Ch 18 thru 18.3 Office Hours No Office Hour Today.
Classical Planning Chapter 10.
Domain-Independent Plan Adaptation Héctor Muñoz-Avila Department of Computer Science and Engineering Lehigh University USA.
(Classical) AI Planning. Some Examples Route search: Find a route between Lehigh University and the Naval Research Laboratory Project management: Construct.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Homework 1 ( Written Portion )  Max : 75  Min : 38  Avg : 57.6  Median : 58 (77%)
Planning (Chapter 10)
Software Product Line Material based on slides and chapter by Linda M. Northrop, SEI.
Computing & Information Sciences Kansas State University Lecture 21 of 42 CIS 530 / 730 Artificial Intelligence Lecture 21 of 42 Planning: Graph Planning.
Planning (Chapter 10)
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
1 Chapter 16 Planning Methods. 2 Chapter 16 Contents (1) l STRIPS l STRIPS Implementation l Partial Order Planning l The Principle of Least Commitment.
Automated Planning Dr. Héctor Muñoz-Avila. What is Planning? Classical Definition Domain Independent: symbolic descriptions of the problems and the domain.
AI Lecture 17 Planning Noémie Elhadad (substituting for Prof. McKeown)
© 2006 Pearson Addison-Wesley. All rights reserved 2-1 Chapter 2 Principles of Programming & Software Engineering.
Problem Reduction So far we have considered search strategies for OR graph. In OR graph, several arcs indicate a variety of ways in which the original.
Intro to Planning Or, how to represent the planning problem in logic.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
(Classical) AI Planning. General-Purpose Planning: State & Goals Initial state: (on A Table) (on C A) (on B Table) (clear B) (clear C) Goals: (on C Table)
Robust Planning using Constraint Satisfaction Techniques Daniel Buettner and Berthe Y. Choueiry Constraint Systems Laboratory Department of Computer Science.
1 CMSC 471 Fall 2004 Class #21 – Thursday, November 11.
1 CMSC 471 Fall 2002 Class #24 – Wednesday, November 20.
Planning I: Total Order Planners Sections
Graphplan CSE 574 April 4, 2003 Dan Weld. Schedule BASICS Intro Graphplan SATplan State-space Refinement SPEEDUP EBL & DDB Heuristic Gen TEMPORAL Partial-O.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
ADVANCED PLANNING TECHNIQUES Dr. Adam Anthony Lecture 22.
Dana Nau: Lecture slides for Automated Planning Licensed under the Creative Commons Attribution-NonCommercial-ShareAlike License:
Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.
EBL & DDB for Graphplan (P lanning Graph as Dynamic CSP: Exploiting EBL&DDB and other CSP Techniques in Graphplan) Subbarao Kambhampati Arizona State University.
SNS College of Engineering Department of Computer Science and Engineering AI Planning Presented By S.Yamuna AP/CSE 5/23/2018 AI.
Planning as Search State Space Plan Space Algorihtm Progression
Planning (Chapter 10)
Review for the Midterm Exam
Class #17 – Thursday, October 27
Graphplan/ SATPlan Chapter
CIS 488/588 Bruce R. Maxim UM-Dearborn
Class #19 – Monday, November 3
Class #20 – Wednesday, November 5
Graphplan/ SATPlan Chapter
Graphplan/ SATPlan Chapter
Class #17 – Tuesday, October 30
Presentation transcript:

Winning by being lazy: Hierarchy, Abstraction and Least- commitment in the new-age planning Subbarao Kambhampati Arizona State University rakaposhi.eas.asu.edu/yochan.html Invited Talk at NIPS-98 Workshop on Reinforcement Learning

Reinforcing “small chunk at a time” behavior --Theory of inverted reinforcement

Objective A quick overview of the ideas of abstraction, hierarchy, reuse and least- commitment in planning –With special emphasis on new-age planners Share some (hopefully portable) lessons...

Overview Planning -- Then and Now Abstraction/hierarchy in planning –Detail Abstraction Least Commitment –Task decomposition –Experiential abstraction (reuse/replay) Lessons

Planning: The problem GStates are modeled in terms of (binary) state-variables (factored rep.) -- Complete initial state, partial goal state GActions are modeled as state transformation functions -- Syntax: ADL language (Pednault) GPlans are sequences of actions Load(o 1 ) In(o 1 ) At(o 1,l 1 ), At(R,l 1 ) At(R,E) Fly() At(R,M), ¬At(R,E)  x In(x)  At(x,M) & ¬At(x, E) Unload(o 1 ) In(o 1 ) ¬In(o 1 ) Earth At(A,E), At(B,E),At(R,E) At(A,M),At(B,M) ¬In(A), ¬In(B) Effects Preconditions Appolo 13

Refinement Planning: The idea Narrowing sets of action sequences to progress towards sets of solutions Partial plans Refinements Remove non-solutions All action sequences All Solutions  P P’ All Sol P P’ All Seq. Refine [AIMAG-97]

Existing Refinement Strategies PSR 0 1:Unload(A)   2:Fly() 3:Unload(A) 0  At(A,M)  ¬At(A,M) 0 1: Unload(A)  2: Load(A) 0  2: Load(B) 0  2: Fly() 0  1: Unload(A) At(A,E) At(B,E) At(R,E) progression Regression 1: Transport(A) 0  At(A,E) At(A,M) 1: Load(A) 0  At(A,E) At(A,M) 2: Fly() 3: Unload(A) HTN Extend Prefix Extend Suffix Add in the middle Decompose State-Space Plan-Space

Search in the space of conjunctive partial plans –Disjunction split into the search space –Solution extraction is trivial Examples: –STRIPS & Prodigy –SNLP & UCPOP –NONLIN & SIPE Search in the space of disjunctive partial plans –Disjunction handled explicitly –Solution extraction is non- trivial CSP/SAT methods Examples: –Graphplan –SATPLAN Conjunctive plannersDisjunctive planners [AIMag-97;IJCAI-97] Then Now

1: Load(A) 2 : Load(B) 0  3 : Fly(R) or Refining disjunctive plans 1: Load(A) 0 1: Load(B) 0 1: Fly(R) 0 2: Load(B) 2: Load(A) 1: Load(A) 0 1: Load(B) 0 2: Unload(A,E) 2: Unload(B,E) 1: Load(A) 0 1: Load(B) 0 2: Fly(R)        Refining conjunctive plans

Detail Abstraction Idea –Abstract some details of the problem or actions. –Solve the abstracted version. –Extend the solution to the detailed version Precondition Abstraction –Work on satisfying important preconditions first Importance judged by: –Length of plans for subgoals [ABSTRIPS, PABLO] –Inter-goal relations [ALPINE] –Distribution-based [HighPoint] –Strong abstractions (with downward refinement property) are rare –Effectiveness is planner-dependent Clashes with other heuristics such as “most constrained first”

Abstracting Resources (Teasing apart Planning and Scheduling) Most planners thrash by addressing planning and scheduling considerations together –Eg. Blocks world, with multiple robot hands Idea: Abstract resources away during planning –Plan assuming infinite resources –Do a post-planning resource allocation phase –Re-plan if needed (with Biplav Srivastava)

Least Commitment (Detail Postponement) Postpone commitments unless forced –Big idea in conjunctive refinement planning Partial-order planners: UCPOP, SNLP –Interacts with precondition abstraction –Becomes a non-issue in disjunctive planning There is very little commitment to begin with Encodings based on partial order planning can actually be worse off [Mali, 98] –Exception: Variablized (“lifted”) representations PSR 0 1:Unload(A)   2:Fly() 3:Unload(A) 0  At(A,M)  ¬At(A,M)

Task Decomposition (HTN) Planning Domain model contains non-primitive actions, and schemas for reducing them Reduction schemas are given by the designer –Can be seen as encoding user-intent Two notions of completeness: –Schema completeness (Partial Hierarchicalization) –Planner completeness

Modeling Action Reduction

Dual views of HTN planning Capturing hierarchical structure of the domain –Motivates top-down planning Start with abstract plans, and reduce them Many technical headaches –Respecting user-intent, maintaining systematicity and minimality [AAAI-98] Phantomization, filters, promiscuity, downward- unlinearizability.. Capturing expert advice about desirable solutions –Motivates bottom-up planning Ensure that each partial plan being considered is “legal” with respect to the reduction schemas Connection to efficiency is not obvious Relative advantages are still unclear... [Barrett, 97]

HTN planning in the new-age The ideas of top-down and bottom-up HTN planning can be ported to disjunctive planners [AIPS-98] –Abstract actions can be seen as disjunctive constraints Add constraints to the SAT/CSP encodings of the planning problem to ensure that: –Abstract actions are related to primitive actions through the reduction schemas [Top-down version] OR –Each primitive actions must be part of some task reduction schema [Bottom-up version] Puzzle: How can increasing encoding sizes lead to efficient planning? –New constraints support simplification of the original constraints [with Amol Mali]

HTNSAT: Some results [with Mali, AIPS-98] 40x speedup

Experiential Abstraction: Macrops, Reuse, Replay Structures being reused –Opaque vs. Modifiable –Solution vs. Solving process (derivation) Acquisition of structures to be reused –Human given vs. Automatically acquired Mechanics of reuse –Phased vs. simultaneous Costs –Storage & Retrieval costs; Solution quality

Case-study: DerSNLP Modifiable derivational traces were reused Traces were automatically acquired during problem solving –Analyze the interactions among the parts of a plan, and store plans for non-interacting subgoals separately Reduces retrieval cost –Use of EBL failure analysis to detect interactions All relevant trace fragments were retrieved and replayed before the control is given to from-scratch planner –Extension failures are traced to individual replayed traces, and their storage indices are modified appropriately Improves retrieval accuracy (with Ihrig, JAIR 97)

DerSNLP: Results Performance with increased Training % Solvability with increased traning (JAIR, 97) Library Size 5 3 1

Reuse in Disjunctive Planning Harder to make a disjunctive planner commit to extending a specific plan first Options: –Support opaque macros along with primitive actions –Modify the problem/domain specification so the old plan’s constraints will be respected in any solution –MAX-SAT formulations of reuse problem [with Amol Mali]

Reachability/Relevance minimizations Reachability analysis –Analyze which actions cannot be executed together and which propositions cannot be made together at particular time steps Graphplan mutual exclusions Domain invariants Relevance analysis –Analyze which actions are relevant and must occur together Greedy Regression (RIFO) –Operator Graphs Inseperability constraints Explicate which parts of a disjunctive structure cannot be part of a solution (focusing)

General Lessons Dual views: Detail reduction vs. Expert advice –Detail reduction => hierarchical solving with promise of improved efficiency –Expert advice implies further constraints on the solutions Strong abstractions are rare –Must take abstractions as advice that can be overridden The interaction between abstraction and search mechanism Emphasis on automatic generation of abstractions –Need to consider utility issues Emphasis on satisficing solutions –Few quantitative guarantees on solution quality