An LP-Based Heuristic for Optimal Planning Menkes van den Briel Department of Industrial Engineering Arizona State University

Slides:

Advertisements

Similar presentations

Hybrid BDD and All-SAT Method for Model Checking Orna Grumberg Joint work with Assaf Schuster and Avi Yadgar Technion – Israel Institute of Technology.

Advertisements

Airline Schedule Optimization (Fleet Assignment I)

Solving IPs – Cutting Plane Algorithm General Idea: Begin by solving the LP relaxation of the IP problem. If the LP relaxation results in an integer solution,

Effective Approaches for Partial Satisfaction (Over-subscription) Planning Romeo Sanchez * Menkes van den Briel ** Subbarao Kambhampati * * Department.

An Exact Algorithm for the Vehicle Routing Problem with Backhauls

Lecture 10: Integer Programming & Branch-and-Bound

Fluent Merging: A General Technique to Improve Reachability Heuristics and Factored Planning Menkes van den Briel Department of Industrial Engineering.

Utilizing Problem Structure in Local Search: The Planning Benchmarks as a Case Study Jőrg Hoffmann Alberts-Ludwigs-University Freiburg.

1 Integer Programming Approaches for Automated Planning Menkes van den Briel Department of Industrial Engineering Arizona State University

Extending Graphplan to handle Resources Presenter: Pham Van Cuong Department of Computer Science New Mexico State University.

TADA Transition Aligned Domain Analysis T J. Benton and Kartik Talamadupula and Subbarao Kambhampati.

Finding Admissible Bounds for Over- subscribed Planning Problems J. Benton Menkes van den BrielSubbarao Kambhampati Arizona State University.

Stanford University CS243 Winter 2006 Wei Li 1 Data Dependences and Parallelization.

Approximation Algorithms

A Hybrid Linear Programming and Relaxed Plan Heuristic for Partial Satisfaction Planning Problems J. Benton Menkes van den BrielSubbarao Kambhampati Arizona.

Minh Do - PARC Planning with Goal Utility Dependencies J. Benton Department of Computer Science Arizona State University Tempe, AZ Subbarao.

Sandia is a multiprogram laboratory operated by Sandia Corporation, a Lockheed Martin Company, for the United States Department of Energy under contract.

Reviving Integer Programming Approaches for AI Planning: A Branch-and-Cut Framework Thomas Vossen Leeds School of Business University of Colorado at Boulder.

Lot sizing and scheduling

LP formulation of Economic Dispatch

1 Lecture 4 Maximal Flow Problems Set Covering Problems.

1 Global Meta-Hybrids for Large-Scale Combinatorial Optimization Professor Leyuan Shi Department of Industrial Engineering University of Wisconsin-Madison.

Toshihide IBARAKI Mikio KUBO Tomoyasu MASUDA Takeaki UNO Mutsunori YAGIURA Effective Local Search Algorithms for the Vehicle Routing Problem with General.

1 Using Composite Variable Modeling to Solve Integrated Freight Transportation Planning Problems Sarah Root University of Michigan IOE November 6, 2006.

Column Generation Approach for Operating Rooms Planning Mehdi LAMIRI, Xiaolan XIE and ZHANG Shuguang Industrial Engineering and Computer Sciences Division.

A Parallel Integer Programming Approach to Global Routing Tai-Hsuan Wu, Azadeh Davoodi Department of Electrical and Computer Engineering Jeffrey Linderoth.

Graph Coloring with Ants

Software Pipelining for Stream Programs on Resource Constrained Multi-core Architectures IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEM 2012 Authors:

Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.

Quasi-static Channel Assignment Algorithms for Wireless Communications Networks Frank Yeong-Sung Lin Department of Information Management National Taiwan.

Tao Lin Chris Chu TPL-Aware Displacement- driven Detailed Placement Refinement with Coloring Constraints ISPD ‘15.

Energy-Efficient Sensor Network Design Subject to Complete Coverage and Discrimination Constraints Frank Y. S. Lin, P. L. Chiu IM, NTU SECON 2005 Presenter:

Resource Mapping and Scheduling for Heterogeneous Network Processor Systems Liang Yang, Tushar Gohad, Pavel Ghosh, Devesh Sinha, Arunabha Sen and Andrea.

1 Outline:  Optimization of Timed Systems  TA-Modeling of Scheduling Tasks  Transformation of TA into Mixed-Integer Programs  Tree Search for TA using.

V. Cacchiani, ATMOS 2007, Seville1 Solving a Real-World Train Unit Assignment Problem V. Cacchiani, A. Caprara, P. Toth University of Bologna (Italy) European.

1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.

Integer LP In-class Prob

Approximation Algorithms Department of Mathematics and Computer Science Drexel University.

Routability-driven Floorplanning With Buffer Planning Chiu Wing Sham Evangeline F. Y. Young Department of Computer Science & Engineering The Chinese University.

Log Truck Scheduling Problem

Lagrangean Relaxation

Efficient Automated Planning with New Formulations Ruoyun Huang Washington University in St. Louis.

Hub Location–Allocation in Intermodal Logistic Networks Hüseyin Utku KIYMAZ.

Outline Motivation and Contributions Related Works ILP Formulation

Efficient Automated Planning with New Formulations Ruoyun Huang Advisors: Dr. Yixin Chen and Dr. Weixiong Zhang Washington University in St. Louis.

Efficient Point Coverage in Wireless Sensor Networks Jie Wang and Ning Zhong Department of Computer Science University of Massachusetts Journal of Combinatorial.

Tommy Messelis * Stefaan Haspeslagh Burak Bilgin Patrick De Causmaecker Greet Vanden Berghe *

Approximation Algorithms Duality My T. UF.

Constraint Programming for the Diameter Constrained Minimum Spanning Tree Problem Thiago F. Noronha Celso C. Ribeiro Andréa C. Santos.

Chapter 6 Optimization Models with Integer Variables.

Heuristic Search Planners. 2 USC INFORMATION SCIENCES INSTITUTE Planning as heuristic search Use standard search techniques, e.g. A*, best-first, hill-climbing.

Tuesday, March 19 The Network Simplex Method for Solving the Minimum Cost Flow Problem Handouts: Lecture Notes Warning: there is a lot to the network.

A Formal Analysis of Required Cooperation in Multi-agent Planning Yu Zhang, Sarath Sreedharan and Subbarao Kambhampati Department of Computer Science Arizona.

Management Science 461 Lecture 4b – P-median problems September 30, 2008.

Decision Support Systems

Aircraft Landing Problem

A Multi-Airport Dynamic Network Flow Model with Capacity Uncertainty

Hybrid BDD and All-SAT Method for Model Checking

Data Driven Resource Allocation for Distributed Learning

Integer Programming An integer linear program (ILP) is defined exactly as a linear program except that values of variables in a feasible solution have.

An Efficient P-center Algorithm

Local Container Truck Routing Problem with its Operational Flexibility Kyungsoo Jeong, Ph.D. Candidate University of California, Irvine Local container.

The minimum cost flow problem

The assignment problem

Basic Project Scheduling

Professor Arne Thesen, University of Wisconsin-Madison

Integer Programming (정수계획법)

Department of Information Management National Taiwan University

Integer Programming (정수계획법)

Presentation transcript:

An LP-Based Heuristic for Optimal Planning Menkes van den Briel Department of Industrial Engineering Arizona State University Subbarao Kambhampati Department of Computer Science Arizona State University Thomas Vossen Leeds School of Business University of Colorado at Boulder J. Benton Department of Computer Science Arizona State University

What is automated planning? loc1loc2 loc1loc2 Initial state s 0  S Goal s *  S

What is automated planning? loc1loc2 loc1loc2 loc1 Initial state s 0  S Goal s *  S Action a =  pre, post, prevail 

What is automated planning? loc1loc2 loc1loc2 loc1 Initial state s 0  S Goal s *  S Action a =  pre, post, prevail  Plan P =  a 1, …, a n 

Motivation Why heuristics? –Heuristic state space search have been very successful in solving automated planning problems Why optimal planning? –Real-world planning applications require optimal or near-optimal solutions The difference between a (near) optimal solution and a feasible solution may be the difference between winning or losing the interest of an investor or strategic partner

LP-based heuristic Relax the ordering of the actions Setup an integer programming formulation Solve the LP-relaxation and use the objective function value as an admissible distance estimate Strengthen the formulation by adding valid inequalites

Action selection formulation Represent the planning problem as a set of loosely coupled network flow problems –Each state variable defines one network flow problem –Nodes correspond to the state variable values –Arcs correspond to state variable transitions

Simple logistics example 1 2 T 1 2 DTG Package1 DTG Truck1 Load(p1,t1,l1) Load(p1,t1,l2) Unload(p1,t1,l1) Unload(p1,t1,l2) Drive(l1,l2)Drive(l2,l1) Load(p1,t1,l1) Unload(p1,t1,l1) loc1loc2

Action selection formulation Variables –x a  Z +, for a  A ; x a is equal to the number of times action a is executed Objective function –MIN  a  A x a Constraints, for all c  C, f  V c –  e  Vc+(f):a  AcE(e) x a –  e  Vc–(f):b  AcE(e) x b  –x a  M  e  Vc+(f):b  AcE(e) x b for all f  s 0 [c], a  A c V (f) 1 if f  s 0 [c], f = s * [c] –1 if f = s 0 [c], f  s * [c] 0 otherwise No time indices No upper bound

Simple logistics example 1 2 T 1 2 DTG Package1 DTG Truck1 Load(p1,t1,l1) Load(p1,t1,l2) Unload(p1,t1,l1) Unload(p1,t1,l2) Drive(l1,l2)Drive(l2,l1) Load(p1,t1,l1) Unload(p1,t1,l1) loc1loc2

Simple logistics example Feasible plan x Drive(l2,l1) = 1 x Load(p1,t1,l1) = 1 x Drive(l1,l2) = 1 x Unload(p1,t1,l2) = T 1 2 DTG Package1 DTG Truck1 Load(p1,t1,l1) Load(p1,t1,l2) Unload(p1,t1,l1) Unload(p1,t1,l2) Drive(l1,l2)Drive(l2,l1) Load(p1,t1,l1) Unload(p1,t1,l1) 4 Drive(l2,l1) Load(p1,t1,l1) Drive(l1,l2) Unload(p1,t1,l2)

Simple logistics example LP solution x Load(p1,t1,l1) = 1 x Unload(p1,t1,l2) = 1 x Drive(l2,l1) = 1/M 1 2 T 1 2 DTG Package1 DTG Truck1 Load(p1,t1,l1) Load(p1,t1,l2) Unload(p1,t1,l1) Unload(p1,t1,l2) Drive(l1,l2)Drive(l2,l1) Load(p1,t1,l1) Unload(p1,t1,l1) 2 + 1/M Drive(l2,l1) Load(p1,t1,l1) Unload(p1,t1,l2) ……

Preliminary results

Strengthening techniques Composition of state variables (i.e. fluent merging) –Given the domain transition graph ( DTG ) of two state variables c 1, c 2, the composition of DTG c1 and DTG c2 is the domain transition graph DTG c1||c2 = (V c1||c2, E c1||c2 ) where –V c1||c2 = V c1  V c2 –((f 1,g 1 ),(f 2,g 2 ))  E c1||c2 if f 1,f 2  V c1, g 1,g 2  V c2 and there exists an action a  A such that one of the following conditions hold pre[c 1 ] = f 1, post[c 1 ] = f 2, and pre[c 2 ] = g 1, post[c 2 ] = g 2 pre[c 1 ] = f 1, post[c 1 ] = f 2, and prevail[c 2 ] = g 1, g 1 = g 2 pre[c 1 ] = f 1, post[c 1 ] = f 2, and g 1 = g 2 The term composition is also used in model checking to define the parallel composition or the synchronized product of automata [Cassandras & Lafortune, 1999]

Example Two DTGs and their composition f3f3 f2f2 f1f1 g2g2 g1g1 b c d DTG c1 DTG c2 a b f 1,g 2 f 2,g 1 f 2,g 2 f 3,g 1 f 3,g 2 f 1,,g 1 DTG c1 || c2 a a b c c d d

Example Two DTGs and their composition –Small in-arcs denote the initial state –Double circles denote the goal f3f3 f2f2 f1f1 g2g2 g1g1 b c d DTG c1 DTG c2 a b f 1,g 2 f 2,g 1 f 2,g 2 f 3,g 1 f 1,,g 1 DTG c1 || c2 a a b c c d d

Simple logistics example loc1loc2 1,1 1,T 2,T 2,2 1,2 2,1 DTG Truck1 || Package1 Drive(l1,l2) Drive(l2,l1) Load(p1,t1,l1) Load(p1,t1,l2) Unload(p1,t1,l1) Unload(p1,t1,l2) Drive(l1,l2) Drive(l2,l1) Drive(l1,l2)Drive(l2,l1)

Simple logistics example 1,1 1,T 2,T 2,2 1,2 2,1 DTG Truck1 || Package1 LP solution x Drive(l2,l1) = 1 x Load(p1,t1,l1) = 1 x Drive(l1,l2) = 1 x Unload(p1,t1,l2) = 1 4 Drive(l2,l1) Load(p1,t1,l1) Drive(l1,l2) Unload(p1,t1,l2) Drive(l1,l2) Drive(l2,l1) Load(p1,t1,l2) Unload(p1,t1,l1) Unload(p1,t1,l2) Drive(l1,l2) Drive(l2,l1) Drive(l1,l2)Drive(l2,l1)

Another example Two DTGs and their composition f3f3 f2f2 f1f1 g3g3 g2g2 g1g1 f 1,g 2 f 1,g 3 f 2,g 1 f 2,g 2 f 2,g 3 f 3,g 1 f 3,g 2 f 3,g 3 f 1,,g 1 DTG c1 DTG c2 DTG c1 || c2

Another example Two DTGs and their composition –Solution to the individual state variables f3f3 f2f2 f1f1 g3g3 g2g2 g1g1 f 1,g 2 f 1,g 3 f 2,g 1 f 2,g 2 f 2,g 3 f 3,g 1 f 3,g 2 f 3,g 3 f 1,,g 1 b a a b DTG c1 DTG c2 DTG c1 || c2

Another example Two DTGs and their composition –Solution to the individual state variables represented in the composed state variable f3f3 f2f2 f1f1 g3g3 g2g2 g1g1 f 1,g 2 f 1,g 3 f 2,g 1 f 2,g 2 f 2,g 3 f 3,g 1 f 3,g 2 f 3,g 3 f 1,,g 1 b a a b DTG c1 DTG c2 DTG c1 || c2 b a

Another example Two DTGs and their composition –Solution to the individual state variables represented in the composed state variable f3f3 f2f2 f1f1 g3g3 g2g2 g1g1 f 1,g 2 f 1,g 3 f 2,g 1 f 2,g 2 f 2,g 3 f 3,g 1 f 3,g 2 f 3,g 3 f 1,,g 1 b a a b DTG c1 DTG c2 DTG c1 || c2 b a Violates balance of flow constraints

Another example Two DTGs and their composition –Adding new balance of flow constraints strengthens the formulation f3f3 f2f2 f1f1 g3g3 g2g2 g1g1 f 1,g 2 f 1,g 3 f 2,g 1 f 2,g 2 f 2,g 3 f 3,g 1 f 3,g 2 f 3,g 3 f 1,,g 1 b a a b DTG c1 DTG c2 DTG c1 || c2 b a c c e d d e

Identifying mergeable fluents When should we create a composition of two or more state variables? –Look at the causal graph –Look at the actions that introduce dependencies in the causal graph Person 1Person 2 Airplane 1Airplane 2 Fuel 1Fuel 2 Person 1Person 2 Airplane 1 Fuel1 Airplane 2 Fuel2

Experimental setup Objective –Minimize number of actions Domains –Selected domains from the International Planning Competition Logistics Freecell Driverlog Zenotravel TPP Blocksworld Resources –2.67Ghz Linux machine –1GB memory –15 minutes runtime –CPLEX 10.0

Experimental setup Distance estimates –LP Action selection formulation with strengthening –LP – Action selection formulation without strengthening –Lplan Step based integer programming formulation by Lplan [Bylander, 1997] –h + Optimal relaxed plan when the delete effects are ignored –h FF Inadmissible but efficient relaxed plan heuristic by FF [Hoffmann, and Nebel, 2001] –Optimal Optimal distance estimate given by Satplanner using the –opt flag [Rintanen, Heljanko, and Niemela, 2005]

Experimental results

Distance estimates from the initial state to the goal (highlighted values equal the optimal distance)

Experimental results Heuristic calculation time LogisticsFreecellDriverlogZenotravel TPP Blocks

Conclusions and future work LP-based heuristic that respects delete effects, but ignores action ordering shows very promising results –Finds the optimal distance estimate in several problem instances –Can be used to calculate admissible distance estimates for various optimization problems in planning –Ongoing work successfully incorporated our LP-based heuristic in a search algorithm that solves oversubscription planning Interesting directions for future work –Apply fluent merging more aggressively –Extend the formulation into a complete planning system

LP-based heuristic Relax the ordering of the actions Setup an integer programming formulation Solve the LP-relaxation and use the objective function value as an admissible distance estimate Strengthen the formulation by adding valid inequalites