Don Perugini, Dennis Jarvis, Shane Reschke, Don Gossink Decision Automation Group Command and Control Division, DSTO Distributed Deliberative Planning.

Slides:

Advertisements

Similar presentations

MULTI-AGENT BASED SCHEDULING D. Ouelhadj ASAP (Automated Scheduling Optimisation and Planning) Research Group School of Computer Science and IT University.

Advertisements

Heuristic Search techniques

FIPA Interaction Protocol. Request Interaction Protocol Summary –Request Interaction Protocol allows one agent to request another to perform some action.

Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.

Problem Solving by Searching Copyright, 1996 © Dale Carnegie & Associates, Inc. Chapter 3 Spring 2007.

Jaringan Komputer Lanjut Packet Switching Network.

Some questions o What are the appropriate control philosophies for Complex Manufacturing systems? Why????Holonic Manufacturing system o Is Object -Oriented.

Negotiating a stable distribution of the payoff among agents may prove challenging. The issue of coalition formation has been investigated extensively,

Techniques for Dealing with Hard Problems Backtrack: –Systematically enumerates all potential solutions by continually trying to extend a partial solution.

Negotiation A Lesson in Multiagent System Based on Jose Vidal’s book Fundamentals of Multiagent Systems Henry Hexmoor SIUC.

Overview UML Extensions for Agents UML UML Agent UML (AUML) Agent UML (AUML) Agent Interaction Protocols Agent Interaction Protocols Richer Role Specification.

Sogang University ICC Lab Using Game Theory to Analyze Wireless Ad Hoc networks.

An Approach to Evaluate Data Trustworthiness Based on Data Provenance Department of Computer Science Purdue University.

LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.

Planning under Uncertainty

MokSAF: Agent-based Team Assistance for Time Critical Tasks Katia Sycara The Robotics Institute

Effective Coordination of Multiple Intelligent Agents for Command and Control The Robotics Institute Carnegie Mellon University PI: Katia Sycara

Analyzing the tradeoffs between breakup and cloning in the context of organizational self-design By Sachin Kamboj.

Resource Management – a Solution for Providing QoS over IP Tudor Dumitraş, Frances Jen-Fung Ning and Humayun Latif.

MAE 552 – Heuristic Optimization Lecture 27 April 3, 2002

Design of Multi-Agent Systems Teacher Bart Verheij Student assistants Albert Hankel Elske van der Vaart Web site

Chapter 13: Process Specifications Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.

Ant Colonies As Logistic Processes Optimizers

Distributed Rational Decision Making Sections By Tibor Moldovan.

A Coordination Model for Distributed Personnel Planning Patrick De Causmaecker, Peter Demeester, Greet Vanden Berghe, Bart Verbeke

HEURISTIC SEARCH. Luger: Artificial Intelligence, 5 th edition. © Pearson Education Limited, 2005 Portion of the state space for tic-tac-toe.

Distributed Scheduling. What is Distributed Scheduling? Scheduling: –A resource allocation problem –Often very complex set of constraints –Tied directly.

Distributed Constraint Optimization * some slides courtesy of P. Modi

Project Management Basics

Chapter 13: Process Specifications Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.

Manufacturing Control system. Manufacturing Control - Managing and controlling the physical activities in the factory aiming to execute the manufacturing.

Adapting Environment-Mediated Self-Organizing Emergent Systems by Exception Rules Holger Kasinger, Bernhard Bauer, Jörg Denzinger and Tom Holvoet.

COGNITIVE RADIO FOR NEXT-GENERATION WIRELESS NETWORKS: AN APPROACH TO OPPORTUNISTIC CHANNEL SELECTION IN IEEE BASED WIRELESS MESH Dusit Niyato,

Managing Social Influences through Argumentation-Based Negotiation Present by Yi Luo.

Engineering Law-Governed Approaches How to reuse, extend and compose interaction specifications Gustavo Carvalho, Carlos Lucena

Protocol Architectures. Simple Protocol Architecture Not an actual architecture, but a model for how they work Similar to “pseudocode,” used for teaching.

A Framework for Distributed Model Predictive Control

Topology aggregation and Multi-constraint QoS routing Presented by Almas Ansari.

Transit price negotiation: repeated game approach Sogea 23 Mai 2007 Nancy, France D.Barth, J.Cohen, L.Echabbi and C.Hamlaoui

Travis Steel. Objectives What is the Agent Paradigm? What is Agent-Oriented Design and how is it different than OO? When to apply AOD techniques? When.

ARTIFICIAL INTELLIGENCE [INTELLIGENT AGENTS PARADIGM] Professor Janis Grundspenkis Riga Technical University Faculty of Computer Science and Information.

Communications Skills (ELE 205)

Parallel and Distributed Simulation Memory Management & Other Optimistic Protocols.

A Faster Approximation Scheme for Timing Driven Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, and Charles J. Alpert** *Dept of ECE, Michigan Technological.

Title: Diagnosing a team of agents: Scaling up Written by: Meir Kalech and Gal A. Kaminka Presented by: Reymes Madrazo-Rivera.

Algorithms and data structures Protected by

Communications Skills (ELE 205) Dr. Ahmad Dagamseh Dr. Ahmad Dagamseh.

GAME PLAYING 1. There were two reasons that games appeared to be a good domain in which to explore machine intelligence: 1.They provide a structured task.

Chapter 6 CASE Tools Software Engineering Chapter 6-- CASE TOOLS

Presented by: Sandeep Chittal Minimum-Effort Driven Dynamic Faceted Search in Structured Databases Authors: Senjuti Basu Roy, Haidong Wang, Gautam Das,

Systems design for scheduling: Open Tools Patrick De Causmaecker, Peter Demeester, Greet Vanden Berghe and Bart Verbeke KaHo Sint-Lieven, Gent, Belgium.

Heuristic Methods for the Single- Machine Problem Chapter 4 Elements of Sequencing and Scheduling by Kenneth R. Baker Byung-Hyun Ha R2.

Wireless sensor and actor networks: research challenges

Unit – I Presentation. Unit – 1 (Introduction to Software Project management) Definition:-  Software project management is the art and science of planning.

Antidio Viguria Ann Krueger A Nonblocking Quorum Consensus Protocol for Replicated Data Divyakant Agrawal and Arthur J. Bernstein Paper Presentation: Dependable.

Knowledge-Driven Wireless Networks Design Cognitive Radios requires New Networking Solution  Knowledge-driven Networking (goes beyond “cognitive networking”,

Biointelligence Lab School of Computer Sci. & Eng. Seoul National University Artificial Intelligence Chapter 8 Uninformed Search.

PGDM/ / II Trimester/E-Business. What is supply chain management?  Supply chain management is the coordination of entities, activities, information.

Wireless sensor and actor networks: research challenges Ian. F. Akyildiz, Ismail H. Kasimoglu

CHaRy Software Synthesis for Hard Real-Time Systems

Supervised Time Series Pattern Discovery through Local Importance

Announcements Homework 3 due today (grace period through Friday)

Market-based Dynamic Task Allocation in Mobile Surveillance Systems

On the Use of Service Level Agreements in AssessGrid

Dynamic Management of Food Redistribution for 412 Food Rescue

UNINFORMED SEARCH -BFS -DFS -DFIS - Bidirectional

CMSC 471 Fall 2011 Class #4 Tue 9/13/11 Uninformed Search

Software Agent.

Presentation transcript:

Don Perugini, Dennis Jarvis, Shane Reschke, Don Gossink Decision Automation Group Command and Control Division, DSTO Distributed Deliberative Planning with Partial Observability: Heuristic Approaches

Outline Motivation Legal Agreement Protocol (LAP) Deliberative Planning & Partial Observability Heuristics Results

Characteristics of Military Operations Decentralised: cooperation among distributed autonomous organisations make their own self-interested decisions (not controlled) may keep information/capabilities private Dynamic organisation’s capabilities, information & goals may change the environment in which they interact may change Open organisations with indeterminate capabilities may come and go at any time Agreements formation of “legal” agreements for services/capabilities contract law to establish commitment and agreements

Legal Agreement Protocol (LAP) LAP facilitates cooperation and coordination among organisations (or agents) Enables planning, task allocation and agreements among agents in a decentralised, dynamic and open environment Extension of the Contract Net Protocol (CNP) Comprises an iterative interaction process: Customer agents extract, match and negotiate capabilities from supplier agents Distributed assembly of capabilities (e.g. using A*) Adapt via updating, withdrawing & backtracking mechanisms (not discussed)

LAP Components Messaging component Describes the sequence of messages (speech acts & semantics) and events that can occur at various stages of the protocol Reasoning component Drives the protocol (messaging component) e.g. when to offer, update, backtrack, negotiate, etc. Highly domain dependent Require heuristics to facilitate effective reasoning and planning within the complex environments that LAP is applied

LAP Messaging Component

LAP Example (Distributed A*) Search Tree Customer: Goal: T init = Expected costs: E init = Current path cost: k = 0 Deadline for responses = 5 mins T init Suppliers A* search, branches evaluated by: f = g + h g = current path cost h = expected cost to achieve remaining tasks h must be an underestimate to guarantee optimality

LAP Example (Distributed A*) Search Tree Customer p 3 =, 12 T init Suppliers p 2 =, 10 p 1 =, 8 p 1 f = 15 p 3 f = 19 p 2 f = 18

LAP Example (Distributed A*) Search Tree Customer T init Suppliers Next best proposal: p 4 =, 15 p 1 offered p 1 f = 15 p 3 f = 19 p 2 f = 18 Invite to offer p 1 p 4 f = 17

LAP Example (Distributed A*) Customer: Tasks left to achieve: T 1 = T init \p 1 = E 1 =, k = 8 (cost of p 1 ) Suppliers T1T1 Search Tree T init p 1 f = 15 p 3 f = 19 p 2 f = 18 p 4 f = 17

LAP Example (Distributed A*) Customer Suppliers T1T1 Search Tree T init p 1 f = 15 p 3 f = 19 p 2 f = 18 p 4 f = 17 p 5 =, 7p 6 =, 20 p 6 f = 28 p 5 f = 22 Backtrack

LAP Example (Distributed A*) Customer Suppliers T1T1 Search Tree T init p 1 f = 15 p 3 f = 19 p 2 f = 18 p 4 f = 17 Reject offer p 1 Invite to offer p 4 p 6 f = 28 p 5 f = 22 Backtrack

Partial Observability With centralised search approaches, the expected cost e i for each sub-task can be determined naïvely using Requires visibility of all proposals In a decentralised environment, and with LAP, the customer does not have access to all other agent’s capabilities (proposals) Makes finding e i difficult Solution: the expected cost e i is determined dynamically during planning as the customer receives information about other agents’ capabilities

Heuristic Approaches Commence with e i = 0 1.Minimum cost heuristic e i is the minimum cost observed so far 2.Alpha factor on difference, limited Increase e i slowly to prevent over-estimation s is a newly observed sub-task cost If s < e i, set e i = s, use minimum cost heuristic Otherwise, e i = e i + , where  = e i – s 3.Average over all sub-tasks e i is the average over all observed sub-task costs 4.Average of current average e i = (e i + s)/2, where s is a newly observed sub-task cost

Experiments Used set partitioning problem datasets Set of tasks T = {1, 2, …, m} need to be achieved using a set of package propoals B = {B 1, B 2, …, B n }, where B j =, p j  T is a set of achieving capabilities at cost c j Aim: achieve all sub-tasks in T once, at minimum cost 90 scenarios 18 datasets 1, 2, 5, 10, 100 suppliers  values of 0.2, 0.4, 0.6, 0.8 Evaluated on: solution quality, number of nodes traversed, number of branches received

Results Average cost heuristics find a solution with less branches & nodes (i.e. less time and communication) than other heuristics, but at the cost of the quality of solution Minimum cost heuristic is 280% worse than the worst alpha factor heuristic (  = 0.8), but the reduction in effort was only 25% for nodes and 22% for branches Heuristic Quality (% above optimal) Nodes Normalized Branches Normalized# Opt Minimum Cost 2.7    Alpha Factor (  =0.2) 0.07    Alpha Factor (  =0.4) 0.36    Alpha Factor (  =0.6) 0.51    Alpha Factor (  =0.8) 0.71    Average Cost 22.5    Average of Average 30.9   

Results Number of branches increases with the number of suppliers since more suppliers can submit proposals for each task announcement Number of nodes traversed decreases (more efficient search) as suppliers increase for all but minimum cost heuristic due to increase in submitted proposals Alpha factor: expected cost increases quickly to the min cost at the start of the search Average: stable, accurate and larger expected cost Min cost: minimum cost of many proposals is lower than the minimum cost of a few proposals

Conclusion Investigated four heuristics to dynamically determine the expected cost during planning using LAP in the presence of partial observability Heuristics have tradeoffs: quality of solution vs effort required to search (nodes & branches) Average cost heuristics required less effort, but at the cost of the quality of solution The quality by using the alpha factor heuristic is much greater than the minimum cost heuristic, with little extra effort Number of supplier influences search effort Number of branches increase with the number of suppliers Number of nodes traversed decreased with all heuristics except the minimum cost heuristic

QUESTIONS? Acknowledgements Peter Smet, Dale Lambert, Jason Scholz