Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick.

Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick Jennings School of Electronics and Computer Science University of Southampton {rs06r, af2, acr, nrj}@ecs.soton.ac.uk

2 This presentation focuses on the use of Max-Sum in coordination problems with continuous parameters From Discrete to Continuous Max-Sum for Decentralised Coordination Empirical Evaluation

3 Max-Sum is a powerful algorithm for solving DCOPs Complete Algorithms DPOP OptAPO ADOPT Communication Cost Iterative Algorithms Best Response (BR) Distributed Stochastic Algorithm (DSA) Fictitious Play (FP) Max-Sum Algorithm Optimality

Max-Sum solves the social welfare maximisation problem in a decentralised way Agents

Max-Sum solves the social welfare maximisation problem in a decentralised way Control Parameters

Max-Sum solves the social welfare maximisation problem in a decentralised way Utility Functions

Max-Sum solves the social welfare maximisation problem in a decentralised way Localised Interaction

Max-Sum solves the social welfare maximisation problem in a decentralised way Agents Social welfare:

The input for the Max-Sum algorithm is a graphical representation of the problem: a Factor Graph Variable nodes Function nodes Agent 1 Agent 2 Agent 3

Max-Sum solves the social welfare maximisation problem by message passing Variable nodes Function nodes Agent 1 Agent 2 Agent 3

Max-Sum solves the social welfare maximisation problem by message passing From variable i to function j From function j to variable i

Until now, Max-Sum was only defined for discretely valued variables Graph Colouring

However, many problems are inherently continuous. Heading and Velocity Unattended Ground Sensor Activation Time Autonomous Ground Robot Thermostat Preferred Room Temperature

So, we extended the Max-Sum algorithm to operate in continuous action spaces Discrete Continuous

We focussed on utility functions that are Continuous Piecewise Linear Functions (CPLFs)

“Continuous” Graph Colouring

A CPLF is defined by a domain partitioning followed by value assignment

To make Max-Sum work on CPLFs, we need to define key two operations on them From variable i to function j From function j to variable i

To make Max-Sum work on CPLFs, we need to define key two operations on them From variable i to function j From function j to variable i 1.Addition of two CPLFs

To make Max-Sum work on CPLFs, we need to define key two operations on them From variable i to function j From function j to variable i 2. Marginal Maximisation to a single variable

Addition of two CPLFs involves merging their domains, and then summing their values

1. Merge domains

Addition of two CPLFs involves merging their domains, and then summing their values

2. Sum Values

Marginal maximisation is the operation of finding the maximum value of a function, if we fix all but one variable From function j to variable i:

Marginal maximisation involves finding the maximum value of a function, if we fix all but one variable

Example: bivariate function:

Marginal maximisation involves the projection of a CLPF on a 2-D plane, and upper envelope extraction Project onto axis

Marginal maximisation involves the projection of a CLPF on a 2-D plane, and upper envelope extraction Project onto axis Result of projection

Marginal maximisation involves the projection of a CLPF on a 2-D plane, and upper envelope extraction Extract Upper Envelope

We empirically evaluated this algorithm in a wide- area surveillance scenario Dense deployment of sensors to detect activity within an urban environment. Unattended Ground Sensor

Sensors adapt their duty cycles to maximise event detection by coordinating with overlapping sensors time duty cycle Discrete time duty cycle time duty cycle Discretised time

Sensors adapt their duty cycles to maximise event detection by coordinating with overlapping sensors time duty cycle DiscreteContinuous time duty cycle time duty cycle time duty cycle time duty cycle time duty cycle

Continuous Max-Sum outperforms Discrete Max- Sum by up to 10% Discretisation Solution Quality (as fraction of optimal) Average Solution Quality over 25 Iterations

Total Message Size Continuous Max-Sum leads to more effective use of communication resources than Discrete Max-Sum Discretisation Total number of values exchanged between agents

In conclusion, we have shown that Continuous Max-Sum is more effective than Discrete Max-Sum 1. No artificial discretisation time

In conclusion, we have shown that Continuous Max-Sum is more effective than Discrete Max-Sum 1. No artificial discretisation 2. Better solutions time Solution Quality

In conclusion, we have shown that Continuous Max-Sum is more effective than Discrete Max-Sum time 1. No artificial discretisation 2. Better solutions 3. Effective communication Solution Quality Message Size

For future work, we wish to extend the algorithm to arbitrary continuous functions For example, using Gaussian Processes

In conclusion, we have shown that Continuous Max-Sum is more effective than Discrete Max-Sum time 1. No artificial discretisation 2. Better solutions 3. Effective communication Solution Quality Message Size Questions?

Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick.

Similar presentations

Presentation on theme: "Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick."— Presentation transcript:

Similar presentations

About project

Feedback

Log in

Auth with social network:

Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick.

Similar presentations

Presentation on theme: "Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick."— Presentation transcript:

Similar presentations

About project

Feedback