Distributed Constraint Optimization: Approximate Algorithms Alessandro Farinelli.

Slides:

Advertisements

Similar presentations

Bayesian Belief Propagation

Advertisements

Hadi Goudarzi and Massoud Pedram

Adopt Algorithm for Distributed Constraint Optimization

Towards a Theoretic Understanding of DCEE Scott Alfeld, Matthew E

1 University of Southern California Keep the Adversary Guessing: Agent Security by Policy Randomization Praveen Paruchuri University of Southern California.

Modeling Maze Navigation Consider the case of a stationary robot and a mobile robot moving towards a goal in a maze. We can model the utility of sharing.

Coverage by Directional Sensors Jing Ai and Alhussein A. Abouzeid Dept. of Electrical, Computer and Systems Engineering Rensselaer Polytechnic Institute.

Randomized Sensing in Adversarial Environments Andreas Krause Joint work with Daniel Golovin and Alex Roper International Joint Conference on Artificial.

Section 3: Appendix BP as an Optimization Algorithm 1.

ANDREW MAO, STACY WONG Regrets and Kidneys. Intro to Online Stochastic Optimization Data revealed over time Distribution of future events is known Under.

Loopy Belief Propagation a summary. What is inference? Given: –Observabled variables Y –Hidden variables X –Some model of P(X,Y) We want to make some.

1 Distributed Adaptive Sampling, Forwarding, and Routing Algorithms for Wireless Visual Sensor Networks Johnsen Kho, Long Tran-Thanh, Alex Rogers, Nicholas.

Planning under Uncertainty

Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm Ruben Stranders, Alex Rogers, Nick Jennings School of Electronics and Computer.

Decentralised Coordination of Mobile Sensors using the Max-Sum Algorithm School of Electronics and Computer Science University of Southampton {rs06r2,

Distributed Message Passing for Large Scale Graphical Models Alexander Schwing Tamir Hazan Marc Pollefeys Raquel Urtasun CVPR2011.

A Decentralised Coordination Algorithm for Maximising Sensor Coverage in Large Sensor Networks Ruben Stranders, Alex Rogers and Nicholas R. Jennings School.

Date:2011/06/08 吳昕澧 BOA: The Bayesian Optimization Algorithm.

DCOPs Meet the Real World: Exploring Unknown Reward Matrices with Applications to Mobile Sensor Networks Manish Jain Matthew E. Taylor Makoto Yokoo MilindTambe.

Decentralised Coordination of Continuously Valued Control Parameters using the Max-Sum Algorithm Ruben Stranders, Alessandro Farinelli, Alex Rogers, Nick.

19 th November 2008 Agent-Based Decentralised Control of Complex Distributed Systems Alex Rogers School of Electronics and Computer Science University.

Recent Development on Elimination Ordering Group 1.

Agent-Based Coordination of Sensor Networks Alex Rogers School of Electronics and Computer Science University of Southampton

Message Passing Algorithms for Optimization

Ant Colony Optimization Optimisation Methods. Overview.

Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.

A Multi-Agent Learning Approach to Online Distributed Resource Allocation Chongjie Zhang Victor Lesser Prashant Shenoy Computer Science Department University.

Impact of Problem Centralization on Distributed Constraint Optimization Algorithms John P. Davin and Pragnesh Jay Modi Carnegie Mellon University School.

A Decentralised Coordination Algorithm for Mobile Sensors School of Electronics and Computer Science University of Southampton {rs06r2, fmdf08r, acr,

The Role of Specialization in LDPC Codes Jeremy Thorpe Pizza Meeting Talk 2/12/03.

Distributed Constraint Optimization * some slides courtesy of P. Modi

Decentralised Coordination through Local Message Passing Alex Rogers School of Electronics and Computer Science University of Southampton

Metaheuristics The idea: search the solution space directly. No math models, only a set of algorithmic steps, iterative method. Find a feasible solution.

Decentralised Coordination of Mobile Sensors School of Electronics and Computer Science University of Southampton Ruben Stranders,

Distributed Real-Time systems 1 By: Mahdi Sadeghizadeh Website: Sadeghizadeh.ir Advanced Computer Networks.

A1A1 A4A4 A2A2 A3A3 Context-Specific Multiagent Coordination and Planning with Factored MDPs Carlos Guestrin Shobha Venkataraman Daphne Koller Stanford.

Fast Spectrum Allocation in Coordinated Dynamic Spectrum Access Based Cellular Networks Anand Prabhu Subramanian*, Himanshu Gupta*,

Conference Paper by: Bikramjit Banerjee University of Southern Mississippi From the Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence.

Distributed Constraint Optimization Michal Jakob Agent Technology Center, Dept. of Computer Science and Engineering, FEE, Czech Technical University A4M33MAS.

Energy Efficient Routing and Self-Configuring Networks Stephen B. Wicker Bart Selman Terrence L. Fine Carla Gomes Bhaskar KrishnamachariDepartment of CS.

Introduction to variable selection I Qi Yu. 2 Problems due to poor variable selection: Input dimension is too large; the curse of dimensionality problem.

ANTs PI meeting, May 29-31, 2002Washington University / DCMP1 Flexible Methods for Multi-agent Distributed Resource Allocation by Exploiting Phase Transitions.

Network Aware Resource Allocation in Distributed Clouds.

COLLABORATIVE SPECTRUM MANAGEMENT FOR RELIABILITY AND SCALABILITY Heather Zheng Dept. of Computer Science University of California, Santa Barbara.

Adaptive CSMA under the SINR Model: Fast convergence using the Bethe Approximation Krishna Jagannathan IIT Madras (Joint work with) Peruru Subrahmanya.

Optimal Power Control, Rate Adaptation and Scheduling for UWB-Based Wireless Networked Control Systems Sinem Coleri Ergen (joint with Yalcin Sadi) Wireless.

Swarm Intelligence 虞台文.

Introduction to Job Shop Scheduling Problem Qianjun Xu Oct. 30, 2001.

Message-Passing for Wireless Scheduling: an Experimental Study Paolo Giaccone (Politecnico di Torino) Devavrat Shah (MIT) ICCCN 2010 – Zurich August 2.

ANTs PI Meeting, Nov. 29, 2000W. Zhang, Washington University1 Flexible Methods for Multi-agent distributed resource Allocation by Exploiting Phase Transitions.

Kevin Ross, UCSC, September Service Network Engineering Resource Allocation and Optimization Kevin Ross Information Systems & Technology Management.

Self‐Organising Sensors for Wide Area Surveillance using the Max‐Sum Algorithm Alex Rogers and Nick Jennings School of Electronics and Computer Science.

Scheduling policies for real- time embedded systems.

1 Exploring Custom Instruction Synthesis for Application-Specific Instruction Set Processors with Multiple Design Objectives Lin, Hai Fei, Yunsi ACM/IEEE.

CS584 - Software Multiagent Systems Lecture 12 Distributed constraint optimization II: Incomplete algorithms and recent theoretical results.

MURI: Integrated Fusion, Performance Prediction, and Sensor Management for Automatic Target Exploitation 1 Dynamic Sensor Resource Management for ATE MURI.

1 Short Term Scheduling. 2  Planning horizon is short  Multiple unique jobs (tasks) with varying processing times and due dates  Multiple unique jobs.

Mobile Agent Migration Problem Yingyue Xu. Energy efficiency requirement of sensor networks Mobile agent computing paradigm Data fusion, distributed processing.

CS270 Project Overview Maximum Planar Subgraph Danyel Fisher Jason Hong Greg Lawrence Jimmy Lin.

CS 484 Load Balancing. Goal: All processors working all the time Efficiency of 1 Distribute the load (work) to meet the goal Two types of load balancing.

Distributed Constraint Optimization Problem for Decentralized Decision Making Optimization in Multi-Agent Systems.

DISTIN: Distributed Inference and Optimization in WSNs A Message-Passing Perspective SCOM Team

Smart Sleeping Policies for Wireless Sensor Networks Venu Veeravalli ECE Department & Coordinated Science Lab University of Illinois at Urbana-Champaign.

Distributed cooperation and coordination using the Max-Sum algorithm

Algorithmic and Domain Centralization in Distributed Constraint Optimization Problems John P. Davin Carnegie Mellon University June 27, 2005 Committee:

1 Chapter 5 Branch-and-bound Framework and Its Applications.

Networked Distributed POMDPs: DCOP-Inspired Distributed POMDPs

Multi-Agent Exploration

The story of distributed constraint optimization in LA: Relaxed

Expectation-Maximization & Belief Propagation

Presentation transcript:

Distributed Constraint Optimization: Approximate Algorithms Alessandro Farinelli

Approximate Algorithms: outline No guarantees – DSA-1, MGM-1 (exchange individual assignments) – Max-Sum (exchange functions) Off-Line guarantees – K-optimality and extensions On-Line Guarantees – Bounded max-sum

Why Approximate Algorithms Motivations – Often optimality in practical applications is not achievable – Fast good enough solutions are all we can have Example – Graph coloring – Medium size problem (about 20 nodes, three colors per node) – Number of states to visit for optimal solution in the worst case 3^20 = 3 billions of states Key problem – Provides guarantees on solution quality

Exemplar Application: Surveillance Event Detection – Vehicles passing on a road Energy Constraints – Sense/Sleep modes – Recharge when sleeping Coordination – Activity can be detected by single sensor – Roads have different traffic loads Aim – Focus on road with more traffic load time duty cycle Heavy traffic road small road Good Schedule Bad Schedule

Surveillance demo Joint work with: A. Rogers, N. R. Jennings

Guarantees on solution quality Key Concept: bound the optimal solution – Assume a maximization problem – optimal solution, a solution – – percentage of optimality [0,1] The higher the better – approximation ratio >= 1 The lower the better – is the bound

Types of Guarantees Generality Accuracy Accuracy: high alpha Generality: less use of instance specific knowledge MGM-1, DSA-1, Max-Sum Bounded Max- Sum DaCSA K-optimality T-optimality Region Opt. On-Line Off-Line No guarantees

Centralized Local Greedy approaches Greedy local search – Start from random solution – Do local changes if global solution improves – Local: change the value of a subset of variables, usually one

Centralized Local Greedy approaches Problems – Local minima – Standard solutions: RandomWalk, Simulated Annealing -2

Distributed Local Greedy approaches Local knowledge Parallel execution: – A greedy local move might be harmful/useless – Need coordination

Distributed Stochastic Algorithm Greedy local search with activation probability to mitigate issues with parallel executions DSA-1: change value of one variable at time Initialize agents with a random assignment and communicate values to neighbors Each agent: – Generates a random number and execute only if rnd less than activation probability – When executing changes value maximizing local gain – Communicate possible variable change to neighbors

DSA-1: Execution Example P = 1/ rnd > ¼ ?

DSA-1: discussion Extremely “cheap” (computation/communication) Good performance in various domains – e.g. target tracking [Fitzpatrick Meertens 03, Zhang et al. 03], – Shows an anytime property (not guaranteed) – Benchmarking technique for coordination Problems – Activation probablity must be tuned [Zhang et al. 03] – No general rule, hard to characterise results across domains

Maximum Gain Message (MGM-1) Coordinate to decide who is going to move – Compute and exchange possible gains – Agent with maximum (positive) gain executes Analysis [Maheswaran et al. 04] – Empirically, similar to DSA – More communication (but still linear) – No Threshold to set – Guaranteed to be monotonic (Anytime behavior)

MGM-1: Example G = -2G = 0G = 2 G = 0

Local greedy approaches Exchange local values for variables – Similar to search based methods (e.g. ADOPT) Consider only local information when maximizing – Values of neighbors Anytime behaviors Could result in very bad solutions

GDL based approaches Generalized Distributive Law – Unifying framework for inference in Graphical models – Builds on basic mathematical properties of semi-rings – Widely used in Info theory, Statistical physics, Probabilistic models Max-sum – DCOP settings: maximise social welfare

Max-sum Agents iteratively computes local functions that depend only on the variable they control X1 X4X3 X2 Shared constraint All incoming messages except x2 All incoming messages Choose arg max

Max-Sum on acyclic graphs Max-sum Optimal on acyclic graphs – Different branches are independent – Each agent can build a correct estimation of its contribution to the global problem (z functions) Message equations very similar to Util messages in DPOP – Sum messages from children and shared constraint – Maximize out agent variable – GDL generalizes DPOP [Vinyals et al. 2010a] X1 X4 X3 X2

Max-sum Performance Good performance on loopy networks [Farinelli et al. 08] – When it converges very good results Interesting results when only one cycle [Weiss 00] – We could remove cycle but pay an exponential price (see DPOP)

Max-Sum for low power devices Low overhead – Msgs number/size Asynchronous computation – Agents take decisions whenever new messages arrive Robust to message loss

Max-sum on hardware Chipcon CC2431 –Low-power devices, 8 KByte RAM Joint work with: L. Teacy, N. J. Grabham, P. Padhy, A. Rogers, N. R. Jennings

Max-Sum for UAVs Ack: F. Delle Fave, A. Rogers, N.R. Jennings and ACFR

Quality guarantees for approx. techniques Key area of research Address trade-off between guarantees and computational effort Particularly important for many real world applications – Critical (e.g. Search and rescue) – Constrained resource (e.g. Embedded devices) – Dynamic settings

Off-Line guarantees Generality Accuracy Characterise solution quality without running the algorithm MGM-1, DSA-1, Max-Sum Bounded Max- Sum DaCSA K-optimality T-optimality Region Opt. On-Line Off-Line No guarantees

K-Optimality framework Given a characterization of solution gives bound on solution quality [Pearce and Tambe 07] Characterization of solution: k-optimal K-optimal solution: – Corresponding value of the objective function can not be improved by changing the assignment of k or less variables.

K-Optimal solutions optimal ?No3-optimal ?Yes

Bounds for K-Optimality For any DCOP with non-negative rewards [Pearce and Tambe 07] K-optimal solution Number of agents Maximum arity of constraints Binary Network (m=2):

K-Optimality Discussion Need algorithms for computing k-optimal solutions – DSA-1, MGM-1 k=1; DSA-2, MGM-2 k=2 [Maheswaran et al. 04] – DALO for generic k (and t-optimality) [Kiekintveld et al. 10] The higher k the more complex the computation (exponential) Percentage of Optimal: The higher k the better The higher the number of agents the worst

Trade-off between generality and solution quality K-optimality based on worst case analysis assuming more knowledge gives much better bounds Knowledge on structure [Pearce and Tambe 07]

Trade-off between generality and solution quality Knowledge on reward [Bowring et al. 08] Beta: ratio of least minimum reward to the maximum

Off-Line Guarantees: Region Optimality k-optimality: use size as a criterion for optimality t-optimality: use distance to a central agent in the constraint graph Region Optimality: define regions based on general criteria (e.g. S-size bounded distance) [Vinyals et al 11] Ack: Meritxell Vinyals 3-size regions x0x1x2x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 1-distance regions x0x1x2x3 C regions

Size-Bounded Distance Region optimality can explore new regions: s-size bounded distance One region per agent, largest t- distance group whose size is less than s S-Size-bounded distance – C-DALO extension of DALO for general regions – Can provides better bounds and keep under control size and number of regions 3-size bounded distance x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 t=1 t=0 t=1 t=0

Max-Sum and Region Optimality Can use region optimality to provide bounds for Max- sum [Vinyals et al 10b] Upon convergence Max-Sum is optimal on SLT regions of the graph [Weiss 00] Single Loops and Trees (SLT): all groups of agents whose vertex induced subgraph contains at most one cycle x1x1 x0x0 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3 x0x0 x1x1 x2x2 x3x3

Bounds for Max-Sum Complete: same as 3-size optimality bipartite 2D grids

Variable Disjoint Cycles Very high quality guarantees if smallest cycle is large

On-Line guarantees Generality Accuracy Characterise solution quality after/while running the algorithm MGM-1, DSA-1, Max-Sum Bounded Max- Sum DaCSA K-optimality T-optimality Region Opt. On-Line Off-Line No guarantees

Build Spanning tree Bounded max-sum Aim: Remove cycles from constraint graph avoiding exponential computation/communication [Rogers et al. 11] Avoid pseudo trees/junction trees Run Max-Sum Compute Bound X1X2X3 Optimal state on tree X4 0100

Computing the weights Compute a weight for each edge – maximum possible impact of the constraint on the solution – Maximum Spanning Tree – Remove edges and modify constraints – W = sum of weights of removed edges

Results (Random Binary Network) Optimal Approx. Lower Bound Upper Bound Bound is significant – Approx. ratio is typically 1.23 (81 %) Comparison with k-optimal with knowledge on reward structure Much more accurate less general

Discussion Discussion with other data-dependent techniques – BnB-ADOPT [Yeoh et al 09] Fix an error bound and execute until the error bound is met Worst case computation remains exponential – ADPOP [Petcu and Faltings 05b] Can fix message size (and thus computation) or error bound and leave the other parameter free Divide and coordinate [Vinyals et al 10] – Divide problems among agents and negotiate agreement by exchanging utility – Provides anytime quality guarantees

Future Challenges for DCOP Handle Uncertainty – E[DPOP] [Leaute et al. 09] – Distributed Coordination of Exploration and Exploitation [Taylor et al. 11] Handle Dynamism – S-DPOP [Petcu and Faltings 05] – Robustness in DCR [Lass et al. 09] – Fast-Max-Sum [MacArhur et al. 09]

Challenging domains: Robotics Cooperative exploration, Surveillance, Patrolling, etc. Main Challenges – Reward is unknown/uncertain – Structure of the problem changes over time – High probability of failures Messages, robots, etc. Work in this direction – [Stranders et al. 09] – [Taylor et al. 11]

Challenging domains: Energy Intelligent Building control, power distribution configuration, electric vehicle management, etc. Main challenges – Large scale, dynamic system – Decomposing agents’ interactions – Interaction with end-users Work in this direction – [Kumar et al. 09] – [Kambooj et al. 11]

References I [Kumar et al. 09] Distributed Constraint Optimization with Structured Resource Constraints, AAMAS 09 [Kamboj at al. 09] Deploying Power Grid-Integrated Electric Vehicles as a Multi-Agent System, AAMAS 11 [Leaute et al. 09] E[DPOP]: Distributed Constraint Optimization under Stochastic Uncertainty using Collaborative Sampling, DCR 09 [Taylor et al. 11] Distributed On-line Multi-Agent Optimization Under Uncertainty: Balancing Exploration and Exploitation, Advances in Complex Systems [Stranders at al 09] Decentralised Coordination of Mobile Sensors Using the Max-Sum Algorithm, AAAI 09 [Petcu and Faltings 05] S-DPOP: Superstabilizing, Fault-containing Multiagent Combinatorial Optimization, AAAI 05 [Lass et al. 09] Robust Distributed Constraint Reasoning, DCR 09 [Vinyals et al. 10] Divide and Coordinate: solving DCOPs by agreement. AAMAS 10 [Petcu and Faltings 05b] A-DPOP: Approximations in Distributed Optimization, CP 2005 [Yeoh et al. 09] Trading off solution quality for faster computation in DCOP search algorithms, IJCAI 09 [Vinyals et al 10a] Constructing a unifying theory of dynamic programming DCOP algorithms via the Generalized Distributive Law, JAAMAS [Rogers et al. 11] Bounded approximate decentralised coordination via the max-sum algorithm, Artificial Intelligence 2011.

References II [Yeoh et al. 09] Trading off solution quality for faster computation in DCOP search algorithms, IJCAI 09 [Vinyals et al 10b] Worst-case bounds on the quality of max-product fixed-points, NIPS 10 [Vinyals et al 11] Quality guarantees for region optimal algorithms, AAMAS 11 [Maheswaran et al. 04] Distributed Algorithms for DCOP: A Graphical Game-Based Approach, PDCS [Kiekintveld et al. 10] Asynchronous Algorithms for Approximate Distributed Constraint Optimization with Quality Bounds, AAMAS 10 [Pearce and Tambe 07] Quality Guarantees on k-Optimal Solutions for Distributed Constraint Optimization Problems, IJCAI 07 [Bowring et al. 08] On K-Optimal Distributed Constraint Optimization Algorithms: New Bounds and Algorithms, AAMAS 08 [Farinelli et al. 08] Decentralised coordination of low-power embedded devices using the max-sum algorithm, AAMAS 08 [Weiss 00] Correctness of local probability propagation in graphical models with loops, Neural Computation [Fitzpatrick and Meertens 03] Distributed Coordination through Anarchic Optimization, Distributed Sensor Networks: a multiagent perspective. [Zhang et al. 03] A Comparative Study of Distributed Constraint algorithms, Distributed Sensor Networks: a multiagent perspective. [MacArthur et al 09 ] Efficient, Superstabilizing Decentralised Optimisation for Dynamic Task Allocation Environments, OptMAS workshop 2009.