Multi-Agent Based Search vs. Local Search and Backtrack Search for Solving Tight CSPs: A Practical Case Study Hui Zou and Berthe Y. Choueiry Constraint Systems Laboratory Department of Computer Science and Engineering University of Nebraska-Lincoln
Introduction Search algorithms : systematic or local-repair C omplex, real-world optimization problems –Systematic search thrashes –Local search gets stuck in ‘local optima’ –Remedial: random walk, breakout, restart strategies, etc. Multi-agent-based search [Liu & al. AIJ 02] –provides us with a new way –Advantages & shortcomings via a practical application
Graduate Teaching Assistants (GTA) problem: In a semester, given –a set of courses –a set of graduate teaching assistants –a set of constraints that specify allowable assignments Find a consistent and satisfactory assignment of GTAs to courses Background - GTA Detailed modeling in [Glaubius & Choueiry ECAI 02 WS on Modeling] Types of constraints : unary, binary, non-binary – Each course has a load, indicates weight of the course – Each GTA has a (hiring) capacity, limits max. load
Background - GTA (cont’) Problem size : Date setMark # variablesDomain size Problem size Spring2001b B × O ×10 97 Fall2001b B × O ×10 99 Fall2002 B ×10 47 O ×10 44 Spring2003 B ×10 84 O ×10 82 B – boosted to make problem solvable O – original, not necessary solvable In practice, this problem is tight, even over-constrained Our goal: ensure GTA support to as many courses as possible
Background - GTA (cont’) Optimization criteria : 1.Maximize the number of courses covered 2.Maximize the geometric average of the assignments wrt the GTAs’ preference values (between 0 and 5). Problem : –Constraints are hard, must be met –Maximal consistent partial-assignment problem (MPA-CSP?) –Not a MAX-CSP (which maximizes #constraints satisfied)
Background - MAS for CSPs Multi-Agent System: agents interact & cooperate in order to achieve a set of goals – Agents: autonomous (perceive & act), goal-directed, can communicate – Interaction protocols: governing communications among agents – Environment: where agents live & act ERA [Liu & al. AIJ 2002] – Environment, Reactive rules, and Agents – A multi-agent approach to solving a general CSP – Transitions between states when agents move
Background - ERA’s components Environment : a n×m two-dimensional array –n: the number of variables (agents) –m: the maximum domain size, |D max | –e(i, j).value: domain value of agent i at position j –e(i, j).violation: violation value of agent i at position j – Zero position : where e(i, j).violation=0 When all agents are in zero position, we have a complete solution ERA=Environment + Reactive rules + Agents Example:
Background - ERA’s components Reactive rules : –Least-move: choose a position with the min. violation value –Better-move: choose a position with a smaller violation value –Random-move: randomly choose a position Combinations of these basic rules form different behaviors. ERA=Environment + Reactive rules + Agents
Background - ERA’s components Agents : a variable is represented by an agent ERA=Environment + Reactive rules + Agents At each state, an agent chooses a position to move to, following the reactive rules. The agents keep moving until all have reached zero position, or a certain time period has elapsed. All agents in zero position Some agents in zero position Assignments are made only for agents in zero position
Background - ERA vs local search ERA operates by local repairs, how different is it from local search? ERA –Each agent has an evaluation function –At each state, any agent moves wherever it desires to move Control is localized: Each agent is in pursuit of its own happiness Local search with min-conflict –One evaluation function for the whole state (cost), summarizes the quality of the state –At each state, few agents are allowed to move (most unhappy ones) Control is centralized: towards one common good
Background - Example ( ERA ) 4-queen problem Init Eval (agent Q1) Eval (agent Q2) Eval (agent Q3) 1 Move (agent Q3) 0 Move(agent4)
ERA – any agent can kick any other agent from its position Local search with min-conflict – cannot repair a variable without violating a previously repaired variable Background - Example (ERA vs. Local search)
Empirical study - In general Apply ERA on GTA assignment problem : 0. (Test & understand the behavior of ERA) 1.Compare performance of: –ERA: FrBLR –LS: hill-climbing, min-conflict & random walk –BT: B&B-like, many orderings (heuristic, random) 2.Observe behavior of ERA on solvable vs. unsolvable problems 3.Observe behavior of individual agents in ERA 4.Identify a limitation of ERA: deadlock phenomenon 8 instances of the GTA assignment problem
Empirical study 1- Performance comparison Date setSystematic Search (BT)Local Search (LS)Multi-agent Search (ERA) Spring2001b B√ O× Fall2001b B√ O√ Fall2002 B√ O× Spring2003 B√ O√ Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio=Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio=Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio=Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio=Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio=Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio=Unassigned Courses Solution Quality Unused GTAs CC ( ×10 8 ) Unassigned CoursesSolution QualityUnused GTAsAvailable Resource CC ( ×10 8 ) Original/BoostedSolvable?# GTAs# CoursesTotal CapacityTotal LoadRatio= Original/BoostedSolvable?# GTAs# CoursesTotal capacity ( C )Total load ( L )Ratio= C \ L Unassigned CoursesSolution Quality Unused GTAsAvailable Resource CC (×10 8 )Unassigned CoursesSolution Quality Unused GTAsAvailable Resource CC (×10 8 )Unassigned CoursesSolution Quality Unused GTAsAvailable Resource CC (×10 8 ) Observations : - Only ERA finds complete solutions to all solvable instances - On unsolvable problems, ERA leaves too many unused GTAs - LS and BT exhibit similar behaviors
Empirical study 2- Solvable vs unsolvable ERA performance on solvable problems ERA performance on unsolvable problems Observation : - Number of agents in zero- position per iteration - ERA behavior differs on solvable vs. unsolvable instances
Empirical study 3- Behavior of individual agents Instances solvable unsolvable Motion of agents variable stable constant Observations: SolvableUnsolvable Variable NoneMost Stable A few Constant MostNone
Empirical study 4- Deadlock – Each circle corresponds to a given GTA – Each square represents an agent – A blank squares indicate that an agent is on a zero-position – The squares with same color indicate agents involved in a deadlock Observation: ERA is not able to avoid deadlocks and yields a degradation of the solution on unsolvable CSPs.
Discussion GoalActions Control SchemaUndoing assignmentsConflict resolution ERA Local + Escape local optima – May yield instability √ + Flexible + Solves tight CSPs Non-committal – Deadlock – Shorter solutions LS Global + Stable behavior – Liable to local optima × + Quickly stabilizes – Fails to solve tight CSPs even with randomness & restart strategies Heuristic + Longer solutions – Problem-dependent BT Systematic + Stable behavior – Thrashes ~ + Quickly stabilizes – Fails to solve tight CSPS even with backtracking & restart strategies + advantages – shortcomings
Dealing with the deadlock Possible approaches: — Direct communications, negotiation mechanisms — Hybrids of search Global control Conflict resolution Experiments: — Enhancing ERA with global control – Don’t accept a move that deteriorates the global goal – Lead to local-search-like behavior (i.e., local optima) — ERA with conflict resolution – add dummy resources – find a complete solution when LS and BT fail – remove dummy assignments, solutions are still better
Future research directions – Test approach using other search techniques – BT search: Randomized, credit-based – Other local repair: squeaky-wheel method – Market-based techniques, etc. – Validate conclusions on other CSPs – random instances, real-world problems – Try search-hybridization techniques References: R. Glaubius and B.Y. Choueiry, Constraint Modeling and Reformulation in the Context of Academic Task Assignment. In Workshop Modeling and Solving Problems with Constraints, ECAI J. Liu, H. Jing, and Y.Y. Tang. Multi-Agent Oriented Constraint Satisfaction. Artificial Intelligence, 136: , 2002.
Questions