Towards Equilibrium Transfer in Markov Games 胡裕靖
Outline Background Preliminary Ideas Some Results
Background
Multi-agent Reinforcement Learning Single-agent RL: Mountain Car Path finding RL in multi-agent tasks Robot Soccer IKEA furniture robot
Markov Games Agent take joint actions from one agent to more than one
Equilibrium-based MARL Some equilibrium solution concepts in game theory can be adopted
Our Previous Work Equilibrium-based MARL: Multi-agent reinforcement learning with meta equilibrium [] Multi-agent reinforcement learning by negotiation with unshared value functions [] Focusing on combining MARL with equilibrium solution concepts Problematic issues: Equilibrium computing is complicated and time consuming A new complexity class: TFNP! [] For tasks with many agents, equilibrium-based MARL algorithms may take too much time How to accelerate the learning process of equilibrium-based MARL?
Transfer Learning in RL Matthew E Taylor, Peter Stone. Transfer learning for reinforcement learning domains. Journal of Machine Learning Research, instance/policy/value function/model/… Alessandra Lazaric. Transfer in reinforcement learning: a framework and a survey. Reinforcement Learning, Springer, accelerate Reuse learnt knowledge
Transfer Learning in Markov Games? instance/policy/value function/model/… …… Why not transfer between these normal-form games within a Markov game? Inter-task transfer Inner-task transfer
Inner-task Transfer Transfer equilibrium between similar normal-form games during learning in a Markov game: Reuse the computed equilibria in previous games Reducing learning time Key problems: Which games are similar? For example: the games occur on different visits of a state How to transfer equilibrium?
Preliminary Ideas
Game Similarity Games with the same action space? Games with different action space? Similarity payoff distance? Equilibrium-based similarity or equilibrium-independent similarity? Drew Fudenberg and David M. Kreps. A theory of learning, experimentation and equilibrium in games
Game Similarity Equilibrium-based similarity Equilibrium transfer Find equilibria of two games and compute the similarity Transfer seems senseless! Weird Cycle
Our Idea Transfer equilibrium between games which are thought to be similar. Evaluate how much the loss brought by equilibrium transfer is. Transfer is acceptable when there is a little loss. The two games are different only in one item.
Problem Definition Approximate Nash equilibrium
Problem Definition
A Naïve Transfer Method Direct Transfer
A Naïve Transfer Method
Some Results
Future Work Some problems: Other transfer methods? Only Nash equilibrium? Equilibrium finding algorithms Transfer between games with different action space Transfer between games with different agent numbers Game abstraction
Thanks!