Shaping multi-agent systems with gradient reinforcement learning
作者:Olivier Buffet, Alain Dutech, François Charpillet
摘要
An original reinforcement learning (RL) methodology is proposed for the design of multi-agent systems. In the realistic setting of situated agents with local perception, the task of automatically building a coordinated system is of crucial importance. To that end, we design simple reactive agents in a decentralized way as independent learners. But to cope with the difficulties inherent to RL used in that framework, we have developed an incremental learning algorithm where agents face a sequence of progressively more complex tasks. We illustrate this general framework by computer experiments where agents have to coordinate to reach a global goal.
论文关键词:Reinforcement learning, Multi-agent systems, Partially observable Markov decision processes, Shaping, Policy-gradient
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10458-006-9010-5