Self-improving reactive agents based on reinforcement learning, planning and teaching
作者:Long-Ji Lin
摘要
To date, reinforcement learning has mostly been studied solving simple learning tasks. Reinforcement learning methods that have been studied so far typically converge slowly. The purpose of this work is thus two-fold: 1) to investigate the utility of reinforcement learning in solving much more complicated learning tasks than previously studied, and 2) to investigate methods that will speed up reinforcement learning.
论文关键词:Reinforcement learning, planning, teaching, connectionist networks
论文评审过程:
论文官网地址:https://doi.org/10.1007/BF00992699