Implementing Temporal-Difference Learning with the Scaled Conjugate Gradient Algorithm

作者:Tasos Falas, Andreas Stafylopatis

摘要

This paper investigates the use of the scaled conjugate gradient (SCG) algorithm in temporal-difference (TD) learning for time series prediction. Special emphasis is given on the implementation details, after examining the theoretical background of the algorithm and the learning methodology and how these could be combined. Simple time series (linear, sinusoidal, etc.) as well as more complex ones, coming from real data, are used to examine the behavior of this novel combination of learning algorithm and methodology. Preliminary experimental results indicate that the implementation as presented in this paper indeed works, but the performance (in terms of learning speed and generalization ability) of TD learning using the SCG algorithm is not as good as expected, at least on the representative problems examined. An attempt to rationalize these results is presented.

论文关键词:reinforcement learning, scaled conjugate gradient, time series prediction, temporal-difference learning

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-005-1384-x