Nonmonotone BFGS-trained recurrent neural networks for temporal sequence processing
作者:
Highlights:
•
摘要
In this paper we propose a nonmonotone approach to recurrent neural networks training for temporal sequence processing applications. This approach allows learning performance to deteriorate in some iterations, nevertheless the network’s performance is improved over time. A self-scaling BFGS is equipped with an adaptive nonmonotone technique that employs approximations of the Lipschitz constant and is tested on a set of sequence processing problems. Simulation results show that the proposed algorithm outperforms the BFGS as well as other methods previously applied to these sequences, providing an effective modification that is capable of training recurrent networks of various architectures.
论文关键词:Recurrent neural networks,Quasi-Newton methods,BFGS updates,Nonmonotone methods,Second-order training algorithms,Temporal sequence
论文评审过程:Available online 21 December 2010.
论文官网地址:https://doi.org/10.1016/j.amc.2010.12.012