Selective Recurrent Neural Network

作者:Branko Šter

摘要

It is known that recurrent neural networks may have difficulties remembering data over long time lags. To overcome this problem, we propose an extended architecture of recurrent neural networks, which is able to deal with long time lags between relevant input signals. A register of latches at the input layer of the network is applied to bypass irrelevant input information and to propagate relevant inputs. The latches are implemented with differentiable multiplexers, thus enabling the derivatives to be propagated through the network. The relevance of input vectors is learned concurrently with the weights of the network using a gradient-based algorithm.

论文关键词:Recurrent neural networks, Temporal processing, Long-term dependencies, Latch, Multiplexer, Finite state automata

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-012-9259-4