Speech recognition with hierarchical recurrent neural networks

作者:

Highlights:

摘要

A hierarchical recurrent neural network (HRNN) for speech recognition is presented. The HRNN is trained by a generalized probabilistic descent (GPD) algorithm. Consequently, the difficulty of empirically selecting an appropriate target function for training RNNs can be avoided. Results obtained in this study indicate the proposed HRNN has the advantages of being capable of absorbing the temporal variation of speech patterns as well as possessing effective discrimination capabilities. The scaling problem of RNNs is also greatly reduced. Additionally, a realization of the system using initial/final sub-syllable models for isolated Mandarin syllable recognition is also undertaken for verifying its effectiveness. The effectiveness of the proposed HRNN is confirmed by the experimental results.

论文关键词:Speech recognition,Hierarchical,Recurrent neural networks,Generalized probabilistic descent,Discriminative training

论文评审过程:Received 26 October 1993, Revised 29 October 1994, Accepted 30 November 1994, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/0031-3203(94)00145-C