Contribution of recurrent connectionist language models in improving LSTM-based Arabic text recognition in videos

作者:

Highlights:

• Different recurrent connectionist language models to improve LSTM-based Arabic text recognition in videos.

• Efficient joint decoding paradigm using language model and LSTM responses.

• Additional decoding hyper-parameters, extensively evaluated, that improve recognition results and optimize running time.

• Significant recognition improvement by integrating connectionist language models that outperform n-grams contribution.

• Final Arabic OCR system that significantly outperforms commercial OCR engine.

摘要

Highlights•Different recurrent connectionist language models to improve LSTM-based Arabic text recognition in videos.•Efficient joint decoding paradigm using language model and LSTM responses.•Additional decoding hyper-parameters, extensively evaluated, that improve recognition results and optimize running time.•Significant recognition improvement by integrating connectionist language models that outperform n-grams contribution.•Final Arabic OCR system that significantly outperforms commercial OCR engine.

论文关键词:Connectionist language model,Arabic video OCR,Decoding,RNN,LSTM,Convolutional Neural Network

论文评审过程:Received 11 March 2016, Revised 12 November 2016, Accepted 15 November 2016, Available online 16 November 2016, Version of Record 28 November 2016.

论文官网地址:https://doi.org/10.1016/j.patcog.2016.11.011