Generating Text Sequence Images for Recognition

作者:Yanxiang Gong, Linjie Deng, Zheng Ma, Mei Xie

摘要

Recently, methods based on deep learning have dominated the field of text recognition. With a large number of training data, most of them can achieve the state-of-the-art performances. However, it is hard to harvest and label sufficient text sequence images from the real scenes. To mitigate this issue, several methods to synthesize text sequence images were proposed, yet they usually need complicated preceding or follow-up steps. In this work, we present a method which is able to generate infinite training data without any auxiliary pre/post-process. We tackle the generation task as an image-to-image translation one and utilize conditional adversarial networks to produce realistic text sequence images in the light of the semantic ones. Some evaluation metrics are involved to assess our method and the results demonstrate that the caliber of the data is satisfactory. The code and dataset will be publicly available soon.

论文关键词:Image generation, Text sequence images, Training data, Text recognition

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-019-10166-x