An image-based automatic Arabic translation system

作者：

Highlights：

•

摘要

In this paper, we present a system that automatically translates Arabic text embedded in images into English. The system consists of three components: text detection from images, character recognition, and machine translation. We formulate the text detection as a binary classification problem and apply gradient boosting tree (GBT), support vector machine (SVM), and location-based prior knowledge to improve the F1 score of text detection from 78.95% to 87.05%. The detected text images are processed by off-the-shelf optical character recognition (OCR) software. We employ an error correction model to post-process the noisy OCR output, and apply a bigram language model to reduce word segmentation errors. The translation module is tailored with compact data structure for hand-held devices. The experimental results show substantial improvements in both word recognition accuracy and translation quality. For instance, in the experiment of Arabic transparent font, the BLEU score increases from 18.70 to 33.47 with use of the error correction module.

论文关键词：Text detection,Image classification,OCR,Error correction

论文评审过程：Received 1 June 2008, Revised 11 September 2008, Accepted 18 October 2008, Available online 14 November 2008.

论文官网地址：https://doi.org/10.1016/j.patcog.2008.10.031