Text detection and recognition in images and video frames

作者：

Highlights：

•

摘要

This paper presents a new method for detecting and recognizing text in complex images and video frames. Text detection is performed in a two-step approach that combines the speed of a text localization step, enabling text size normalization, with the strength of a machine learning text verification step applied on background independent features. Text recognition, applied on the detected text lines, is addressed by a text segmentation step followed by an traditional OCR algorithm within a multi-hypotheses framework relying on multiple segments, language modeling and OCR statistics. Experiments conducted on large databases of real broadcast documents demonstrate the validity of our approach.

论文关键词：Text localization,Text segmentation,Text recognition,SVM,MRF,Video OCR

论文评审过程：Received 30 December 2002, Accepted 20 June 2003, Available online 4 October 2003.

论文官网地址：https://doi.org/10.1016/j.patcog.2003.06.001