Text detection and recognition in images and video frames

作者:

Highlights:

摘要

This paper presents a new method for detecting and recognizing text in complex images and video frames. Text detection is performed in a two-step approach that combines the speed of a text localization step, enabling text size normalization, with the strength of a machine learning text verification step applied on background independent features. Text recognition, applied on the detected text lines, is addressed by a text segmentation step followed by an traditional OCR algorithm within a multi-hypotheses framework relying on multiple segments, language modeling and OCR statistics. Experiments conducted on large databases of real broadcast documents demonstrate the validity of our approach.

论文关键词:Text localization,Text segmentation,Text recognition,SVM,MRF,Video OCR

论文评审过程:Received 30 December 2002, Accepted 20 June 2003, Available online 4 October 2003.

论文官网地址:https://doi.org/10.1016/j.patcog.2003.06.001