Text detection and recognition in images and video frames
作者:
Highlights:
•
摘要
This paper presents a new method for detecting and recognizing text in complex images and video frames. Text detection is performed in a two-step approach that combines the speed of a text localization step, enabling text size normalization, with the strength of a machine learning text verification step applied on background independent features. Text recognition, applied on the detected text lines, is addressed by a text segmentation step followed by an traditional OCR algorithm within a multi-hypotheses framework relying on multiple segments, language modeling and OCR statistics. Experiments conducted on large databases of real broadcast documents demonstrate the validity of our approach.
论文关键词:Text localization,Text segmentation,Text recognition,SVM,MRF,Video OCR
论文评审过程:Received 30 December 2002, Accepted 20 June 2003, Available online 4 October 2003.
论文官网地址:https://doi.org/10.1016/j.patcog.2003.06.001