Automatic identification and skew estimation of text lines in real scene images

作者:

Highlights:

摘要

A method for the automatic localization of text embedded in complex images is proposed. It permits to detect the spatial position and the skew of the text lines which are present in the scene and to return a binary representation of each text line. Strengths of the algorithm are independence of text skew and of presence of connected text. After a preprocessing step the input image is segmented in order to obtain a set of connected components which represent the basic objects of the algorithm. Several heuristics are proposed to characterize text objects which depend both on the geometrical features of single components and on the geometrical and spatial relations among components. According to these heuristics several components are discarded and the retained ones are grouped into text lines candidates by means of a divisive hierachical clustering procedure. In the experimental session we describe the application of the algorithm to the extraction of text lines from the images of 100 book covers. Results about skew estimation are also reported.

论文关键词:Text localization,Skew estimation,Image processing,Connected components analysis,Clustering,OCR

论文评审过程:Received 17 November 1997, Revised 19 June 1998, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(98)00108-3