AUTOMATIC TEXT LOCATION IN IMAGES AND VIDEO FRAMES

作者：

Highlights：

•

摘要

Textual data is very important in a number of applications such as image database indexing and document understanding. The goal of automatic text location without character recognition capabilities is to extract image regions that contain only text. These regions can then be either fed to an optical character recognition module or highlighted for a user. Text location is a very difficult problem because the characters in text can vary in font, size, spacing, alignment, orientation, color and texture. Further, characters are often embedded in a complex background in the image. We propose a new text location algorithm that is suitable in a number of applications, including conversion of newspaper advertisements from paper documents to their electronic versions, World Wide Web search, color image indexing and video indexing. In many of these applications, it is not necessary to extract all the text, so we emphasize on extracting important text with large size and high contrast. Our algorithm is very fast and has been shown to be successful in extracting important text in a large number of test images.

论文关键词：Automatic text location,Web search,Image database,Video indexing,Multivalued image decomposition,Connected component analysis

论文评审过程：Received 22 January 1998, Revised 23 April 1998, Available online 7 June 2001.

论文官网地址：https://doi.org/10.1016/S0031-3203(98)00067-3