Document segmentation using polynomial spline wavelets

作者:

Highlights:

摘要

Wavelet transforms have been widely used as effective tools in texture segmentation in the past decade. Segmentation of document images, which usually contain three types of texture information: text, picture and background, can be regarded as a special case of texture segmentation. B-spline wavelets possess some desirable properties such as being well localized in time and frequency, and being compactly supported, which make them an effective tool for texture analysis. Based on the observation that text textures provide fast-changed and relatively regular distributed edges in the wavelet transform domain, an efficient document segmentation algorithm is designed via cubic B-spline wavelets. Three-means or two-means classification is applied for classifying pixels with similar characteristics after feature estimation at the outputs of high frequency bands of spline wavelet transforms. We examine and evaluate the contributions of different factors to the segmentation results from the viewpoints of decomposition levels, frequency bands and wavelet functions. Further performance analysis reveals the advantages of the proposed method.

论文关键词:Document analysis,k-means classification,Polynomial spline,Segmentation,Wavelet transform

论文评审过程:Received 3 August 1999, Revised 8 August 2000, Accepted 4 October 2000, Available online 30 August 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(00)00160-6