A multi-scale framework for adaptive binarization of degraded document images

作者:

Highlights:

摘要

In this work, a multi-scale binarization framework is introduced, which can be used along with any adaptive threshold-based binarization method. This framework is able to improve the binarization results and to restore weak connections and strokes, especially in the case of degraded historical documents. This is achieved thanks to localized nature of the framework on the spatial domain. The framework requires several binarizations on different scales, which is addressed by introduction of fast grid-based models. This enables us to explore high scales which are usually unreachable to the traditional approaches. In order to expand our set of adaptive methods, an adaptive modification of Otsu's method, called AdOtsu, is introduced. In addition, in order to restore document images suffering from bleed-through degradation, we combine the framework with recursive adaptive methods. The framework shows promising performance in subjective and objective evaluations performed on available datasets.

论文关键词:Document image processing,Binarization,Adaptive methods,Multi-scale framework

论文评审过程:Received 17 July 2009, Revised 11 November 2009, Accepted 30 December 2009, Available online 11 January 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.12.024