Neuro semantic thresholding using OCR software for high precision OCR applications

作者:

Highlights:

摘要

This paper describes a novel approach to binarization techniques. It presents a way of obtaining a threshold that depends both on the image and the final application using a semantic description of the histogram and a neural network. The intended applications of this technique are high precision OCR algorithms over a limited number of document types.The input image histogram is smoothed and its derivative is found. Using a polygonal version of the derivative and the smoothed histogram, a new description of the histogram is calculated. Using this description and a training set, a general neural network is capable of obtaining an optimum threshold for our application.

论文关键词:Thresholding,Binarization,GRNN,Semantic description,OCR

论文评审过程:Received 7 March 2008, Revised 5 May 2009, Accepted 18 September 2009, Available online 29 September 2009.

论文官网地址:https://doi.org/10.1016/j.imavis.2009.09.011