Context in word recognition

作者:

Highlights:

摘要

Relatively low character error rates can often lead to prohibitive levels of word error rates. This paper examines several techniques for integrating an independent contextual postprocessor (CPP) into a full classification system. Using positional binary n-grams the CPP can correct many errors directly. In those cases where the correction process leads to ambiguity, the CPP can direct additional processing. Experimental results demonstrate that almost all of the derived improvement results from CPP-directed reclassification. This only requires that the CPP have the classifier likelihood fed forward to it. Therefore, a standardized CPP can be built independently of the rest of the classification system. An initial 45% word error rate is reduced to about a 2% word error rate and a 1% reject rate. Presence of a dictionary allows these figures to be reduced even further.

论文关键词:Context,Contextual postprocessor,Character recognition,Positional binary n-grams,Error detection,Error correction

论文评审过程:Received 25 October 1974, Revised 19 May 1975, Available online 16 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(76)90027-3