Phrase-based correction model for improving handwriting recognition accuracies

作者:

Highlights:

摘要

We propose a method for increasing word recognition accuracies by correcting the output of a handwriting recognition system. We treat the handwriting recognizer as a black box, such that there is no access to its internals. This enables us to keep our algorithm general and independent of any particular system. We use a novel method for correcting the output based on a “phrase-based” system in contrast to traditional source-channel models. We report the accuracies of two in-house handwritten word recognizers before and after the correction. We achieve highly encouraging results for a large synthetically generated dataset. We also report results for a commercially available OCR on real data.

论文关键词:Post-processing,Noisy channel,Handwriting recognition,Error correction,Viterbi decoding

论文评审过程:Received 6 August 2008, Revised 9 December 2008, Accepted 21 December 2008, Available online 6 January 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.12.014