Handwritten word-spotting using hidden Markov models and universal vocabularies

作者:

Highlights:

摘要

Handwritten word-spotting is traditionally viewed as an image matching task between one or multiple query word-images and a set of candidate word-images in a database. This is a typical instance of the query-by-example paradigm. In this article, we introduce a statistical framework for the word-spotting problem which employs hidden Markov models (HMMs) to model keywords and a Gaussian mixture model (GMM) for score normalization. We explore the use of two types of HMMs for the word modeling part: continuous HMMs (C-HMMs) and semi-continuous HMMs (SC-HMMs), i.e. HMMs with a shared set of Gaussians. We show on a challenging multi-writer corpus that the proposed statistical framework is always superior to a traditional matching system which uses dynamic time warping (DTW) for word-image distance computation. A very important finding is that the SC-HMM is superior when labeled training data is scarce—as low as one sample per keyword—thanks to the prior information which can be incorporated in the shared set of Gaussians.

论文关键词:Word-spotting,Hidden Markov model,Score normalization,Universal vocabulary,Handwriting recognition

论文评审过程:Received 10 June 2008, Revised 5 December 2008, Accepted 2 February 2009, Available online 20 February 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.02.005