Towards an omnilingual word retrieval system for ancient manuscripts

作者:

Highlights:

摘要

In this article, we introduce the first method that allows the indexation of ancient manuscripts of any language and alphabet. We describe a word retrieval engine inspired by recent word-spotting advances on ancient manuscripts. Our approach does not need any layout segmentation and makes use of features fitted to any type of alphabet (Latin, Arabic, Chinese, etc.) and writing. The engine is tested on numerous documents and in several use-cases.

论文关键词:Document indexing,Word-spotting,Word retrieval,Ancient documents,Segmentation-free,Omnilingual

论文评审过程:Received 22 April 2008, Revised 12 December 2008, Accepted 16 January 2009, Available online 3 February 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.01.026