Large-scale document image retrieval and classification with runlength histograms and binary embeddings
作者:
Highlights:
•
摘要
We present a new document image descriptor based on multi-scale runlength histograms. This descriptor does not rely on layout analysis and can be computed efficiently. We show how this descriptor can achieve state-of-the-art results on two very different public datasets in classification and retrieval tasks. Moreover, we show how we can compress and binarize these descriptors to make them suitable for large-scale applications. We can achieve state-of-the-art results in classification using binary descriptors of as few as 16–64 bits.
论文关键词:Visual document descriptor,Compression,Large-scale,Retrieval,Classification
论文评审过程:Received 16 January 2012, Revised 26 November 2012, Accepted 10 December 2012, Available online 19 December 2012.
论文官网地址:https://doi.org/10.1016/j.patcog.2012.12.004