Content-based image retrieval and semantic automatic image annotation based on the weighted average of triangular histograms using support vector machine

作者:Zahid Mehmood, Toqeer Mahmood, Muhammad Arshad Javid

摘要

In recent years, the rapid growth of multimedia content makes content-based image retrieval (CBIR) a challenging research problem. The content-based attributes of the image are associated with the position of objects and regions within the image. The addition of image content-based attributes to image retrieval enhances its performance. In the last few years, the bag-of-visual-words (BoVW) based image representation model gained attention and significantly improved the efficiency and effectiveness of CBIR. In BoVW-based image representation model, an image is represented as an order-less histogram of visual words by ignoring the spatial attributes. In this paper, we present a novel image representation based on the weighted average of triangular histograms (WATH) of visual words. The proposed approach adds the image spatial contents to the inverted index of the BoVW model, reduces overfitting problem on larger sizes of the dictionary and semantic gap issues between high-level image semantic and low-level image features. The qualitative and quantitative analysis conducted on three image benchmarks demonstrates the effectiveness of the proposed approach based on WATH.

论文关键词:Content-based image retrieval, Bag-of-visual-words, Support vector machine, Dense SIFT, Image classification

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-017-0957-5