FRWCAE: joint faster-RCNN and Wasserstein convolutional auto-encoder for instance retrieval

作者:Yi-yang Zhang, Yong Feng, Da-jiang Liu, Jia-xing Shang, Bao-hua Qiang

摘要

Based on the powerful feature extraction capability of deep convolutional neural networks, image-level retrieval methods have achieved superior performance compared to the hand-crafted features and indexing algorithms. However, people tend to focus on foreground objects of interest in images. Locating objects accurately and using object-level features for retrieval become the essential tasks of instance search. In this work, we propose a novel instance retrieval method FRWACE, which combines the Faster R-CNN framework for object-level feature extraction with a brand-new Wasserstein Convolutional Auto-encoder for dimensionality reduction. In addition, we propose a considerate category-first spatial re-rank strategy to improve instance-level retrieval accuracy. Extensive experiments on four large datasets Oxford 5K, Paris 6K, Oxford 105K and Paris 106K show that our approach has achieved significant performance compared to the state-of-the-arts.

论文关键词:Instance-level retrieval, Convolutional auto-encoder, Wasserstein distance, Dimensionality reduction

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10489-019-01625-y