Semantic ranking structure preserving for cross-modal retrieval
作者:Hui Liu, Yong Feng, Mingliang Zhou, Baohua Qiang
摘要
Cross-modal retrieval not only needs to eliminate the heterogeneity of modalities, but also needs to constrain the return order of retrieval results. Accordingly, we propose a novel common representation space learning method, called Semantic Ranking Structure Preserving (SRSP) for Cross-modal Retrieval in this paper. First, the dependency relationship between labels is used to minimize the discriminative loss of multi-modal data and mine potential relationships between samples to get richer semantic information in the common space. Second, we constrain the correlation ranking of representations in common space, so as to break the modal gap and promote the multi-modal correlation learning. The comprehensive experimental comparison results show that our algorithm substantially enhances the performance and consistently outperforms very recent algorithms in terms of widely used cross-modal benchmark datasets.
论文关键词:Cross-modal retrieval, Common space learning, Graph convolutional, Semantic structure preserving
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10489-020-01930-x