Self-Attention based fine-grained cross-media hybrid network

作者：

Highlights：

• Propose a common attention space learning method for different feature spaces.

• Jointly learn the relative position encoding of local features and the spatial relationship between features.

• Achieve state-of-the-art performance on fine-grained cross-media retrieval task.

摘要

•Propose a common attention space learning method for different feature spaces.•Jointly learn the relative position encoding of local features and the spatial relationship between features.•Achieve state-of-the-art performance on fine-grained cross-media retrieval task.

论文关键词：Fine-Grained,Cross-Media,Retrieval,Attention

论文评审过程：Received 3 October 2021, Revised 20 April 2022, Accepted 24 April 2022, Available online 26 April 2022, Version of Record 22 May 2022.

论文官网地址：https://doi.org/10.1016/j.patcog.2022.108748

原文链接
谷歌学术
必应学术
百度学术