Self-Attention based fine-grained cross-media hybrid network

作者:

Highlights:

• Propose a common attention space learning method for different feature spaces.

• Jointly learn the relative position encoding of local features and the spatial relationship between features.

• Achieve state-of-the-art performance on fine-grained cross-media retrieval task.

摘要

•Propose a common attention space learning method for different feature spaces.•Jointly learn the relative position encoding of local features and the spatial relationship between features.•Achieve state-of-the-art performance on fine-grained cross-media retrieval task.

论文关键词:Fine-Grained,Cross-Media,Retrieval,Attention

论文评审过程:Received 3 October 2021, Revised 20 April 2022, Accepted 24 April 2022, Available online 26 April 2022, Version of Record 22 May 2022.

论文官网地址:https://doi.org/10.1016/j.patcog.2022.108748