Improving image captioning with Pyramid Attention and SC-GAN

作者:

Highlights:

• We proposed a variant of self-attention to extract the global features of the image.

• The visual and semantic relationship of different objects are integrated into our local-relation attention.

• We use a new self-critical training method to apply GAN to image captioning.

摘要

Highlights•We proposed a variant of self-attention to extract the global features of the image.•The visual and semantic relationship of different objects are integrated into our local-relation attention.•We use a new self-critical training method to apply GAN to image captioning.

论文关键词:Image captioning,Pyramid Attention network,Self-critical training,Reinforcement learning,Generative adversarial network,Sequence-level learning

论文评审过程:Received 18 September 2021, Revised 17 November 2021, Accepted 18 November 2021, Available online 30 November 2021, Version of Record 10 December 2021.

论文官网地址:https://doi.org/10.1016/j.imavis.2021.104340