Past is important: Improved image captioning by looking back in time
作者:
Highlights:
• In the paper, we first propose a Visual Reserved model that enables previous visual context to be considered for the current sequence reasoning.
• Next, a Attentional-Fluctuation Supervised model is also proposed in reinforcement learning structure.
摘要
•In the paper, we first propose a Visual Reserved model that enables previous visual context to be considered for the current sequence reasoning.•Next, a Attentional-Fluctuation Supervised model is also proposed in reinforcement learning structure.
论文关键词:Image captioning,Reinforcement learning,Visual attention
论文评审过程:Received 27 January 2020, Revised 29 December 2020, Accepted 24 January 2021, Available online 10 February 2021, Version of Record 26 February 2021.
论文官网地址:https://doi.org/10.1016/j.image.2021.116183