Human-Centric Image Captioning
作者:
Highlights:
• We propose a new task of Human-Centric Image Captioning and build a dataset - HC-COCO.
• We introduce the Human-Centric Feature Hierarchization to hierarchize image features more explicitly for human-centric captioning by incorporating human body part information.
• We propose a novel three-branch architecture for the separate information flow control and optimization, which helps generating more detailed captions for human activities.
• Our proposed method achieves state-of-the-art performance on HC-COCO, outperforming the previous state of the art by a clear margin.
摘要
•We propose a new task of Human-Centric Image Captioning and build a dataset - HC-COCO.•We introduce the Human-Centric Feature Hierarchization to hierarchize image features more explicitly for human-centric captioning by incorporating human body part information.•We propose a novel three-branch architecture for the separate information flow control and optimization, which helps generating more detailed captions for human activities.•Our proposed method achieves state-of-the-art performance on HC-COCO, outperforming the previous state of the art by a clear margin.
论文关键词:Human-centric,Image captioning,Feature hierarchization
论文评审过程:Received 9 February 2021, Revised 1 January 2022, Accepted 21 January 2022, Available online 22 January 2022, Version of Record 6 February 2022.
论文官网地址:https://doi.org/10.1016/j.patcog.2022.108545