Top-down and bottom-up attentional multiple instance learning for still image action recognition
作者:
Highlights:
• An end-to-end approach that discovers action-related image parts.
• Creates pixelimage-level attention masks.
• Image-level fine-grained action maps are used to localize the action related cues.
• Extensive evaluations on four different still image action recognition datasets.
• Spatial and top-down attention can help identify action-related regions and pixels.
摘要
•An end-to-end approach that discovers action-related image parts.•Creates pixelimage-level attention masks.•Image-level fine-grained action maps are used to localize the action related cues.•Extensive evaluations on four different still image action recognition datasets.•Spatial and top-down attention can help identify action-related regions and pixels.
论文关键词:Still image action recognition,Multiple instance learning,Attention
论文评审过程:Received 11 November 2020, Revised 25 January 2022, Accepted 11 February 2022, Available online 22 February 2022, Version of Record 17 March 2022.
论文官网地址:https://doi.org/10.1016/j.image.2022.116664