Video action detection by learning graph-based spatio-temporal interactions
作者:
Highlights:
• Video action detection is addressed using spatio-temporal graphs.
• A single graph can handle spatial and temporal relationships.
• Improvements over robust backbones and state-of-the-art results are presented.
• Improvements are obtained without backbone finetuning, learning only interactions.
摘要
•Video action detection is addressed using spatio-temporal graphs.•A single graph can handle spatial and temporal relationships.•Improvements over robust backbones and state-of-the-art results are presented.•Improvements are obtained without backbone finetuning, learning only interactions.
论文关键词:Video understanding,Action detection,Graph learning
论文评审过程:Received 29 June 2020, Revised 1 February 2021, Accepted 24 February 2021, Available online 27 February 2021, Version of Record 5 March 2021.
论文官网地址:https://doi.org/10.1016/j.cviu.2021.103187