Video action detection by learning graph-based spatio-temporal interactions

作者：

Highlights：

• Video action detection is addressed using spatio-temporal graphs.

• A single graph can handle spatial and temporal relationships.

• Improvements over robust backbones and state-of-the-art results are presented.

• Improvements are obtained without backbone finetuning, learning only interactions.

摘要

•Video action detection is addressed using spatio-temporal graphs.•A single graph can handle spatial and temporal relationships.•Improvements over robust backbones and state-of-the-art results are presented.•Improvements are obtained without backbone finetuning, learning only interactions.

论文关键词：Video understanding,Action detection,Graph learning

论文评审过程：Received 29 June 2020, Revised 1 February 2021, Accepted 24 February 2021, Available online 27 February 2021, Version of Record 5 March 2021.

论文官网地址：https://doi.org/10.1016/j.cviu.2021.103187

原文链接
谷歌学术
必应学术
百度学术