Weakly-supervised action localization via embedding-modeling iterative optimization
作者:
Highlights:
• An iterative optimizing mechanism is designed for effective action recognition with trimmed-untrimmed parallel modeling.
• A shared subspace embedding is proposed with generative adversarial networks for coherent knowledge transfer.
• A two-stage self-attentive representation learning workflow is developed to capture the fine-grained frame-level relevance.
• Extensive experiments are conducted on two challenging untrimmed video datasets with superior results.
摘要
•An iterative optimizing mechanism is designed for effective action recognition with trimmed-untrimmed parallel modeling.•A shared subspace embedding is proposed with generative adversarial networks for coherent knowledge transfer.•A two-stage self-attentive representation learning workflow is developed to capture the fine-grained frame-level relevance.•Extensive experiments are conducted on two challenging untrimmed video datasets with superior results.
论文关键词:Action recognition,Temporal action localization,Attention mechanism,Generative adversarial networks,Subspace embedding
论文评审过程:Received 9 October 2019, Revised 24 September 2020, Accepted 14 December 2020, Available online 16 January 2021, Version of Record 21 January 2021.
论文官网地址:https://doi.org/10.1016/j.patcog.2021.107831