Human action recognition toward massive-scale sport sceneries based on deep multi-model feature fusion
作者:
Highlights:
• We fuse visual feature, audio signal and skeleton posture into a hybrid feature representation.
• We convert the audio information to spectrogram image.
• We conduct a comparative study to compare the performance of different feature fusion.
摘要
•We fuse visual feature, audio signal and skeleton posture into a hybrid feature representation.•We convert the audio information to spectrogram image.•We conduct a comparative study to compare the performance of different feature fusion.
论文关键词:Human action recognition,Multi-model feature fusion,Sport scenes,Multi-class SVM,Human behavior
论文评审过程:Received 17 September 2019, Revised 10 January 2020, Accepted 20 January 2020, Available online 23 January 2020, Version of Record 3 March 2020.
论文官网地址:https://doi.org/10.1016/j.image.2020.115802