Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition

作者:

Highlights:

• Combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network for skeleton-based human activity and hand gesture recognition.

• Two-stage training strategy which firstly focuses on the CNN training and, secondly, adjusts the full method CNN+LSTM.

• A method for data augmentation in the context of spatiotemporal 3D data sequences.

• An exhaustive experimental study on publicly available data benchmarks with respect to the state-of-the-art most representative methods.

• Comparison among different CPU and GPU platforms.

摘要

•Combination of a Convolutional Neural Network (CNN) and a Long Short-Term Memory (LSTM) recurrent network for skeleton-based human activity and hand gesture recognition.•Two-stage training strategy which firstly focuses on the CNN training and, secondly, adjusts the full method CNN+LSTM.•A method for data augmentation in the context of spatiotemporal 3D data sequences.•An exhaustive experimental study on publicly available data benchmarks with respect to the state-of-the-art most representative methods.•Comparison among different CPU and GPU platforms.

论文关键词:Deep learning,Convolutional Neural Network,Recurrent neural network,Long Short-Term Memory,Human activity recognition,Hand gesture recognition,Real-time

论文评审过程:Received 2 December 2016, Revised 6 October 2017, Accepted 24 October 2017, Available online 21 December 2017, Version of Record 21 December 2017.

论文官网地址:https://doi.org/10.1016/j.patcog.2017.10.033