Hand sign language recognition using multi-view hand skeleton

作者:

Highlights:

• We propose a model for hand sign recognition using SSD, 3DCNN, and LSTM from RGB.

• We propose a dataset including 10′000 RGB sign videos.

• We build hand skeleton using multi-view projection of 3D hand keypoints.

• Our model outperforms state-of-the-art models on NYU and First-Person datasets.

• We apply 3DCNN on stacked inputs to get discriminant local spatio-temporal features.

摘要

•We propose a model for hand sign recognition using SSD, 3DCNN, and LSTM from RGB.•We propose a dataset including 10′000 RGB sign videos.•We build hand skeleton using multi-view projection of 3D hand keypoints.•Our model outperforms state-of-the-art models on NYU and First-Person datasets.•We apply 3DCNN on stacked inputs to get discriminant local spatio-temporal features.

论文关键词:Multi-view hand skeleton,Hand sign language recognition,3DCNN,Hand pose estimation,RGB video,Hand action recognition

论文评审过程:Received 18 September 2019, Revised 26 December 2019, Accepted 21 February 2020, Available online 22 February 2020, Version of Record 11 March 2020.

论文官网地址:https://doi.org/10.1016/j.eswa.2020.113336