Illumination invariants in deep video expression recognition
作者:
Highlights:
• highlights
• We develop a scale invariant architecture for generating illumination invariant deep motion features.
• We report state of the art results for video gesture recognition using spatio-temporal convolutional neural networks.
• We introduce an improved topology and protocol for semi-supervised learning, where the number of labeled data points is only a fraction of the entire dataset.
摘要
highlights•We develop a scale invariant architecture for generating illumination invariant deep motion features.•We report state of the art results for video gesture recognition using spatio-temporal convolutional neural networks.•We introduce an improved topology and protocol for semi-supervised learning, where the number of labeled data points is only a fraction of the entire dataset.
论文关键词:Deep learning,Expression recognition,Video classification,Neural nets,Machine learning
论文评审过程:Received 23 May 2017, Revised 19 September 2017, Accepted 15 October 2017, Available online 20 October 2017, Version of Record 21 December 2017.
论文官网地址:https://doi.org/10.1016/j.patcog.2017.10.017