Learning spatial-temporal deformable networks for unconstrained face alignment and tracking in videos
作者:
Highlights:
• In our approach, we propose a differential transformer module, which typically learns the deformable offsets to adaptively augment the receptive field. By doing this, we achieve shape-informative and robust feature representation versus conventional fixed-length filters for high-performance face alignment.
• Extensively, we carefully develop a temporal relational reasoning module integrated in our DHGN. Consequently, it infers the temporal offsets to capture the meaningful ordinal relationship among frames, which reinforces the robustness of our model to temporal variations over time steps.
• Experimentally, our DHGN consistently outperforms most existing face alignment methods on 300-W challengingset and COFW including large variations. Moreover, our proposed T-DHGN achieves smoothing performance in the difficult face tracking dataset, i.e., 300-VW Category Three.
摘要
•In our approach, we propose a differential transformer module, which typically learns the deformable offsets to adaptively augment the receptive field. By doing this, we achieve shape-informative and robust feature representation versus conventional fixed-length filters for high-performance face alignment.•Extensively, we carefully develop a temporal relational reasoning module integrated in our DHGN. Consequently, it infers the temporal offsets to capture the meaningful ordinal relationship among frames, which reinforces the robustness of our model to temporal variations over time steps.•Experimentally, our DHGN consistently outperforms most existing face alignment methods on 300-W challengingset and COFW including large variations. Moreover, our proposed T-DHGN achieves smoothing performance in the difficult face tracking dataset, i.e., 300-VW Category Three.
论文关键词:Face alignment,Face tracking,Spatial transformer,Relational reasoning,Video analysis,Biometrics
论文评审过程:Received 30 May 2019, Revised 13 March 2020, Accepted 29 March 2020, Available online 1 June 2020, Version of Record 23 July 2020.
论文官网地址:https://doi.org/10.1016/j.patcog.2020.107354