A weight initialization based on the linear product structure for neural networks
作者:
Highlights:
• We consider the neural network training from a nonlinear computation point of view.
• A new linear product structure initialization strategy has been developed for training neural networks.
• Theoretical analysis shows that the LPS initialization yields a low probability of dying ReLU.
摘要
•We consider the neural network training from a nonlinear computation point of view.•A new linear product structure initialization strategy has been developed for training neural networks.•Theoretical analysis shows that the LPS initialization yields a low probability of dying ReLU.
论文关键词:Weight initialization,Linear product structure,Neural networks,Nonlinear computation
论文评审过程:Received 16 March 2021, Revised 25 September 2021, Accepted 29 September 2021, Available online 19 October 2021, Version of Record 19 October 2021.
论文官网地址:https://doi.org/10.1016/j.amc.2021.126722