Stochastic batch size for adaptive regularization in deep network optimization
作者:
Highlights:
• Adaptive regularization for deep network optimization via parameter-wise batch size.
• The stochastic batch size reflects local and global properties of each parameter.
• Beneficial for practical studies where the number of training examples is small.
摘要
•Adaptive regularization for deep network optimization via parameter-wise batch size.•The stochastic batch size reflects local and global properties of each parameter.•Beneficial for practical studies where the number of training examples is small.
论文关键词:Deep network optimization,Adaptive regularization,Stochastic gradient descent,Adaptive mini-batch size
论文评审过程:Received 4 August 2020, Revised 11 November 2021, Accepted 3 May 2022, Available online 5 May 2022, Version of Record 10 May 2022.
论文官网地址:https://doi.org/10.1016/j.patcog.2022.108776