Achieving adversarial robustness via sparsity

作者:Ningyi Liao, Shufan Wang, Liyao Xiang, Nanyang Ye, Shuo Shao, Pengzhi Chu

摘要

Network pruning has been known to produce compact models without much accuracy degradation. However, how the pruning process affects a network’s robustness and the working mechanism behind remain unresolved. In this work, we theoretically prove that the sparsity of network weights is closely associated with model robustness. Through experiments on a variety of adversarial pruning methods, image-classification models and datasets, we find that weights sparsity will not hurt but improve robustness, where both weights inheritance from the lottery ticket and adversarial training improve model robustness in network pruning. Based on these findings, we propose a novel adversarial training method called inverse weights inheritance, which imposes sparse weights distribution on a large network by inheriting weights from a small network, thereby improving the robustness of the large network.

论文关键词:Adversarial learning, Neural network pruning, Robustness, Sparsity

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-021-06049-9