AutoPruner: An end-to-end trainable filter pruning method for efficient deep model inference
作者:
Highlights:
• Filter selection and model fine-tuning are integrated into a single end-to-end trainable framework.
• Adaptive compression ratio and multi-layer compression.
• Good generalization ability.
摘要
•Filter selection and model fine-tuning are integrated into a single end-to-end trainable framework.•Adaptive compression ratio and multi-layer compression.•Good generalization ability.
论文关键词:Neural network pruning,Model compression,CNN acceleration
论文评审过程:Received 7 September 2019, Revised 15 May 2020, Accepted 16 May 2020, Available online 25 May 2020, Version of Record 15 June 2020.
论文官网地址:https://doi.org/10.1016/j.patcog.2020.107461