Efficient semantic segmentation with pyramidal fusion
作者:
Highlights:
• We present pyramidal fusion: a principled approach for dense recognition based on resolution pyramids.
• Our pyramidal model outperforms all previous semantic segmentation approaches aiming at real-time operation.
• It achieves 76.4% mIoU on Cityscapes test by processing 2MPx images at 34 Hz on GTX 1080 Ti.
• Our model acts as an ensemble of shallow models with a large effective receptive field.
摘要
•We present pyramidal fusion: a principled approach for dense recognition based on resolution pyramids.•Our pyramidal model outperforms all previous semantic segmentation approaches aiming at real-time operation.•It achieves 76.4% mIoU on Cityscapes test by processing 2MPx images at 34 Hz on GTX 1080 Ti.•Our model acts as an ensemble of shallow models with a large effective receptive field.
论文关键词:Semantic segmentation,Real-time inference,Shared resolution pyramid,Computer vision,Deep learning
论文评审过程:Received 28 July 2019, Revised 20 July 2020, Accepted 19 August 2020, Available online 20 August 2020, Version of Record 1 November 2020.
论文官网地址:https://doi.org/10.1016/j.patcog.2020.107611