Semantic segmentation using stride spatial pyramid pooling and dual attention decoder
作者:
Highlights:
• We propose an SSPP structure to capture multiscale semantic information.
• Attention mechanism is applied to bridge the information gap in segmentation networks.
• We propose a new decoder to make full use of the low- and high-level feature maps.
• Auxiliary loss is applied to make the network easier to train.
• Our method attains state-of-the-art performance on PASCAL VOC 2012, Cityscapes and COCO-Stuff.
摘要
•We propose an SSPP structure to capture multiscale semantic information.•Attention mechanism is applied to bridge the information gap in segmentation networks.•We propose a new decoder to make full use of the low- and high-level feature maps.•Auxiliary loss is applied to make the network easier to train.•Our method attains state-of-the-art performance on PASCAL VOC 2012, Cityscapes and COCO-Stuff.
论文关键词:Semantic segmentation,Convolutional neural networks,Pyramid pooling,Attention mechanism
论文评审过程:Received 7 February 2019, Revised 24 May 2020, Accepted 12 June 2020, Available online 13 June 2020, Version of Record 18 June 2020.
论文官网地址:https://doi.org/10.1016/j.patcog.2020.107498