Distance transform regression for spatially-aware deep semantic segmentation

作者:

Highlights:

摘要

Understanding visual scenes relies more and more on dense pixel-wise classification obtained via deep fully convolutional neural networks. However, due to the nature of the networks, predictions often suffer from blurry boundaries and ill-segmented shapes, fueling the need for post-processing. This work introduces a new semantic segmentation regularization based on the regression of a distance transform. After computing the distance transform on the label masks, we train a FCN in a multi-task setting in both discrete and continuous spaces by learning jointly classification and distance regression. This requires almost no modification of the network structure and adds a very low overhead to the training process. Learning to approximate the distance transform back-propagates spatial cues that implicitly regularizes the segmentation. We validate this technique with several architectures on various datasets, and we show significant improvements compared to competitive baselines.

论文关键词:

论文评审过程:Received 28 September 2018, Revised 23 August 2019, Accepted 26 August 2019, Available online 6 September 2019, Version of Record 1 November 2019.

论文官网地址:https://doi.org/10.1016/j.cviu.2019.102809