Distance transform regression for spatially-aware deep semantic segmentation

作者：

Highlights：

•

摘要

Understanding visual scenes relies more and more on dense pixel-wise classification obtained via deep fully convolutional neural networks. However, due to the nature of the networks, predictions often suffer from blurry boundaries and ill-segmented shapes, fueling the need for post-processing. This work introduces a new semantic segmentation regularization based on the regression of a distance transform. After computing the distance transform on the label masks, we train a FCN in a multi-task setting in both discrete and continuous spaces by learning jointly classification and distance regression. This requires almost no modification of the network structure and adds a very low overhead to the training process. Learning to approximate the distance transform back-propagates spatial cues that implicitly regularizes the segmentation. We validate this technique with several architectures on various datasets, and we show significant improvements compared to competitive baselines.

论文关键词：

论文评审过程：Received 28 September 2018, Revised 23 August 2019, Accepted 26 August 2019, Available online 6 September 2019, Version of Record 1 November 2019.

论文官网地址：https://doi.org/10.1016/j.cviu.2019.102809