Semantic-aware scene recognition

作者：

Highlights：

• A novel approach for scene recognition based on an end-to-end multi-modal CNN that combines image and context information by means of an attention module.

• Context information, in the shape of semantic segmentation, is used to gate features extracted from an RGB image.

• The gating process reinforces the learning of indicative scene content and enhances scene disambiguation.

• The proposed approach outperforms state-of-the-art performance while significantly reducing the number of network parameters.

摘要

•A novel approach for scene recognition based on an end-to-end multi-modal CNN that combines image and context information by means of an attention module.•Context information, in the shape of semantic segmentation, is used to gate features extracted from an RGB image.•The gating process reinforces the learning of indicative scene content and enhances scene disambiguation.•The proposed approach outperforms state-of-the-art performance while significantly reducing the number of network parameters.

论文关键词：Scene recognition,Deep learning,Convolutional neural networks,Semantic segmentation

论文评审过程：Received 10 September 2019, Revised 22 January 2020, Accepted 1 February 2020, Available online 1 February 2020, Version of Record 6 February 2020.

论文官网地址：https://doi.org/10.1016/j.patcog.2020.107256