End-to-End Trained Sparse Coding Network with Spatial Pyramid Pooling for Image Classification

作者:Boheng Chen, Yige Wang, Gang Wei, Jie Li, Biyun Ma

摘要

Spatial pyramid matching using sparse coding (ScSPM) has become an efficient method and a benchmark in image classification. However, since it is unsupervised, the trained dictionary may be suboptimal. To further improve classification accuracy, in this paper we propose a sparse coding network with spatial pyramid pooling based on the end-to-end deep learning approach. In our new system, the minimization problem in sparse coding can be modeled as a feed-forward neural network and image features can be extracted by the deep convolutional network. By minimizing the final classifier loss using the end-to-end deep learning method, the sparse coding network can be trained in a supervised way. Our proposed model is tested on three image databases and in terms of classification accuracy, it significantly outperforms ScSPM. Compared with other image classification approaches based on deep learning, it can also achieve a noticeable improvement.

论文关键词:Sparse coding network, Spatial pyramid pooling, Image classification, Deep convolutional network

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11063-018-9967-5