End-to-end environmental sound classification using a 1D convolutional neural network

作者:

Highlights:

• A model based on deep learning for environmental sound classification is proposed.

• Model provides the trade-off between the audio length, accuracy and parameter space.

• Model omits the signal processing modules and learns representations from audio.

• Model has a small parameter space compared to other architectures in the literature.

• The frequency and magnitude responses of some of the learned filters are analyzed.

摘要

•A model based on deep learning for environmental sound classification is proposed.•Model provides the trade-off between the audio length, accuracy and parameter space.•Model omits the signal processing modules and learns representations from audio.•Model has a small parameter space compared to other architectures in the literature.•The frequency and magnitude responses of some of the learned filters are analyzed.

论文关键词:Convolutional neural network,Environmental sound classification,Deep learning,Gammatone filterbank

论文评审过程:Received 10 April 2019, Revised 1 June 2019, Accepted 19 June 2019, Available online 20 June 2019, Version of Record 28 June 2019.

论文官网地址:https://doi.org/10.1016/j.eswa.2019.06.040