Learning representations of sound using trainable COPE feature extractors

作者:

Highlights:

• We introduce trainable COPE feature extractors for sound representation learning.

• A COPE feature extractor is trained using a single prototype sound of interest.

• We propose a method for audio event detection with robustness to varying SNR.

• We experiment on four data sets: MIVIA audio, MIVIA roads, ESC-10 and TU Dortmund.

• We achieve better results than existing methods for audio event detection.

摘要

•We introduce trainable COPE feature extractors for sound representation learning.•A COPE feature extractor is trained using a single prototype sound of interest.•We propose a method for audio event detection with robustness to varying SNR.•We experiment on four data sets: MIVIA audio, MIVIA roads, ESC-10 and TU Dortmund.•We achieve better results than existing methods for audio event detection.

论文关键词:Audio analysis,Event detection,Peaks of energy,Representation learning,Trainable feature extractors

论文评审过程:Received 27 July 2017, Revised 26 February 2019, Accepted 21 March 2019, Available online 21 March 2019, Version of Record 27 March 2019.

论文官网地址:https://doi.org/10.1016/j.patcog.2019.03.016