Bootstrap learning for accurate onset detection

作者：Ning Hu, Roger B. Dannenberg

摘要

Supervised learning models have been applied to create good onset detection systems for musical audio signals. However, this always requires a large set of labeled training examples, and hand-labeling is quite tedious and time consuming. In this paper, we present a bootstrap learning approach to train an accurate note onset detection model. Audio alignment techniques are first used to find the correspondence between a symbolic music representation (such as MIDI data) and an acoustic recording. This alignment provides an initial estimate of note boundaries which can be used to train an onset detector. Once trained, the detector can be used to refine the initial set of note boundaries and training can be repeated. This iterative training process eliminates the need for hand-labeled audio. Tests show that this training method can improve an onset detector initially trained on synthetic data.

论文关键词：Bootstrap learning, Onset detection, Audio-to-score alignment

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10994-006-8458-5