Using duration models to reduce fragmentation in audio segmentation

作者:Samer Abdallah, Mark Sandler, Christophe Rhodes, Michael Casey

摘要

We investigate explicit segment duration models in addressing the problem of fragmentation in musical audio segmentation. The resulting probabilistic models are optimised using Markov Chain Monte Carlo methods; in particular, we introduce a modification to Wolff’s algorithm to make it applicable to a segment classification model with an arbitrary duration prior. We apply this to a collection of pop songs, and show experimentally that the generated segmentations suffer much less from fragmentation than those produced by segmentation algorithms based on clustering, and are closer to an expert listener’s annotations, as evaluated by two different performance measures.

论文关键词:Segmentation, Duration prior, MCMC, Gibbs sampling, Wolff algorithm

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-006-0586-4