Aligning music audio with symbolic scores using a hybrid graphical model

作者:Christopher Raphael

摘要

We present a new method for establishing an alignment between a polyphonic musical score and a corresponding sampled audio performance. The method uses a graphical model containing both latent discrete variables, corresponding to score position, as well as a latent continuous tempo process. We use a simple data model based only on the pitch content of the audio signal. The data interpretation is defined to be the most likely configuration of the hidden variables, given the data, and we develop computational methodology to identify or approximate this configuration using a variant of dynamic programming involving parametrically represented continuous variables. Experiments are presented on a 55-minute hand-marked orchestral test set.

论文关键词:Graphical models, Score matching, Music, Score following

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-006-8415-3