A hybrid graphical model for rhythmic parsing

作者:

摘要

A method is presented for the rhythmic parsing problem: Given a sequence of observed musical note onset times, we simultaneously estimate the corresponding notated rhythm and tempo process. A graphical model is developed that represents the evolution of tempo and rhythm and relates these hidden quantities to an observable performance. The rhythm variables are discrete and the tempo and observation variables are continuous. We show how to compute the globally most likely configuration of the tempo and rhythm variables given an observation of note onset times. Experiments are presented on both MIDI data and a data set derived from an audio signal. A generalization to computing MAP estimates for arbitrary conditional Gaussian distributions is outlined.

论文关键词:Conditional Gaussian distribution,Rhythmic parsing,MAP estimate,Dynamic programming,Music recognition,Hybrid graphical model

论文评审过程:Received 4 October 2001, Available online 12 March 2002.

论文官网地址:https://doi.org/10.1016/S0004-3702(02)00192-3