Pitch-Dependent Identification of Musical Instrument Sounds
作者:Tetsuro Kitahara, Masataka Goto, Hiroshi G. Okuno
摘要
This paper describes a musical instrument identification method that takes into consideration the pitch dependency of timbres of musical instruments. The difficulty in musical instrument identification resides in the pitch dependency of musical instrument sounds, that is, acoustic features of most musical instruments vary according to the pitch (fundamental frequency, F0). To cope with this difficulty, we propose an F0-dependent multivariate normal distribution, where each element of the mean vector is represented by a function of F0. Our method first extracts 129 features (e.g., the spectral centroid, the gradient of the straight line approximating the power envelope) from a musical instrument sound and then reduces the dimensionality of the feature space into 18 dimension. In the 18-dimensional feature space, it calculates an F0-dependent mean function and an F0-normalized covariance, and finally applies the Bayes decision rule. Experimental results of identifying 6,247 solo tones of 19 musical instruments shows that the proposed method improved the recognition rate from 75.73% to 79.73%.
论文关键词:musical instrument identification, the pitch dependency, fundamental frequency, automatic music transcription, computational auditory scene analysis
论文评审过程:
论文官网地址:https://doi.org/10.1007/s10489-005-4612-1