Combining classifier decisions for robust speaker identification
作者:
Highlights:
•
摘要
In this work, we combine the decisions of two classifiers as an alternative means of improving the performance of a speaker recognition system in adverse environments. The difference between these classifiers is in their feature-sets. One system is based on the popular mel-frequency cepstral coefficients (MFCC) and the other on the new parametric feature-sets (PFS) algorithm. The feature-vectors both have mel-scale spectral warping and are computed in the cepstral domain but the feature-sets differs in the use of spectral filters and compressions. The performance of the classifier is not much different in recognition rates terms but they are complementary. This shows that there is information that is not captured in the popular mel-frequency cepstral coefficients (MFCC), and the parametric feature-sets (PFS) is able to add further information for improved performance. Several ways of combining these classifiers gives significant improvements in a speaker identification task using a very large telephone degraded NTIMIT database.
论文关键词:Speaker identification,Parametric feature sets,Multiple classifier systems,Gaussian mixture model
论文评审过程:Received 23 June 2005, Revised 17 August 2005, Available online 10 October 2005.
论文官网地址:https://doi.org/10.1016/j.patcog.2005.08.004