Speaker identification using hybrid Karhunen–Loeve transform and Gaussian mixture model approach
作者:
Highlights:
•
摘要
This paper proposes a classification scheme that incorporates Karhunen–Loeve transform (KLT) and Gaussian mixture model (GMM) for text-independent speaker identification. Our results show that the combination is beneficial to both classification accuracy and computational cost. For a database with 500 Mandarin speakers, it is demonstrated that accuracy improvement of up to 4% and computational cost saving of 10 times compared to those of the conventional GMM model can be achieved.
论文关键词:Karhunen–Loeve transform,Bhattacharyya distance,Gaussian mixture models,Speaker identification,Mel frequency cepstral coefficients
论文评审过程:Received 28 July 2003, Accepted 26 August 2003, Available online 13 November 2003.
论文官网地址:https://doi.org/10.1016/j.patcog.2003.08.013