The infinite Student's t-factor mixture analyzer for robust clustering and classification

作者:

Highlights:

摘要

Recently, the Student's t-factor mixture analyzer (tFMA) has been proposed. Compared with the mixture of Student's t-factor analyzers (MtFA), the tFMA has better performance when processing high-dimensional data. Moreover, the factors estimated by the tFMA can be visualized in a low-dimensional latent space, which is not shared by the MtFA. However, as the tFMA belongs to finite mixtures and the related parameter estimation method is based on the maximum likelihood criterion, it could not automatically determine the appropriate model complexity according to the observed data, leading to overfitting. In this paper, we propose an infinite Student's t-factor mixture analyzer (itFMA) to handle this issue. The itFMA is based on the nonparametric Bayesian statistics which assumes infinite number of mixing components in advance, and automatically determines the proper number of components after observing the high-dimensional data. Moreover, we derive an efficient variational inference algorithm for the itFMA. The proposed itFMA and the related variational inference algorithm are used to cluster and classify high-dimensional data. Experimental results of some applications show that the itFMA has good generalization capacity, offering a more robust and powerful performance than other competing approaches.

论文关键词:Infinite Student's t-factor mixture analyzer,Nonparametric Bayesian statistics,Variational inference,Clustering,Classification

论文评审过程:Received 10 November 2011, Revised 25 April 2012, Accepted 9 May 2012, Available online 18 May 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.05.003