Time series clustering with ARMA mixtures

作者:

Highlights:

摘要

Clustering problems are central to many knowledge discovery and data mining tasks. However, most existing clustering methods can only work with fixed-dimensional representations of data patterns. In this paper, we study the clustering of data patterns that are represented as sequences or time series possibly of different lengths. We propose a model-based approach to this problem using mixtures of autoregressive moving average (ARMA) models. We derive an expectation-maximization (EM) algorithm for learning the mixing coefficients as well as the parameters of the component models. To address the model selection problem, we use the Bayesian information criterion (BIC) to determine the number of clusters in the data. Experiments are conducted on a number of simulated and real datasets. Results from the experiments show that our method compares favorably with other methods proposed previously by others for similar time series clustering tasks.

论文关键词:ARMA model,EM algorithm,Mixture model,Model-based clustering,Time series analysis

论文评审过程:Received 22 May 2003, Accepted 16 December 2003, Available online 15 April 2004.

论文官网地址:https://doi.org/10.1016/j.patcog.2003.12.018