Towards better time series prediction with model-independent, low-dispersion clusters of contextual subsequence embeddings

作者:

Highlights:

摘要

Many applications in financial investment management, environmental pollution reduction, energy resource scheduling, and consumer sales promotion need to forecast future values in multivariate tie series. Despite extensive research efforts laid on the task for decades from the perspective of prediction models, this paper explores boost the prediction performance with model-independent, low-dispersion clusters. The intuition is that appropriately splitting training data could lower the prediction uncertainty, resulting in better accuracy. In the proposed multi-horizon time series prediction framework (TSP-LDC), autoencoders generate contextual embeddings that capture variables’ nonlinearity and temporal dynamics. A discriminative clustering process on the contextual embeddings produces homogeneous clusters, intending to bring better performance of prediction models. Each cluster has a corresponding prediction model responsible for making inferences for subsequences belonging to that cluster, where the discriminative probabilities of the subsequence indicate its membership. Extensive experiments demonstrate that TSP-LDC improves the performance of popular approaches of time series prediction on various datasets, proving the effectiveness of the proposed approaches.

论文关键词:Multivariate time series prediction,Autoencoders,Contextual embeddings,Discriminative clustering

论文评审过程:Received 17 May 2021, Revised 30 September 2021, Accepted 21 October 2021, Available online 26 October 2021, Version of Record 2 November 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107641