Finding time series discord based on bit representation clustering

作者:

Highlights:

摘要

The problem of finding time series discord has attracted much attention recently due to its numerous applications and several algorithms have been suggested. However, most of them suffer from high computation cost and cannot satisfy the requirement of real applications. In this paper, we propose a novel discord discovery algorithm BitClusterDiscord which is based on bit representation clustering. Firstly, we use PAA (Piecewise Aggregate Approximation) bit serialization to segment time series, so as to capture the main variation characteristic of time series and avoid the influence of noise. Secondly, we present an improved K-Medoids clustering algorithm to merge several patterns with similar variation behaviors into a common cluster. Finally, based on bit representation clustering, we design two pruning strategies and propose an effective algorithm for time series discord discovery. Extensive experiments have demonstrated that the proposed approach can not only effectively find discord of time series, but also greatly improve the computational efficiency.

论文关键词:Time series data mining,Discord discovery,Clustering,Pruning,Dimensionality reduction

论文评审过程:Received 31 January 2013, Revised 9 September 2013, Accepted 12 September 2013, Available online 28 September 2013.

论文官网地址:https://doi.org/10.1016/j.knosys.2013.09.015