A time series representation model for accurate and fast similarity detection
作者:
Highlights:
•
摘要
Similarity search and detection is a central problem in time series data processing and management. Most approaches to this problem have been developed around the notion of dynamic time warping, whereas several dimensionality reduction techniques have been proposed to improve the efficiency of similarity searches. Due to the continuous increasing of sources of time series data and the cruciality of real-world applications that use such data, we believe there is a challenging demand for supporting similarity detection in time series in a both accurate and fast way. Our proposal is to define a concise yet feature-rich representation of time series, on which the dynamic time warping can be applied for effective and efficient similarity detection of time series. We present the Derivative time series Segment Approximation (DSA) representation model, which originally features derivative estimation, segmentation and segment approximation to provide both high sensitivity in capturing the main trends of time series and data compression. We extensively compare DSA with state-of-the-art similarity methods and dimensionality reduction techniques in clustering and classification frameworks. Experimental evidence from effectiveness and efficiency tests on various datasets shows that DSA is well-suited to support both accurate and fast similarity detection.
论文关键词:Time series data,Representation models,Similarity detection,Dimensionality reduction,Clustering,Classification
论文评审过程:Received 9 November 2007, Revised 28 February 2009, Accepted 27 March 2009, Available online 10 April 2009.
论文官网地址:https://doi.org/10.1016/j.patcog.2009.03.030