The interaction of normalisation and clustering in sub-domain definition for multi-source transfer learning based time series anomaly detection

作者:

Highlights:

摘要

This paper examines how data normalisation and clustering interact in the definition of sub-domains within multi-source transfer learning systems for time series anomaly detection. The paper introduces a distinction between (i) clustering as a primary/direct method for anomaly detection, and (ii) clustering as a method for identifying sub-domains within the source or target datasets. Reporting the results of three sets of experiments, we find that normalisation after feature extraction and before clustering results in the best performance for anomaly detection. Interestingly, we find that in the multi-source transfer learning scenario clustering on the target dataset and identifying subdomains in the target data can result in improved model performance, as compared to identifying sub-domains through defining clusters using the multi-source dataset.

论文关键词:Anomaly detection,Transfer learning,Time series analysis,Cloud infrastructure

论文评审过程:Received 16 March 2022, Revised 11 September 2022, Accepted 12 September 2022, Available online 17 September 2022, Version of Record 30 September 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.109894