Statistical Drift Detection Ensemble for batch processing of data streams

作者:

Highlights:

摘要

Among the difficulties being considered in data stream processing, a particularly interesting one is the phenomenon of concept drift. Methods of concept drift detection are frequently used to eliminate the negative impact on the quality of classification in the environment of evolving concepts. This article proposes Statistical Drift Detection Ensemble (sdde), a novel method of concept drift detection. The method uses drift magnitude and conditioned marginal covariate drift measures, analyzed by an ensemble of detectors, whose members focus on random subspaces of the stream’s features. The proposed detector was compared with state-of-the-art methods on both synthetic data streams and the semi-synthetic streams generated based on the real-world concepts. A series of computer experiments and a statistical analysis of the results, both for the classification accuracy and Drift Detection errors were carried out and confirmed the effectiveness of the proposed method.

论文关键词:00-01,99-00,Data streams,Concept drift,Drift detection,Statistical drift detection,Classification

论文评审过程:Received 24 March 2022, Revised 2 June 2022, Accepted 3 July 2022, Available online 14 July 2022, Version of Record 21 July 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.109380