An online Bayesian approach to change-point detection for categorical data

作者:

Highlights:

摘要

Change-point detection for categorical data has wide applications in many fields. Existing methods either are distribution-free, not utilizing categorical information sufficiently, or have limited performance when there exists “rare events” (events that occur with low frequency). In this paper, we propose a Bayesian change-point detection model for categorical data based on Dirichlet-multinomial mixtures. Because of the introduction of prior information, our method performs well for the existence of “rare events”. An online parameter estimation procedure and an online detection strategy are then designed to adapt to data streams. Monte Carlo simulations discuss the power of the proposed method and show advantages compared with existing algorithms. Applications in biomedical research, document analysis, health news case study and location monitoring indicate practical values of our method.

论文关键词:Bayes factor,Change-point detection,Dirichlet-multinomial mixtures,Online strategy

论文评审过程:Received 24 March 2019, Revised 3 December 2019, Accepted 17 March 2020, Available online 29 March 2020, Version of Record 16 April 2020.

论文官网地址:https://doi.org/10.1016/j.knosys.2020.105792