Churn models for prepaid customers in the cellular telecommunication industry using large data marts

作者:

Highlights:

摘要

In this article, we test the usefulness of the popular data mining models to predict churn of the clients of the Polish cellular telecommunication company. When comparing to previous studies on this topic, our research is novel in the following areas: (1) we deal with prepaid clients (previous studies dealt with postpaid clients) who are far more likely to churn, are less stable and much less is known about them (no application, demographical or personal data), (2) we have 1381 potential variables derived from the clients’ usage (previous studies dealt with data with at least tens of variables) and (3) we test the stability of models across time for all the percentiles of the lift curve – our test sample is collected six months after the estimation of the model. The main finding from our research is that linear models, especially logistic regression, are a very good choice when modelling churn of the prepaid clients. Decision trees are unstable in high percentiles of the lift curve, and we do not recommend their usage.

论文关键词:Churn prediction,Retention,Wireless,Cellular,CRM

论文评审过程:Available online 27 November 2009.

论文官网地址:https://doi.org/10.1016/j.eswa.2009.11.083