Cost-sensitive learning for defect escalation

作者:

Highlights:

摘要

While most software defects (i.e., bugs) are corrected and tested as part of the prolonged software development cycle, enterprise software venders often have to release software products before all reported defects are corrected, due to deadlines and limited resources. A small number of these reported defects will be escalated by customers whose businesses are seriously impacted. Escalated defects must be resolved immediately and individually by the software vendors at a very high cost. The total costs can be even greater, including loss of reputation, satisfaction, loyalty, and repeat revenue. In this paper, we develop a Software defecT Escalation Prediction (STEP) system to mine historical defect report data and predict the escalation risk of current defect reports for maximum net profit. More specifically, we first describe a simple and general framework to convert the maximum net profit problem to cost-sensitive learning. We then apply and compare four well-known cost-sensitive learning approaches for STEP. Our experiments suggest that cost-sensitive decision trees (CSTree) is the best methods for producing the highest positive net profit.

论文关键词:Software defect escalation prediction,Cost-sensitive learning,Data mining,Defect escalation,Machine learning

论文评审过程:Received 2 September 2013, Revised 12 April 2014, Accepted 21 April 2014, Available online 30 April 2014.

论文官网地址:https://doi.org/10.1016/j.knosys.2014.04.033