SPAN: A self-paced association augmentation and node embedding-based model for software bug classification and assignment

作者:

Highlights:

摘要

Effective bug classification and assignment to relevant developers improves the efficiency of software management. However, textually dependent approaches produce inconsistent results on varying datasets, while approaches that depend upon multi-source data can produce dataset conflicts and inaccuracy. Accordingly, we introduce a model based on Self-Paced Association augmentation and Node embedding (SPAN), which uses an effective combination of textually dependent and independent bug categorization to produce consistent results, followed by a bug assignment mechanism to prevent conflicts. To this end, we present a novel unified classifier and assignment model that exploits the connections between nodes in the Software Bug Report Network (SBRNet) to identify the target features. The model is capable of accurately categorizing bugs in a self-paced manner with association augmentation. Finally, we present an approach that assigns the most appropriate developer for bug resolution through SBRNet node information embedding. Our deep two-step self-paced solution is capable of categorizing software bugs with improved accuracy, while still utilizing fewer features. Results reveal that our model is more effective (up to 98% classification accuracy and 96% for bug assignment) when compared to its counterparts.

论文关键词:Bug classification,Bug triage,Association augmentation,Graph embedding,Bug assignment,Bug report analysis

论文评审过程:Received 25 March 2021, Revised 5 November 2021, Accepted 7 November 2021, Available online 26 November 2021, Version of Record 11 December 2021.

论文官网地址:https://doi.org/10.1016/j.knosys.2021.107711