Correlation-based software search by leveraging software term database
作者:Zhixing Li, Gang Yin, Tao Wang, Yang Zhang, Yue Yu, Huaimin Wang
摘要
Internet-scale open source software (OSS) production in various communities generates abundant reusable resources for software developers. However, finding the desired and mature software with keyword queries from a considerable number of candidates, especially for the fresher, is a significant challenge because current search services often fail to understand the semantics of user queries. In this paper, we construct a software term database (STDB) by analyzing tagging data in Stack Overflow and propose a correlation-based software search (CBSS) approach that performs correlation retrieval based on the term relevance obtained from STDB. In addition, we design a novel ranking method to optimize the initial retrieval result. We explore four research questions in four experiments, respectively, to evaluate the effectiveness of the STDB and investigate the performance of the CBSS. The experiment results show that the proposed CBSS can effectively respond to keyword-based software searches and significantly outperforms other existing search services at finding mature software.
论文关键词:software retrieval, software term database, open source software
论文评审过程:
论文官网地址:https://doi.org/10.1007/s11704-017-6573-z