A novel Fuzzy-PSO term weighting automatic query expansion approach using combined semantic filtering

作者:

Highlights:

摘要

Information Retrieval system retrieves relevant documents from large datasets. Automatic Query Expansion (AQE) is one of the approaches to enhance IR performance by adding additional terms to original query. The selection of suitable additional terms for AQE is a crucial task. Term weighting method is one of the ways to deal with such a problem. This paper presents a new term weighting based AQE approach to retrieve more relevant documents from data corpus. The proposed approach comprises of three major steps. First step determines the optimal weights of different IR evidences for different terms using Particle Swarm Optimization (PSO). Fuzzy logic technique is used to improve performance of PSO by controlling inertia and acceleration coefficients during the optimization. Co-occurrence score is introduced as new IR evidence in the proposed approach. Second step is focused on removal of noisy terms by using new combined semantic filtering method. Third step reweights the terms using Rocchio method. The proposed approach is compared with recently developed automatic query expansion approaches in terms of performance measures such as precision, recall, F-measure and MAP (Mean Average Precision). Three benchmark datasets CACM, CISI and TREC-3 are used to verify the results. The proposed approach is found better than other approaches according to results obtained for these benchmark datasets.

论文关键词:Automatic query expansion,Term weighting schemes,Co-occurrence score,Fuzzy logic,Particle swarm optimization,Term frequency,Inverse document frequency

论文评审过程:Received 11 March 2017, Revised 31 August 2017, Accepted 2 September 2017, Available online 6 September 2017, Version of Record 4 October 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.09.004