Retrieval parameter optimization using genetic algorithms

作者:

Highlights:

摘要

This paper describes our experiments on automatic parameter optimization for the Japanese monolingual retrieval task. Unlike regression approaches, we optimized parameters completely independently of retrieval models enabling the optimized parameter set to illustrate the characteristics of the target test collections. We adopted genetic algorithms as optimization tools and cross-validated with four test collections, namely the CLIR-J-J collections for NTCIR-3 to NTCIR-6. The most difficult retrieval parameters to optimize are the feedback parameters, because there are no principles for calibrating them. Our approach optimized feedback parameters and basic scoring parameters at the same time. Using test sets and validation sets, we achieved effectiveness levels comparable with very strong baselines, i.e., the best-performing NTCIR official runs.

论文关键词:Information retrieval,Test collections,Parameter optimization,Genetic algorithm

论文评审过程:Received 14 March 2008, Revised 6 February 2009, Accepted 28 April 2009, Available online 16 June 2009.

论文官网地址:https://doi.org/10.1016/j.ipm.2009.04.008