Automatic ranking of retrieval models using retrievability measure

作者:Shariq Bashir, Andreas Rauber

摘要

Analyzing retrieval model performance using retrievability (maximizing findability of documents) has recently evolved as an important measurement for recall-oriented retrieval applications. Most of the work in this domain is either focused on analyzing retrieval model bias or proposing different retrieval strategies for increasing documents retrievability. However, little is known about the relationship between retrievability and other information retrieval effectiveness measures such as precision, recall, MAP and others. In this study, we analyze the relationship between retrievability and effectiveness measures. Our experiments on TREC chemical retrieval track dataset reveal that these two independent goals of information retrieval, maximizing retrievability of documents and maximizing effectiveness of retrieval models are quite related to each other. This correlation provides an attractive alternative for evaluating, ranking or optimizing retrieval models’ effectiveness on a given corpus without requiring any ground truth available (relevance judgments).

论文关键词:Retrieval models evaluation, Retrieval bias analysis, Automatic ranking of retrieval models, Genetic programming

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-014-0759-6