A simple probabilistic model for the relevance assessment of documents

作者:

Highlights:

摘要

When assessing the relevance of documents, different jurors usually do not completely agree. A simple model is set up to take this fact into account by assuming that the relevance assigned by the juror is a random variable. It leads to some interesting conclusions: The worst possible method to assess the relevance is a mere bisection into relevant and irrelevant. Even an ideal system cannot consistently find all relevant documents and only those, which is empirically well known. The retrieval system should also assign a measure of relevance rather than divide the set of all documents only into those found and those not found; in particular, Boolean operations should be supplemented by a ranking algorithm.

论文关键词:

论文评审过程:Received 6 December 1974, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(75)90034-5