A simple probabilistic model for the relevance assessment of documents

作者：

Highlights：

•

摘要

When assessing the relevance of documents, different jurors usually do not completely agree. A simple model is set up to take this fact into account by assuming that the relevance assigned by the juror is a random variable. It leads to some interesting conclusions: The worst possible method to assess the relevance is a mere bisection into relevant and irrelevant. Even an ideal system cannot consistently find all relevant documents and only those, which is empirically well known. The retrieval system should also assign a measure of relevance rather than divide the set of all documents only into those found and those not found; in particular, Boolean operations should be supplemented by a ranking algorithm.

论文关键词：

论文评审过程：Received 6 December 1974, Available online 13 July 2002.

论文官网地址：https://doi.org/10.1016/0306-4573(75)90034-5