Ranking documents in thesaurus-based boolean retrieval systems

作者:

Highlights:

摘要

In this paper we investigate document ranking methods in thesaurus-based boolean retrieval systems, and propose a new thesaurus-based ranking algorithm called the Extended Relevance (E-Relevance) algorithm. The E-Relevance algorithm integrates the extended boolean model and the thesaurus-based relevance algorithm. Since the E-Relevance algorithm has all the desirable properties of the extended boolean model, it avoids the various problems of previous thesaurus-based ranking algorithms. The E-Relevance algorithm also ranks documents effectively by using term dependence information from the thesaurus. We have shown through performance comparison that the proposed algorithm achieves higher retrieval effectiveness than the others proposed earlier.

论文关键词:Information retrieval,Boolean retrieval system,Ranking algorithm,Thesaurus

论文评审过程:Received 18 February 1992, Accepted 30 November 1992, Available online 19 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(94)90025-6