An improved algorithm for the calculation of exact term discrimination values

作者:

Highlights:

摘要

The term discrimination model provides a means of evaluating indexing terms in automatic document retrieval systems. This article describes an efficient algorithm for the calculation of term discrimination values that may be used when the interdocument similarity measure used is the cosine coefficient and when the document representatives have been weighted using one particular term-weighting scheme. The algorithm has an expected running time proportional to Nn2 for a collection of N documents, each of which has been assigned an average of n terms.

论文关键词:

论文评审过程:Received 24 March 1987, Accepted 8 May 1987, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(88)90073-8