A fast procedure for the calculation of similarity coefficients in automatic classification

作者:

Highlights:

摘要

A fast algorithm is described for comparing the lists of terms representing documents in automatic classification experiments. The speed of the procedure arises from the fact that all of the non-zero-valued coefficients for a given document are identified together, using an inverted file to the terms in the document collection. The complexity and running time of the algorithm are compared with previously described procedures.

论文关键词:

论文评审过程:Received 8 October 1980, Available online 13 July 2002.

论文官网地址:https://doi.org/10.1016/0306-4573(81)90026-1