Determining information retrieval and filtering performance without experimentation

作者:

Highlights:

摘要

The performance of an information retrieval or text and media filtering system may be determined through analytic methods as well as by traditional simulation or experimental methods. These analytic methods can provide precise statements about expected performance. They can thus determine which of two similarly performing systems is superior. For both a single query term and for a multiple query term retrieval model, a method for comparing the performance of different probabilistic retrieval methods is developed. This method may be used in computing the average search length for a query, given only knowledge of database parameter values. Predictive models for inverse document frequency, binary independence, and relevance feedback based retrieval and filtering are described. Simulations illustrate how the single term model performs and sample performance predictions are given for single term and multiple term problems.

论文关键词:

论文评审过程:Received 8 November 1993, Accepted 22 November 1994, Available online 21 February 2000.

论文官网地址:https://doi.org/10.1016/0306-4573(95)00072-O