Full text retrieval based on syntactic similarities

作者:

Highlights:

摘要

Natural language queries are a basic requirement for modern information retrieval systems. We can therefore base retrieval in text information systems on text comparison using syntactic traces. The syntactic trace of a text is the set of all overlapping n-grams of this text. Thus, retrieval is done by comparing the n-gram sets. We define here a syntactic similarity function consisting of a direct and an indirect factor. That the presented theory is useful for the retrieval of information in natural language information systems, is shown by the results of the prototype TRIGIR based on trigrams.

论文关键词:

论文评审过程:Received 17 September 1986, Revised 27 April 1987, Available online 10 June 2003.

论文官网地址:https://doi.org/10.1016/0306-4379(88)90027-0