Using cause-effect relations in text to improve information retrieval precision

作者:

Highlights:

摘要

This study attempted to use semantic relations expressed in text, in particular cause-effect relations, to improve information retrieval effectiveness. The study investigated whether the information obtained by matching cause-effect relations expressed in documents with the cause-effect relations expressed in users’ queries can be used to improve document retrieval results, in comparison to using just keyword matching without considering relations.An automatic method for identifying and extracting cause-effect information in Wall Street Journal text was developed. Causal relation matching was found to yield a small but significant improvement in retrieval results when the weights used for combining the scores from different types of matching were customized for each query. Causal relation matching did not perform better than word proximity matching (i.e. matching pairs of causally related words in the query with pairs of words that co-occur within document sentences), but the best results were obtained when causal relation matching was combined with word proximity matching. The best kind of causal relation matching was found to be one in which one member of the causal relation (either the cause or the effect) was represented as a wildcard that could match with any word.

论文关键词:Information retrieval,Semantic relations,Cause-effect relations,Logistic regression

论文评审过程:Received 6 February 1999, Accepted 18 February 2000, Available online 6 December 2000.

论文官网地址:https://doi.org/10.1016/S0306-4573(00)00022-4