Probabilistic information retrieval model for a dependency structured indexing system

作者:

Highlights:

摘要

Most previous information retrieval (IR) models assume that terms of queries and documents are statistically independent from each other. However, conditional independence assumption is obviously and openly understood to be wrong, so we present a new method of incorporating term dependence into a probabilistic retrieval model by adapting a dependency structured indexing system using a dependency parse tree and Chow Expansion to compensate the weakness of the assumption. In this paper, we describe a theoretic process to apply the Chow Expansion to the general probabilistic models and the state-of-the-art 2-Poisson model. Through experiments on document collections in English and Korean, we demonstrate that the incorporation of term dependences using Chow Expansion contributes to the improvement of performance in probabilistic IR systems.

论文关键词:Information retrieval,Term dependence,Chow Expansion,Dependency parse tree,Probabilistic model,2-Poisson model

论文评审过程:Received 8 May 2003, Accepted 10 November 2003, Available online 19 December 2003.

论文官网地址:https://doi.org/10.1016/j.ipm.2003.11.001