A new dual wing harmonium model for document retrieval

作者:

Highlights:

摘要

A new dual wing harmonium model that integrates term frequency features and term connection features into a low dimensional semantic space without increase of computation load is proposed for the application of document retrieval. Terms and vectorized graph connectionists are extracted from the graph representation of document by employing weighted feature extraction method. We then develop a new dual wing harmonium model projecting these multiple features into low dimensional latent topics with different probability distributions assumption. Contrastive divergence algorithm is used for efficient learning and inference. We perform extensive experimental verification, and the comparative results suggest that the proposed method is accurate and computationally efficient for document retrieval.

论文关键词:Dual wing harmonium,Term connection,Graph representation,Document retrieval,Multiple features

论文评审过程:Received 14 October 2008, Revised 16 February 2009, Accepted 15 March 2009, Available online 26 March 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.03.021