Dual pattern-enhanced representations model for query-focused multi-document summarisation

作者:

Highlights:

摘要

To address the problem of query-focused multi-document summarisation, we present a novel unsupervised pattern-enhanced approach for representing coherent topics across documents, as well as the query relevance, in order to generate topically coherent summaries that meet the information needs of users. The proposed model employs not only a pattern-enhanced topic model to generate discriminative and semantic rich representations for topics and documents, but also a pattern-based relevance model for the query relevance of sentences. With these dual pattern-based representations for sentences, we are able to integrate various indicative metrics, such as rational coverage of document topics and sentence relevance, into a unified model. When evaluated on the datasets of the document understanding conferences of 2006 and 2007, the proposed approach shows a performance improvement as compared to a number of state-of-the-art methods and unsupervised baseline systems.

论文关键词:Query-focused multi-document summarisation,Pattern mining,Topic modelling,Query expansion,Three-way decision theory,Unsupervised approach

论文评审过程:Received 17 January 2018, Revised 20 September 2018, Accepted 25 September 2018, Available online 12 October 2018, Version of Record 21 November 2018.

论文官网地址:https://doi.org/10.1016/j.knosys.2018.09.035