Single document summarization using the information from documents with the same topic
作者:
Highlights:
•
摘要
The essence of extractive summarization is to measure the importance of sentences in the document. When extracting summary from a single document, it is difficult to comprehensively and effectively evaluate the importance of sentences due to the lack of information. In this paper, we propose a kind of single document summarization method using information from documents under the same topic. This method integrates the topic information from neighborhood documents and statistical information from the target document to calculate the score of sentences. Then the scoring results are used as a prior scores for each sentence in the target document. After the target document is represented by the sentence graph, the final score of the sentences are obtained by the biased random walk algorithm. Finally, the Maximal Marginal Relevance (MMR) algorithm is used to select the sentences to form summary. The experimental results on the DUC2001 and DUC2002 datasets show that the effect of extracting summary is improved by incorporating information from the documents under the same topic.
论文关键词:Extractive summarization,Neighborhood documents,Graph model,Biased LexRank
论文评审过程:Received 27 June 2020, Revised 24 June 2021, Accepted 25 June 2021, Available online 27 June 2021, Version of Record 1 July 2021.
论文官网地址:https://doi.org/10.1016/j.knosys.2021.107265