Investigating sentence weighting components for automatic summarisation

作者:

Highlights:

摘要

The work described here initially formed part of a triangulation exercise to establish the effectiveness of the Query Term Order algorithm. It subsequently proved to be a reliable indicator for summarising English web documents. We utilised the human summaries from the Document Understanding Conference data, and generated queries automatically for testing the QTO algorithm. Six sentence weighting schemes that made use of Query Term Frequency and QTO were constructed to produce system summaries, and this paper explains the process of combining and balancing the weighting components. The summaries produced were evaluated by the ROUGE-1 metric, and the results showed that using QTO in a weighting combination resulted in the best performance. We also found that using a combination of more weighting components always produced improved performance compared to any single weighting component.

论文关键词:Query Term Order,Query Term Frequency,Sentence Location,Sentence Order,Sentence weighting scheme

论文评审过程:Received 29 January 2006, Revised 27 April 2006, Accepted 16 May 2006, Available online 5 July 2006.

论文官网地址:https://doi.org/10.1016/j.ipm.2006.05.010