Multi-candidate reduction: Sentence compression as a tool for document summarization tasks

作者:

Highlights:

摘要

This article examines the application of two single-document sentence compression techniques to the problem of multi-document summarization—a “parse-and-trim” approach and a statistical noisy-channel approach. We introduce the multi-candidate reduction (MCR) framework for multi-document summarization, in which many compressed candidates are generated for each source sentence. These candidates are then selected for inclusion in the final summary based on a combination of static and dynamic features. Evaluations demonstrate that sentence compression is a valuable component of a larger multi-document summarization framework.

论文关键词:Headline generation,Summarization,Parse-and-trim,Hidden Markov model

论文评审过程:Received 18 July 2006, Revised 3 January 2007, Accepted 8 January 2007, Available online 23 March 2007.

论文官网地址:https://doi.org/10.1016/j.ipm.2007.01.016