Supervised automatic evaluation for summarization with voted regression model

作者:

Highlights:

摘要

The high quality evaluation of generated summaries is needed if we are to improve automatic summarization systems. Although human evaluation provides better results than automatic evaluation methods, its cost is huge and it is difficult to reproduce the results. Therefore, we need an automatic method that simulates human evaluation if we are to improve our summarization system efficiently. Although automatic evaluation methods have been proposed, they are unreliable when used for individual summaries. To solve this problem, we propose a supervised automatic evaluation method based on a new regression model called the voted regression model (VRM). VRM has two characteristics: (1) model selection based on ‘corrected AIC’ to avoid multicollinearity, (2) voting by the selected models to alleviate the problem of overfitting. Evaluation results obtained for TSC3 and DUC2004 show that our method achieved error reductions of about 17–51% compared with conventional automatic evaluation methods. Moreover, our method obtained the highest correlation coefficients in several different experiments.

论文关键词:Automatic summarization,Automatic evaluation,Text summarization challenge,Document understanding conference,Regression model

论文评审过程:Received 17 July 2006, Revised 10 January 2007, Accepted 10 January 2007, Available online 7 March 2007.

论文官网地址:https://doi.org/10.1016/j.ipm.2007.01.012