Beyond keyword and cue-phrase matching: A sentence-based abstraction technique for information extraction

作者:

Highlights:

摘要

With the explosion in the quantity of on-line text and multimedia information in recent years, there has been a renewed interest in the automated extraction of knowledge and information in various disciplines. In this paper, we provide a novel quantitative model for the creation of a summary by extracting a set of sentences that represent the most salient content of a text. The model is based on a shallow linguistic extraction technique. What distinguishes it from previous research is that it does not work on the detection of specific keywords or cue-phrases to evaluate the relevance of the sentence concerned. Instead, the attention is focused on the identification of the main factors in the textual continuity. Simulation experiments suggest that this technique is useful because it moves away from a purely keyword-based method of textual information extraction and its associated limitations.

论文关键词:Information extraction,Automatic summary,Shallow text processing,Connectionist model

论文评审过程:Received 1 June 2003, Accepted 1 November 2004, Available online 28 June 2005.

论文官网地址:https://doi.org/10.1016/j.dss.2004.11.017