KP-Miner: A keyphrase extraction system for English and Arabic documents

作者:

Highlights:

摘要

Automatic keyphrase extraction has many important applications including but not limited to summarization, cataloging/indexing, feature extraction for clustering and classification, and data mining. This paper presents the KP-Miner system, and demonstrates through experimentation and comparison with widely used systems that it is effective and efficient in extracting keyphrases from both English and Arabic documents of varied length. Unlike other existing keyphrase extraction systems, the KP-Miner system does not need to be trained on a particular document set in order to achieve its task. It also has the advantage of being configurable as the rules and heuristics adopted by the system are related to the general nature of documents and keyphrases. This implies that the users of this system can use their understanding of the document(s) being input into the system to fine-tune it to their particular needs.

论文关键词:Keyphrase extraction,Heuristic rules,Automatic indexing

论文评审过程:Received 25 February 2008, Accepted 14 May 2008, Available online 29 May 2008.

论文官网地址:https://doi.org/10.1016/j.is.2008.05.002