Efficient stemmer generation

作者:

Highlights:

摘要

This paper presents an algorithm for generating stemmers from text stemmer specification files. A small study shows that the generated stemmers are computationally efficient, often running faster than stemmers custom written to implement particular stemming algorithms. The stemmer specification files are easily written and modified by non-programmers, making it much easier to create a stemmer, or tune a stemmer's performance, than would be the case with a custom stemmer program. Stemmer generation is thus also human-resource efficient.

论文关键词:Stemming,Automatic indexing,Algorithms

论文评审过程:Received 11 February 2001, Accepted 6 August 2001, Available online 14 March 2002.

论文官网地址:https://doi.org/10.1016/S0306-4573(01)00047-4