A modular approach for lexical normalization applied to Spanish tweets

作者:

Highlights:

• An extensible and modular approach for normalizing Spanish tweets is proposed.

• We make use of lightweight resources build with low manual effort.

• System performance is also analyzed module-wise and phenomenon-wise.

• The domain adaptability of our proposed system is easy and successful.

• The performance increases if a classifier-based reranking process is introduced.

摘要

•An extensible and modular approach for normalizing Spanish tweets is proposed.•We make use of lightweight resources build with low manual effort.•System performance is also analyzed module-wise and phenomenon-wise.•The domain adaptability of our proposed system is easy and successful.•The performance increases if a classifier-based reranking process is introduced.

论文关键词:Twitter,Text normalization,Domain adaptation

论文评审过程:Available online 14 February 2015.

论文官网地址:https://doi.org/10.1016/j.eswa.2015.02.003