Part-of-speech tagger for Ainu language based on higher order Hidden Markov Model

作者:

Highlights:

摘要

This paper presents POST-AL, the first part-of-speech tagger for Ainu language. The system uses a hand-crafted dictionary based on Ainu narratives “yukar”. The system provides three types of information: word/token, part of speech, and translation of the token (in Japanese). Evaluation on a training set provided positive results. The system could be useful in a great number of tasks related to the research on Ainu language, such as content analysis or translation, which till now have been done mostly manually.

论文关键词:Part-of-speech tagging,Natural speech processing,Natural language processing,Hidden Markov Model,Ainu language

论文评审过程:Available online 18 April 2012.

论文官网地址:https://doi.org/10.1016/j.eswa.2012.04.031