Train-O-Matic: Supervised Word Sense Disambiguation with no (manual) effort

作者:

摘要

Word Sense Disambiguation (WSD) is the task of associating the correct meaning with a word in a given context. WSD provides explicit semantic information that is beneficial to several downstream applications, such as question answering, semantic parsing and hypernym extraction. Unfortunately, WSD suffers from the well-known knowledge acquisition bottleneck problem: it is very expensive, in terms of both time and money, to acquire semantic annotations for a large number of sentences. To address this blocking issue we present Train-O-Matic, a knowledge-based and language-independent approach that is able to provide millions of training instances annotated automatically with word meanings. The approach is fully automatic, i.e., no human intervention is required, and the only type of human knowledge used is a task-independent WordNet-like resource.

论文关键词:Word Sense Disambiguation,Corpus Generation,Word Sense Distribution learning,Multilinguality

论文评审过程:Received 24 November 2018, Revised 11 November 2019, Accepted 15 November 2019, Available online 26 November 2019, Version of Record 4 December 2019.

论文官网地址:https://doi.org/10.1016/j.artint.2019.103215