Paraphrase-focused learning to rank for domain-specific frequently asked questions retrieval

作者:

Highlights:

• We study the potential of supervised learning to rank for FAQ retrieval.

• Supervised models offer performance improvements for this task.

• We explored low-effort paraphrase-based data labeling strategies.

• Paraphrase-based labeling was effective for the best models on two FAQ data collections.

• We make a new FAQ retrieval data set publicly available.

摘要

•We study the potential of supervised learning to rank for FAQ retrieval.•Supervised models offer performance improvements for this task.•We explored low-effort paraphrase-based data labeling strategies.•Paraphrase-based labeling was effective for the best models on two FAQ data collections.•We make a new FAQ retrieval data set publicly available.

论文关键词:Question answering,FAQ retrieval,Learning to rank,ListNET,LambdaMART,Convolutional neural network

论文评审过程:Received 14 June 2017, Revised 11 September 2017, Accepted 12 September 2017, Available online 12 September 2017, Version of Record 5 October 2017.

论文官网地址:https://doi.org/10.1016/j.eswa.2017.09.031