Medical query generation by term–category correlation

作者:

Highlights:

摘要

Natural language descriptions are helpful for users to precisely describe medical information needs. However search engines often operate on keyword-based queries. Generating keyword-based queries from the descriptions is thus essential. Its goal lies in retrieving more relevant information that may be ranked high for easy access. In response to the goal, we present a technique MQG (Medical Query Generator) that, given an information need description, generates a query by selecting (from the description) those terms having stronger correlation to medical categories. Empirical evaluation on a medical text database OHSUMED shows that MQG greatly outperforms several state-of-the-art techniques, including those that expand queries by a complete dictionary of medical terms and their equivalence terms in retrieval. Moreover, it reduces the load incurred to the text ranker by retrieving fewer documents for ranking. It also reduces the load incurred to the search engines by using fewer terms in the queries.

论文关键词:Medical information need,Natural language,Query generation,Term selection,Term–category correlation

论文评审过程:Received 4 February 2009, Revised 9 April 2010, Accepted 30 April 2010, Available online 26 May 2010.

论文官网地址:https://doi.org/10.1016/j.ipm.2010.04.003