The AMTEx approach in the medical document indexing and retrieval application
作者:
Highlights:
•
摘要
AMTEx is a medical document indexing method, specifically designed for the automatic indexing of documents in large medical collections, such as MEDLINE, the premier bibliographic database of the US National Library of Medicine (NLM). AMTEx combines MeSH, the terminological thesaurus resource of NLM, with a well-established method for extraction of terminology, the C/NC-value method. The performance evaluation of two AMTEx configurations is measured against the current state-of-the-art, the MetaMap Transfer (MMTx) method in four experiments, using two types of corpora: a subset of MEDLINE (PMC) full document corpus and a subset of MEDLINE (OHSUMED) abstracts, for each of the indexing and retrieval tasks, respectively. The experimental results demonstrate that AMTEx performs better in indexing in 20–50% of the processing time compared to MMTx, while for the retrieval task, AMTEx performs better in the full text (PMC) corpus.
论文关键词:Document indexing,Medical document retrieval,Term extraction,MMTx,AMTEx
论文评审过程:Received 10 July 2008, Revised 31 October 2008, Accepted 25 November 2008, Available online 10 December 2008.
论文官网地址:https://doi.org/10.1016/j.datak.2008.11.002