A document classification and retrieval system for R&D in semiconductor industry – A hybrid approach

作者:

Highlights:

摘要

In this paper, a hybrid methodology with a vector space model (VSM) and process-oriented attributes for document management is proposed. The VSM is fine-tuned for classifying documents generated during R&D processes. The document correlation values are computed with the VSM for efficient retrieval. Only documents with high correlation values are presented to meet the specific retrieval purpose, which results in efficient and effective document retrieval. We further design a document classification and retrieval prototype system. The prototype is implemented to facilitate R&D document management in semiconductor industries.

论文关键词:Document classification and retrieval,Vector space model,Document management system

论文评审过程:Available online 17 June 2008.

论文官网地址:https://doi.org/10.1016/j.eswa.2008.06.024