Associative Naïve Bayes classifier: Automated linking of gene ontology to medline documents

作者:

Highlights:

摘要

We demonstrate a text-mining method, called associative Naïve Bayes (ANB) classifier, for automated linking of MEDLINE documents to gene ontology (GO). The approach of this paper is a nontrivial extension of document classification methodology from a fixed set of classes C={c1,c2,…,cn} to a knowledge hierarchy like GO. Due to the complexity of GO, we use a knowledge representation structure. With that structure, we develop the text mining classifier, called ANB classifier, which automatically links Medline documents to GO. To check the performance, we compare our datasets under several well-known classifiers: NB classifier, large Bayes classifier, support vector machine and ANB classifier. Our results, described in the following, indicate its practical usefulness.

论文关键词:Data mining,Knowledge discovery,Gene ontology,Document classification

论文评审过程:Received 30 April 2008, Revised 6 January 2009, Accepted 19 January 2009, Available online 30 January 2009.

论文官网地址:https://doi.org/10.1016/j.patcog.2009.01.020