Learning accurate and concise naïve Bayes classifiers from attribute value taxonomies and data

作者:J. Zhang, D.-K. Kang, A. Silvescu, V. Honavar

摘要

In many application domains, there is a need for learning algorithms that can effectively exploit attribute value taxonomies (AVT)—hierarchical groupings of attribute values—to learn compact, comprehensible and accurate classifiers from data—including data that are partially specified. This paper describes AVT-NBL, a natural generalization of the naïve Bayes learner (NBL), for learning classifiers from AVT and data. Our experimental results show that AVT-NBL is able to generate classifiers that are substantially more compact and more accurate than those produced by NBL on a broad range of data sets with different percentages of partially specified values. We also show that AVT-NBL is more efficient in its use of training data: AVT-NBL produces classifiers that outperform those produced by NBL using substantially fewer training examples.

论文关键词:Attribute value taxonomies, AVT-based naïve Bayes learner, Partially specified data

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-005-0211-z