The Use of Background Knowledge in Decision Tree Induction

作者:Marlon Núñez

摘要

At present, algorithms of the ID3 family are not based on background knowledge. For that reason, most of the time they are neither logical nor understandable to experts. These algorithms cannot perform different types of generalization as others can do (Michalski, 1983; Kodratoff, 1983), nor can they can reduce the cost of classifications. The algorithm presented in this paper tries to generate more logical and understandable decision trees than those generated by ID3-like algorithms; it executes various types of generalization and at the same time reduces the classification cost by means of background knowledge. The background knowledge contains the ISA hierarchy and the measurement cost associated with each attribute. The user can define the degrees of economy and generalization. These data will influence directly the quantity of search that the algorithm must undertake. This algorithm, which is an attribute version of the EG2 method (Núñez, 1988a, 1988b), has been implemented and the results appear in this paper comparing them with other methods.

论文关键词:Decision trees, knowledge acquisition, economic induction, background knowledge

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1022609710832