A multistrategy approach to classification learning in databases
作者:
Highlights:
•
摘要
This paper proposes a hybrid classification learning system for databases that integrates rule induction and lazy learning. For rule induction learning, we use an entropy function based on Hellinger divergence to measure the amount of information each inductive rule contains. For lazy learning, we also use the Hellinger measure to automatically generate attribute weights and to compute similarities between data values of non-numeric data types. Our system has been implemented and tested extensively on a number of well-known machine learning data sets. The performance of our system was favorable compared to those of other well-known classification learning techniques based on monostrategic learning methods.
论文关键词:Rule induction,Lazy learning,Database,Data mining,Hellinger divergence
论文评审过程:Received 14 October 1998, Revised 17 March 1999, Accepted 14 April 1999, Available online 3 August 1999.
论文官网地址:https://doi.org/10.1016/S0169-023X(99)00018-X