Knowledge discovery from numerical data

作者:

Highlights:

摘要

One of the authors previously presented an algorithm for discovering understandable propositions from numerical data. The algorithm consists of normalization, multiple regression analysis and the approximation of multilinear functions by continuous Boolean functions. Continuous Boolean functions are included in the space of multilinear functions. The space of multilinear functions can be corresponded to the space of probability distributions, using the principle of indifference. The distance between two probability distributions is described by Kullback-Leibler (KL) information. Thus, the distance between multilinear functions and continuous Boolean functions is described using KL information. However, since the approximation algorithm is exponential in computational complexity, this paper presents a polynomial approximation algorithm.

论文关键词:Continuous Boolean functions,Multilinear functions,Kullback-Leibler information

论文评审过程:Received 11 August 1997, Accepted 3 November 1997, Available online 10 August 1998.

论文官网地址:https://doi.org/10.1016/S0950-7051(98)00040-9