Learning data discretization via convex optimization

作者：Vojtech Franc, Ondrej Fikar, Karel Bartos, Michal Sofka

摘要

Discretization of continuous input functions into piecewise constant or piecewise linear approximations is needed in many mathematical modeling problems. It has been shown that choosing the length of the piecewise segments adaptively based on data samples leads to improved accuracy of the subsequent processing such as classification. Traditional approaches are often tied to a particular classification model which results in local greedy optimization of a criterion function. This paper proposes a technique for learning the discretization parameters along with the parameters of a decision function in a convex optimization of the true objective. The general formulation is applicable to a wide range of learning problems. Empirical evaluation demonstrates that the proposed convex algorithms yield models with fewer number of parameters with comparable or better accuracy than the existing methods.

论文关键词：Piecewise constant embedding, Piecewise linear embedding, Parameter discretization, Convex optimization, Classification, Histograms

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10994-017-5654-4