An aggregate and iterative disaggregate algorithm with proven optimality in machine learning

作者:Young Woong Park, Diego Klabjan

摘要

We propose a clustering-based iterative algorithm to solve certain optimization problems in machine learning, where we start the algorithm by aggregating the original data, solving the problem on aggregated data, and then in subsequent steps gradually disaggregate the aggregated data. We apply the algorithm to common machine learning problems such as the least absolute deviation regression problem, support vector machines, and semi-supervised support vector machines. We derive model-specific data aggregation and disaggregation procedures. We also show optimality, convergence, and the optimality gap of the approximated solution in each iteration. A computational study is provided.

论文关键词:Optimization, Machine learning, Data aggregation, Least absolute deviation regression, Support vector machine, Semi-supervised support vector machine, Aggregate and iterative disaggregate, AID

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-016-5562-z