Improving kernel online learning with a snapshot memory

作者：Trung Le, Khanh Nguyen, Dinh Phung

摘要

We propose in this paper the Stochastic Variance-reduced Gradient Descent for Kernel Online Learning (DualSVRG), which obtains the \(\varepsilon\)-approximate linear convergence rate and is not vulnerable to the curse of kernelization. Our approach uses a variance reduction technique to reduce the variance when estimating full gradient, and further exploits recent work in dual space gradient descent for online learning to achieve model optimality. This is achieved by introducing the concept of an instant memory, which is a snapshot storing the most recent incoming data instances and proposing three transformer oracles, namely budget, coverage, and always-move oracles. We further develop rigorous theoretical analysis to demonstrate that our proposed approach can obtain the \(\varepsilon\)-approximate linear convergence rate, while maintaining model sparsity, hence encourages fast training. We conduct extensive experiments on several benchmark datasets to compare our DualSVRG with state-of-the-art baselines in both batch and online settings. The experimental results show that our DualSVRG yields superior predictive performance, while spending comparable training time with baselines.

论文关键词：Kernel online learning, Incremental stochastic gradient descent, Online learning, Kernel methods, Stochastic optimization

论文评审过程：

论文官网地址：https://doi.org/10.1007/s10994-021-06075-7