Derandomizing Stochastic Prediction Strategies

作者：V. Vovk

摘要

In this paper we continue study of the games of prediction with expert advice with uncountably many experts. A convenient interpretation of such games is to construe the pool of experts as one “stochastic predictor”, who chooses one of the experts in the pool at random according to the prior distribution on the experts and then replicates the (deterministic ) predictions of the chosen expert. We notice that if the stochastic predictor's total loss is at most L with probability at least p then the learner's loss can be bounded by cL + aln \(\frac{1}{{\text{P}}}\) for the usual constants c and a. This interpretation is used to revamp known results and obtain new results on tracking the best expert. It is also applied to merging overconfident experts and to fitting polynomials to data.

论文关键词：on-line learning, prediction with expert advice, tracking the best expert, regression

论文评审过程：

论文官网地址：https://doi.org/10.1023/A:1007595032382