Potential-Based Algorithms in On-Line Prediction and Game Theory

作者:Nicolò Cesa-Bianchi, Gábor Lugosi

摘要

In this paper we show that several known algorithms for sequential prediction problems (including Weighted Majority and the quasi-additive family of Grove, Littlestone, and Schuurmans), for playing iterated games (including Freund and Schapire's Hedge and MW, as well as the Λ-strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general game-theoretic problems. In particular, we describe a notion of generalized regret andshow its applications in learning theory.

论文关键词:universal prediction, on-line learning, Blackwell's strategy, perceptron algorithm, weighted average predictors, internal regret, boosting

论文评审过程:

论文官网地址:https://doi.org/10.1023/A:1022901500417