Potential-Based Algorithms in On-Line Prediction and Game Theory
作者:Nicolò Cesa-Bianchi, Gábor Lugosi
摘要
In this paper we show that several known algorithms for sequential prediction problems (including Weighted Majority and the quasi-additive family of Grove, Littlestone, and Schuurmans), for playing iterated games (including Freund and Schapire's Hedge and MW, as well as the Λ-strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general game-theoretic problems. In particular, we describe a notion of generalized regret andshow its applications in learning theory.
论文关键词:universal prediction, on-line learning, Blackwell's strategy, perceptron algorithm, weighted average predictors, internal regret, boosting
论文评审过程:
论文官网地址:https://doi.org/10.1023/A:1022901500417