Generalized mirror descents in congestion games

作者:

摘要

Different types of dynamics have been studied in repeated game play, and one of them which has received much attention recently consists of those based on “no-regret” algorithms from the area of machine learning. It is known that dynamics based on generic no-regret algorithms may not converge to Nash equilibria in general, but to a larger set of outcomes, namely coarse correlated equilibria. Moreover, convergence results based on generic no-regret algorithms typically use a weaker notion of convergence: the convergence of the average plays instead of the actual plays. Some work has been done showing that when using a specific no-regret algorithm, the well-known multiplicative updates algorithm, convergence of actual plays to equilibria can be shown and better quality of outcomes in terms of the price of anarchy can be reached for atomic congestion games and load balancing games. Are there more cases of natural no-regret dynamics that perform well in suitable classes of games in terms of convergence and quality of outcomes that the dynamics converge to?

论文关键词:Mirror-descent algorithm,No-regret dynamics,Convergence,Bandit model

论文评审过程:Received 12 April 2015, Revised 5 September 2016, Accepted 12 September 2016, Available online 29 September 2016, Version of Record 5 October 2016.

论文官网地址:https://doi.org/10.1016/j.artint.2016.09.002