Adapting attackers and defenders patrolling strategies: A reinforcement learning approach for Stackelberg security games

作者：

Highlights：

• We consider a Reinforcement Learning process for Stackelberg security games.

• The architecture involves the Adaptive Primary Learning and the Actor–critic modules.

• We provide a game-theoretic formulation method based on Markov decision process.

• We design the extraproximal method to find the Strong Lp-Stackelberg/Nash equilibrium.

• We provide an efficient algorithm that accelerate the reinforcement learning process.

摘要

•We consider a Reinforcement Learning process for Stackelberg security games.•The architecture involves the Adaptive Primary Learning and the Actor–critic modules.•We provide a game-theoretic formulation method based on Markov decision process.•We design the extraproximal method to find the Strong Lp-Stackelberg/Nash equilibrium.•We provide an efficient algorithm that accelerate the reinforcement learning process.

论文关键词：Security games,Reinforcement learning,Stackelberg games,Behavioral games,Multiple players,Strong Stackelberg/Nash equilibrium

论文评审过程：Received 8 February 2017, Revised 11 October 2017, Accepted 26 December 2017, Available online 9 January 2018, Version of Record 30 April 2018.

论文官网地址：https://doi.org/10.1016/j.jcss.2017.12.004