The complexity of synchronizing Markov decision processes
作者:
Highlights:
•
摘要
We consider Markov decision processes (MDP) as generators of sequences of probability distributions over states. A probability distribution is p-synchronizing if the probability mass is at least p in a single state, or in a given set of states. We consider four temporal synchronizing modes: a sequence of probability distributions is always p-synchronizing, eventually p-synchronizing, weakly p-synchronizing, or strongly p-synchronizing if, respectively, all, some, infinitely many, or all but finitely many distributions in the sequence are p-synchronizing. We provide tight results on the expressiveness, decidability, complexity, and memory requirement for winning strategies for all synchronizing modes in MDPs.
论文关键词:Markov decision processes,Complexity,Sequences of probability distributions,Synchronization
论文评审过程:Received 6 May 2016, Revised 20 March 2018, Accepted 20 September 2018, Available online 9 October 2018, Version of Record 19 November 2018.
论文官网地址:https://doi.org/10.1016/j.jcss.2018.09.004