Discovering hidden structure in factored MDPs

作者:

摘要

Markov Decision Processes (MDPs) describe a wide variety of planning scenarios ranging from military operations planning to controlling a Mars rover. However, todayʼs solution techniques scale poorly, limiting MDPsʼ practical applicability. In this work, we propose algorithms that automatically discover and exploit the hidden structure of factored MDPs. Doing so helps solve MDPs faster and with less memory than state-of-the-art techniques.

论文关键词:Markov Decision Process,MDP,Planning under uncertainty,Generalization,Abstraction,Basis function,Nogood,Heuristic,Dead end

论文评审过程:Received 1 August 2010, Revised 8 April 2012, Accepted 9 May 2012, Available online 15 May 2012.

论文官网地址:https://doi.org/10.1016/j.artint.2012.05.002