A weakly monotonic backward induction algorithm on finite bounded subsets of vector lattices

作者:

Highlights:

摘要

We present a new efficient and robust backward induction algorithm, which is weakly monotonic, working on bounded subsets without holes of lattices. We prove all its properties, give examples of applications, and illustrate its behavior, comparing it with the natural extension of the unidimensional algorithm presented in Puterman (Markov Decision Processes: Discrete Stochastic Dynamic Programming, Wiley, New York, 1994), in the sense of Topkis (Frontiers of Economic Research Series, Princeton University Press, Princeton, NJ, 1998) and White (Recent Developments in Markov Decision Processes, Academic Press, New York, 1980, 261) and showing, also experimentally, that it is much more efficient.

论文关键词:Discrete-time controlled dynamical systems,Nonstationary Markov decision processes,Nonchaotic weakly monotonic optimal policies,Partial order

论文评审过程:Received 10 September 2002, Available online 23 February 2004.

论文官网地址:https://doi.org/10.1016/j.cam.2003.11.007