Article contents
An analysis of transient Markov decision processes
Published online by Cambridge University Press: 14 July 2016
Abstract
This paper is concerned with the analysis of Markov decision processes in which a natural form of termination ensures that the expected future costs are bounded, at least under some policies. Whereas most previous analyses have restricted attention to the case where the set of states is finite, this paper analyses the case where the set of states is not necessarily finite or even countable. It is shown that all the existence, uniqueness, and convergence results of the finite-state case hold when the set of states is a general Borel space, provided we make the additional assumption that the optimal value function is bounded below. We give a sufficient condition for the optimal value function to be bounded below which holds, in particular, if the set of states is countable.
Keywords
MSC classification
- Type
- Research Article
- Information
- Copyright
- © Applied Probability Trust 2006
References
- 8
- Cited by