Published online by Cambridge University Press: 14 July 2016
This work concerns controlled Markov chains with denumerable state space, (possibly) unbounded cost function, and an expected average cost criterion. Under a Lyapunov function condition, together with mild continuity-compactness assumptions, a simple necessary and sufficient criterion is given so that the relative value functions and differential costs produced by the value iteration scheme converge pointwise to the solution of the optimality equation; this criterion is applied to obtain convergence results when the cost function is bounded below or bounded above.
Research supported by a U.S.-México Collaborative Research Program funded by the National Science Foundation under Grant NSF-INT 9201430, and by the Consejo Nacional de Ciencia y Tecnología (CONACyT), México.
The work of Cavazos-Cadena was partially supported by the PSF Organization under Grant No. 200–150/94–01, and by the MAXTOR Foundation for Applied Probability and Statistics (MAXFAPS) under Grant No. 01–01–56/01–94.