Crossref Citations
This article has been cited by the following publications. This list is generated based on data provided by
Crossref.
Whittle, P.
1980.
Stability and characterisation conditions in negative programming.
Journal of Applied Probability,
Vol. 17,
Issue. 03,
p.
635.
Stidham, S.
1981.
On the convergence of successive approximations in dynamic programming with non-zero terminal reward.
Zeitschrift für Operations Research,
Vol. 25,
Issue. 3,
p.
57.
van Dawen, Rolf
1984.
DGOR.
Vol. 1983,
Issue. ,
p.
475.
Haurie, A.
and
L'Ecuyer, P.
1986.
Approximation and bounds in discrete event dynamic programming.
IEEE Transactions on Automatic Control,
Vol. 31,
Issue. 3,
p.
227.
Puterman, Martin L.
1990.
Stochastic Models.
Vol. 2,
Issue. ,
p.
331.
1994.
Markov Decision Processes.
p.
613.
Ramakrishnan, S.
and
Sudderth, W.
1998.
Geometric Convergence of Algorithms in Gambling Theory.
Mathematics of Operations Research,
Vol. 23,
Issue. 3,
p.
568.
Stidham, Shaler
2000.
Computational Probability.
Vol. 24,
Issue. ,
p.
325.
Yu, Huizhen
2015.
On Convergence of Value Iteration for a Class of Total Cost Markov Decision Processes.
SIAM Journal on Control and Optimization,
Vol. 53,
Issue. 4,
p.
1982.
Yu, Huizhen
and
Bertsekas, Dimitri P.
2015.
A Mixed Value and Policy Iteration Method for Stochastic Control with Universally Measurable Policies.
Mathematics of Operations Research,
Vol. 40,
Issue. 4,
p.
926.