Crossref Citations
This article has been cited by the following publications. This list is generated based on data provided by Crossref.
Hübner, G.
1988.
A unified approach to adaptive control of average reward Markov decision processes.
OR Spektrum,
Vol. 10,
Issue. 3,
p.
161.
Jalali, A.
and
Ferguson, M.
1989.
Computationally efficient adaptive control algorithms for Markov chains.
p.
1283.
Jalali, Ahmad
and
Ferguson, Michael
1990.
Adaptive control of Markov chains with local updates.
Systems & Control Letters,
Vol. 14,
Issue. 3,
p.
209.
H�bner, G.
and
Sch�l, M.
1991.
Adaptive policy-iteration and policy-value-iteration for discounted Markov decision processes.
ZOR Zeitschrift f�r Operations Research Methods and Models of Operations Research,
Vol. 35,
Issue. 6,
p.
491.
Jalali, A.
and
Ferguson, M.J.
1992.
Computationally efficient algorithms for on-line optimization of markov decision processes.
Automatica,
Vol. 28,
Issue. 1,
p.
107.
Rumeau, Tomás Prieto
2003.
Statistical inference for a finite optimal stopping problem with unknown transition probabilities.
Test,
Vol. 12,
Issue. 1,
p.
215.
Prieto-Rumeau, Tomás
2005.
Central limit theorem for the estimator of the value of an optimal stopping problem.
Test,
Vol. 14,
Issue. 1,
p.
215.