Optimal stationary policies for denumerable Markov chains in continuous time

John Bather

doi:10.2307/1426026

Abstract

This paper is concerned with the problem of selecting the transition intensities for a Markov chain in continuous time so as to minimise the long-term average cost. Sufficient conditions are established for an optimal stationary policy using unbounded solutions of the optimality equation. This is a development of recent work on Markovian decision processes in discrete time. The theory is illustrated by considering a simple birth and death process with controlled immigration.

References

[1] Bather, J. (1973) Optimal decision procedures for finite Markov chains. Adv. Appl. Prob. 5, 328–540, 541–553.CrossRef Google Scholar

[2] Derman, C. (1966) Denumerable state Markovian decision processes—average cost criterion. Ann. Math. Statist. 37, 1545–1554.Google Scholar

[3] Feller, W. (1966) An Introduction to Probability Theory and its Applications, Vol. II. Wiley, New York.Google Scholar

[4] Hordijk, A. (1974) Dynamic programming and Markov potential theory. Mathematical Centre Tracts. No. 51. Amsterdam.Google Scholar

[5] Howard, R. A. (1960) Dynamic Programming and Markov Processes. Wiley, New York.Google Scholar

[6] Miller, B. L. (1968) Finite state continuous time Markov decision processes with a finite planning horizon. SIAM J. Control 6, 266–280.Google Scholar

[7] Miller, B. L. (1968) Finite state continuous time Markov decision processes with an infinite planning horizon. J. Math. Anal. Appl. 22, 552–569.Google Scholar

[8] Robinson, D. R. (1976) Markov decision chains with unbounded costs and applications to the control of queues. Adv. Appl. Prob. 8, 159–176.Google Scholar

[9] Ross, S. M. (1970) Average cost semi-Markov decision processes. J. Appl. Prob. 7, 649–656.Google Scholar

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Yushkevic, A. A. and Fainberg, E. A. 1979. On Homogeneous Markov Models with Continuous Time and Finite or Countable State Space. Theory of Probability & Its Applications, Vol. 24, Issue. 1, p. 156.

Abakuks, Andris 1979. An optimal hunting policy for a stochastic logistic model. Journal of Applied Probability, Vol. 16, Issue. 2, p. 319.

Kitayev, M. Yu. 1986. Semi-Markov and Jump Markov Controlled Models: Average Cost Criterion. Theory of Probability & Its Applications, Vol. 30, Issue. 2, p. 272.

Kitaev, M. 1987. Elimination of eanlomization in semi-Markov decision models with average cost criterion. Optimization, Vol. 18, Issue. 3, p. 439.

Piunovskii, A. B. 1989. On homogeneous controlied Markov models in continuous time. Cybernetics, Vol. 25, Issue. 1, p. 55.

Kyriakidis, E.G. and Abakuks, Andris 1989. Optimal pest control through catastrophes. Journal of Applied Probability, Vol. 26, Issue. 04, p. 873.

Kyriakidis, E.G. 1995. Optimal pest control through the introduction of a predator. European Journal of Operational Research, Vol. 81, Issue. 2, p. 357.

Kyriakidis, E.G. 1995. Optimal control of a simple immigration-birth-death process through total catastrophes. European Journal of Operational Research, Vol. 81, Issue. 2, p. 346.

Kyriakidis, E. G. 1999. Optimal control of a truncated general immigration process through total catastrophes. Journal of Applied Probability, Vol. 36, Issue. 2, p. 461.

Kyriakidis, E. G. 1999. Optimal control of a truncated general immigration process through total catastrophes. Journal of Applied Probability, Vol. 36, Issue. 02, p. 461.

Xianping Guo and Ke Liu 2001. A note on optimality conditions for continuous-time Markov decision processes with average cost criterion. IEEE Transactions on Automatic Control, Vol. 46, Issue. 12, p. 1984.

Guo, Xianping and Zhu, Weiping 2002. Denumerable-state continuous-time Markov decision processes with unbounded transition and reward rates under the discounted criterion. Journal of Applied Probability, Vol. 39, Issue. 2, p. 233.

Guo, Xianping and Zhu, Weiping 2002. Markov Processes and Controlled Markov Chains. p. 167.

Guo, Xianping and Zhu, Weiping 2002. Denumerable state continuous time Markov decision processes with unbounded cost and transition rates under average criterion. The ANZIAM Journal, Vol. 43, Issue. 4, p. 541.

Guo, Xianping and Hernández-Lerma, Onésimo 2003. Continuous-time controlled Markov chains. The Annals of Applied Probability, Vol. 13, Issue. 1,

Kyriakidis, E.G. 2004. Optimal control of a simple immigration–emigration process through total catastrophes. European Journal of Operational Research, Vol. 155, Issue. 1, p. 198.

Kyriakidis, Epaminondas G. and Dimitrakos, Theodosis D. 2005. Computation of the Optimal Policy for the Control of a Compound Immigration Process through Total Catastrophes. Methodology and Computing in Applied Probability, Vol. 7, Issue. 1, p. 97.

Guo, Xianping and Cao, Xi-Ren 2005. Optimal Control of Ergodic Continuous-Time Markov Chains with Average Sample-Path Rewards. SIAM Journal on Control and Optimization, Vol. 44, Issue. 1, p. 29.

Kyriakidis, E. G. 2006. On the control of a truncated general immigration process through the introduction of a predator. Journal of Applied Mathematics and Decision Sciences, Vol. 2006, Issue. , p. 1.

Leder, Nicole Heidergott, Bernd and Hordijk, Arie 2010. An Approximation Approach for the Deviation Matrix of Continuous-Time Markov Processes with Application to Markov Decision Theory. Operations Research, Vol. 58, Issue. 4-part-1, p. 918.

Download full list

Article contents

Optimal stationary policies for denumerable Markov chains in continuous time

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

Optimal stationary policies for denumerable Markov chains in continuous time

Abstract

Keywords

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests