Search

3 results

Optimal decision procedures for finite Markov chains. Part III: General convex systems
John Bather
Journal:

Advances in Applied Probability / Volume 5 / Issue 3 / December 1973

Published online by Cambridge University Press:

01 July 2016, pp. 541-553

Print publication:

December 1973
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This paper is concerned with the general problem of finding an optimal transition matrix for a finite Markov chain, where the probabilities for each transition must be chosen from a given convex family of distributions. The immediate cost is determined by this choice, but it is required to minimise the average expected cost in the long run. The problem is investigated by classifying the states according to the accessibility relations between them. If an optimal policy exists, it can be found by considering the convex subsystems associated with the states at different levels in the classification scheme.

Optimal decision procedures for finite Markov chains. Part II: Communicating systems
John Bather
Journal:

Advances in Applied Probability / Volume 5 / Issue 3 / December 1973

Published online by Cambridge University Press:

01 July 2016, pp. 521-540

Print publication:

December 1973
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
A Markov process in discrete time with a finite state space is controlled by choosing the transition probabilities from a given convex family of distributions depending on the present state. The immediate cost is prescribed for each choice and it is required to minimise the average expected cost over an infinite future. The paper considers a special case of this general problem and provides the foundation for a general solution. The main result is that an optimal policy exists if each state of the system can be reached with positive probability from any other state by choosing a suitable policy.

Optimal decision procedures for finite markov chains. Part I: Examples
John Bather
Journal:

Advances in Applied Probability / Volume 5 / Issue 2 / August 1973

Published online by Cambridge University Press:

01 July 2016, pp. 328-339

Print publication:

August 1973
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
A Markov process in discrete time with a finite state space is controlled by choosing the transition probabilities from a prescribed set depending on the state occupied at any time. Given the immediate cost for each choice, it is required to minimise the expected cost over an infinite future, without discounting. Various techniques are reviewed for the case when there is a finite set of possible transition matrices and an example is given to illustrate the unpredictable behaviour of policy sequences derived by backward induction. Further examples show that the existing methods may break down when there is an infinite family of transition matrices. A new approach is suggested, based on the idea of classifying the states according to their accessibility from one another.