Hostname: page-component-cd9895bd7-dzt6s Total loading time: 0 Render date: 2024-12-19T01:06:51.936Z Has data issue: false hasContentIssue false

A forwards induction approach to candidate drug selection

Published online by Cambridge University Press:  01 July 2016

S. Qu*
Affiliation:
University of Oxford
J. C. Gittins*
Affiliation:
University of Oxford
*
Postal address: Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK.
Postal address: Department of Statistics, University of Oxford, 1 South Parks Road, Oxford OX1 3TG, UK.
Rights & Permissions [Opens in a new window]

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

A forwards induction policy is a type of greedy algorithm for Markov decision processes. It is straightforward to implement and is optimal for a large class of models, especially in stochastic resource allocation. In this paper we consider a model for the optimal allocation of resources in pre-clinical pharmaceutical research. We show that although they are not always strictly optimal, forwards induction policies perform well.

Type
General Applied Probability
Copyright
Copyright © Applied Probability Trust 2011 

References

Brealey, R. and Myers, S. (2000). Principles of Corporate Finance, 6th edn. McGraw Hill, New York.Google Scholar
Casella, G. and Berger, R. L. (2001). Statistical Inference, 2nd edition. Duxbury Press.Google Scholar
Charalambous, C. (2009). Models and software for improving the profitability of pharmaceutical research. , University of Oxford.Google Scholar
Charalambous, C. and Gittins, J. C. (2008). Optimal selection policies for a sequence of candidate drugs. Adv. Appl. Prob. 40, 359376.Google Scholar
Gittins, J. C. (1979). Bandit processes and dynamic allocation indices. J. R. Statist. Soc. B 41, 148177.Google Scholar
Gittins, J., Glazebrook, K. and Weber, R. (2011a). Multi-Armed Bandit Allocation Indices, 2nd edn. John Wiley, Chichester.Google Scholar
Gittins, J. C. et al. (2011b). OPRRA User Guide. Available at www.stats.ox.ac.uk/people/academic_staff/john_gittins.Google Scholar
Glazebrook, K. D. (1995). Stochastic scheduling and forwards induction. Discrete Appl. Math. 57, 145165.CrossRefGoogle Scholar
Glazebrook, K. D. and Gittins, J. C. (1993). The performance of forwards induction policies. Stoch. Process. Appl. 46, 301326.Google Scholar
Press, W. H., Teukolsky, S. A., Vetterling, W. T. and Flannery, B. P. (2007). Numerical Recipes, 3rd edn. Cambridge University Press.Google Scholar
Puterman, M. L. (2005). Markov Decision Processes, 2nd edn. John Wiley, New York.Google Scholar
Yu, J. Y. and Gittins, J. C. (2008). Models and software for improving the profitability of pharmaceutical research. Europ. J. Operat. Res. 189, 459475.Google Scholar