Crossref Citations
This article has been cited by the following publications. This list is generated based on data provided by
Crossref.
Andradóttir, Sigrún
1996.
A Global Search Method for Discrete Stochastic Optimization.
SIAM Journal on Optimization,
Vol. 6,
Issue. 2,
p.
513.
Auer, P.
2000.
Using upper confidence bounds for online learning.
p.
270.
Cesa-Bianchi, Nicolò
2002.
MULTIARMED BANDITS IN THE WORST CASE.
IFAC Proceedings Volumes,
Vol. 35,
Issue. 1,
p.
91.
Chang, Hyeong Soo
Fu, Michael C.
Hu, Jiaqiao
and
Marcus, Steven I.
2005.
An Adaptive Sampling Algorithm for Solving Markov Decision Processes.
Operations Research,
Vol. 53,
Issue. 1,
p.
126.
Andradóttir, Sigrún
2006.
Simulation optimization with countably infinite feasible regions.
ACM Transactions on Modeling and Computer Simulation,
Vol. 16,
Issue. 4,
p.
357.
Pandey, Sandeep
Chakrabarti, Deepayan
and
Agarwal, Deepak
2007.
Multi-armed bandit problems with dependent arms.
p.
721.
Audibert, Jean-Yves
Munos, Rémi
and
Szepesvári, Csaba
2007.
Algorithmic Learning Theory.
Vol. 4754,
Issue. ,
p.
150.
Alaya-Feki, Afef Ben Hadj
Moulines, Eric
and
LeCornec, Alain
2008.
Dynamic spectrum access with non-stationary Multi-Armed Bandit.
p.
416.
Jouini, Wassim
Ernst, Damien
Moy, Christophe
and
Palicot, Jacques
2009.
Multi-armed bandit based policies for cognitive radio's decision making issues.
p.
1.
Mersereau, A.J.
Rusmevichientong, P.
and
Tsitsiklis, J.N.
2009.
A Structured Multiarmed Bandit Problem and the Greedy Policy.
IEEE Transactions on Automatic Control,
Vol. 54,
Issue. 12,
p.
2787.
Audibert, Jean-Yves
Munos, Rémi
and
Szepesvári, Csaba
2009.
Exploration–exploitation tradeoff using variance estimates in multi-armed bandits.
Theoretical Computer Science,
Vol. 410,
Issue. 19,
p.
1876.
Liu, Keqin
and
Zhao, Qing
2010.
Decentralized multi-armed bandit with multiple distributed players.
p.
1.
Anandkumar, Animashree
Michael, Nithin
and
Tang, Ao
2010.
Opportunistic Spectrum Access with Multiple Users: Learning under Competition.
p.
1.
Scott, Steven L.
2010.
A modern Bayesian look at the multi‐armed bandit.
Applied Stochastic Models in Business and Industry,
Vol. 26,
Issue. 6,
p.
639.
Hussain, Zakria
Leung, Alex P.
Pasupa, Kitsuchart
Hardoon, David R.
Auer, Peter
and
Shawe-Taylor, John
2010.
Machine Learning and Knowledge Discovery in Databases.
Vol. 6321,
Issue. ,
p.
554.
Kim, Song-Ju
Aono, Masashi
and
Hara, Masahiko
2010.
Unconventional Computation.
Vol. 6079,
Issue. ,
p.
69.
Rusmevichientong, Paat
and
Tsitsiklis, John N.
2010.
Linearly Parameterized Bandits.
Mathematics of Operations Research,
Vol. 35,
Issue. 2,
p.
395.
Gai, Yi
Krishnamachari, Bhaskar
and
Jain, Rahul
2010.
Learning Multiuser Channel Allocations in Cognitive Radio Networks: A Combinatorial Multi-Armed Bandit Formulation.
p.
1.
Auer, Peter
and
Ortner, Ronald
2010.
UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem.
Periodica Mathematica Hungarica,
Vol. 61,
Issue. 1-2,
p.
55.
Liu, Keqin
and
Zhao, Qing
2010.
Distributed Learning in Multi-Armed Bandit With Multiple Players.
IEEE Transactions on Signal Processing,
Vol. 58,
Issue. 11,
p.
5667.