Minimax optimal algorithms for adversarial bandit problem with multiple plays

Vural, Nuri Mert; Gökçesu, Hakan; Gökçesu, K.; Kozat, Süleyman Serdar

Minimax optimal algorithms for adversarial bandit problem with multiple plays

Files

Minimax_Optimal_Algorithms_for_Adversarial_Bandit_Problem_With_Multiple_Plays.pdf (997.51 KB)

Date

2019

Authors

Vural, Nuri Mert

Gökçesu, Hakan

Gökçesu, K.

Kozat, Süleyman Serdar

BUIR Usage Stats

2
views

40
downloads

Citation Stats

Abstract

We investigate the adversarial bandit problem with multiple plays under semi-bandit feedback. We introduce a highly efficient algorithm that asymptotically achieves the performance of the best switching m-arm strategy with minimax optimal regret bounds. To construct our algorithm, we introduce a new expert advice algorithm for the multiple-play setting. By using our expert advice algorithm, we additionally improve the best-known high-probability bound for the multi-play setting by O(√(m)). Our results are guaranteed to hold in an individual sequence manner since we have no statistical assumption on the bandit arm gains. Through an extensive set of experiments involving synthetic and real data, we demonstrate significant performance gains achieved by the proposed algorithm with respect to the state-of-the-art algorithms.

Source Title

IEEE Transactions on Signal Processing

Publisher

IEEE

Keywords

Adversarial multi-armed bandit, Multiple plays, Switching bandit, Minimax optimal, Individual sequence manner

Permalink

http://hdl.handle.net/11693/75951

Published Version (Please cite this version)

https://dx.doi.org/10.1109/TSP.2019.2928952

Collections

Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

Minimax optimal algorithms for adversarial bandit problem with multiple plays

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Minimax optimal algorithms for adversarial bandit problem with multiple plays

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type