Minimax optimal algorithms for adversarial bandit problem with multiple plays
buir.contributor.author | Vural, Nuri Mert | |
buir.contributor.author | Gökçesu, Hakan | |
buir.contributor.author | Kozat, Süleyman Serdar | |
dc.citation.epage | 4398 | en_US |
dc.citation.issueNumber | 16 | en_US |
dc.citation.spage | 4383 | en_US |
dc.citation.volumeNumber | 67 | en_US |
dc.contributor.author | Vural, Nuri Mert | |
dc.contributor.author | Gökçesu, Hakan | |
dc.contributor.author | Gökçesu, K. | |
dc.contributor.author | Kozat, Süleyman Serdar | |
dc.date.accessioned | 2021-03-17T12:05:56Z | |
dc.date.available | 2021-03-17T12:05:56Z | |
dc.date.issued | 2019 | |
dc.department | Department of Electrical and Electronics Engineering | en_US |
dc.description.abstract | We investigate the adversarial bandit problem with multiple plays under semi-bandit feedback. We introduce a highly efficient algorithm that asymptotically achieves the performance of the best switching m-arm strategy with minimax optimal regret bounds. To construct our algorithm, we introduce a new expert advice algorithm for the multiple-play setting. By using our expert advice algorithm, we additionally improve the best-known high-probability bound for the multi-play setting by O(√(m)). Our results are guaranteed to hold in an individual sequence manner since we have no statistical assumption on the bandit arm gains. Through an extensive set of experiments involving synthetic and real data, we demonstrate significant performance gains achieved by the proposed algorithm with respect to the state-of-the-art algorithms. | en_US |
dc.description.provenance | Submitted by Onur Emek (onur.emek@bilkent.edu.tr) on 2021-03-17T12:05:56Z No. of bitstreams: 1 Minimax_Optimal_Algorithms_for_Adversarial_Bandit_Problem_With_Multiple_Plays.pdf: 1021452 bytes, checksum: 213618bafbf2ced0516a8b3f17332249 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2021-03-17T12:05:56Z (GMT). No. of bitstreams: 1 Minimax_Optimal_Algorithms_for_Adversarial_Bandit_Problem_With_Multiple_Plays.pdf: 1021452 bytes, checksum: 213618bafbf2ced0516a8b3f17332249 (MD5) Previous issue date: 2019 | en |
dc.identifier.doi | 10.1109/TSP.2019.2928952 | en_US |
dc.identifier.issn | 1053-587X | |
dc.identifier.uri | http://hdl.handle.net/11693/75951 | |
dc.language.iso | English | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | https://dx.doi.org/10.1109/TSP.2019.2928952 | en_US |
dc.source.title | IEEE Transactions on Signal Processing | en_US |
dc.subject | Adversarial multi-armed bandit | en_US |
dc.subject | Multiple plays | en_US |
dc.subject | Switching bandit | en_US |
dc.subject | Minimax optimal | en_US |
dc.subject | Individual sequence manner | en_US |
dc.title | Minimax optimal algorithms for adversarial bandit problem with multiple plays | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Minimax_Optimal_Algorithms_for_Adversarial_Bandit_Problem_With_Multiple_Plays.pdf
- Size:
- 997.51 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: