Vector optimization with stochastic bandit feedback
buir.contributor.author | Ararat, Çağın | |
buir.contributor.author | Tekin, Cem | |
buir.contributor.orcid | Ararat, Çağın|0000-0002-6985-7665 | |
buir.contributor.orcid | Tekin, Cem|0000-0003-4361-4021 | |
dc.citation.epage | 2190 | en_US |
dc.citation.spage | 2165 | |
dc.citation.volumeNumber | 206 | |
dc.contributor.author | Ararat, Çağın | |
dc.contributor.author | Tekin, Cem | |
dc.contributor.editor | Ruiz, F. | |
dc.contributor.editor | Dy J. | |
dc.contributor.editor | Van de Meent, J-W. | |
dc.coverage.spatial | Valencia, Spain | |
dc.date.accessioned | 2024-03-08T12:47:10Z | |
dc.date.available | 2024-03-08T12:47:10Z | |
dc.date.issued | 2023-03-07 | |
dc.department | Department of Industrial Engineering | |
dc.department | Department of Electrical and Electronics Engineering | |
dc.description | Conference Name: 26th International Conference on Artificial Intelligence and Statistics, AISTATS 2023 | |
dc.description | Date of Conference: 25 April 2023 - 27 April 2023 | |
dc.description.abstract | We introduce vector optimization problems with stochastic bandit feedback, in which preferences among designs are encoded by a polyhedral ordering cone C. Our setup generalizes the best arm identification problem to vector-valued rewards by extending the concept of Pareto set beyond multi-objective optimization. We characterize the sample complexity of (ϵ, δ)-PAC Pareto set identification by defining a new cone-dependent notion of complexity, called the ordering complexity. In particular, we provide gap-dependent and worst-case lower bounds on the sample complexity and show that, in the worst-case, the sample complexity scales with the square of ordering complexity. Furthermore, we investigate the sample complexity of the naïve elimination algorithm and prove that it nearly matches the worst-case sample complexity. Finally, we run experiments to verify our theoretical results and illustrate how C and sampling budget affect the Pareto set, the returned (ϵ, δ)-PAC Pareto set, and the success of identification. Copyright © 2023 by the author(s) | |
dc.identifier.issn | 2640-3498 | |
dc.identifier.uri | https://hdl.handle.net/11693/114415 | |
dc.language.iso | en_US | |
dc.publisher | ML Research Press | |
dc.source.title | Proceedings of Machine Learning Research | |
dc.subject | Artificial intelligence | |
dc.subject | Budget control | |
dc.subject | Stochastic systems | |
dc.title | Vector optimization with stochastic bandit feedback | |
dc.type | Conference Paper |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Vector_optimization_with_stochastic_bandit_feedback.pdf
- Size:
- 617.89 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 2.01 KB
- Format:
- Item-specific license agreed upon to submission
- Description: