Learning the pareto set under incomplete preferences: pure exploration in vector bandits
buir.contributor.author | Karagözlü, Efe Mert | |
buir.contributor.author | Yıldırım, Yaşar Cahit | |
buir.contributor.author | Ararat, Çagın | |
buir.contributor.author | Tekin, Cem | |
buir.contributor.orcid | Tekin, Cem|0000-0003-4361-4021 | |
dc.citation.volumeNumber | 238 | |
dc.contributor.author | Karagözlü, Efe Mert | |
dc.contributor.author | Yıldırım, Yaşar Cahit | |
dc.contributor.author | Ararat, Çagın | |
dc.contributor.author | Tekin, Cem | |
dc.contributor.editor | Dasgupta, S | |
dc.contributor.editor | Mandt, S | |
dc.contributor.editor | Li, Y | |
dc.coverage.spatial | Valencia, Spain | |
dc.date.accessioned | 2025-02-28T13:54:00Z | |
dc.date.available | 2025-02-28T13:54:00Z | |
dc.date.issued | 2024-11-26 | |
dc.department | Department of Industrial Engineering | |
dc.department | Department of Electrical and Electronics Engineering | |
dc.description | Conference Name: 27th International Conference on Artificial Intelligence and Statistics (AISTATS) | |
dc.description | Date of Conference: May 02-04, 2024 | |
dc.description.abstract | We study pure exploration in bandit problems with vector-valued rewards, where the goal is to (approximately) identify the Pareto set of arms given incomplete preferences induced by a polyhedral convex cone. We address the open problem of designing sampleefficient learning algorithms for such problems. We propose Pareto Vector Bandits (PaVeBa), an adaptive elimination algorithm that nearly matches the gap-dependent and worst-case lower bounds on the sample complexity of (., d)-PAC Pareto set identification. Finally, we provide an in-depth numerical investigation of PaVeBa and its heuristic vari-ants by comparing them with the state-of-the-art multi-objective and vector optimization algorithms on several real-world datasets with conflicting objectives. | |
dc.identifier.issn | 2640-3498 | |
dc.identifier.uri | https://hdl.handle.net/11693/117032 | |
dc.language.iso | English | |
dc.relation.ispartofseries | Proceedings of Machine Learning Research | |
dc.source.title | International conference on artificial intelligence and statistics | |
dc.subject | Optimization | |
dc.subject | Design | |
dc.title | Learning the pareto set under incomplete preferences: pure exploration in vector bandits | |
dc.type | Conference Paper |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Learning_the_Pareto_Set_Under_Incomplete_Preferences_Pure_Exploration_in_Vector_Bandits.pdf
- Size:
- 5.77 MB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: