Learning the pareto set under incomplete preferences: pure exploration in vector bandits

Karagözlü, Efe Mert; Yıldırım, Yaşar Cahit; Ararat, Çagın; Tekin, Cem

Learning the pareto set under incomplete preferences: pure exploration in vector bandits

buir.contributor.author	Karagözlü, Efe Mert
buir.contributor.author	Yıldırım, Yaşar Cahit
buir.contributor.author	Ararat, Çagın
buir.contributor.author	Tekin, Cem
buir.contributor.orcid	Tekin, Cem\|0000-0003-4361-4021
dc.citation.volumeNumber	238
dc.contributor.author	Karagözlü, Efe Mert
dc.contributor.author	Yıldırım, Yaşar Cahit
dc.contributor.author	Ararat, Çagın
dc.contributor.author	Tekin, Cem
dc.contributor.editor	Dasgupta, S
dc.contributor.editor	Mandt, S
dc.contributor.editor	Li, Y
dc.coverage.spatial	Valencia, Spain
dc.date.accessioned	2025-02-28T13:54:00Z
dc.date.available	2025-02-28T13:54:00Z
dc.date.issued	2024-11-26
dc.department	Department of Industrial Engineering
dc.department	Department of Electrical and Electronics Engineering
dc.description	Conference Name: 27th International Conference on Artificial Intelligence and Statistics (AISTATS)
dc.description	Date of Conference: May 02-04, 2024
dc.description.abstract	We study pure exploration in bandit problems with vector-valued rewards, where the goal is to (approximately) identify the Pareto set of arms given incomplete preferences induced by a polyhedral convex cone. We address the open problem of designing sampleefficient learning algorithms for such problems. We propose Pareto Vector Bandits (PaVeBa), an adaptive elimination algorithm that nearly matches the gap-dependent and worst-case lower bounds on the sample complexity of (., d)-PAC Pareto set identification. Finally, we provide an in-depth numerical investigation of PaVeBa and its heuristic vari-ants by comparing them with the state-of-the-art multi-objective and vector optimization algorithms on several real-world datasets with conflicting objectives.
dc.identifier.issn	2640-3498
dc.identifier.uri	https://hdl.handle.net/11693/117032
dc.language.iso	English
dc.relation.ispartofseries	Proceedings of Machine Learning Research
dc.source.title	International conference on artificial intelligence and statistics
dc.subject	Optimization
dc.subject	Design
dc.title	Learning the pareto set under incomplete preferences: pure exploration in vector bandits
dc.type	Conference Paper

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Learning_the_Pareto_Set_Under_Incomplete_Preferences_Pure_Exploration_in_Vector_Bandits.pdf
Size:: 5.77 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Industrial Engineering
Scholarly Publications - Electrical and Electronics Engineering