Learning the pareto set under incomplete preferences: pure exploration in vector bandits

buir.contributor.authorKaragözlü, Efe Mert
buir.contributor.authorYıldırım, Yaşar Cahit
buir.contributor.authorArarat, Çagın
buir.contributor.authorTekin, Cem
buir.contributor.orcidTekin, Cem|0000-0003-4361-4021
dc.citation.volumeNumber238
dc.contributor.authorKaragözlü, Efe Mert
dc.contributor.authorYıldırım, Yaşar Cahit
dc.contributor.authorArarat, Çagın
dc.contributor.authorTekin, Cem
dc.contributor.editorDasgupta, S
dc.contributor.editorMandt, S
dc.contributor.editorLi, Y
dc.coverage.spatialValencia, Spain
dc.date.accessioned2025-02-28T13:54:00Z
dc.date.available2025-02-28T13:54:00Z
dc.date.issued2024-11-26
dc.departmentDepartment of Industrial Engineering
dc.departmentDepartment of Electrical and Electronics Engineering
dc.descriptionConference Name: 27th International Conference on Artificial Intelligence and Statistics (AISTATS)
dc.descriptionDate of Conference: May 02-04, 2024
dc.description.abstractWe study pure exploration in bandit problems with vector-valued rewards, where the goal is to (approximately) identify the Pareto set of arms given incomplete preferences induced by a polyhedral convex cone. We address the open problem of designing sampleefficient learning algorithms for such problems. We propose Pareto Vector Bandits (PaVeBa), an adaptive elimination algorithm that nearly matches the gap-dependent and worst-case lower bounds on the sample complexity of (., d)-PAC Pareto set identification. Finally, we provide an in-depth numerical investigation of PaVeBa and its heuristic vari-ants by comparing them with the state-of-the-art multi-objective and vector optimization algorithms on several real-world datasets with conflicting objectives.
dc.identifier.issn2640-3498
dc.identifier.urihttps://hdl.handle.net/11693/117032
dc.language.isoEnglish
dc.relation.ispartofseriesProceedings of Machine Learning Research
dc.source.titleInternational conference on artificial intelligence and statistics
dc.subjectOptimization
dc.subjectDesign
dc.titleLearning the pareto set under incomplete preferences: pure exploration in vector bandits
dc.typeConference Paper

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Learning_the_Pareto_Set_Under_Incomplete_Preferences_Pure_Exploration_in_Vector_Bandits.pdf
Size:
5.77 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: