Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret

Sarıtaç, A. Ömer; Tekin, Cem

Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret

Files

Combinatorial_multi_armed_bandit_problem_with_probabilistically_triggered_arms_a_case_with_bounded_regret.pdf (342.89 KB)

Date

2017-11

Authors

Sarıtaç, A. Ömer

Tekin, Cem

BUIR Usage Stats

3
views

52
downloads

Citation Stats

Abstract

In this paper, we study the combinatorial multi-armed bandit problem (CMAB) with probabilistically triggered arms (PTAs). Under the assumption that the arm triggering probabilities (ATPs) are positive for all arms, we prove that a simple greedy policy, named greedy CMAB (G-CMAB), achieves bounded regret. This improves the result in previous work, which shows that the regret is O (log T) under no such assumption on the ATPs. Then, we numerically show that G-CMAB achieves bounded regret in a real-world movie recommendation problem, where the action corresponds to recommending a set of movies, arms correspond to the edges between movies and users, and the goal is to maximize the total number of users that are attracted by at least one movie. In addition to this problem, our results directly apply to the online influence maximization (OIM) problem studied in numerous prior works.

Source Title

2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Publisher

IEEE

Keywords

Bounded regret, Combinatorial multi-armed bandit, Online learning, Probabilistically triggered arms

Permalink

http://hdl.handle.net/11693/50176

Published Version (Please cite this version)

https://doi.org/10.1109/GlobalSIP.2017.8308614

Collections

Scholarly Publications - Industrial Engineering
Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Conference Paper

Full item page

Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type