Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret

Sarıtaç, A. Ömer; Tekin, Cem

Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret

dc.citation.epage	115	en_US
dc.citation.spage	111	en_US
dc.contributor.author	Sarıtaç, A. Ömer	en_US
dc.contributor.author	Tekin, Cem	en_US
dc.coverage.spatial	Montreal, QC, Canada
dc.date.accessioned	2019-02-21T16:04:18Z
dc.date.available	2019-02-21T16:04:18Z
dc.date.issued	2017-11	en_US
dc.department	Department of Industrial Engineering	en_US
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description	Date of Conference: 14-16 Nov. 2017
dc.description	Conference name: 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP)
dc.description.abstract	In this paper, we study the combinatorial multi-armed bandit problem (CMAB) with probabilistically triggered arms (PTAs). Under the assumption that the arm triggering probabilities (ATPs) are positive for all arms, we prove that a simple greedy policy, named greedy CMAB (G-CMAB), achieves bounded regret. This improves the result in previous work, which shows that the regret is O (log T) under no such assumption on the ATPs. Then, we numerically show that G-CMAB achieves bounded regret in a real-world movie recommendation problem, where the action corresponds to recommending a set of movies, arms correspond to the edges between movies and users, and the goal is to maximize the total number of users that are attracted by at least one movie. In addition to this problem, our results directly apply to the online influence maximization (OIM) problem studied in numerous prior works.
dc.identifier.doi	10.1109/GlobalSIP.2017.8308614
dc.identifier.uri	http://hdl.handle.net/11693/50176
dc.language.iso	English
dc.publisher	IEEE
dc.relation.isversionof	https://doi.org/10.1109/GlobalSIP.2017.8308614
dc.source.title	2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP)	en_US
dc.subject	Bounded regret	en_US
dc.subject	Combinatorial multi-armed bandit	en_US
dc.subject	Online learning	en_US
dc.subject	Probabilistically triggered arms	en_US
dc.title	Combinatorial multi-armed bandit problem with probabilistically triggered arms: a case with bounded regret	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Combinatorial_multi_armed_bandit_problem_with_probabilistically_triggered_arms_a_case_with_bounded_regret.pdf
Size:: 342.89 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Industrial Engineering
Scholarly Publications - Electrical and Electronics Engineering