Online contextual influence maximization with costly observations

Sarıtaç, Anıl Ömer; Karakurt, Altuğ; Tekin, Cem

Online contextual influence maximization with costly observations

buir.contributor.author	Sarıtaç, Anıl Ömer
buir.contributor.author	Karakurt, Altuğ
buir.contributor.author	Tekin, Cem
dc.citation.epage	289	en_US
dc.citation.issueNumber	2	en_US
dc.citation.spage	273	en_US
dc.citation.volumeNumber	5	en_US
dc.contributor.author	Sarıtaç, Anıl Ömer	en_US
dc.contributor.author	Karakurt, Altuğ	en_US
dc.contributor.author	Tekin, Cem	en_US
dc.date.accessioned	2020-01-29T10:11:34Z
dc.date.available	2020-01-29T10:11:34Z
dc.date.issued	2019-06
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.department	Department of Industrial Engineering	en_US
dc.description.abstract	In the online contextual influence maximization problem with costly observations, the learner faces a series of epochs in each of which a different influence spread process takes place over a network. At the beginning of each epoch, the learner exogenously influences (activates) a set of seed nodes in the network. Then, the influence spread process takes place over the network, through which other nodes get influenced. The learner has the option to observe the spread of influence by paying an observation cost. The goal of the learner is to maximize its cumulative reward, which is defined as the expected total number of influenced nodes over all epochs minus the observation costs. We depart from the prior work in three aspects: 1) the learner does not know how the influence spreads over the network, i.e., it is unaware of the influence probabilities; 2) influence probabilities depend on the context; and 3) observing influence is costly. We consider two different influence observation settings: costly edge-level feedback, in which the learner freely observes the set of influenced nodes, but pays to observe the influence outcomes on the edges of the network; and costly node-level feedback, in which the learner pays to observe whether a node is influenced or not. Since the offline influence maximization problem itself is NP-hard, for these settings, we develop online learning algorithms that use an approximation algorithm as a subroutine to obtain the set of seed nodes in each epoch. When the influence probabilities are Hölder continuous functions of the context, we prove that these algorithms achieve sublinear regret (for any sequence of contexts) with respect to an approximation oracle that knows the influence probabilities for all contexts. Our numerical results on several networks illustrate that the proposed algorithms perform on par with the state-of-the-art methods even when the observations are cost free.	en_US
dc.identifier.doi	10.1109/TSIPN.2018.2866334	en_US
dc.identifier.issn	2373-776X
dc.identifier.uri	http://hdl.handle.net/11693/52895
dc.language.iso	English	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://doi.org/10.1109/TSIPN.2018.2866334	en_US
dc.source.title	IEEE Transactions on Signal and Information Processing over Networks	en_US
dc.subject	Influence maximization	en_US
dc.subject	Combinatorial bandits	en_US
dc.subject	Social networks	en_US
dc.subject	Approximation algorithms	en_US
dc.subject	Costly observations	en_US
dc.subject	Regret bounds	en_US
dc.title	Online contextual influence maximization with costly observations	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Online_Contextual_Influence_Maximization_with_Costly_Observations.pdf
Size:: 1.01 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering
Scholarly Publications - Industrial Engineering