Distributed multi-agent online learning based on global feedback

Tekin, C.; Zhang, S.; Schaar, Mihaela van der

Distributed multi-agent online learning based on global feedback

dc.citation.epage	2238	en_US
dc.citation.issueNumber	9	en_US
dc.citation.spage	2225	en_US
dc.citation.volumeNumber	63	en_US
dc.contributor.author	Tekin, C.	en_US
dc.contributor.author	Zhang, S.	en_US
dc.contributor.author	Schaar, Mihaela van der	en_US
dc.date.accessioned	2019-02-13T08:18:04Z
dc.date.available	2019-02-13T08:18:04Z
dc.date.issued	2015-05-01	en_US
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description.abstract	Abstract—In this paper, we develop online learning algorithms that enable the agents to cooperatively learn how to maximize the overall reward in scenarios where only noisy global feedback is available without exchanging any information among themselves. We prove that our algorithms' learning regrets—the losses incurred by the algorithms due to uncertainty—are logarithmically increasing in time and thus the time average reward converges to the optimal average reward. Moreover, we also illustrate how the regret depends on the size of the action space, and we show that this relationship is influenced by the informativeness of the reward structure with regard to each agent's individual action. When the overall reward is fully informative, regret is shown to be linear in the total number of actions of all the agents. When the reward function is not informative, regret is linear in the number of joint actions. Our analytic and numerical results show that the proposed learning algorithms significantly outperform existing online learning solutions in terms of regret and learning speed. We illustrate how our theoretical framework can be used in practice by applying it to online Big Data mining using distributed classifiers.	en_US
dc.identifier.doi	10.1109/TSP.2015.2403288	en_US
dc.identifier.eissn	1941-0476
dc.identifier.issn	1053-587X
dc.identifier.uri	http://hdl.handle.net/11693/49389
dc.language.iso	English	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.relation.isversionof	http://doi.org/10.1109/TSP.2015.2403288	en_US
dc.source.title	IEEE Transactions on Signal Processing	en_US
dc.subject	Big data mining	en_US
dc.subject	Distributed cooperative learning	en_US
dc.subject	Multiagent learning	en_US
dc.subject	Multiarmed bandits	en_US
dc.subject	Online learning	en_US
dc.subject	Reward informativeness	en_US
dc.title	Distributed multi-agent online learning based on global feedback	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Distributed_Multi_Agent_Online_Learning.pdf
Size:: 2.11 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering