Distributed multi-agent online learning based on global feedback
dc.citation.epage | 2238 | en_US |
dc.citation.issueNumber | 9 | en_US |
dc.citation.spage | 2225 | en_US |
dc.citation.volumeNumber | 63 | en_US |
dc.contributor.author | Tekin, C. | en_US |
dc.contributor.author | Zhang, S. | en_US |
dc.contributor.author | Schaar, Mihaela van der | en_US |
dc.date.accessioned | 2019-02-13T08:18:04Z | |
dc.date.available | 2019-02-13T08:18:04Z | |
dc.date.issued | 2015-05-01 | en_US |
dc.department | Department of Electrical and Electronics Engineering | en_US |
dc.description.abstract | Abstract—In this paper, we develop online learning algorithms that enable the agents to cooperatively learn how to maximize the overall reward in scenarios where only noisy global feedback is available without exchanging any information among themselves. We prove that our algorithms' learning regrets—the losses incurred by the algorithms due to uncertainty—are logarithmically increasing in time and thus the time average reward converges to the optimal average reward. Moreover, we also illustrate how the regret depends on the size of the action space, and we show that this relationship is influenced by the informativeness of the reward structure with regard to each agent's individual action. When the overall reward is fully informative, regret is shown to be linear in the total number of actions of all the agents. When the reward function is not informative, regret is linear in the number of joint actions. Our analytic and numerical results show that the proposed learning algorithms significantly outperform existing online learning solutions in terms of regret and learning speed. We illustrate how our theoretical framework can be used in practice by applying it to online Big Data mining using distributed classifiers. | en_US |
dc.description.provenance | Submitted by Betül Özen (ozen@bilkent.edu.tr) on 2019-02-13T08:18:04Z No. of bitstreams: 1 Distributed_Multi_Agent_Online_Learning.pdf: 2217111 bytes, checksum: e07098d0e005d9b60695e8fce9575068 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2019-02-13T08:18:04Z (GMT). No. of bitstreams: 1 Distributed_Multi_Agent_Online_Learning.pdf: 2217111 bytes, checksum: e07098d0e005d9b60695e8fce9575068 (MD5) Previous issue date: 2015-05-01 | en |
dc.identifier.doi | 10.1109/TSP.2015.2403288 | en_US |
dc.identifier.eissn | 1941-0476 | |
dc.identifier.issn | 1053-587X | |
dc.identifier.uri | http://hdl.handle.net/11693/49389 | |
dc.language.iso | English | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers | en_US |
dc.relation.isversionof | http://doi.org/10.1109/TSP.2015.2403288 | en_US |
dc.source.title | IEEE Transactions on Signal Processing | en_US |
dc.subject | Big data mining | en_US |
dc.subject | Distributed cooperative learning | en_US |
dc.subject | Multiagent learning | en_US |
dc.subject | Multiarmed bandits | en_US |
dc.subject | Online learning | en_US |
dc.subject | Reward informativeness | en_US |
dc.title | Distributed multi-agent online learning based on global feedback | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Distributed_Multi_Agent_Online_Learning.pdf
- Size:
- 2.11 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: