Strategizing against q-learners: a control-theoretical approach

buir.contributor.authorArslantaş, Yüksel
buir.contributor.authorYüceel, Ege
buir.contributor.authorSayın, Muhammed O.
buir.contributor.orcidArslantaş, Yüksel|0009-0007-7516-4850
buir.contributor.orcidYüceel, Ege|0009-0008-6765-3598
buir.contributor.orcidSayın, Muhammed O.|0000-0001-5779-3986
dc.citation.epage1738
dc.citation.spage1733
dc.citation.volumeNumber8
dc.contributor.authorArslantaş, Yüksel
dc.contributor.authorYüceel, Ege
dc.contributor.authorSayın, Muhammed O.
dc.date.accessioned2025-02-18T13:04:38Z
dc.date.available2025-02-18T13:04:38Z
dc.date.issued2024-06-18
dc.departmentDepartment of Electrical and Electronics Engineering
dc.description.abstractIn this letter, we explore the susceptibility of the independent Q-learning algorithms (a classical and widely used multi-agent reinforcement learning method) to strategic manipulation of sophisticated opponents in normal-form games played repeatedly. We quantify how much strategically sophisticated agents can exploit naive Q-learners if they know the opponents' Q-learning algorithm. To this end, we formulate the strategic actors' interactions as a stochastic game (whose state encompasses Q-function estimates of the Q-learners) as if the Q-learning algorithms are the underlying dynamical system. We also present a quantization-based approximation scheme to tackle the continuum state space and analyze its performance for two competing strategic actors and a single strategic actor both analytically and numerically.
dc.identifier.doi10.1109/LCSYS.2024.3416240
dc.identifier.eissn2475-1456
dc.identifier.urihttps://hdl.handle.net/11693/116383
dc.language.isoEnglish
dc.publisherInstitute of Electrical and Electronics Engineers
dc.relation.isversionofhttps://dx.doi.org/10.1109/LCSYS.2024.3416240
dc.rightsCC BY 4.0 DEED (Attribution 4.0 International)
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/deed.tr
dc.source.titleIEEE Control Systems Letters
dc.subjectReinforcement learning
dc.subjectGame theory
dc.subjectMarkov processes
dc.titleStrategizing against q-learners: a control-theoretical approach
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Strategizing_against_q-learners_a_control-theoretical_approach.pdf
Size:
2.34 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: