Fictitious play in Markov games with single controller

buir.contributor.authorSayın, Muhammed Ömer
dc.citation.epage936en_US
dc.citation.spage919en_US
dc.contributor.authorSayın, Muhammed Ömer
dc.contributor.authorZhang, Kaiqing
dc.contributor.authorOzdaglar, Asuman
dc.coverage.spatialBoulder, USAen_US
dc.date.accessioned2023-02-26T10:45:29Z
dc.date.available2023-02-26T10:45:29Z
dc.date.issued2022-07-12
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.descriptionConference Name: 23rd ACM Conference on Economics and Computation, EC 2022en_US
dc.descriptionDate of Conference: 11-15 July 2022en_US
dc.description.abstractCertain but important classes of strategic-form games, including zero-sum and identical-interest games, have thefictitious-play-property (FPP), i.e., beliefs formed in fictitious play dynamics always converge to a Nash equilibrium (NE) in the repeated play of these games. Such convergence results are seen as a (behavioral) justification for the game-theoretical equilibrium analysis. Markov games (MGs), also known as stochastic games, generalize the repeated play of strategic-form games to dynamic multi-state settings with Markovian state transitions. In particular, MGs are standard models for multi-agent reinforcement learning - a reviving research area in learning and games, and their game-theoretical equilibrium analyses have also been conducted extensively. However, whether certain classes of MGs have the FPP or not (i.e., whether there is a behavioral justification for equilibrium analysis or not) remains largely elusive. In this paper, we study a new variant of fictitious play dynamics for MGs and show its convergence to an NE in n-player identical-interest MGs in which a single player controls the state transitions. Such games are of interest in communications, control, and economics applications. Our result together with the recent results in [42] establishes the FPP of two-player zero-sum MGs and n-player identical-interest MGs with a single controller (standing at two different ends of the MG spectrum from fully competitive to fully cooperative). © 2022 Owner/Author.en_US
dc.identifier.doi10.1145/3490486.3538289en_US
dc.identifier.isbn978145039150-4
dc.identifier.urihttp://hdl.handle.net/11693/111763
dc.language.isoEnglishen_US
dc.publisherAssociation for Computing Machinery, Incen_US
dc.relation.isversionofhttps://dx.doi.org/10.1145/3490486.3538289en_US
dc.source.titleProceedings of the 23rd ACM Conference on Economics and Computationen_US
dc.subjectFictitious playen_US
dc.subjectIdentical-interest gamesen_US
dc.subjectMarkov gamesen_US
dc.subjectZero-sum gamesen_US
dc.titleFictitious play in Markov games with single controlleren_US
dc.typeConference Paperen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Fictitious_Play_in_Markov_Games_with_Single_Controller.pdf
Size:
1.17 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: