(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

Zhang, K.Q.; Sayın, Muhammed Ömer; Ozdaglar, A.

(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

buir.contributor.author	Sayın, Muhammed Ömer
buir.contributor.orcid	Sayın, Muhammed Ömer\|0000-0001-5779-3986
dc.citation.epage	391
dc.citation.issueNumber	3
dc.citation.spage	366
dc.citation.volumeNumber	23
dc.contributor.author	Zhang, K.Q.
dc.contributor.author	Sayın, Muhammed Ömer
dc.contributor.author	Ozdaglar, A.
dc.date.accessioned	2025-02-28T05:27:28Z
dc.date.available	2025-02-28T05:27:28Z
dc.date.issued	2024
dc.department	Department of Electrical and Electronics Engineering
dc.description.abstract	In this paper, we study fictitious-play-type dynamics for identical-interest stochastic games (SGs) and show their convergence to the Nash equilibrium. We develop off-policy and on-policy dynamics, and generalize these learning dynamics and convergence results to the smooth fictitious play variant when the smooth best-response is used in the updates. One key feature of our dynamics is the independent estimates of the continuation payoffs among agents. While this feature makes the dynamics more natural and uncoupled, it also leads to the challenge that the auxiliary stage games encountered during learning can become non-identical-interest anymore. We handle such a deviation from the identical-interest setting by either focusing on specific structures, e.g., the single-controller or symmetric SGs, or studying specific sublinear stepsizes to characterize the convergence rate of such a deviation as timestep evolves.
dc.identifier.doi	10.30546/1683-6154.23.3.2024.366
dc.identifier.eissn	1683-6154
dc.identifier.issn	1683-3511
dc.identifier.uri	https://hdl.handle.net/11693/116955
dc.language.iso	English
dc.publisher	Natural Sciences Publishing Corporation
dc.relation.isversionof	https://dx.doi.org/10.30546/1683-6154.23.3.2024.366
dc.source.title	Applied and Computational Mathematics
dc.subject	Stochastic/Markov games
dc.subject	Fictitious-play
dc.subject	Uncoupled learning
dc.subject	Nash equilibrium
dc.subject	Independent estimates
dc.subject	Learning dynamics
dc.title	(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates
dc.type	Article

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Bilkent-research-paper.pdf
Size:: 145.41 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering