(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

buir.contributor.authorSayın, Muhammed Ömer
buir.contributor.orcidSayın, Muhammed Ömer|0000-0001-5779-3986
dc.citation.epage391
dc.citation.issueNumber3
dc.citation.spage366
dc.citation.volumeNumber23
dc.contributor.authorZhang, K.Q.
dc.contributor.authorSayın, Muhammed Ömer
dc.contributor.authorOzdaglar, A.
dc.date.accessioned2025-02-28T05:27:28Z
dc.date.available2025-02-28T05:27:28Z
dc.date.issued2024
dc.departmentDepartment of Electrical and Electronics Engineering
dc.description.abstractIn this paper, we study fictitious-play-type dynamics for identical-interest stochastic games (SGs) and show their convergence to the Nash equilibrium. We develop off-policy and on-policy dynamics, and generalize these learning dynamics and convergence results to the smooth fictitious play variant when the smooth best-response is used in the updates. One key feature of our dynamics is the independent estimates of the continuation payoffs among agents. While this feature makes the dynamics more natural and uncoupled, it also leads to the challenge that the auxiliary stage games encountered during learning can become non-identical-interest anymore. We handle such a deviation from the identical-interest setting by either focusing on specific structures, e.g., the single-controller or symmetric SGs, or studying specific sublinear stepsizes to characterize the convergence rate of such a deviation as timestep evolves.
dc.identifier.doi10.30546/1683-6154.23.3.2024.366
dc.identifier.eissn1683-6154
dc.identifier.issn1683-3511
dc.identifier.urihttps://hdl.handle.net/11693/116955
dc.language.isoEnglish
dc.publisherNatural Sciences Publishing Corporation
dc.relation.isversionofhttps://dx.doi.org/10.30546/1683-6154.23.3.2024.366
dc.source.titleApplied and Computational Mathematics
dc.subjectStochastic/Markov games
dc.subjectFictitious-play
dc.subjectUncoupled learning
dc.subjectNash equilibrium
dc.subjectIndependent estimates
dc.subjectLearning dynamics
dc.title(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Bilkent-research-paper.pdf
Size:
145.41 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: