(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates
buir.contributor.author | Sayın, Muhammed Ömer | |
buir.contributor.orcid | Sayın, Muhammed Ömer|0000-0001-5779-3986 | |
dc.citation.epage | 391 | |
dc.citation.issueNumber | 3 | |
dc.citation.spage | 366 | |
dc.citation.volumeNumber | 23 | |
dc.contributor.author | Zhang, K.Q. | |
dc.contributor.author | Sayın, Muhammed Ömer | |
dc.contributor.author | Ozdaglar, A. | |
dc.date.accessioned | 2025-02-28T05:27:28Z | |
dc.date.available | 2025-02-28T05:27:28Z | |
dc.date.issued | 2024 | |
dc.department | Department of Electrical and Electronics Engineering | |
dc.description.abstract | In this paper, we study fictitious-play-type dynamics for identical-interest stochastic games (SGs) and show their convergence to the Nash equilibrium. We develop off-policy and on-policy dynamics, and generalize these learning dynamics and convergence results to the smooth fictitious play variant when the smooth best-response is used in the updates. One key feature of our dynamics is the independent estimates of the continuation payoffs among agents. While this feature makes the dynamics more natural and uncoupled, it also leads to the challenge that the auxiliary stage games encountered during learning can become non-identical-interest anymore. We handle such a deviation from the identical-interest setting by either focusing on specific structures, e.g., the single-controller or symmetric SGs, or studying specific sublinear stepsizes to characterize the convergence rate of such a deviation as timestep evolves. | |
dc.identifier.doi | 10.30546/1683-6154.23.3.2024.366 | |
dc.identifier.eissn | 1683-6154 | |
dc.identifier.issn | 1683-3511 | |
dc.identifier.uri | https://hdl.handle.net/11693/116955 | |
dc.language.iso | English | |
dc.publisher | Natural Sciences Publishing Corporation | |
dc.relation.isversionof | https://dx.doi.org/10.30546/1683-6154.23.3.2024.366 | |
dc.source.title | Applied and Computational Mathematics | |
dc.subject | Stochastic/Markov games | |
dc.subject | Fictitious-play | |
dc.subject | Uncoupled learning | |
dc.subject | Nash equilibrium | |
dc.subject | Independent estimates | |
dc.subject | Learning dynamics | |
dc.title | (Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates | |
dc.type | Article |