(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

Zhang, K.Q.; Sayın, Muhammed Ömer; Ozdaglar, A.

(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

Files

Bilkent-research-paper.pdf (145.41 KB)

Date

2024

Authors

Zhang, K.Q.

Sayın, Muhammed Ömer

Ozdaglar, A.

BUIR Usage Stats

27
views

271
downloads

Citation Stats

Abstract

In this paper, we study fictitious-play-type dynamics for identical-interest stochastic games (SGs) and show their convergence to the Nash equilibrium. We develop off-policy and on-policy dynamics, and generalize these learning dynamics and convergence results to the smooth fictitious play variant when the smooth best-response is used in the updates. One key feature of our dynamics is the independent estimates of the continuation payoffs among agents. While this feature makes the dynamics more natural and uncoupled, it also leads to the challenge that the auxiliary stage games encountered during learning can become non-identical-interest anymore. We handle such a deviation from the identical-interest setting by either focusing on specific structures, e.g., the single-controller or symmetric SGs, or studying specific sublinear stepsizes to characterize the convergence rate of such a deviation as timestep evolves.

Source Title

Applied and Computational Mathematics

Publisher

Natural Sciences Publishing Corporation

Keywords

Stochastic/Markov games, Fictitious-play, Uncoupled learning, Nash equilibrium, Independent estimates, Learning dynamics

Permalink

https://hdl.handle.net/11693/116955

Published Version (Please cite this version)

https://dx.doi.org/10.30546/1683-6154.23.3.2024.366

Collections

Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

(Smooth) Fictitious-play in identical-interest stochastic games with independent continuation-payoff estimates

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type