Fictitious play in zero-sum stochastic games

Sayin, Muhammed O.; Parise, Francesca; Ozdaglar, Asuman

Fictitious play in zero-sum stochastic games

buir.contributor.author	Sayin, Muhammed O.
buir.contributor.orcid	Sayin, Muhammed O. \| 0000-0003-2661-0918
dc.citation.epage	2114	en_US
dc.citation.issueNumber	4	en_US
dc.citation.spage	2095	en_US
dc.citation.volumeNumber	60	en_US
dc.contributor.author	Sayin, Muhammed O.
dc.contributor.author	Parise, Francesca
dc.contributor.author	Ozdaglar, Asuman
dc.date.accessioned	2023-02-23T08:56:52Z
dc.date.available	2023-02-23T08:56:52Z
dc.date.issued	2022
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description.abstract	We present a novel variant of fictitious play dynamics combining classical fictitiousplay with Q-learning for stochastic games and analyze its convergence properties in two-player zero-sum stochastic games. Our dynamics involves players forming beliefs on the opponent strategyand their own continuation payoff (Q-function), and playing a greedy best response by using theestimated continuation payoffs. Players update their beliefs from observations of opponent actions.A key property of the learning dynamics is that update of the beliefs onQ-functions occurs at aslower timescale than update of the beliefs on strategies. We show that in both the model-based andmodel-free cases (without knowledge of player payoff functions and state transition probabilities),the beliefs on strategies converge to a stationary mixed Nash equilibrium of the zero-sum stochasticgame.	en_US
dc.identifier.doi	10.1137/21M1426675	en_US
dc.identifier.eissn	1095-7138
dc.identifier.issn	0363-0129
dc.identifier.uri	http://hdl.handle.net/11693/111622
dc.language.iso	English	en_US
dc.publisher	Society for Industrial and Applied Mathematics	en_US
dc.relation.isversionof	https://www.doi.org/10.1137/21M1426675	en_US
dc.source.title	SIAM Journal on Control and Optimization	en_US
dc.subject	Stochastic games	en_US
dc.subject	fictitious play	en_US
dc.subject	Q-learning	en_US
dc.subject	two-timescale learning	en_US
dc.title	Fictitious play in zero-sum stochastic games	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Fictitious_play_in_zero-sum_stochastic_games.pdf
Size:: 475.52 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering