Fictitious play in zero-sum stochastic games
buir.contributor.author | Sayin, Muhammed O. | |
buir.contributor.orcid | Sayin, Muhammed O. | 0000-0003-2661-0918 | |
dc.citation.epage | 2114 | en_US |
dc.citation.issueNumber | 4 | en_US |
dc.citation.spage | 2095 | en_US |
dc.citation.volumeNumber | 60 | en_US |
dc.contributor.author | Sayin, Muhammed O. | |
dc.contributor.author | Parise, Francesca | |
dc.contributor.author | Ozdaglar, Asuman | |
dc.date.accessioned | 2023-02-23T08:56:52Z | |
dc.date.available | 2023-02-23T08:56:52Z | |
dc.date.issued | 2022 | |
dc.department | Department of Electrical and Electronics Engineering | en_US |
dc.description.abstract | We present a novel variant of fictitious play dynamics combining classical fictitiousplay with Q-learning for stochastic games and analyze its convergence properties in two-player zero-sum stochastic games. Our dynamics involves players forming beliefs on the opponent strategyand their own continuation payoff (Q-function), and playing a greedy best response by using theestimated continuation payoffs. Players update their beliefs from observations of opponent actions.A key property of the learning dynamics is that update of the beliefs onQ-functions occurs at aslower timescale than update of the beliefs on strategies. We show that in both the model-based andmodel-free cases (without knowledge of player payoff functions and state transition probabilities),the beliefs on strategies converge to a stationary mixed Nash equilibrium of the zero-sum stochasticgame. | en_US |
dc.description.provenance | Submitted by Mandana Moftakhari (mandana.mir@bilkent.edu.tr) on 2023-02-23T08:56:52Z No. of bitstreams: 1 Fictitious_play_in_zero-sum_stochastic_games.pdf: 486928 bytes, checksum: 7a2713e251dee8dac5bbac18268d0090 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2023-02-23T08:56:52Z (GMT). No. of bitstreams: 1 Fictitious_play_in_zero-sum_stochastic_games.pdf: 486928 bytes, checksum: 7a2713e251dee8dac5bbac18268d0090 (MD5) Previous issue date: 2022 | en |
dc.identifier.doi | 10.1137/21M1426675 | en_US |
dc.identifier.eissn | 1095-7138 | |
dc.identifier.issn | 0363-0129 | |
dc.identifier.uri | http://hdl.handle.net/11693/111622 | |
dc.language.iso | English | en_US |
dc.publisher | Society for Industrial and Applied Mathematics | en_US |
dc.relation.isversionof | https://www.doi.org/10.1137/21M1426675 | en_US |
dc.source.title | SIAM Journal on Control and Optimization | en_US |
dc.subject | Stochastic games | en_US |
dc.subject | fictitious play | en_US |
dc.subject | Q-learning | en_US |
dc.subject | two-timescale learning | en_US |
dc.title | Fictitious play in zero-sum stochastic games | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Fictitious_play_in_zero-sum_stochastic_games.pdf
- Size:
- 475.52 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.69 KB
- Format:
- Item-specific license agreed upon to submission
- Description: