• About
  • Policies
  • What is open access
  • Library
  • Contact
Advanced search
      View Item 
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Electrical and Electronics Engineering
      • View Item
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Electrical and Electronics Engineering
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Fictitious play in zero-sum stochastic games

      Thumbnail
      View / Download
      475.5 Kb
      Author(s)
      Sayin, Muhammed O.
      Parise, Francesca
      Ozdaglar, Asuman
      Date
      2022
      Source Title
      SIAM Journal on Control and Optimization
      Print ISSN
      0363-0129
      Electronic ISSN
      1095-7138
      Publisher
      Society for Industrial and Applied Mathematics
      Volume
      60
      Issue
      4
      Pages
      2095 - 2114
      Language
      English
      Type
      Article
      Item Usage Stats
      31
      views
      2
      downloads
      Abstract
      We present a novel variant of fictitious play dynamics combining classical fictitiousplay with Q-learning for stochastic games and analyze its convergence properties in two-player zero-sum stochastic games. Our dynamics involves players forming beliefs on the opponent strategyand their own continuation payoff (Q-function), and playing a greedy best response by using theestimated continuation payoffs. Players update their beliefs from observations of opponent actions.A key property of the learning dynamics is that update of the beliefs onQ-functions occurs at aslower timescale than update of the beliefs on strategies. We show that in both the model-based andmodel-free cases (without knowledge of player payoff functions and state transition probabilities),the beliefs on strategies converge to a stationary mixed Nash equilibrium of the zero-sum stochasticgame.
      Keywords
      Stochastic games
      fictitious play
      Q-learning
      two-timescale learning
      Permalink
      http://hdl.handle.net/11693/111622
      Published Version (Please cite this version)
      https://www.doi.org/10.1137/21M1426675
      Collections
      • Department of Electrical and Electronics Engineering 4011
      Show full item record

      Browse

      All of BUIRCommunities & CollectionsTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCoursesThis CollectionTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCourses

      My Account

      Login

      Statistics

      View Usage StatisticsView Google Analytics Statistics

      Bilkent University

      If you have trouble accessing this page and need to request an alternate format, contact the site administrator. Phone: (312) 290 2976
      © Bilkent University - Library IT

      Contact Us | Send Feedback | Off-Campus Access | Admin | Privacy