Computational methods for risk-averse undiscounted transient markov models
Date
2014
Authors
Çavuş, O.
Ruszczyński, A.
Editor(s)
Advisor
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
Operations Research
Print ISSN
0030-364X
Electronic ISSN
1526-5463
Publisher
Institute for Operations Research and the Management Sciences (I N F O R M S)
Volume
62
Issue
2
Pages
401 - 417
Language
English
Type
Journal Title
Journal ISSN
Volume Title
Citation Stats
Attention Stats
Usage Stats
2
views
views
57
downloads
downloads
Series
Abstract
The total cost problem for discrete-time controlled transient Markov models is considered. The objective functional is a Markov dynamic risk measure of the total cost. Two solution methods, value and policy iteration, are proposed, and their convergence is analyzed. In the policy iteration method, we propose two algorithms for policy evaluation: the nonsmooth Newton method and convex programming, and we prove their convergence. The results are illustrated on a credit limit control problem.