Reward-rate maximization in sequential identification under a stochastic deadline

Dayanık, S.; Yu, A. J.

Reward-rate maximization in sequential identification under a stochastic deadline

dc.citation.epage	2948	en_US
dc.citation.issueNumber	4	en_US
dc.citation.spage	2922	en_US
dc.citation.volumeNumber	51	en_US
dc.contributor.author	Dayanık, S.	en_US
dc.contributor.author	Yu, A. J.	en_US
dc.date.accessioned	2016-02-08T11:03:43Z
dc.date.available	2016-02-08T11:03:43Z
dc.date.issued	2013	en_US
dc.department	Department of Industrial Engineering	en_US
dc.department	Department of Mathematics	en_US
dc.description.abstract	Any intelligent system performing evidence-based decision making under time pressure must negotiate a speed-accuracy trade-off. In computer science and engineering, this is typically modeled as minimizing a Bayes-risk functional that is a linear combination of expected decision delay and expected terminal decision loss. In neuroscience and psychology, however, it is often modeled as maximizing the long-term reward rate, or the ratio of expected terminal reward and expected decision delay. The two approaches have opposing advantages and disadvantages. While Bayes-risk minimization can be solved with powerful dynamic programming techniques unlike reward-rate maximization, it also requires the explicit specification of the relative costs of decision delay and error, which is obviated by reward-rate maximization. Here, we demonstrate that, for a large class of sequential multihypothesis identification problems under a stochastic deadline, the reward-rate maximization is equivalent to a special case of Bayes-risk minimization, in which the optimal policy that attains the minimal risk when the unit sampling cost is exactly the maximal reward rate is also the policy that attains maximal reward rate. We show that the maximum reward rate is the unique unit sampling cost for which the expected total observation cost and expected terminal reward break even under every Bayes-risk optimal decision rule. This interplay between reward-rate maximization and Bayesrisk minimization formulations allows us to show that maximum reward rate is always attained. We can compute the policy that maximizes reward rate by solving an inverse Bayes-risk minimization problem, whereby we know the Bayes risk of the optimal policy and need to find the associated unit sampling cost parameter. Leveraging this equivalence, we derive an iterative dynamic programming procedure for solving the reward-rate maximization problem exponentially fast, thus incorporating the advantages of both the reward-rate maximization and Bayes-risk minimization formulations. As an illustration, we will apply the procedure to a two-hypothesis identification example.	en_US
dc.identifier.doi	10.1137/100818005	en_US
dc.identifier.eissn	1095-7138
dc.identifier.issn	0363-0129
dc.identifier.uri	http://hdl.handle.net/11693/26706
dc.language.iso	English	en_US
dc.relation.isversionof	http://dx.doi.org/10.1137/100818005	en_US
dc.source.title	SIAM Journal on Control and Optimization	en_US
dc.subject	Bayes-risk minimization	en_US
dc.subject	Dynamic programming	en_US
dc.subject	Reward-rate maximization	en_US
dc.subject	Sequential multihypothesis testing	en_US
dc.subject	Speed-accuracy trade off	en_US
dc.subject	Bayes-risk minimization	en_US
dc.subject	Computer science and engineerings	en_US
dc.subject	Dynamic programming techniques	en_US
dc.subject	Identification problem	en_US
dc.subject	Iterative Dynamic Programming	en_US
dc.subject	Multi-hypothesis testing	en_US
dc.subject	Optimal decision-rule	en_US
dc.subject	Trade off	en_US
dc.subject	Costs	en_US
dc.subject	Dynamic programming	en_US
dc.subject	Economic and social effects	en_US
dc.subject	Equivalence classes	en_US
dc.subject	Intelligent systems	en_US
dc.subject	Inverse problems	en_US
dc.subject	Iterative methods	en_US
dc.subject	Optimization	en_US
dc.subject	Stochastic systems	en_US
dc.subject	Decision making	en_US
dc.title	Reward-rate maximization in sequential identification under a stochastic deadline	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Reward-rate_maximization_in_sequential_identification_under_a_stochastic_deadline.pdf
Size:: 2.82 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Industrial Engineering
Scholarly Publications - Mathematics