Abstract.
This work concerns controlled Markov chains with denumerable state space and discrete time parameter. The reward function is assumed to be≤0 and the performance of a control policy is measured by the expected total-reward criterion. Within this context, sufficient conditions are given so that the existence of a stationary policy which is ε-optimal at every state is guaranteed.
Similar content being viewed by others
Author information
Authors and Affiliations
Additional information
Manuscript received: December 1997/final version received: December 1998
Rights and permissions
About this article
Cite this article
Cavazos-Cadena, R., Montes-De-Oca, R. Nearly optimal stationary policies in negative dynamic programming. Mathematical Methods of OR 49, 441–456 (1999). https://doi.org/10.1007/s001860050060
Issue Date:
DOI: https://doi.org/10.1007/s001860050060