Abstract
In this paper we investigate denumerable state semi-Markov decision chains with small interest rates. We consider average and Blackwell optimality and allow for multiple closed sets and unbounded immediate rewards. Our analysis uses the existence of a Laurent series expansion for the total discounted rewards and the continuity of its terms. The assumptions are expressed in terms of a weighted supremum norm. Our method is based on an algebraic treatment of Laurent series; it constructs an appropriate linear space with a lexicographic ordering. Using two operators and a positiveness property we establish the existence of bounded solutions to optimality equations. The theory is illustrated with an example of a K-dimensional queueing system. This paper is strongly based on the work of Denardo [11] and Dekker and Hordijk [7].
Original language | English |
---|---|
Pages (from-to) | 185-211 |
Number of pages | 27 |
Journal | Annals of Operations Research |
Volume | 28 |
Issue number | 1 |
DOIs | |
Publication status | Published - Dec 1991 |