Guo, X and Zhang, Y ORCID: 0000-0002-3200-6306
(2016)
Optimality of mixed policies for average continuous-time Markov decision processes with constraints.
Mathematics of Operations Research, 41 (4).
pp. 1161-1207.
Text
GZMORAccepted.pdf - Author Accepted Manuscript Download (148kB) |
Abstract
This article concerns the average criteria for continuous-time Markov decision processes with N constraints. We show the following; (a) every extreme point of the space of performance vectors corresponding to the set of stable measures is generated by a deterministic stationary policy; and (b) there exists a mixed optimal policy, where the mixture is over no more than N + 1 deterministic stationary policies.
Item Type: | Article |
---|---|
Additional Information: | ## TULIP Type: Articles/Papers (Journal) ## |
Uncontrolled Keywords: | continuous-time Markov decision processes, average criteria, mixed policy, constrained optimality |
Depositing User: | Symplectic Admin |
Date Deposited: | 01 Jul 2016 15:17 |
Last Modified: | 19 Jan 2023 07:35 |
DOI: | 10.1287/moor.2015.0777 |
Related URLs: | |
URI: | https://livrepository.liverpool.ac.uk/id/eprint/3001984 |