Publicación:
Absorbing Markov decision processes

Fecha
2024-02-09
Autores
Dufour, François
Editor/a
Director/a
Tutor/a
Coordinador/a
Prologuista
Revisor/a
Ilustrador/a
Derechos de acceso
info:eu-repo/semantics/openAccess
Título de la revista
ISSN de la revista
Título del volumen
Editor
EDP Sciences
Proyectos de investigación
Unidades organizativas
Número de la revista
Resumen
In this paper, we study discrete-time absorbing Markov Decision Processes (MDP) with measurable state space and Borel action space with a given initial distribution. For such models, solutions to the characteristic equation that are not occupation measures may exist. Several necessary and sufficient conditions are provided to guarantee that any solution to the characteristic equation is an occupation measure. Under the so-called continuity-compactness conditions, we first show that a measure is precisely an occupation measure if and only if it satisfies the characteristic equation and an additional absolute continuity condition. Secondly, it is shown that the set of occupation measures is compact in the weak-strong topology if and only if the model is uniformly absorbing. Several examples are provided to illustrate our results.
Descripción
Categorías UNESCO
Palabras clave
Markov decision processes, absorbing model, occupation measures, characteristic equation, phantom measures, compactness of the set of occupation measures
Citación
Absorbing Markov decision processes. François Dufour, Tomás Prieto-Rumeau. ESAIM: COCV 30 5 (2024) https://www.esaim-cocv.org/articles/cocv/abs/2024/01/cocv230200/cocv230200.html https://doi.org/10.1051/cocv/2024002
Centro
Facultades y escuelas::Facultad de Ciencias
Departamento
Estadística, Investigación Operativa y Cálculo Numérico
Grupo de investigación
Grupo de innovación
Programa de doctorado
Cátedra