Publicación: Discrete-time control with non-constant discount factor
dc.contributor.author | Jasso Fuentes, Héctor | |
dc.contributor.author | Menaldi, José Luis | |
dc.contributor.author | Prieto Rumeau, Tomás | |
dc.date.accessioned | 2024-11-08T08:29:07Z | |
dc.date.available | 2024-11-08T08:29:07Z | |
dc.date.issued | 2020-06-20 | |
dc.description.abstract | This paper deals with discrete-time Markov decision processes (MDPs) with Borel state and action spaces, and total expected discounted cost optimality criterion. We assume that the discount factor is not constant: it may depend on the state and action; moreover, it can even take the extreme values zero or one. We propose sufficient conditions on the data of the model ensuring the existence of optimal control policies and allowing the characterization of the optimal value function as a solution to the dynamic programming equation. As a particular case of these MDPs with varying discount factor, we study MDPs with stopping, as well as the corresponding optimal stopping times and contact set. We show applications to switching MDPs models and, in particular, we study a pollution accumulation problem. | en |
dc.description.version | Versión final | |
dc.identifier.citation | Jasso-Fuentes, H., Menaldi, JL. & Prieto-Rumeau, T. Discrete-time control with non-constant discount factor. Math Meth Oper Res 92, 377–399 (2020). https://link.springer.com/article/10.1007/s00186-020-00716-8 https://doi.org/10.1007/s00186-020-00716-8 | |
dc.identifier.doi | https://doi.org/10.1007/s00186-020-00716-8 | |
dc.identifier.issn | 1432-2994; e-ISSN: 1432-5217 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14468/24308 | |
dc.journal.title | Mathematical Methods of Operations Research | |
dc.journal.volume | 92 | |
dc.language.iso | en | |
dc.page.final | 399 | |
dc.page.initial | 377 | |
dc.publisher | Springer Nature | |
dc.relation.center | Facultades y escuelas::Facultad de Ciencias | |
dc.relation.department | Estadística, Investigación Operativa y Cálculo Numérico | |
dc.rights | info:eu-repo/semantics/openAccess | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/deed.es | |
dc.subject | 12 Matemáticas | |
dc.subject | 12 Matemáticas::1209 Estadística | |
dc.subject | 53 Ciencias Económicas::5311 Organización y dirección de empresas ::5311.07 Investigación operativa | |
dc.subject.keywords | Markov decision processes | en |
dc.subject.keywords | dynamic programming | en |
dc.subject.keywords | optimal stopping problems | en |
dc.title | Discrete-time control with non-constant discount factor | en |
dc.type | artículo | es |
dc.type | journal article | en |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | 6f28d560-9dfd-4c43-ac89-b1064aedac5c | |
relation.isAuthorOfPublication.latestForDiscovery | 6f28d560-9dfd-4c43-ac89-b1064aedac5c |
Archivos
Bloque original
1 - 1 de 1
Cargando...
- Nombre:
- PrietoRumeau_Tomas_DiscreteTimeControlWithNon.pdf
- Tamaño:
- 339.99 KB
- Formato:
- Adobe Portable Document Format
Bloque de licencias
1 - 1 de 1
No hay miniatura disponible
- Nombre:
- license.txt
- Tamaño:
- 3.62 KB
- Formato:
- Item-specific license agreed to upon submission
- Descripción: