A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion

Počet záznamů: 1

1.

SYSNO ASEP	0432661
Druh ASEP	J - Článek v odborném periodiku
Zařazení RIV	J - Článek v odborném periodiku
Poddruh J	Článek ve WOS
Název	A Counterexample on Sample-Path Optimality in Stable Markov Decision Chains with the Average Reward Criterion
Tvůrce(i)	Cavazos-Cadena, R. (MX) Montes-de-Oca, R. (MX) Sladký, Karel (UTIA-B)_RID
Celkový počet autorů	3
Zdroj.dok.	Journal of Optimization Theory and Applications. - : Springer - ISSN 0022-3239 Roč. 163, č. 2 (2014), s. 674-684
Poč.str.	11 s.
Forma vydání	Tištěná - P
Jazyk dok.	eng - angličtina
Země vyd.	US - Spojené státy americké
Klíč. slova	Strong sample-path optimality ; Lyapunov function condition ; Stationary policy ; Expected average reward criterion
Vědní obor RIV	BB - Aplikovaná statistika, operační výzkum
Institucionální podpora	UTIA-B - RVO:67985556
UT WOS	000343160900017
DOI	10.1007/s10957-013-0474-6
Anotace	This note deals with Markov decision chains evolving on a denumerable state space. Under standard continuity compactness requirements, an explicit example is provided to show that, with respect to a strong sample-path average reward criterion, the Lyapunov function condition does not ensure the existence of an optimal stationary policy.
Pracoviště	Ústav teorie informace a automatizace
Kontakt	Markéta Votavová, votavova@utia.cas.cz, Tel.: 266 052 201.
Rok sběru	2015

Počet záznamů: 1