Identification of Optimal Policies in Markov Decision Processes

Sladký, Karel

Počet záznamů: 1

Identification of Optimal Policies in Markov Decision Processes

1.

SYSNO ASEP	0346161
Druh ASEP	J - Článek v odborném periodiku
Zařazení RIV	J - Článek v odborném periodiku
Poddruh J	Článek ve WOS
Název	Identification of Optimal Policies in Markov Decision Processes
Tvůrce(i)	Sladký, Karel (UTIA-B)_RID
Zdroj.dok.	Kybernetika. - : Ústav teorie informace a automatizace AV ČR, v. v. i. - ISSN 0023-5954 46 2010, č. 3 (2010), s. 558-570
Poč.str.	13 s.
Akce	International Conference on Mathematical Methods in Economy and Industry
Datum konání	15.06.2009-18.06.2009
Místo konání	České Budějovice
Země	CZ - Česká republika
Typ akce	CST
Jazyk dok.	eng - angličtina
Země vyd.	CZ - Česká republika
Klíč. slova	finite state Markov decision processes ; discounted and average costs ; elimination of suboptimal policies
Vědní obor RIV	BB - Aplikovaná statistika, operační výzkum
CEP	GA402/08/0107 GA ČR - Grantová agentura ČR
	GA402/07/1113 GA ČR - Grantová agentura ČR
CEZ	AV0Z10750506 - UTIA-B (2005-2011)
UT WOS	000280425000019
Anotace	In this note we focus attention on identifying optimal policies and on elimination suboptimal policies minimizing optimality criteria in discrete-time Markov decision processes with finite state space and compact action set. We present unified approach to value iteration algorithms that enables to generate lower and upper bounds on optimal values, as well as on the current policy. Using the modified value iterations it is possible to eliminate suboptimal actions and to identify an optimal policy or nearly optimal policies in a finite number of steps without knowing precise values of the performance function.
Pracoviště	Ústav teorie informace a automatizace
Kontakt	Markéta Votavová, votavova@utia.cas.cz, Tel.: 266 052 201.
Rok sběru	2011

Počet záznamů: 1