Second Order Optimality in Transient and Discounted Markov Decision Chains

Sladký, Karel

Počet záznamů: 1

Second Order Optimality in Transient and Discounted Markov Decision Chains

1.

SYSNO ASEP	0448938
Druh ASEP	C - Konferenční příspěvek (mezinárodní konf.)
Zařazení RIV	D - Článek ve sborníku
Název	Second Order Optimality in Transient and Discounted Markov Decision Chains
Tvůrce(i)	Sladký, Karel (UTIA-B)_RID
Celkový počet autorů	1
Zdroj.dok.	Procedings of the 33rd International Conference Mathematical Methods in Economics MME 2015. - Plzeň : University of West Bohemia, Plzeň, 2015 - ISBN 978-80-261-0539-8
Rozsah stran	s. 731-736
Poč.str.	6 s.
Forma vydání	Tištěná - P
Akce	Mathematical Methods in Economics 2015 /33./
Datum konání	09.09.2015-11.09.2015
Místo konání	Cheb
Země	CZ - Česká republika
Typ akce	EUR
Jazyk dok.	eng - angličtina
Země vyd.	CZ - Česká republika
Klíč. slova	dynamic programming ; discounted and transient Markov reward chains ; reward-variance optimality
Vědní obor RIV	BC - Teorie a systémy řízení
CEP	GA13-14445S GA ČR - Grantová agentura ČR
	GA15-10331S GA ČR - Grantová agentura ČR
Institucionální podpora	UTIA-B - RVO:67985556
UT WOS	000387898900125
Anotace	The article is devoted to second order optimality in Markov decision processes. Attention is primarily focused on the reward variance for discounted models and undiscounted transient models (i.e. where the spectral radius of the transition probability matrix is less than unity). Considering the second order optimality criteria means that in the class of policies maximizing (or minimizing) total expected discounted reward (or undiscounted reward for the transient model) we choose the policy minimizing the total variance. Explicit formulae for calculating the variances for transient and discounted models are reported along with sketches of algoritmic procedures for finding second order optimal policies.
Pracoviště	Ústav teorie informace a automatizace
Kontakt	Markéta Votavová, votavova@utia.cas.cz, Tel.: 266 052 201.
Rok sběru	2016

Počet záznamů: 1