Number of the records: 1  

Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies

  1. 1.
    SYSNO ASEP0495875
    Document TypeV - Research Report
    R&D Document TypeThe record was not marked in the RIV
    TitleBalancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies
    Author(s) Kárný, Miroslav (UTIA-B) RID, ORCID
    Hůla, František (UTIA-B)
    Number of authors2
    Issue dataPraha: ÚTIA AV ČR, v.v.i, 2018
    SeriesResearch Report
    Series number2376
    Number of pages13 s.
    Publication formPrint - P
    Languageeng - English
    CountryCZ - Czech Republic
    KeywordsExploitation ; Exploration ; Bayesian estimation ; Adaptive systems ; Fully probabilistic design ; Kullback-Leibler divergence ; Decision policy ; Markov decision process
    Subject RIVBC - Control Systems Theory
    OECD categoryComputer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
    R&D ProjectsGA16-09848S GA ČR - Czech Science Foundation (CSF)
    GA18-15970S GA ČR - Czech Science Foundation (CSF)
    Institutional supportUTIA-B - RVO:67985556
    AnnotationAdaptive decision making learns an environment model serving a design of a decision policy. The policy-generated actions influence both the acquired reward and the future knowledge. The optimal policy properly balances exploitation with exploration. The inherent dimensionality
    curse of decision making under incomplete knowledge prevents the realisation of the optimal design.
    WorkplaceInstitute of Information Theory and Automation
    ContactMarkéta Votavová, votavova@utia.cas.cz, Tel.: 266 052 201.
    Year of Publishing2019
Number of the records: 1  

  This site uses cookies to make them easier to browse. Learn more about how we use cookies.