Number of the records: 1  

Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies

  1. 1.
    0495875 - ÚTIA 2019 CZ eng V - Research Report
    Kárný, Miroslav - Hůla, František
    Balancing Exploitation and Exploration via Fully Probabilistic Design of Decision Policies.
    Praha: ÚTIA AV ČR, v.v.i, 2018. 13 s. Research Report, 2376.
    R&D Projects: GA ČR GA16-09848S; GA ČR(CZ) GA18-15970S
    Institutional support: RVO:67985556
    Keywords : Exploitation * Exploration * Bayesian estimation * Adaptive systems * Fully probabilistic design * Kullback-Leibler divergence * Decision policy * Markov decision process
    OECD category: Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
    http://library.utia.cas.cz/separaty/2018/AS/karny-0495875.pdf

    Adaptive decision making learns an environment model serving a design of a decision policy. The policy-generated actions influence both the acquired reward and the future knowledge. The optimal policy properly balances exploitation with exploration. The inherent dimensionality
    curse of decision making under incomplete knowledge prevents the realisation of the optimal design.
    Permanent Link: http://hdl.handle.net/11104/0288947

     
    FileDownloadSizeCommentaryVersionAccess
    0495875.pdf1515 KBOtheropen-access
     
Number of the records: 1  

  This site uses cookies to make them easier to browse. Learn more about how we use cookies.