Batch reinforcement learning with state importance

被引:0
|
作者
Li, LH [1 ]
Bulitko, V [1 ]
Greiner, R [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2E8, Canada
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate the problem of using function approximation in rein forcement learning where the agent's policy is represented as a classifier mapping states to actions. High classification accuracy is usually deemed to correlate with high policy quality. But this is not necessarily the case as increasing classification accuracy can actually decrease the policy's quality. This phenomenon takes place when the learning process begins to focus on classifying less "important" states. In this paper, we introduce a measure of state's decision-making importance that can be used to improve policy learning. As a result, the focused learning process is shown to converge faster to better policies(1).
引用
收藏
页码:566 / 568
页数:3
相关论文
共 50 条
  • [1] Learning state importance for preference-based reinforcement learning
    Zhang, Guoxi
    Kashima, Hisashi
    MACHINE LEARNING, 2023, 113 (4) : 1885 - 1901
  • [2] Learning state importance for preference-based reinforcement learning
    Guoxi Zhang
    Hisashi Kashima
    Machine Learning, 2024, 113 : 1885 - 1901
  • [3] Reinforcement learning in batch processes
    Wilson, JA
    Martinez, EC
    APPLICATION OF NEURAL NETWORKS AND OTHER LEARNING TECHNOLOGIES IN PROCESS ENGINEERING, 2001, : 269 - 286
  • [4] Causal explanation for reinforcement learning: quantifying state and temporal importance
    Xiaoxiao Wang
    Fanyu Meng
    Xin Liu
    Zhaodan Kong
    Xin Chen
    Applied Intelligence, 2023, 53 : 22546 - 22564
  • [5] Causal explanation for reinforcement learning: quantifying state and temporal importance
    Wang, Xiaoxiao
    Meng, Fanyu
    Liu, Xin
    Kong, Zhaodan
    Chen, Xin
    APPLIED INTELLIGENCE, 2023, 53 (19) : 22546 - 22564
  • [6] Reinforcement Learning for Batch-to-Batch Bioprocess Optimisation
    Petsagkourakis, P.
    Sandoval, I. Orson
    Bradford, E.
    Zhang, D.
    del Rio-Chanona, E. A.
    29TH EUROPEAN SYMPOSIUM ON COMPUTER AIDED PROCESS ENGINEERING, PT A, 2019, 46 : 919 - 924
  • [7] Batch Reinforcement Learning from Crowds
    Zhang, Guoxi
    Kashima, Hisashi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 38 - 51
  • [8] Batch Reinforcement Learning with Hyperparameter Gradients
    Lee, Byung-Jun
    Lee, Jongmin
    Vrancx, Peter
    Kim, Dongho
    Kim, Kee-Eung
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [9] Reinforcement learning for batch bioprocess optimization
    Petsagkourakis, P.
    Sandoval, I. O.
    Bradford, E.
    Zhang, D.
    del Rio-Chanona, E. A.
    COMPUTERS & CHEMICAL ENGINEERING, 2020, 133
  • [10] Small batch deep reinforcement learning
    Obando-Ceron, Johan
    Bellemare, Marc G.
    Castro, Pablo Samuel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,