Explainable Reinforcement Learning via a Causal World Model

被引:0
|
作者
Yu, Zhongwei [1 ]
Ruan, Jingqing [1 ]
Xing, Dengpeng [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generating explanations for reinforcement learning (RL) is challenging as actions may produce long-term effects on the future. In this paper, we develop a novel framework for explainable RL by learning a causal world model without prior knowledge of the causal structure of the environment. The model captures the influence of actions, allowing us to interpret the long-term effects of actions through causal chains, which present how actions influence environmental variables and finally lead to rewards. Different from most explanatory models which suffer from low accuracy, our model remains accurate while improving explainability, making it applicable in model-based learning. As a result, we demonstrate that our causal model can serve as the bridge between explainability and learning.
引用
收藏
页码:4540 / 4548
页数:9
相关论文
共 50 条
  • [31] Root Cause Attribution of Delivery Risks via Causal Discovery with Reinforcement Learning
    Bo, Shi
    Xiao, Minheng
    ALGORITHMS, 2024, 17 (11)
  • [32] Reinforcement Learning Method with Internal World Model Training
    Hirata, Kenji
    Iizuka, Hiroyuki
    Yamamoto, Masahito
    2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 201 - 204
  • [33] A World Model for Actor-Critic in Reinforcement Learning
    Panov, A. I.
    Ugadiarov, L. A.
    PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
  • [34] Multi-world Model in Continual Reinforcement Learning
    Shen, Kevin
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23757 - 23759
  • [35] Real-World Reinforcement Learning via Multifidelity Simulators
    Cutler, Mark
    Walsh, Thomas J.
    How, Jonathan P.
    IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (03) : 655 - 671
  • [36] Learning explainable task-relevant state representation for model-free deep reinforcement learning
    Zhao, Tingting
    Li, Guixi
    Zhao, Tuo
    Chen, Yarui
    Xie, Ning
    Niu, Gang
    Sugiyama, Masashi
    NEURAL NETWORKS, 2024, 180
  • [37] Explainable navigation system using fuzzy reinforcement learning
    Bautista-Montesano, Rolando
    Bustamante-Bello, Rogelio
    Ramirez-Mendoza, Ricardo A.
    INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2020, 14 (04): : 1411 - 1428
  • [38] Explainable Deep Reinforcement Learning: State of the Art and Challenges
    Vouros, George A.
    ACM COMPUTING SURVEYS, 2023, 55 (05)
  • [39] An integrated network embedding with reinforcement learning for explainable recommendation
    Vo, Tham
    SOFT COMPUTING, 2022, 26 (08) : 3757 - 3775
  • [40] Explainable navigation system using fuzzy reinforcement learning
    Rolando Bautista-Montesano
    Rogelio Bustamante-Bello
    Ricardo A. Ramirez-Mendoza
    International Journal on Interactive Design and Manufacturing (IJIDeM), 2020, 14 : 1411 - 1428