Causal Influence Detection for Improving Efficiency in Reinforcement Learning

被引:0
|
作者
Seitzer, Maximilian [1 ]
Schoelkopf, Bernhard [1 ]
Martius, Georg [1 ]
机构
[1] MPI Intelligent Syst, Tubingen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many reinforcement learning (RL) environments consist of independent entities that interact sparsely. In such environments, RL agents have only limited influence over other entities in any particular situation. Our idea in this work is that learning can be efficiently guided by knowing when and what the agent can influence with its actions. To achieve this, we introduce a measure of situation-dependent causal influence based on conditional mutual information and show that it can reliably detect states of influence. We then propose several ways to integrate this measure into RL algorithms to improve exploration and off-policy learning. All modified algorithms show strong increases in data efficiency on robotic manipulation tasks.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Improving the Efficiency of Bayesian Inverse Reinforcement Learning
    Michini, Bernard
    How, Jonathan P.
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 3651 - 3656
  • [2] Improving the Stability of Intrusion Detection With Causal Deep Learning
    Zeng, Zengri
    Peng, Wei
    Zeng, Detian
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4750 - 4763
  • [3] A Survey on Causal Reinforcement Learning
    Zeng, Yan
    Cai, Ruichu
    Sun, Fuchun
    Huang, Libo
    Hao, Zhifeng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (04) : 5942 - 5962
  • [4] CAUSAL DISCOVERY WITH REINFORCEMENT LEARNING
    Huawei Noah's Ark Lab
    不详
    Int. Conf. Learn. Represent., ICLR,
  • [5] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
    Wilson, Callum
    Riccardi, Annalisa
    OPTIMIZATION AND ENGINEERING, 2023, 24 (01) : 223 - 255
  • [6] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
    Callum Wilson
    Annalisa Riccardi
    Optimization and Engineering, 2023, 24 : 223 - 255
  • [7] Improving the Accuracy of Network Intrusion Detection with Causal Machine Learning
    Zeng, Zengri
    Peng, Wei
    Zhao, Baokang
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [8] Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
    Chen, Lili
    Lee, Kimin
    Srinivas, Aravind
    Abbeel, Pieter
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
    Everitt, Tom
    Hutter, Marcus
    Kumar, Ramana
    Krakovna, Victoria
    SYNTHESE, 2021, 198 (SUPPL 27) : 6435 - 6467
  • [10] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
    Tom Everitt
    Marcus Hutter
    Ramana Kumar
    Victoria Krakovna
    Synthese, 2021, 198 : 6435 - 6467