Causal Influence Detection for Improving Efficiency in Reinforcement Learning

被引：0

作者：

Seitzer, Maximilian ^{[1
]}

Schoelkopf, Bernhard ^{[1
]}

Martius, Georg ^{[1
]}

机构：

[1] MPI Intelligent Syst, Tubingen, Germany

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many reinforcement learning (RL) environments consist of independent entities that interact sparsely. In such environments, RL agents have only limited influence over other entities in any particular situation. Our idea in this work is that learning can be efficiently guided by knowing when and what the agent can influence with its actions. To achieve this, we introduce a measure of situation-dependent causal influence based on conditional mutual information and show that it can reliably detect states of influence. We then propose several ways to integrate this measure into RL algorithms to improve exploration and off-policy learning. All modified algorithms show strong increases in data efficiency on robotic manipulation tasks.

引用

页数：14

共 50 条

[1] Improving the Efficiency of Bayesian Inverse Reinforcement Learning
Michini, Bernard
How, Jonathan P.
2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 3651 - 3656
[2] Improving the Stability of Intrusion Detection With Causal Deep Learning
Zeng, Zengri
Peng, Wei
Zeng, Detian
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (04): : 4750 - 4763
[3] A Survey on Causal Reinforcement Learning
Zeng, Yan
Cai, Ruichu
Sun, Fuchun
Huang, Libo
Hao, Zhifeng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (04) : 5942 - 5962
[4] CAUSAL DISCOVERY WITH REINFORCEMENT LEARNING
Huawei Noah's Ark Lab
不详
Int. Conf. Learn. Represent., ICLR,
[5] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
Wilson, Callum
Riccardi, Annalisa
OPTIMIZATION AND ENGINEERING, 2023, 24 (01) : 223 - 255
[6] Improving the efficiency of reinforcement learning for a spacecraft powered descent with Q-learning
Callum Wilson
Annalisa Riccardi
Optimization and Engineering, 2023, 24 : 223 - 255
[7] Improving the Accuracy of Network Intrusion Detection with Causal Machine Learning
Zeng, Zengri
Peng, Wei
Zhao, Baokang
SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
[8] Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings
Chen, Lili
Lee, Kimin
Srinivas, Aravind
Abbeel, Pieter
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[9] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
Everitt, Tom
Hutter, Marcus
Kumar, Ramana
Krakovna, Victoria
SYNTHESE, 2021, 198 (SUPPL 27) : 6435 - 6467
[10] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective
Tom Everitt
Marcus Hutter
Ramana Kumar
Victoria Krakovna
Synthese, 2021, 198 : 6435 - 6467

← 1 2 3 4 5 →