Model tree methods for explaining deep reinforcement learning agents in real-time robotic applications

被引:11
|
作者
Gjaerum, Vilde B. [1 ]
Strumke, Inga [2 ]
Lover, Jakob [3 ]
Miller, Timothy [4 ]
Lekkas, Anastasios M. [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Engn Cybernet, N-7034 Trondheim, Norway
[2] Norwegian Univ Sci & Technol, Dept Comp Sci, N-7034 Trondheim, Norway
[3] Norwegian Univ Sci & Technol, Dept Engn Cybernet, N-7052 Trondheim, Norway
[4] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic 3010, Australia
关键词
Explainable artificial intelligence; Model trees; Reinforcement learning; Robotics;
D O I
10.1016/j.neucom.2022.10.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning has shown useful in the field of robotics but the black-box nature of deep neural networks impedes the applicability of deep reinforcement learning agents for real-world tasks. This is addressed in the field of explainable artificial intelligence, by developing explanation methods that aim to explain such agents to humans. Model trees as surrogate models have proven useful for producing explanations for black-box models used in real-world robotic applications, in particular, due to their capability of providing explanations in real time. In this paper, we provide an overview and analysis of available methods for building model trees for explaining deep reinforcement learning agents solving robotics tasks. We find that multiple outputs are important for the model to be able to grasp the dependencies of coupled output features, i.e. actions. Additionally, our results indicate that introducing domain knowledge via a hierarchy among the input features during the building process results in higher accuracies and a faster building process. (c) 2022 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [31] Real-Time Neural MPC: Deep Learning Model Predictive Control for Quadrotors and Agile Robotic Platforms
    Salzmann, Tim
    Kaufmann, Elia
    Arrizabalaga, Jon
    Pavone, Marco
    Scaramuzza, Davide
    Ryll, Markus
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 2397 - 2404
  • [32] A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference
    Yan-e Hou
    Wenbo Gu
    WeiChuan Dong
    Lanxue Dang
    International Journal of Computational Intelligence Systems, 16
  • [33] Real-Time Model-Free Deep Reinforcement Learning for Force Control of a Series Elastic Actuator
    Sambhus, Ruturaj
    Gokce, Aydin
    Welch, Stephen
    Herron, Connor W.
    Leonessa, Alexander
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5645 - 5652
  • [34] A Deep Reinforcement Learning Real-Time Recommendation Model Based on Long and Short-Term Preference
    Hou, Yan-e
    Gu, Wenbo
    Dong, WeiChuan
    Dang, Lanxue
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [35] Real-time model for wave attenuation using active plate breakwater based on deep reinforcement learning
    Liang, Hongjian
    Qin, Hao
    Mu, Lin
    Su, Haowen
    OCEAN ENGINEERING, 2023, 277
  • [36] Real-time systems for mobile robotic applications based on a behavioural model
    Buendía, F
    Hassan, H
    Simó, J
    Crespo, A
    REAL TIME PROGRAMMING 1999 (WRTP'99), 1999, : 153 - 158
  • [37] Deep Reinforcement Learning for Resource Protection and Real-Time Detection in IoT Environment
    Liang, Wei
    Huang, Weihong
    Long, Jing
    Zhang, Ke
    Li, Kuan-Ching
    Zhang, Dafang
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (07) : 6392 - 6401
  • [38] Deep reinforcement learning in real-time strategy games: a systematic literature review
    Barros e Sa, Gabriel Caldas
    Madeira, Charles Andrye Galvao
    APPLIED INTELLIGENCE, 2025, 55 (03)
  • [39] Real-Time Channel Management in WLANs: Deep Reinforcement Learning versus Heuristics
    Iacoboaiea, Ovidiu
    Krolikowski, Jonatan
    Ben Houidi, Zied
    Rossi, Dario
    2021 IFIP NETWORKING CONFERENCE AND WORKSHOPS (IFIP NETWORKING), 2021,
  • [40] Real-Time Trajectory Adaptation for Quadrupedal Locomotion using Deep Reinforcement Learning
    Gangapurwala, Siddhant
    Geisert, Mathieu
    Orsolino, Romeo
    Fallon, Maurice
    Havoutis, Ioannis
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 5973 - 5979