A Closer Look at Reward Decomposition for High-level Robotic Explanations

被引:3
|
作者
Lu, Wenhao [1 ]
Zhao, Xufeng [1 ]
Magg, Sven [2 ]
Gromniak, Martin [1 ,3 ]
Li, Mengdi [1 ]
Wermter, Stefan [1 ]
机构
[1] Univ Hamburg, Dept Informat, Knowledge Technol Grp, Hamburg, Germany
[2] Hamburger Informat Technol Ctr HITeC, Hamburg, Germany
[3] ZAL Ctr Appl Aeronaut Res, Hamburg, Germany
来源
2023 IEEE INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, ICDL | 2023年
关键词
D O I
10.1109/ICDL55364.2023.10364407
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Explaining the behaviour of intelligent agents learned by reinforcement learning (RL) to humans is challenging yet crucial due to their incomprehensible proprioceptive states, variational intermediate goals, and resultant unpredictability. Moreover, one-step explanations for RL agents can be ambiguous as they fail to account for the agent's future behaviour at each transition, adding to the complexity of explaining robot actions. By leveraging abstracted actions that map to task-specific primitives, we avoid explanations on the movement level. To further improve the transparency and explainability of robotic systems, we propose an explainable Q-Map learning framework that combines reward decomposition (RD) with abstracted action spaces, allowing for non-ambiguous and high-level explanations based on object properties in the task. We demonstrate the effectiveness of our framework through quantitative and qualitative analysis of two robotic scenarios, showcasing visual and textual explanations, from output artefacts of RD explanations, that are easy for humans to comprehend. Additionally, we demonstrate the versatility of integrating these artefacts with large language models (LLMs) for reasoning and interactive querying.
引用
收藏
页码:429 / 436
页数:8
相关论文
共 50 条
  • [21] Learning high-level robotic manipulation actions with visual predictive model
    Anji Ma
    Guoyi Chi
    Serena Ivaldi
    Lipeng Chen
    Complex & Intelligent Systems, 2024, 10 : 811 - 823
  • [22] Tautomerism and Thermal Decomposition of Tetrazole: High-Level ab Initio Study
    Kiselev, Vitaly G.
    Cheblakov, Pavel B.
    Gritsan, Nina P.
    JOURNAL OF PHYSICAL CHEMISTRY A, 2011, 115 (09): : 1743 - 1753
  • [23] Catalytic decomposition of sodium tetraphenylborate in high-level nuclear waste.
    Barnes, MJ
    Crawford, CL
    Fink, SD
    Fondeur, FF
    Hobbs, DT
    Peterson, RA
    Walker, DD
    Wilmarth, WR
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2000, 219 : U762 - U763
  • [24] High-level synthesis of asynchronous systems by data-driven decomposition
    Wong, CG
    Martin, AJ
    40TH DESIGN AUTOMATION CONFERENCE, PROCEEDINGS 2003, 2003, : 508 - 513
  • [25] Extending a net splitting operation for decomposition of high-level Petri nets
    Moutinho, Filipe
    Gomes, Luis
    38TH ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS SOCIETY (IECON 2012), 2012, : 6120 - 6125
  • [26] System-level design merits a closer look
    Moretti, G
    EDN, 2002, 47 (04) : 43 - +
  • [27] REAL-TIME ANALYZER FURNISHES HIGH-LEVEL LOOK AT SOFTWARE OPERATION
    ABLEIDINGER, B
    AGARWAL, N
    NOBLES, C
    ELECTRONIC DESIGN, 1985, 33 (22) : 117 - &
  • [28] A NEW LOOK AT PARTIAL FRACTION EXPANSION FROM A HIGH-LEVEL LANGUAGE VIEWPOINT
    CHEN, CF
    LEUNG, KK
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1981, 7 (05) : 361 - 367
  • [29] The effect of IAS/IFRS adoption on earnings management (smoothing): A closer look at competing explanations
    Capkun, Vedran
    Collins, Dan
    Jeanjean, Thomas
    JOURNAL OF ACCOUNTING AND PUBLIC POLICY, 2016, 35 (04) : 352 - 394
  • [30] Decentralized Control of Robotic Swarms from High-Level Temporal Logic Specifications
    Moarref, Salar
    Kress-Gazit, Hadas
    2017 INTERNATIONAL SYMPOSIUM ON MULTI-ROBOT AND MULTI-AGENT SYSTEMS (MRS), 2017,