HEX: Human-in-the-loop explainability via deep reinforcement learning

被引:0
|
作者
Lash, Michael T. [1 ]
机构
[1] Univ Kansas, Sch Business, Analyt Informat & Operat Area, 1654 Naismith Dr, Lawrence, KS 66045 USA
关键词
Explainability; Interpretability; Human-in-the-loop; Deep reinforcement learning; Machine learning; Behavioral machine learning; Decision support; EXPLANATIONS; ALGORITHMS; MODELS;
D O I
10.1016/j.dss.2024.114304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of machine learning (ML) models in decision-making contexts, particularly those used in high-stakes decision-making, are fraught with issue and peril since a person - not a machine - must ultimately be held accountable for the consequences of decisions made using such systems. Machine learning explainability (MLX) promises to provide decision-makers with prediction-specific rationale, assuring them that the model-elicited predictions are made for the right reasons and are thus reliable. Few works explicitly consider this key human-in the-loop (HITL) component, however. In this work we propose HEX, a human-in-the-loop deep reinforcement learning approach to MLX. HEX incorporates 0-distrust projection to synthesize decider-specific explainers that produce explanations strictly in terms of a decider's preferred explanatory features using any classification model. Our formulation explicitly considers the decision boundary of the ML model in question using proposed explanatory point mode of explanation, thus ensuring explanations are specific to the ML model in question. We empirically evaluate HEX against other competing methods, finding that HEX is competitive with the state-of-the-art and outperforms other methods in human-in-the-loop scenarios. We conduct a randomized, controlled laboratory experiment utilizing actual explanations elicited from both HEX and competing methods. We causally establish that our method increases decider's trust and tendency to rely on trusted features.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] A survey on active learning and human-in-the-loop deep learning for medical image analysis
    Budd, Samuel
    Robinson, Emma C.
    Kainz, Bernhard
    MEDICAL IMAGE ANALYSIS, 2021, 71
  • [32] Reinforcement Learning Control of Robotic Knee With Human-in-the-Loop by Flexible Policy Iteration
    Gao, Xiang
    Si, Jennie
    Wen, Yue
    Li, Minhan
    Huang, He
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5873 - 5887
  • [33] A human-in-the-loop deep learning paradigm for synergic visual evaluation in children
    Zhang, Kai
    Li, Xiaoyan
    He, Lin
    Guo, Chong
    Yang, Yahan
    Dong, Zhou
    Yang, Haoqing
    Zhu, Yi
    Chen, Chuan
    Zhou, Xiaojing
    Li, Wangting
    Liu, Zhenzhen
    Wu, Xiaohang
    Liu, Xiyang
    Lin, Haotian
    NEURAL NETWORKS, 2020, 122 : 163 - 173
  • [34] Fast Human-in-the-Loop Control for HVAC Systems via Meta-Learning and Model-Based Offline Reinforcement Learning
    Chen, Liangliang
    Meng, Fei
    Zhang, Ying
    IEEE TRANSACTIONS ON SUSTAINABLE COMPUTING, 2023, 8 (03): : 504 - 521
  • [35] Human as AI mentor: Enhanced human-in-the-loop reinforcement learning for safe and efficient autonomous driving
    Huang, Zilin
    Sheng, Zihao
    Ma, Chengyuan
    Chen, Sikai
    COMMUNICATIONS IN TRANSPORTATION RESEARCH, 2024, 4
  • [36] A unified microstructure segmentation approach via human-in-the-loop machine learning
    Na, Juwon
    Kim, Se-Jong
    Kim, Heekyu
    Kang, Seong-Hoon
    Lee, Seungchul
    ACTA MATERIALIA, 2023, 255
  • [37] A survey of human-in-the-loop for machine learning
    Wu, Xingjiao
    Xiao, Luwei
    Sun, Yixuan
    Zhang, Junhang
    Ma, Tianlong
    He, Liang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 135 : 364 - 381
  • [38] Human-in-the-loop Applied Machine Learning
    Brodley, Carla E.
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 1 - 1
  • [39] Enabling Autonomous Medical Image Data Annotation: A human-in-the-loop Reinforcement Learning Approach
    da Cruz, Leonardo C.
    Sierra-Franco, Cesar A.
    Silva-Calpa, Greis Francy M.
    Raposo, Alberto Barbosa
    PROCEEDINGS OF THE 2021 16TH CONFERENCE ON COMPUTER SCIENCE AND INTELLIGENCE SYSTEMS (FEDCSIS), 2021, : 271 - 279
  • [40] Safety-Aware Human-in-the-Loop Reinforcement Learning With Shared Control for Autonomous Driving
    Huang, Wenhui
    Liu, Haochen
    Huang, Zhiyu
    Lv, Chen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16181 - 16192