HEX: Human-in-the-loop explainability via deep reinforcement learning

被引:0
|
作者
Lash, Michael T. [1 ]
机构
[1] Univ Kansas, Sch Business, Analyt Informat & Operat Area, 1654 Naismith Dr, Lawrence, KS 66045 USA
关键词
Explainability; Interpretability; Human-in-the-loop; Deep reinforcement learning; Machine learning; Behavioral machine learning; Decision support; EXPLANATIONS; ALGORITHMS; MODELS;
D O I
10.1016/j.dss.2024.114304
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of machine learning (ML) models in decision-making contexts, particularly those used in high-stakes decision-making, are fraught with issue and peril since a person - not a machine - must ultimately be held accountable for the consequences of decisions made using such systems. Machine learning explainability (MLX) promises to provide decision-makers with prediction-specific rationale, assuring them that the model-elicited predictions are made for the right reasons and are thus reliable. Few works explicitly consider this key human-in the-loop (HITL) component, however. In this work we propose HEX, a human-in-the-loop deep reinforcement learning approach to MLX. HEX incorporates 0-distrust projection to synthesize decider-specific explainers that produce explanations strictly in terms of a decider's preferred explanatory features using any classification model. Our formulation explicitly considers the decision boundary of the ML model in question using proposed explanatory point mode of explanation, thus ensuring explanations are specific to the ML model in question. We empirically evaluate HEX against other competing methods, finding that HEX is competitive with the state-of-the-art and outperforms other methods in human-in-the-loop scenarios. We conduct a randomized, controlled laboratory experiment utilizing actual explanations elicited from both HEX and competing methods. We causally establish that our method increases decider's trust and tendency to rely on trusted features.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Deep Reinforcement Learning Policy in Hex Game System
    Lu, Mengxuan
    Li, Xuejun
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 6623 - 6626
  • [42] Human-in-the-Loop Deep Learning Retinal Image Classification with Customized Loss Function
    Shakya, Suhev
    Vasquez, Mariana
    Wang, Yiyang
    Tchoua, Roselyne
    Furst, Jacob
    Raicu, Daniela
    MEDICAL IMAGING 2022: COMPUTER-AIDED DIAGNOSIS, 2022, 12033
  • [43] Uncertainty-Based Human-in-the-Loop Deep Learning for Land Cover Segmentation
    Garcia Rodriguez, Carlos
    Vitria, Jordi
    Mora, Oscar
    REMOTE SENSING, 2020, 12 (22) : 1 - 14
  • [44] Explainability in Deep Reinforcement Learning: A Review into Current Methods and Applications
    Hickling, Thomas
    Zenati, Abdelhafid
    Aouf, Nabil
    Spencer, Phillippa
    ACM COMPUTING SURVEYS, 2024, 56 (05)
  • [45] Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation
    Chen, Xiaoyu
    Zhong, Han
    Yang, Zhuoran
    Wang, Zhaoran
    Wang, Liwei
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [46] Learning to Navigate in Human Environments via Deep Reinforcement Learning
    Gao, Xingyuan
    Sun, Shiying
    Zhao, Xiaoguang
    Tan, Min
    NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 : 418 - 429
  • [47] Performance-Based Human-in-the-Loop Optimal Bipartite Consensus Control for Multi-Agent Systems via Reinforcement Learning
    Huang, Zongsheng
    Li, Tieshan
    Long, Yue
    Yang, Hanqing
    2024 14th International Conference on Information Science and Technology, ICIST 2024, 2024, : 497 - 502
  • [48] Continual learning classification method with human-in-the-loop
    Liu, Jia
    Li, Dong
    Shan, Wangweiyi
    Liu, Shulin
    METHODSX, 2023, 11
  • [49] Human-in-the-Loop Learning for Dynamic Congestion Games
    Li, Hongbo
    Duan, Lingjie
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 11159 - 11171
  • [50] Active Learning for Human-in-the-Loop Customs Inspection
    Kim, Sundong
    Mai, Tung-Duong
    Han, Sungwon
    Park, Sungwon
    Nguyen, D. K. Thi
    So, Jaechan
    Singh, Karandeep
    Cha, Meeyoung
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (12) : 12039 - 12052