ATTEXPLAINER: Explain Transformer via Attention by Reinforcement Learning

被引:0
|
作者
Niu, Runliang [1 ]
Wei, Zhepei [1 ]
Wang, Yan [1 ,2 ]
Wang, Qi [1 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China
[2] Jilin Univ, Minist Educ, Coll Comp Sci & Technol, Key Lab Symbol Computat & Knowledge Engn, Changchun, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer and its variants, built based on attention mechanisms, have recently achieved remarkable performance in many NLP tasks. Most existing works on Transformer explanation tend to reveal and utilize the attention matrix with human subjective intuitions in a qualitative manner. However, the huge size of dimensions directly challenges these methods to quantitatively analyze the attention matrix. Therefore, in this paper, we propose a novel reinforcement learning (RL) based framework for Transformer explanation via attention matrix, namely ATTEXPLAINER. The RL agent learns to perform step-by-step masking operations by observing the change in attention matrices. We have adapted our method to two scenarios, perturbation-based model explanation and text adversarial attack. Experiments on three widely used text classification benchmarks validate the effectiveness of the proposed method compared to state-of-the-art baselines. Additional studies show that our method is highly transferable and consistent with human intuition. The code of this paper is available at https://github.com/niuzaisheng/AttExplainer.
引用
收藏
页码:724 / 731
页数:8
相关论文
共 50 条
  • [41] Graph Transformer with Reinforcement Learning for Vehicle Routing Problem
    Fellek, Getu
    Farid, Ahmed
    Gebreyesus, Goytom
    Fujimura, Shigeru
    Yoshie, Osamu
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2023, 18 (05) : 701 - 713
  • [42] Transformer in reinforcement learning for decision-making: a survey
    Yuan, Weilin
    Chen, Jiaxing
    Chen, Shaofei
    Feng, Dawei
    Hu, Zhenzhen
    Li, Peng
    Zhao, Weiwei
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 763 - 790
  • [43] Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
    Tsai, Yao-Hung Hubert
    Bai, Shaojie
    Yamada, Makoto
    Morency, Louis-Philippe
    Salakhutdinov, Ruslan
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4344 - 4353
  • [44] Dynamic Job-Shop Scheduling via Graph Attention Networks and Deep Reinforcement Learning
    Liu, Chien-Liang
    Tseng, Chun-Jan
    Weng, Po-Hao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8662 - 8672
  • [45] MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning
    Baee, Sonia
    Pakdamanian, Erfan
    Kim, Inki
    Feng, Lu
    Ordonez, Vicente
    Barnes, Laura
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13158 - 13168
  • [46] Flexible Job Shop Scheduling via Dual Attention Network-Based Reinforcement Learning
    Wang, Runqing
    Wang, Gang
    Sun, Jian
    Deng, Fang
    Chen, Jie
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3091 - 3102
  • [47] Attention-Based Highway Safety Planner for Autonomous Driving via Deep Reinforcement Learning
    Chen, Guoxi
    Zhang, Ya
    Li, Xinde
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 162 - 175
  • [48] Stochastic Economic Lot Scheduling via Self-Attention Based Deep Reinforcement Learning
    Song, Wen
    Mi, Nan
    Li, Qiqiang
    Zhuang, Jing
    Cao, Zhiguang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (02) : 1457 - 1468
  • [49] An Efficient Message Dissemination Scheme for Cooperative Drivings via Cooperative Hierarchical Attention Reinforcement Learning
    Liu, Bingyi
    Han, Weizhen
    Wang, Enshu
    Xiong, Shengwu
    Qiao, Chunming
    Wang, Jianping
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 5527 - 5542
  • [50] Explain Reinforcement Learning Agents Through Fuzzy Rule Reconstruction
    Ou, Liang
    Chang, Yu-Chen
    Wang, Yu-Kai
    Lin, Chin-Teng
    2023 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ, 2023,