ATTEXPLAINER: Explain Transformer via Attention by Reinforcement Learning

被引：0

作者：

Niu, Runliang ^{[1
]}

Wei, Zhepei ^{[1
]}

Wang, Yan ^{[1
,2
]}

Wang, Qi ^{[1
]}

机构：

[1] Jilin Univ, Sch Artificial Intelligence, Changchun, Peoples R China

[2] Jilin Univ, Minist Educ, Coll Comp Sci & Technol, Key Lab Symbol Computat & Knowledge Engn, Changchun, Peoples R China

来源：

PROCEEDINGS OF THE THIRTY-FIRST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2022 | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Transformer and its variants, built based on attention mechanisms, have recently achieved remarkable performance in many NLP tasks. Most existing works on Transformer explanation tend to reveal and utilize the attention matrix with human subjective intuitions in a qualitative manner. However, the huge size of dimensions directly challenges these methods to quantitatively analyze the attention matrix. Therefore, in this paper, we propose a novel reinforcement learning (RL) based framework for Transformer explanation via attention matrix, namely ATTEXPLAINER. The RL agent learns to perform step-by-step masking operations by observing the change in attention matrices. We have adapted our method to two scenarios, perturbation-based model explanation and text adversarial attack. Experiments on three widely used text classification benchmarks validate the effectiveness of the proposed method compared to state-of-the-art baselines. Additional studies show that our method is highly transferable and consistent with human intuition. The code of this paper is available at https://github.com/niuzaisheng/AttExplainer.

引用

页码：724 / 731

页数：8

共 50 条

[41] Graph Transformer with Reinforcement Learning for Vehicle Routing Problem
Fellek, Getu
Farid, Ahmed
Gebreyesus, Goytom
Fujimura, Shigeru
Yoshie, Osamu
IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2023, 18 (05) : 701 - 713
[42] Transformer in reinforcement learning for decision-making: a survey
Yuan, Weilin
Chen, Jiaxing
Chen, Shaofei
Feng, Dawei
Hu, Zhenzhen
Li, Peng
Zhao, Weiwei
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2024, 25 (06) : 763 - 790
[43] Transformer Dissection: An Unified Understanding for Transformer's Attention via the Lens of Kernel
Tsai, Yao-Hung Hubert
Bai, Shaojie
Yamada, Makoto
Morency, Louis-Philippe
Salakhutdinov, Ruslan
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 4344 - 4353
[44] Dynamic Job-Shop Scheduling via Graph Attention Networks and Deep Reinforcement Learning
Liu, Chien-Liang
Tseng, Chun-Jan
Weng, Po-Hao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (06) : 8662 - 8672
[45] MEDIRL: Predicting the Visual Attention of Drivers via Maximum Entropy Deep Inverse Reinforcement Learning
Baee, Sonia
Pakdamanian, Erfan
Kim, Inki
Feng, Lu
Ordonez, Vicente
Barnes, Laura
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13158 - 13168
[46] Flexible Job Shop Scheduling via Dual Attention Network-Based Reinforcement Learning
Wang, Runqing
Wang, Gang
Sun, Jian
Deng, Fang
Chen, Jie
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3091 - 3102
[47] Attention-Based Highway Safety Planner for Autonomous Driving via Deep Reinforcement Learning
Chen, Guoxi
Zhang, Ya
Li, Xinde
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (01) : 162 - 175
[48] Stochastic Economic Lot Scheduling via Self-Attention Based Deep Reinforcement Learning
Song, Wen
Mi, Nan
Li, Qiqiang
Zhuang, Jing
Cao, Zhiguang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (02) : 1457 - 1468
[49] An Efficient Message Dissemination Scheme for Cooperative Drivings via Cooperative Hierarchical Attention Reinforcement Learning
Liu, Bingyi
Han, Weizhen
Wang, Enshu
Xiong, Shengwu
Qiao, Chunming
Wang, Jianping
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 5527 - 5542
[50] Explain Reinforcement Learning Agents Through Fuzzy Rule Reconstruction
Ou, Liang
Chang, Yu-Chen
Wang, Yu-Kai
Lin, Chin-Teng
2023 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, FUZZ, 2023,

← 1 2 3 4 5 →