Local Interpretations for Explainable Natural Language Processing: A Survey

被引:7
|
作者
Luo, Siwen [1 ]
Ivison, Hamish [2 ]
Han, Soyeon Caren [3 ]
Poon, Josiah [4 ]
机构
[1] Univ Western Australia, 35 Stirling Hwy, Perth, WA 6009, Australia
[2] Univ Washington, 3800 E Stevens Way NE, Seattle, WA 98195 USA
[3] Univ Melbourne, 700 Swanston St, Melbourne, Vic 3010, Australia
[4] Univ Sydney, 1 Cleveland St, Darlington, NSW 2008, Australia
关键词
Deep neural networks; explainable AI; local interpretation; natural language processing; PREDICTION;
D O I
10.1145/3649450
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models. This work investigates various methods to improve the interpretability of deep neural networks for Natural Language Processing (NLP) tasks, including machine translation and sentiment analysis. We provide a comprehensive discussion on the definition of the term interpretability and its various aspects at the beginning of this work. The methods collected and summarised in this survey are only associated with local interpretation and are specifically divided into three categories: (1) interpreting the model's predictions through related input features; (2) interpreting through natural language explanation; (3) probing the hidden states of models and word representations.
引用
收藏
页数:36
相关论文
共 50 条
  • [41] Pre-trained models for natural language processing: A survey
    QIU XiPeng
    SUN TianXiang
    XU YiGe
    SHAO YunFan
    DAI Ning
    HUANG XuanJing
    Science China(Technological Sciences), 2020, 63 (10) : 1872 - 1897
  • [42] 241Computational Politeness in Natural Language Processing: A Survey
    Priya, Priyanshu
    Firdaus, Mauajama
    Ekbal, Asif
    ACM COMPUTING SURVEYS, 2024, 56 (09)
  • [43] Natural Language Processing Meets Quantum Physics: A Survey and Categorization
    Wu, Sixuan
    Li, Jian
    Zhang, Peng
    Zhang, Yue
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3172 - 3182
  • [44] A Survey on the Integration of Blockchain Smart Contracts and Natural Language Processing
    Song, Zikai
    Shen, Pengxu
    Liu, Chuan
    Liu, Chao
    Gao, Haoyu
    Lei, Hong
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND NETWORKS, VOL III, CENET 2023, 2024, 1127 : 467 - 477
  • [45] Natural Language Processing (NLP) based Text Summarization - A Survey
    Awasthi, Ishitva
    Gupta, Kuntal
    Bhogal, Prabjot Singh
    Anand, Sahejpreet Singh
    Soni, Piyush Kumar
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1310 - 1317
  • [46] Natural language processing for similar languages, varieties, and dialects: A survey
    Zampieri, Marcos
    Nakov, Preslav
    Scherrer, Yves
    NATURAL LANGUAGE ENGINEERING, 2020, 26 (06) : 595 - 612
  • [47] Identification of Causal Dependencies by using Natural Language Processing: A Survey
    Nazaruka, Erika
    PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON EVALUATION OF NOVEL APPROACHES TO SOFTWARE ENGINEERING (ENASE), 2019, : 603 - 613
  • [48] Pre-trained models for natural language processing: A survey
    Qiu XiPeng
    Sun TianXiang
    Xu YiGe
    Shao YunFan
    Dai Ning
    Huang XuanJing
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2020, 63 (10) : 1872 - 1897
  • [49] XNLP: A Living Survey for XAI Research in Natural Language Processing
    Qian, Kun
    Danilevsky, Marina
    Katsis, Yannis
    Kawas, Ban
    Oduor, Erick
    Popa, Lucian
    Li, Yunyao
    26TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES (IUI '21 COMPANION), 2021, : 78 - 80
  • [50] A Survey of Natural Language Processing Implementation for Data Query Systems
    Wong, Albert
    Joiner, Dakota
    Chiu, Chunyin
    Elsayed, Mohamed
    Pereira, Keegan
    Khmelevsky, Youry
    Mahony, Joe
    IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SYSTEMS SCIENCE AND ENGINEERING (IEEE RASSE 2021), 2021,