I've Got the "answer"! Interpretation of LLMs Hidden States in Question Answering

被引：0

作者：

Goloviznina, Valeriya ^{[1
]}

Kotelnikov, Evgeny ^{[1
]}

机构：

[1] Vyatka State Univ, Kirov, Russia

来源：

NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024 | 2024年 / 14762卷

基金：

俄罗斯科学基金会;

关键词：

Interpretation; LLM; XAI; Question-Answering;

D O I：

10.1007/978-3-031-70239-6_8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Interpretability and explainability of AI are becoming increasingly important in light of the rapid development of large language models (LLMs). This paper investigates the interpretation of LLMs in the context of the knowledge-based question answering. The main hypothesis of the study is that correct and incorrect model behavior can be distinguished at the level of hidden states. The quantized models LLaMA-2-7B-Chat, Mistral-7B, Vicuna-7B and the MuSeRC question-answering dataset are used to test this hypothesis. The results of the analysis support the proposed hypothesis. We also identify the layers which have a negative effect on the model's behavior. As a prospect of practical application of the hypothesis, we propose to train such "weak" layers additionally in order to improve the quality of the task solution.

引用

页码：106 / 120

页数：15

共 4 条

[1] "I've Already Googled It, and I Can't Understand It" User's Perceptions of Virtual Reference and Social Question-Answering Sites
Kitzie, Vanessa L.
Connaway, Lynn Silipigni
Radford, Marie L.
REFERENCE & USER SERVICES QUARTERLY, 2021, 59 (3-4) : 204 - 215
[2] 'But I did everything right, why has my child got an allergy.?' Can current research answer this parents' question?
Holdstock, Victoria
McCormick, Michael
Raptis, George
CLINICAL AND EXPERIMENTAL ALLERGY, 2018, 48 (11): : 1573 - 1573
[3] You've Got to Ask Yourself One Question: Do I Feel Lucky? A Mixed Methods Analysis of an Open Ended Question in the Psychosocial Outcomes in StrokE (POISE) Study
Richtering, S.
O'Reilly, R.
McEvoy, L.
Glozier, N.
Jan, S.
Hackett, M.
CEREBROVASCULAR DISEASES, 2016, 42 : 15 - 15
[4] "How Long Have I Got?" in Stage IV NSCLC Patients With at Least 3 Months Up to 10 Years Survival, Accuracy of Long-, Intermediate-, and Short-Term Survival Prediction Is Not Good Enough to Answer This Question
Guo, Huiru
Li, Hegen
Zhu, Lihua
Feng, Jiali
Huang, Xiange
Baak, Jan P. A.
FRONTIERS IN ONCOLOGY, 2021, 11

← 1 →