Towards Analysis and Interpretation of Large Language Models for Arithmetic Reasoning

被引:0
|
作者
Akter, Mst Shapna [1 ]
Shahriar, Hossain [2 ]
Cuzzocrea, Alfredo [3 ,4 ]
机构
[1] Univ West Florida, Dept Intelligent Syst & Robot, Pensacola, FL 32514 USA
[2] Univ West Florida, Ctr Cybersecur, Pensacola, FL USA
[3] Univ Calabria, iDEA Lab, Arcavacata Di Rende, Italy
[4] Univ Paris City, Dept Comp Sci, Paris, France
关键词
LLMs; Arithmetic Reasoning; Causal Mediation Analysis;
D O I
10.1109/SDS60720.2024.00049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs) have recently conquered the research scene, with particular regards to the Transformer architecture in the context of arithmetic reasoning. In this so-delineated scenario, this paper puts the basis for a causal mediation analysis about the approach of Transformer-based LLMs to complex arithmetic problems. In particular, we try to discover which parameters are crucial for complex reasoning tasks such as model activations. Our preliminary results state that, for complex arithmetic operations, information is channeled from mid-layer activations to the final token through enhanced attention mechanisms. Preliminary experiments are reported.
引用
收藏
页码:267 / 270
页数:4
相关论文
共 50 条
  • [1] A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis
    Stolfo, Alessandro
    Belinkov, Yonatan
    Sachan, Mrinmaya
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7035 - 7052
  • [2] Towards Reasoning in Large Language Models: A Survey
    Huang, Jie
    Chang, Kevin Chen-Chuan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 1049 - 1065
  • [3] Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models
    Hou, Yifan
    Li, Jiaoda
    Fei, Yu
    Stolfo, Alessandro
    Zhou, Wangchunshu
    Zeng, Guangtao
    Bosselut, Antoine
    Sachan, Mrinmaya
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 4902 - 4919
  • [4] Towards Benchmarking and Improving the Temporal Reasoning Capability of Large Language Models
    Tan, Qingyu
    Ng, Hwee Tou
    Bing, Lidong
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14820 - 14835
  • [5] LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
    Parmar, Mihir
    Patel, Nisarg
    Varshney, Neeraj
    Nakamura, Mutsumi
    Luo, Man
    Mashetty, Santosh
    Mitra, Arindam
    Baral, Chitta
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13679 - 13707
  • [6] Large Language Models Are Reasoning Teachers
    Ho, Namgyu
    Schmid, Laura
    Yun, Se-Young
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14852 - 14882
  • [7] Benchmarking Large Language Models for Log Analysis, Security, and Interpretation
    Karlsen, Egil
    Luo, Xiao
    Zincir-Heywood, Nur
    Heywood, Malcolm
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2024, 32 (03)
  • [8] An Evaluation of Reasoning Capabilities of Large Language Models in Financial Sentiment Analysis
    Du, Kelvin
    Xing, Frank
    Mao, Rui
    Cambria, Erik
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 189 - 194
  • [9] Are Large Language Models Capable of Causal Reasoning for Sensing Data Analysis?
    Hu, Zhizhang
    Zhang, Yue
    Rossi, Ryan
    Yu, Tong
    Kim, Sungchul
    Pan, Shijia
    PROCEEDINGS OF THE 2024 WORKSHOP ON EDGE AND MOBILE FOUNDATION MODELS, EDGEFM 2024, 2024, : 24 - 29
  • [10] Uncovering the Interpretation of Large Language Models
    Akter, Mst Shapna
    Shahriar, Hossain
    Cuzzocrea, Alfredo
    Wu, Fan
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 1057 - 1066