Exploring the reversal curse and other deductive logical reasoning in BERT and GPT-based large language models

被引:0
|
作者
Wu, Da [1 ,2 ]
Yang, Jingye [1 ,2 ]
Wang, Kai [1 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Raymond G Perelman Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Math, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
来源
PATTERNS | 2024年 / 5卷 / 09期
关键词
BACKWARD RECALL;
D O I
10.1016/j.patter.2024.101030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The "Reversal Curse"describes the inability of autoregressive decoder large language models (LLMs) to deduce "B is A"from "A is B,"assuming that B and A are distinct and can be uniquely identified from each other. This logical failure suggests limitations in using generative pretrained transformer (GPT) models for tasks like constructing knowledge graphs. Our study revealed that a bidirectional LLM, bidirectional encoder representations from transformers (BERT), does not suffer from this issue. To investigate further, we focused on more complex deductive reasoning by training encoder and decoder LLMs to perform union and intersection operations on sets. While both types of models managed tasks involving two sets, they struggled with operations involving three sets. Our findings underscore the differences between encoder and decoder models in handling logical reasoning. Thus, selecting BERT or GPT should depend on the task's specific needs, utilizing BERT's bidirectional context comprehension or GPT's sequence prediction strengths.
引用
收藏
页数:12
相关论文
共 49 条
  • [1] Enhancing Neural Decoding with Large Language Models: A GPT-Based Approach
    Lee, Dong Hyeok
    Chung, Chun Kee
    2024 12TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI 2024, 2024,
  • [2] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
    Guo, Pei
    You, Wangjie
    Li, Juntao
    Yan, Bowen
    Zhang, Min
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685
  • [3] Exploring a GPT-based large language model for variable autonomy in a VR-based human-robot teaming simulation
    Lakhnati, Younes
    Pascher, Max
    Gerken, Jens
    FRONTIERS IN ROBOTICS AND AI, 2024, 11
  • [4] NPGPT: natural product-like compound generation with GPT-based chemical language models
    Sakano, Koh
    Furui, Kairi
    Ohue, Masahito
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [5] Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
    Saparov, Abulhair
    Pang, Richard Yuanzhe
    Padmakumar, Vishakh
    Joshi, Nitish
    Kazemi, Seyed Mehran
    Kim, Najoung
    He, He
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding
    Xiao, Ziang
    Yuan, Xingdi
    Liao, Q. Vera
    Abdelghani, Rania
    Oudeyer, Pierre-Yves
    COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION, 2023, : 75 - 78
  • [7] ChatGPT: Where Is a Silver Lining? Exploring the realm of GPT and large language models
    Tikhonova, Elena
    Raitskaya, Lilia
    JOURNAL OF LANGUAGE AND EDUCATION, 2023, 9 (03): : 5 - 11
  • [8] LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
    Parmar, Mihir
    Patel, Nisarg
    Varshney, Neeraj
    Nakamura, Mutsumi
    Luo, Man
    Mashetty, Santosh
    Mitra, Arindam
    Baral, Chitta
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13679 - 13707
  • [9] Case-Based Reasoning with Language Models for Classification of Logical Fallacies
    Sourati, Zhivar
    Ilievski, Filip
    Sandlin, Hong-An
    Mermoud, Alain
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5188 - 5196
  • [10] KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models
    Kim, Jiho
    Kwon, Yeonsu
    Jo, Yohan
    Choi, Edward
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9410 - 9421