Exploring the reversal curse and other deductive logical reasoning in BERT and GPT-based large language models

被引:0
|
作者
Wu, Da [1 ,2 ]
Yang, Jingye [1 ,2 ]
Wang, Kai [1 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Raymond G Perelman Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA
[2] Univ Penn, Dept Math, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
来源
PATTERNS | 2024年 / 5卷 / 09期
关键词
BACKWARD RECALL;
D O I
10.1016/j.patter.2024.101030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The "Reversal Curse"describes the inability of autoregressive decoder large language models (LLMs) to deduce "B is A"from "A is B,"assuming that B and A are distinct and can be uniquely identified from each other. This logical failure suggests limitations in using generative pretrained transformer (GPT) models for tasks like constructing knowledge graphs. Our study revealed that a bidirectional LLM, bidirectional encoder representations from transformers (BERT), does not suffer from this issue. To investigate further, we focused on more complex deductive reasoning by training encoder and decoder LLMs to perform union and intersection operations on sets. While both types of models managed tasks involving two sets, they struggled with operations involving three sets. Our findings underscore the differences between encoder and decoder models in handling logical reasoning. Thus, selecting BERT or GPT should depend on the task's specific needs, utilizing BERT's bidirectional context comprehension or GPT's sequence prediction strengths.
引用
收藏
页数:12
相关论文
共 49 条
  • [41] ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models
    Chen, Zhipeng
    Zhou, Kun
    Zhang, Beichen
    Gong, Zheng
    Zhao, Wayne Xin
    Wen, Ji-Rong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14777 - 14790
  • [42] Revolutionizing Cyber Threat Detection With Large Language Models: A Privacy-Preserving BERT-Based Lightweight Model for IoT/IIoT Devices
    Ferrag, Mohamed Amine
    Ndhlovu, Mthandazo
    Tihanyi, Norbert
    Cordeiro, Lucas C.
    Debbah, Merouane
    Lestable, Thierry
    Thandi, Narinderjit Singh
    IEEE ACCESS, 2024, 12 : 23733 - 23750
  • [43] Exploring the Application of Large Language Models Based AI Agents in Leakage Detection of Natural Gas Valve Chambers
    Wei, Qian
    Sun, Hongjun
    Xu, Yin
    Pang, Zisheng
    Gao, Feixiang
    ENERGIES, 2024, 17 (22)
  • [44] Large Language Models are Few-shot Testers: Exploring LLM-based General Bug Reproduction
    Kang, Sungmin
    Yoon, Juyeon
    Yoo, Shin
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ICSE, 2023, : 2312 - 2323
  • [45] Exploring Large-Scale Language Models to Evaluate EEG-Based Multimodal Data for Mental Health
    Hu, Yongquan
    Zhang, Shuning
    Dang, Ting
    Jia, Hong
    Salim, Flora D.
    Hu, Wen
    Quigley, Aaron J.
    COMPANION OF THE 2024 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING, UBICOMP COMPANION 2024, 2024, : 412 - 417
  • [46] Evaluating large language models for surgical chart review of second stage implant-based breast reconstruction: a comparative analysis of manual review, GPT-3.5 Turbo, and GPT-4 Turbo
    Lakhlani, Devi
    Dadhania, Dhruv
    Nazerali, Rahim
    EUROPEAN JOURNAL OF PLASTIC SURGERY, 2025, 48 (01)
  • [47] Exploring large language models for the generation of synthetic training samples for aspect-based sentiment analysis in low resource settings
    Hellwig, Nils Constantin
    Fehle, Jakob
    Wolff, Christian
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 261
  • [48] Exploring automated energy optimization with unstructured building data: A multi-agent based framework leveraging large language models
    Xiao, Tong
    Xu, Peng
    ENERGY AND BUILDINGS, 2024, 322
  • [49] Exploring the ability of emerging large language models to detect cyberbullying in social posts through new prompt-based classification approaches
    Cirillo, Stefano
    Desiato, Domenico
    Polese, Giuseppe
    Solimando, Giandomenico
    Sugumaran, Vijayan
    Sundaramurthy, Shanmugam
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)