Exploring the reversal curse and other deductive logical reasoning in BERT and GPT-based large language models

被引：0

作者：

Wu, Da ^{[1
,2
]}

Yang, Jingye ^{[1
,2
]}

Wang, Kai ^{[1
,3
]}

机构：

[1] Childrens Hosp Philadelphia, Raymond G Perelman Ctr Cellular & Mol Therapeut, Philadelphia, PA 19104 USA

[2] Univ Penn, Dept Math, Philadelphia, PA 19104 USA

[3] Univ Penn, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA

来源：

PATTERNS | 2024年 / 5卷 / 09期

关键词：

BACKWARD RECALL;

D O I：

10.1016/j.patter.2024.101030

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The "Reversal Curse"describes the inability of autoregressive decoder large language models (LLMs) to deduce "B is A"from "A is B,"assuming that B and A are distinct and can be uniquely identified from each other. This logical failure suggests limitations in using generative pretrained transformer (GPT) models for tasks like constructing knowledge graphs. Our study revealed that a bidirectional LLM, bidirectional encoder representations from transformers (BERT), does not suffer from this issue. To investigate further, we focused on more complex deductive reasoning by training encoder and decoder LLMs to perform union and intersection operations on sets. While both types of models managed tasks involving two sets, they struggled with operations involving three sets. Our findings underscore the differences between encoder and decoder models in handling logical reasoning. Thus, selecting BERT or GPT should depend on the task's specific needs, utilizing BERT's bidirectional context comprehension or GPT's sequence prediction strengths.

引用

页数：12

共 49 条

[1] Enhancing Neural Decoding with Large Language Models: A GPT-Based Approach
Lee, Dong Hyeok
Chung, Chun Kee
2024 12TH INTERNATIONAL WINTER CONFERENCE ON BRAIN-COMPUTER INTERFACE, BCI 2024, 2024,
[2] Exploring Reversal Mathematical Reasoning Ability for Large Language Models
Guo, Pei
You, Wangjie
Li, Juntao
Yan, Bowen
Zhang, Min
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 13671 - 13685
[3] Exploring a GPT-based large language model for variable autonomy in a VR-based human-robot teaming simulation
Lakhnati, Younes
Pascher, Max
Gerken, Jens
FRONTIERS IN ROBOTICS AND AI, 2024, 11
[4] NPGPT: natural product-like compound generation with GPT-based chemical language models
Sakano, Koh
Furui, Kairi
Ohue, Masahito
JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
[5] Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples
Saparov, Abulhair
Pang, Richard Yuanzhe
Padmakumar, Vishakh
Joshi, Nitish
Kazemi, Seyed Mehran
Kim, Najoung
He, He
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[6] Supporting Qualitative Analysis with Large Language Models: Combining Codebook with GPT-3 for Deductive Coding
Xiao, Ziang
Yuan, Xingdi
Liao, Q. Vera
Abdelghani, Rania
Oudeyer, Pierre-Yves
COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION, 2023, : 75 - 78
[7] ChatGPT: Where Is a Silver Lining? Exploring the realm of GPT and large language models
Tikhonova, Elena
Raitskaya, Lilia
JOURNAL OF LANGUAGE AND EDUCATION, 2023, 9 (03): : 5 - 11
[8] LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
Parmar, Mihir
Patel, Nisarg
Varshney, Neeraj
Nakamura, Mutsumi
Luo, Man
Mashetty, Santosh
Mitra, Arindam
Baral, Chitta
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 13679 - 13707
[9] Case-Based Reasoning with Language Models for Classification of Logical Fallacies
Sourati, Zhivar
Ilievski, Filip
Sandlin, Hong-An
Mermoud, Alain
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5188 - 5196
[10] KG-GPT: A General Framework for Reasoning on Knowledge Graphs Using Large Language Models
Kim, Jiho
Kwon, Yeonsu
Jo, Yohan
Choi, Edward
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9410 - 9421

← 1 2 3 4 5 →