Relationalizing Tables with Large Language Models: The Promise and Challenges

被引：0

作者：

Huang, Zezhou ^{[1
]}

Wu, Eugene ^{[2
]}

机构：

[1] Columbia Univ, New York, NY 10027 USA

[2] Columbia Univ, DSI, New York, NY 10027 USA

来源：

2024 IEEE 40TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, ICDEW | 2024年

基金：

美国国家科学基金会;

关键词：

Large Language Model; Data Transformation; Prompt Engineering; Data Management;

D O I：

10.1109/ICDEW61823.2024.00045

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Tables in the wild are usually not relationalized, making querying them difficult. To relationalize tables, recent works designed seven transformation operators, and deep neural networks were adopted to automatically find the sequence of operators, achieving an accuracy of 57.0%. In comparison, earlier versions of large language models like GPT-3.5 only reached 13.1%. However, these results were obtained using naive prompts. Furthermore, GPT-4 is recently available, which is substantially larger and more performant. This study examines how the selection of models, specifically GPT-3.5 and GPT-4, and various prompting strategies, such as Chain-of-Thought and task decomposition, affect accuracy. The main finding is that GPT-4, combined with Task Decomposition and Chain-of-Thought, attains a remarkable accuracy of 74.6%. Further analysis of errors made by GPT-4 shows the challenges that about half of the errors are not due to the model's shortcomings, but rather to ambiguities in the benchmarks. When these benchmarks are disambiguated, GPT-4's accuracy improves to 86.9%.

引用

页码：305 / 309

页数：5

共 50 条

[21] Large Language Models in Medicine: Addressing Ethical Challenges
Chang, Yin-Hsi
Ong, Jasmine Chiat Ling
William, Wasswa
Butte, Atul J.
Shah, Nigam H.
Chew, Lita Sui Tjien
Liu, Nan
Doshi-Velez, Finale
Lu, Wei
Savulescu, Julian
Ting, Daniel
INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
[22] Open-Domain Question Answering over Tables with Large Language Models
Liang, Xinyi
Hu, Rui
Liu, Yu
Zhu, Konglin
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 347 - 358
[23] How Well Do Large Language Models Understand Tables in Materials Science?
Circi, Defne
Khalighinejad, Ghazal
Chen, Anlan
Dhingra, Bhuwan
Brinson, L. Catherine
INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2024, 13 (03) : 669 - 687
[24] Advancing Patient Education in Idiopathic Intracranial Hypertension The Promise of Large Language Models
Dihan, Qais A.
Brown, Andrew D.
Zaldivar, Ana T.
Chauhan, Muhammad Z.
Eleiwa, Taher K.
Hassan, Amr K.
Solyman, Omar
Gise, Ryan
Phillips, Paul H.
Sallam, Ahmed B.
Elhusseiny, Abdelrahman M.
NEUROLOGY-CLINICAL PRACTICE, 2025, 15 (01)
[25] The Promise and Challenge of Large Language Models for Knowledge Engineering: Insights from a Hackathon
Walker, Johanna
Koutsiana, Elisavet
Nwachukwu, Michelle
Merono-Penuela, Albert
Simperl, Elena
EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
[26] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
Karabacak, Mert
Margetis, Konstantinos
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
[27] Large Language Models in Health Systems: Governance, Challenges, and Solutions
Tripathi, Satvik
Mongeau, Kyle
Alkhulaifat, Dana
Elahi, Ameena
Cook, Tessa S.
ACADEMIC RADIOLOGY, 2025, 32 (03) : 1189 - 1191
[28] Large language models for building energy applications: Opportunities and challenges
Liu, Mingzhe
Zhang, Liang
Chen, Jianli
Chen, Wei-An
Yang, Zhiyao
Lo, L. James
Wen, Jin
O'Neill, Zheng
BUILDING SIMULATION, 2025, 18 (02) : 225 - 234
[29] Large Language Models for Networking: Applications, Enabling Techniques, and Challenges
Huang, Yudong
Du, Hongyang
Zhang, Xinyuan
Niyato, Dusit
Kang, Jiawen
Xiong, Zehui
Wang, Shuo
Huang, Tao
IEEE NETWORK, 2025, 39 (01): : 235 - 242
[30] Utilizing Large Language Models in Ophthalmology: The Current Landscape and Challenges
Chotcomwongse, Peranut
Ruamviboonsuk, Paisan
Grzybowski, Andrzej
OPHTHALMOLOGY AND THERAPY, 2024, 13 (10) : 2543 - 2558

← 1 2 3 4 5 →