Relationalizing Tables with Large Language Models: The Promise and Challenges

被引:0
|
作者
Huang, Zezhou [1 ]
Wu, Eugene [2 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Columbia Univ, DSI, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Large Language Model; Data Transformation; Prompt Engineering; Data Management;
D O I
10.1109/ICDEW61823.2024.00045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables in the wild are usually not relationalized, making querying them difficult. To relationalize tables, recent works designed seven transformation operators, and deep neural networks were adopted to automatically find the sequence of operators, achieving an accuracy of 57.0%. In comparison, earlier versions of large language models like GPT-3.5 only reached 13.1%. However, these results were obtained using naive prompts. Furthermore, GPT-4 is recently available, which is substantially larger and more performant. This study examines how the selection of models, specifically GPT-3.5 and GPT-4, and various prompting strategies, such as Chain-of-Thought and task decomposition, affect accuracy. The main finding is that GPT-4, combined with Task Decomposition and Chain-of-Thought, attains a remarkable accuracy of 74.6%. Further analysis of errors made by GPT-4 shows the challenges that about half of the errors are not due to the model's shortcomings, but rather to ambiguities in the benchmarks. When these benchmarks are disambiguated, GPT-4's accuracy improves to 86.9%.
引用
收藏
页码:305 / 309
页数:5
相关论文
共 50 条
  • [21] Large Language Models in Medicine: Addressing Ethical Challenges
    Chang, Yin-Hsi
    Ong, Jasmine Chiat Ling
    William, Wasswa
    Butte, Atul J.
    Shah, Nigam H.
    Chew, Lita Sui Tjien
    Liu, Nan
    Doshi-Velez, Finale
    Lu, Wei
    Savulescu, Julian
    Ting, Daniel
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [22] Open-Domain Question Answering over Tables with Large Language Models
    Liang, Xinyi
    Hu, Rui
    Liu, Yu
    Zhu, Konglin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XII, ICIC 2024, 2024, 14873 : 347 - 358
  • [23] How Well Do Large Language Models Understand Tables in Materials Science?
    Circi, Defne
    Khalighinejad, Ghazal
    Chen, Anlan
    Dhingra, Bhuwan
    Brinson, L. Catherine
    INTEGRATING MATERIALS AND MANUFACTURING INNOVATION, 2024, 13 (03) : 669 - 687
  • [24] Advancing Patient Education in Idiopathic Intracranial Hypertension The Promise of Large Language Models
    Dihan, Qais A.
    Brown, Andrew D.
    Zaldivar, Ana T.
    Chauhan, Muhammad Z.
    Eleiwa, Taher K.
    Hassan, Amr K.
    Solyman, Omar
    Gise, Ryan
    Phillips, Paul H.
    Sallam, Ahmed B.
    Elhusseiny, Abdelrahman M.
    NEUROLOGY-CLINICAL PRACTICE, 2025, 15 (01)
  • [25] The Promise and Challenge of Large Language Models for Knowledge Engineering: Insights from a Hackathon
    Walker, Johanna
    Koutsiana, Elisavet
    Nwachukwu, Michelle
    Merono-Penuela, Albert
    Simperl, Elena
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [26] Embracing Large Language Models for Medical Applications: Opportunities and Challenges
    Karabacak, Mert
    Margetis, Konstantinos
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (05)
  • [27] Large Language Models in Health Systems: Governance, Challenges, and Solutions
    Tripathi, Satvik
    Mongeau, Kyle
    Alkhulaifat, Dana
    Elahi, Ameena
    Cook, Tessa S.
    ACADEMIC RADIOLOGY, 2025, 32 (03) : 1189 - 1191
  • [28] Large language models for building energy applications: Opportunities and challenges
    Liu, Mingzhe
    Zhang, Liang
    Chen, Jianli
    Chen, Wei-An
    Yang, Zhiyao
    Lo, L. James
    Wen, Jin
    O'Neill, Zheng
    BUILDING SIMULATION, 2025, 18 (02) : 225 - 234
  • [29] Large Language Models for Networking: Applications, Enabling Techniques, and Challenges
    Huang, Yudong
    Du, Hongyang
    Zhang, Xinyuan
    Niyato, Dusit
    Kang, Jiawen
    Xiong, Zehui
    Wang, Shuo
    Huang, Tao
    IEEE NETWORK, 2025, 39 (01): : 235 - 242
  • [30] Utilizing Large Language Models in Ophthalmology: The Current Landscape and Challenges
    Chotcomwongse, Peranut
    Ruamviboonsuk, Paisan
    Grzybowski, Andrzej
    OPHTHALMOLOGY AND THERAPY, 2024, 13 (10) : 2543 - 2558