Relationalizing Tables with Large Language Models: The Promise and Challenges

被引:0
|
作者
Huang, Zezhou [1 ]
Wu, Eugene [2 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Columbia Univ, DSI, New York, NY 10027 USA
基金
美国国家科学基金会;
关键词
Large Language Model; Data Transformation; Prompt Engineering; Data Management;
D O I
10.1109/ICDEW61823.2024.00045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Tables in the wild are usually not relationalized, making querying them difficult. To relationalize tables, recent works designed seven transformation operators, and deep neural networks were adopted to automatically find the sequence of operators, achieving an accuracy of 57.0%. In comparison, earlier versions of large language models like GPT-3.5 only reached 13.1%. However, these results were obtained using naive prompts. Furthermore, GPT-4 is recently available, which is substantially larger and more performant. This study examines how the selection of models, specifically GPT-3.5 and GPT-4, and various prompting strategies, such as Chain-of-Thought and task decomposition, affect accuracy. The main finding is that GPT-4, combined with Task Decomposition and Chain-of-Thought, attains a remarkable accuracy of 74.6%. Further analysis of errors made by GPT-4 shows the challenges that about half of the errors are not due to the model's shortcomings, but rather to ambiguities in the benchmarks. When these benchmarks are disambiguated, GPT-4's accuracy improves to 86.9%.
引用
收藏
页码:305 / 309
页数:5
相关论文
共 50 条
  • [1] Open-source large language models in medical education: Balancing promise and challenges
    Ray, Partha Pratim
    ANATOMICAL SCIENCES EDUCATION, 2024, 17 (06) : 1361 - 1362
  • [2] The promise of large language models in health care
    Arora, Anmol
    Arora, Ananya
    LANCET, 2023, 401 (10377): : 641 - 642
  • [3] The promise of AI Large Language Models for Epilepsy care
    Landais, Raphaelle
    Sultan, Mustafa
    Thomas, Rhys H.
    EPILEPSY & BEHAVIOR, 2024, 154
  • [4] From promise to practice: challenges and pitfalls in the evaluation of large language models for data extraction in evidence synthesis
    Gartlehner, Gerald
    Kahwati, Leila
    Nussbaumer-Streit, Barbara
    Crotty, Karen
    Hilscher, Rainer
    Kugley, Shannon
    Viswanathan, Meera
    Thomas, Ian
    Konet, Amanda
    Booth, Graham
    Chew, Robert
    BMJ EVIDENCE-BASED MEDICINE, 2024,
  • [5] Art or Artifice? Large Language Models and the False Promise of Creativity
    Chakrabarty, Tuhin
    Laban, Philippe
    Agarwal, Divyansh
    Muresan, Smaranda
    Wu, Chien-Sheng
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [6] Benchmarking Large Language Models: Opportunities and Challenges
    Hodak, Miro
    Ellison, David
    Van Buren, Chris
    Jiang, Xiaotong
    Dholakia, Ajay
    PERFORMANCE EVALUATION AND BENCHMARKING, TPCTC 2023, 2024, 14247 : 77 - 89
  • [7] Ethical and Theological Challenges of Large Language Models
    Strahornik, Vojko
    BOGOSLOVNI VESTNIK-THEOLOGICAL QUARTERLY-EPHEMERIDES THEOLOGICAE, 2023, 83 (04): : 839 - 852
  • [8] MULTILINGUAL JAILBREAK CHALLENGES IN LARGE LANGUAGE MODELS
    Deng, Yue
    Zhang, Wenxuan
    Pan, Sinno Jialin
    Bing, Lidong
    arXiv, 2023,
  • [9] Large language models in psychiatry: Opportunities and challenges
    Volkmer, Sebastian
    Meyer-Lindenberg, Andreas
    Schwarz, Emanuel
    PSYCHIATRY RESEARCH, 2024, 339
  • [10] Harnessing the potential of large language models in medical education: promise and pitfalls
    Benitez, Trista M.
    Xu, Yueyuan
    Boudreau, J. Donald
    Kow, Alfred Wei Chieh
    Bello, Fernando
    Phuoc, Le Van
    Wang, Xiaofei
    Sun, Xiaodong
    Leung, Gilberto Ka-Kit
    Lan, Yanyan
    Wang, Yaxing
    Cheng, Davy
    Tham, Yih-Chung
    Wong, Tien Yin
    Chung, Kevin C.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (03) : 776 - 783