Towards Text-to-SQL over Aggregate Tables

被引:1
|
作者
Li, Shuqin [1 ]
Zhou, Kaibin [2 ]
Zhuang, Zeyang [2 ]
Wang, Haofen [1 ]
Ma, Jun [3 ]
机构
[1] Tongji Univ, Coll Design & Innovat, Shanghai 200092, Peoples R China
[2] Tongji Univ, Sch Software, Shanghai 201804, Peoples R China
[3] Tongji Univ, Sch Automot Studies, Shanghai 201804, Peoples R China
关键词
Text-to-SQL; Question Answering; Business Intelligence; Deep Learning;
D O I
10.1162/dint_a_00194
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text-to-SQL aims at translating textual questions into the corresponding SQL queries. Aggregate tables are widely created for high-frequent queries. Although text-to-SQL has emerged as an important task, recent studies paid little attention to the task over aggregate tables. The increased aggregate tables bring two challenges: (1) mapping of natural language questions and relational databases will suffer from more ambiguity, (2) modern models usually adopt self-attention mechanism to encode database schema and question. The mechanism is of quadratic time complexity, which will make inferring more time-consuming as input sequence length grows. In this paper, we introduce a novel approach named WAGG for text-to-SQL over aggregate tables. To effectively select among ambiguous items, we propose a relation selection mechanism for relation computing. To deal with high computation costs, we introduce a dynamical pruning strategy to discard unrelated items that are common for aggregate tables. We also construct a new large-scale dataset SpiderwAGG extended from Spider dataset for validation, where extensive experiments show the effectiveness and efficiency of our proposed method with 4% increase of accuracy and 15% decrease of inference time w.r.t a strong baseline RAT-SQL.
引用
收藏
页码:457 / 474
页数:18
相关论文
共 50 条
  • [41] Graph Reasoning Enhanced Language Models for Text-to-SQL
    Gong, Zheng
    Sun, Ying
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2447 - 2451
  • [42] Exploring Chain of Thought Style Prompting for Text-to-SQL
    Tai, Chang-You
    Chen, Ziru
    Zhang, Tianshu
    Deng, Xiang
    Sun, Huan
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 5376 - 5393
  • [43] A Review of Cross-Domain Text-to-SQL Models
    Gan, Yujian
    Purver, Matthew
    Woodward, John R.
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 101 - 108
  • [44] Thai Question Text-To-SQL Parsing Using Transformer
    Tungruethaipak, Natthawat
    Prom-on, Santitham
    2024 21ST INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING, JCSSE 2024, 2024, : 631 - 637
  • [45] Text-to-SQL Error Correction with Language Models of Code
    Chen, Ziru
    Chen, Shijie
    White, Michael
    Mooney, Raymond
    Payani, Ali
    Srinivasa, Jayanth
    Su, Yu
    Sun, Huan
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1359 - 1372
  • [46] Multitask Pretraining with Structured Knowledge for Text-to-SQL Generation
    Giaquinto, Robert
    Zhang, Dejiao
    Kleiner, Benjamin
    Li, Yang
    Tan, Ming
    Bhatia, Parminder
    Nallapati, Ramesh
    Ma, Xiaofei
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11067 - 11083
  • [47] Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion
    Zhao, Chen
    Su, Yu
    Pauls, Adam
    Platanios, Emmanouil Antonios
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 5568 - 5578
  • [48] A Heterogeneous Graph to Abstract Syntax Tree Framework for Text-to-SQL
    Cao, Ruisheng
    Chen, Lu
    Li, Jieyu
    Zhang, Hanchong
    Xu, Hongshen
    Zhang, Wangyou
    Yu, Kai
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13796 - 13813
  • [49] Synthesizing Text-to-SQL Data from Weak and Strong LLMs
    Yang, Jiaxi
    Hui, Binyuan
    Yang, Min
    Yang, Jian
    Lin, Junyang
    Zhou, Chang
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7864 - 7875
  • [50] CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases
    Yu, Tao
    Zhang, Rui
    Er, He Yang
    Li, Suyi
    Xue, Eric
    Pang, Bo
    Lin, Xi Victoria
    Tan, Yi Chern
    Shi, Tianze
    Li, Zihan
    Jiang, Youxuan
    Yasunaga, Michihiro
    Shim, Sungrok
    Chen, Tao
    Fabbri, Alexander
    Li, Zifan
    Chen, Luyao
    Zhang, Yuwen
    Dixit, Shreya
    Zhang, Vincent
    Xiong, Caiming
    Socher, Richard
    Lasecki, Walter S.
    Radev, Dragomir
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 1962 - 1979