Towards Text-to-SQL over Aggregate Tables

被引:0
|
作者
Shuqin Li [1 ]
Kaibin Zhou [2 ]
Zeyang Zhuang [2 ]
Haofen Wang [1 ]
Jun Ma [3 ]
机构
[1] College of Design and Innovation, Tongji University
[2] School of Software, Tongji University
[3] School of Automotive Studies, Tongji
关键词
D O I
暂无
中图分类号
TP391.1 [文字信息处理]; TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-to-SQL aims at translating textual questions into the corresponding SQL queries. Aggregate tables are widely created for high-frequent queries. Although text-to-SQL has emerged as an important task, recent studies paid little attention to the task over aggregate tables. The increased aggregate tables bring two challenges:(1) mapping of natural language questions and relational databases will suffer from more ambiguity,(2) modern models usually adopt self-attention mechanism to encode database schema and question. The mechanism is of quadratic time complexity, which will make inferring more time-consuming as input sequence length grows. In this paper, we introduce a novel approach named WAGG for text-to-SQL over aggregate tables. To effectively select among ambiguous items, we propose a relation selection mechanism for relation computing. To deal with high computation costs, we introduce a dynamical pruning strategy to discard unrelated items that are common for aggregate tables. We also construct a new large-scale dataset Spiderw AGG extended from Spider dataset for validation, where extensive experiments show the effectiveness and efficiency of our proposed method with 4% increase of accuracy and 15% decrease of inference time w.r.t a strong baseline RAT-SQL.
引用
收藏
页码:457 / 474
页数:18
相关论文
共 50 条
  • [21] Uncovering and Categorizing Social Biases in Text-to-SQL
    Liu, Yan
    Gao, Yan
    Su, Zhe
    Chen, Xiaokang
    Ash, Elliott
    Lou, Jian-Guang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 13573 - 13584
  • [22] Improving Text-to-SQL with a Hybrid Decoding Method
    Jeong, Geunyeong
    Han, Mirae
    Kim, Seulgi
    Lee, Yejin
    Lee, Joosang
    Park, Seongsik
    Kim, Harksoo
    ENTROPY, 2023, 25 (03)
  • [23] Integrating Question Answering and Text-to-SQL in Portuguese
    Jose, Marcos Menon
    Jose, Marcelo Archanjo
    Maua, Denis Deratani
    Cozman, Fabio Gagliardi
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 278 - 287
  • [24] Error Detection for Text-to-SQL Semantic Parsing
    Chen, Shijie
    Chen, Ziru
    Sun, Huan
    Su, Yu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11730 - 11743
  • [25] An Exploratory Study on Model Compression for Text-to-SQL
    Sun, Shuo
    Gao, Yuze
    Zhang, Yuchen
    Su, Jian
    Bin Chen
    Lin, Yingzhan
    Sun, Shuqi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11647 - 11654
  • [26] Structure-Grounded Pretraining for Text-to-SQL
    Deng, Xiang
    Awadallah, Ahmed Hassan
    Meek, Christopher
    Polozov, Oleksandr
    Sun, Huan
    Richardson, Matthew
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1337 - 1350
  • [27] An In-Depth Benchmarking of Text-to-SQL Systems
    Gkini, Orest
    Belmpas, Theofilos
    Koutrika, Georgia
    Ioannidis, Yannis
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 632 - 644
  • [28] A survey on deep learning approaches for text-to-SQL
    George Katsogiannis-Meimarakis
    Georgia Koutrika
    The VLDB Journal, 2023, 32 : 905 - 936
  • [29] Ar-Spider: Text-to-SQL in Arabic
    Almohaimeed, Saleh
    Almohaimeed, Saad
    Al Ghanim, Mansour
    Wang, Liqiang
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1024 - 1030
  • [30] Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table Perturbation
    Pi, Xinyu
    Wang, Bing
    Gao, Yan
    Guo, Jiaqi
    Li, Zhoujun
    Lou, Jian-Guang
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 2007 - 2022