Structure-Grounded Pretraining for Text-to-SQL

被引:0
|
作者
Deng, Xiang [1 ,2 ]
Awadallah, Ahmed Hassan [2 ]
Meek, Christopher [2 ]
Polozov, Oleksandr [2 ]
Sun, Huan [1 ]
Richardson, Matthew [2 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Microsoft Res, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning to capture text-table alignment is essential for tasks like text-to-SQL. A model needs to correctly recognize natural language references to columns and values and to ground them in the given database schema. In this paper, we present a novel weakly supervised Structure-Grounded pretraining framework (STRUG) for text-to-SQL that can effectively learn to capture text-table alignment based on a parallel text-table corpus. We identify a set of novel pretraining tasks: column grounding, value grounding and column-value mapping, and leverage them to pretrain a text-table encoder. Additionally, to evaluate different methods under more realistic text-table alignment settings, we create a new evaluation set Spider-Realistic based on Spider dev set with explicit mentions of column names removed, and adopt eight existing textto-SQL datasets for cross-database evaluation. S TRuG brings significant improvement over BERTLARGE in all settings. Compared with existing pretraining methods such as GRAPPA, S TRuG achieves similar performance on Spider, and outperforms all baselines on more realistic sets. All the code and data used in this work is public available at https://aka.ms/strug.
引用
收藏
页码:1337 / 1350
页数:14
相关论文
共 50 条
  • [21] An Exploratory Study on Model Compression for Text-to-SQL
    Sun, Shuo
    Gao, Yuze
    Zhang, Yuchen
    Su, Jian
    Bin Chen
    Lin, Yingzhan
    Sun, Shuqi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 11647 - 11654
  • [22] An In-Depth Benchmarking of Text-to-SQL Systems
    Gkini, Orest
    Belmpas, Theofilos
    Koutrika, Georgia
    Ioannidis, Yannis
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 632 - 644
  • [23] A survey on deep learning approaches for text-to-SQL
    George Katsogiannis-Meimarakis
    Georgia Koutrika
    The VLDB Journal, 2023, 32 : 905 - 936
  • [24] Towards Text-to-SQL over Aggregate Tables
    Shuqin Li
    Kaibin Zhou
    Zeyang Zhuang
    Haofen Wang
    Jun Ma
    Data Intelligence, 2023, 5 (02) : 457 - 474
  • [25] Ar-Spider: Text-to-SQL in Arabic
    Almohaimeed, Saleh
    Almohaimeed, Saad
    Al Ghanim, Mansour
    Wang, Liqiang
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1024 - 1030
  • [26] SADGA: Structure-Aware Dual Graph Aggregation Network for Text-to-SQL
    Cai, Ruichu
    Yuan, Jinjie
    Xu, Boyan
    Hao, Zhifeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [27] Bridging the gap between text-to-SQL research and real-world applications: A unified all-in-one framework for text-to-SQL
    Han, Mirae
    Park, Seongsik
    Kim, Harksoo
    Kim, Seulgi
    KNOWLEDGE-BASED SYSTEMS, 2024, 306
  • [28] Selective Demonstrations for Cross-domain Text-to-SQL
    Chang, Shuaichen
    Fosler-Lussier, Eric
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 14174 - 14189
  • [29] Small, Medium, and Large Language Models for Text-to-SQL
    Oliveira, Aiko
    Nascimento, Eduardo
    Pinheiro, Joao
    Avila, Caio Viktor S.
    Coelho, Gustavo
    Feijo, Lucas
    Izquierdo, Yenier
    Garcia, Grettel
    Paes Leme, Luiz Andre P.
    Lemos, Melissa
    Casanova, Marco A.
    CONCEPTUAL MODELING, ER 2024, 2025, 15238 : 276 - 294
  • [30] Semantic Evaluation for Text-to-SQL with Distilled Test Suites
    Zhong, Ruiqi
    Yu, Tao
    Klein, Dan
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 396 - 411