Towards Enhancing Database Education: Natural Language Generation Meets Query Execution Plans

被引:11
|
作者
Wang, Weiguo [1 ,2 ]
Bhowmick, Sourav S. [1 ]
Li, Hui [2 ]
Joty, Shafiq [1 ]
Liu, Siyuan [1 ]
Chen, Peng [2 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[2] Xidian Univ, Sch Cyber Engn, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
REPETITION; BOREDOM;
D O I
10.1145/3448016.3452822
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The database systems course is offered as part of an undergraduate computer science degree program in many major universities. A key learning goal of learners taking such a course is to understand how SQL queries are processed in a RDBMS in practice. Since a query execution plan (QEP) describes the execution steps of a query, learners can acquire the understanding by perusing the QEPS generated by a RDBMS. Unfortunately, in practice, it is often daunting for a learner to comprehend these QEPS containing vendor-specific implementation details, hindering her learning process. In this paper, we present a novel, end-to-end, generic system called LANTERN that generates a natural language description of a QEP to facilitate understanding of the query execution steps. It takes as input an SQL query and its QEP, and generates a natural language description of the execution strategy deployed by the underlying RDBMS. Specifically, it deploys a declarative framework called POOL that enables subject matter experts to efficiently create and maintain natural language descriptions of physical operators used in QEPS. A rule-based framework called RULE-LANTERN is proposed that exploits POOL to generate natural language descriptions of QEPS. Despite the high accuracy of RULE-LANTERN, our engagement with learners reveal that, consistent with existing psychology theories, perusing such rule-based descriptions lead to boredom due to repetitive statements across different QEPS. To address this issue, we present a novel deep learning-based language generation framework called NEURAL-LANTERN that infuses language variability in the generated description by exploiting a set of paraphrasing tools and word embedding. Our experimental study with real learners shows the effectiveness of LANTERN in facilitating comprehension of QEPS.
引用
收藏
页码:1933 / 1945
页数:13
相关论文
共 50 条
  • [1] LANTERN: Boredom-conscious Natural Language Description Generation of Query Execution Plans for Database Education
    Chen, Peng
    Li, Hui
    Bhowmick, Sourav S.
    Joty, Shafiq R.
    Wang, Weiguo
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2413 - 2416
  • [2] NEURON: Query Execution Plan Meets Natural Language Processing For Augmenting DB Education
    Liu, Siyuan
    Bhowmick, Sourav S.
    Zhang, Wanlu
    Wang, Shu
    Huang, Wanyi
    Joty, Shafiq
    SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1953 - 1956
  • [3] MOCHA: A Tool for Visualizing Impact of Operator Choices in Query Execution Plans for Database Education
    Tan, Jess
    Yeo, Desmond
    Neoh, Rachael
    Chua, Huey-Eng
    Bhowmick, Sourav S.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3602 - 3605
  • [4] A Study on Database Intrusion Detection Based on Query Execution Plans
    Morzy, Tadeusz
    Zakrzewicz, Maciej
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2024, 2024, 14912 : 353 - 358
  • [5] Enhancing Natural Language Query to SQL Query Generation Through Classification-Based Table Selection
    Chopra, Ankush
    Azam, Rauful
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 : 152 - 165
  • [6] Research on the Query Condition of Natural Language in Database
    Zheng Fengbin
    Zheng Shanshan
    Ge Qiang
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2008, : 389 - 393
  • [7] Strong natural language query generation
    Liu, Binsheng
    Lu, Xiaolu
    Culpepper, J. Shane
    INFORMATION RETRIEVAL JOURNAL, 2021, 24 (4-5): : 322 - 346
  • [8] Strong natural language query generation
    Binsheng Liu
    Xiaolu Lu
    J. Shane Culpepper
    Information Retrieval Journal, 2021, 24 : 322 - 346
  • [9] Towards Predicting Query Execution Time for Concurrent and Dynamic Database Workloads
    Wu, Wentao
    Chi, Yun
    Hacigumus, Hakan
    Naughton, Jeffrey F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (10): : 925 - 936
  • [10] LOGER: A Learned Optimizer towards Generating Efficient and Robust Query Execution Plans
    Chen, Tianyi
    Gao, Jun
    Chen, Hedui
    Tu, Yaofeng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (07): : 1777 - 1789