Towards Enhancing Database Education: Natural Language Generation Meets Query Execution Plans

被引：11

作者：

Wang, Weiguo ^{[1
,2
]}

Bhowmick, Sourav S. ^{[1
]}

Li, Hui ^{[2
]}

Joty, Shafiq ^{[1
]}

Liu, Siyuan ^{[1
]}

Chen, Peng ^{[2
]}

机构：

[1] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore

[2] Xidian Univ, Sch Cyber Engn, Xian, Peoples R China

来源：

SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA | 2021年

基金：

中国国家自然科学基金;

关键词：

REPETITION; BOREDOM;

D O I：

10.1145/3448016.3452822

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The database systems course is offered as part of an undergraduate computer science degree program in many major universities. A key learning goal of learners taking such a course is to understand how SQL queries are processed in a RDBMS in practice. Since a query execution plan (QEP) describes the execution steps of a query, learners can acquire the understanding by perusing the QEPS generated by a RDBMS. Unfortunately, in practice, it is often daunting for a learner to comprehend these QEPS containing vendor-specific implementation details, hindering her learning process. In this paper, we present a novel, end-to-end, generic system called LANTERN that generates a natural language description of a QEP to facilitate understanding of the query execution steps. It takes as input an SQL query and its QEP, and generates a natural language description of the execution strategy deployed by the underlying RDBMS. Specifically, it deploys a declarative framework called POOL that enables subject matter experts to efficiently create and maintain natural language descriptions of physical operators used in QEPS. A rule-based framework called RULE-LANTERN is proposed that exploits POOL to generate natural language descriptions of QEPS. Despite the high accuracy of RULE-LANTERN, our engagement with learners reveal that, consistent with existing psychology theories, perusing such rule-based descriptions lead to boredom due to repetitive statements across different QEPS. To address this issue, we present a novel deep learning-based language generation framework called NEURAL-LANTERN that infuses language variability in the generated description by exploiting a set of paraphrasing tools and word embedding. Our experimental study with real learners shows the effectiveness of LANTERN in facilitating comprehension of QEPS.

引用

页码：1933 / 1945

页数：13

共 50 条

[1] LANTERN: Boredom-conscious Natural Language Description Generation of Query Execution Plans for Database Education
Chen, Peng
Li, Hui
Bhowmick, Sourav S.
Joty, Shafiq R.
Wang, Weiguo
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2413 - 2416
[2] NEURON: Query Execution Plan Meets Natural Language Processing For Augmenting DB Education
Liu, Siyuan
Bhowmick, Sourav S.
Zhang, Wanlu
Wang, Shu
Huang, Wanyi
Joty, Shafiq
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1953 - 1956
[3] MOCHA: A Tool for Visualizing Impact of Operator Choices in Query Execution Plans for Database Education
Tan, Jess
Yeo, Desmond
Neoh, Rachael
Chua, Huey-Eng
Bhowmick, Sourav S.
PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (12): : 3602 - 3605
[4] A Study on Database Intrusion Detection Based on Query Execution Plans
Morzy, Tadeusz
Zakrzewicz, Maciej
BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2024, 2024, 14912 : 353 - 358
[5] Enhancing Natural Language Query to SQL Query Generation Through Classification-Based Table Selection
Chopra, Ankush
Azam, Rauful
ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EANN 2024, 2024, 2141 : 152 - 165
[6] Research on the Query Condition of Natural Language in Database
Zheng Fengbin
Zheng Shanshan
Ge Qiang
ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2008, : 389 - 393
[7] Strong natural language query generation
Liu, Binsheng
Lu, Xiaolu
Culpepper, J. Shane
INFORMATION RETRIEVAL JOURNAL, 2021, 24 (4-5): : 322 - 346
[8] Strong natural language query generation
Binsheng Liu
Xiaolu Lu
J. Shane Culpepper
Information Retrieval Journal, 2021, 24 : 322 - 346
[9] Towards Predicting Query Execution Time for Concurrent and Dynamic Database Workloads
Wu, Wentao
Chi, Yun
Hacigumus, Hakan
Naughton, Jeffrey F.
PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (10): : 925 - 936
[10] LOGER: A Learned Optimizer towards Generating Efficient and Robust Query Execution Plans
Chen, Tianyi
Gao, Jun
Chen, Hedui
Tu, Yaofeng
PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (07): : 1777 - 1789

← 1 2 3 4 5 →