Learning-Based Sample Tuning for Approximate Query Processing in Interactive Data Exploration

被引:0
|
作者
Zhang, Hanbing [1 ]
Jing, Yinan [1 ]
He, Zhenying [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200437, Peoples R China
基金
中国国家自然科学基金;
关键词
Measurement; Adaptation models; Costs; Tuners; Accuracy; Q-learning; Query processing; Optimization; Synthetic data; Approximate query processing; interactive data exploration; data analysis;
D O I
10.1109/TKDE.2023.3341451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For interactive data exploration, approximate query processing (AQP) is a useful approach that usually uses samples to provide a timely response for queries by trading query accuracy. Existing AQP systems often materialize samples in the memory for reuse to speed up query processing. How to tune the samples according to the workload is one of the key problems in AQP. However, since the data exploration workload is so complex that it cannot be accurately predicted, existing sample tuning approaches cannot adapt to the changing workload very well. To address this problem, this paper proposes a deep reinforcement learning-based sample tuner, RL-STuner. When tuning samples, RL-STuner considers the workload changes from a global perspective and uses a Deep Q-learning Network (DQN) model to select an optimal sample set that has the maximum utility for the current workload. In addition, this paper proposes a set of optimization mechanisms to reduce the sample tuning cost. Experimental results on both real-world and synthetic datasets show that RL-STuner outperforms the existing sample tuning approaches and achieves 1.6x-5.2x improvements on query accuracy with a low tuning cost.
引用
收藏
页码:6532 / 6546
页数:15
相关论文
共 50 条
  • [1] LAQP: Learning-based approximate query processing
    Zhang, Meifan
    Wang, Hongzhi
    INFORMATION SCIENCES, 2021, 546 : 1113 - 1134
  • [2] Learning-Based Optimization for Online Approximate Query Processing
    Bi, Wenyuan
    Zhang, Hanbing
    Jing, Yinan
    He, Zhenying
    Zhang, Kai
    Wang, X. Sean
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 96 - 103
  • [3] Approximate Query Processing for Interactive Data Science
    Kraska, Tim
    SIGMOD'17: PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2017, : 525 - 525
  • [4] AIDE: An Active Learning-Based Approach for Interactive Data Exploration
    Dimitriadou, Kyriaki
    Papaemmanouil, Olga
    Diao, Yanlei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (11) : 2842 - 2856
  • [5] Learned Optimizer for Online Approximate Query Processing in Data Exploration
    Liu, Liyuan
    Zhang, Hanbing
    Jing, Yinan
    He, Zhenying
    Zhang, Kai
    Wang, X. Sean
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (08) : 3977 - 3991
  • [6] A Sample Advisor for Approximate Query Processing
    Roesch, Philipp
    Lehner, Wolfgang
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, 2010, 6295 : 490 - +
  • [7] Approximate Query Processing for Data Exploration using Deep Generative Models
    Thirumuruganathan, Saravanan
    Hasan, Shohedul
    Koudas, Nick
    Das, Gautam
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1309 - 1320
  • [8] Optimization for Active Learning-based Interactive Database Exploration
    Huang, Enhui
    Peng, Liping
    Di Palma, Luciano
    Abdelkafi, Ahmed
    Liu, Anna
    Diao, Yanlei
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 12 (01): : 71 - 84
  • [9] Toward Approximate Intelligence Approximate Query Engines & Approximate Data Exploration
    Slezak, Dominik
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1687 - 1687
  • [10] A Histogram based Analytical Approximate Query Processing for Massive Data
    Wang, Yijun
    Wang, Hanhu
    Li, Hui
    INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY II, PTS 1-4, 2013, 411-414 : 362 - 365