Learning-Based Sample Tuning for Approximate Query Processing in Interactive Data Exploration

被引:0
|
作者
Zhang, Hanbing [1 ]
Jing, Yinan [1 ]
He, Zhenying [1 ]
Zhang, Kai [1 ]
Wang, X. Sean [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200437, Peoples R China
基金
中国国家自然科学基金;
关键词
Measurement; Adaptation models; Costs; Tuners; Accuracy; Q-learning; Query processing; Optimization; Synthetic data; Approximate query processing; interactive data exploration; data analysis;
D O I
10.1109/TKDE.2023.3341451
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For interactive data exploration, approximate query processing (AQP) is a useful approach that usually uses samples to provide a timely response for queries by trading query accuracy. Existing AQP systems often materialize samples in the memory for reuse to speed up query processing. How to tune the samples according to the workload is one of the key problems in AQP. However, since the data exploration workload is so complex that it cannot be accurately predicted, existing sample tuning approaches cannot adapt to the changing workload very well. To address this problem, this paper proposes a deep reinforcement learning-based sample tuner, RL-STuner. When tuning samples, RL-STuner considers the workload changes from a global perspective and uses a Deep Q-learning Network (DQN) model to select an optimal sample set that has the maximum utility for the current workload. In addition, this paper proposes a set of optimization mechanisms to reduce the sample tuning cost. Experimental results on both real-world and synthetic datasets show that RL-STuner outperforms the existing sample tuning approaches and achieves 1.6x-5.2x improvements on query accuracy with a low tuning cost.
引用
收藏
页码:6532 / 6546
页数:15
相关论文
共 50 条
  • [21] Learning-based query optimization for multi-probe approximate nearest neighbor search
    Zhang, Pengcheng
    Yao, Bin
    Gao, Chao
    Wu, Bin
    He, Xiao
    Li, Feifei
    Lu, Yuanfei
    Zhan, Chaoqun
    Tang, Feilong
    VLDB JOURNAL, 2023, 32 (03): : 623 - 645
  • [22] Learning-based query optimization for multi-probe approximate nearest neighbor search
    Pengcheng Zhang
    Bin Yao
    Chao Gao
    Bin Wu
    Xiao He
    Feifei Li
    Yuanfei Lu
    Chaoqun Zhan
    Feilong Tang
    The VLDB Journal, 2023, 32 : 623 - 645
  • [23] Approximate query processing based on wavelet transform
    Xiong, Yun-Bo
    Liu, Bing
    Hu, Yun-Fa
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1337 - +
  • [24] Compressed data cube for approximate OLAP query processing
    Yu Feng
    Shan Wang
    Journal of Computer Science and Technology, 2002, 17 : 625 - 635
  • [25] Benchmark for Evaluating Approximate Query Processing on Data Streams
    Di Tria, Francesco
    Lefons, Ezio
    Tangorra, Filippo
    2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 191 - 196
  • [26] Approximate Query Processing for Big Data in Heterogeneous Databases
    Muniswamaiah, Manoj
    Agerwala, Tilak
    Tappert, Charles C.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5765 - 5767
  • [27] Distributed data summaries for approximate query processing in PDMS
    Hose, Katja
    Klan, Daniel
    Sattler, Kai-Uwe
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 37 - 44
  • [28] Compressed data cube for approximate OLAP query processing
    Feng, Y
    Wang, S
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2002, 17 (05) : 625 - 635
  • [29] Fine-Tuning Data Structures for Query Processing
    Shaikhha, Amir
    Kelepeshis, Marios
    Ghorbani, Mahdi
    PROCEEDINGS OF THE 21ST ACM/IEEE INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION, CGO 2023, 2023, : 149 - 161
  • [30] Learning-based Declarative Query Optimization
    Tripathi, Shivani
    CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, : 416 - 416