Intelligent Automated Workload Analysis for Database Replatforming

被引:3
|
作者
Aleyasen, Amirhossein [1 ,2 ]
Morcos, Mark [1 ]
Antova, Lyublena [1 ]
Sugiyama, Marc [1 ]
Korablev, Dmitri [1 ]
Patvarczki, Jozsef [1 ]
Mutreja, Rima [1 ]
Duller, Michael [1 ]
Waas, Florian M. [1 ]
Winslett, Marianne [2 ]
机构
[1] Datometry Inc, San Francisco, CA 94105 USA
[2] Univ Illinois, San Francisco, CA 94105 USA
来源
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22) | 2022年
关键词
workload analysis; data warehousing; porting complexity; database replatforming; adaptive data virtualization;
D O I
10.1145/3514221.3526050
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Performing a detailed workload analysis is a crucial step in determining the feasibility, timeline and cost of a major data warehouse replatforming project, i.e., migration from one platform to another. A large company's data warehouse applications may include millions of queries, some of which will use features that are unsupported or have different semantics in the new warehouse, or may have poor performance there. In this paper we present q Insight, a workload analyzer that Datometry has used in data warehouse replatforming efforts for dozens of major clients. qInsight leverages Datometry's Hyper-Q to obtain insights from a workload, including SQL features and workload structural information that could not be obtained without deep query analysis. qInsight uses the identified features and a weighting scheme based on human expert judgments to assess the difficulty of rewriting each application in the workload via traditional migration methods. Datometry's clients find this information useful in planning their projects, including the order in which to migrate applications. We present a q Insight-based data warehouse usage analysis of over 1.7 billion queries from real-world workloads.
引用
收藏
页码:2273 / 2285
页数:13
相关论文
共 50 条
  • [21] Intelligent hypermedia database
    Wu, XY
    Qi, FH
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 1258 - 1261
  • [22] A Comprehensive I/O Knowledge Cycle for Modular and Automated HPC Workload Analysis
    Zhu, Zhaobin
    Neuwirth, Sarah
    Lippert, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER 2022), 2022, : 581 - 588
  • [23] Design and Implementation of Intelligent Analysis Module in Database Audit System
    Li, Wei
    Liu, Chenying
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON EDUCATION TECHNOLOGY, MANAGEMENT AND HUMANITIES SCIENCE (ETMHS 2015), 2015, 27 : 398 - 402
  • [24] NoWog: A Workload Generator for Database Performance Benchmarking
    Ameri, Parinaz
    Schlitter, Nico
    Meyer, Joerg
    Streit, Achim
    2016 IEEE 14TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 14TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 2ND INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/DATACOM/CYBERSC, 2016, : 666 - 673
  • [25] Workload Management in Database Management Systems: A Taxonomy
    Zhang, Mingyi
    Martin, Patrick
    Powley, Wendy
    Chen, Jianjun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (07) : 1386 - 1402
  • [26] A Grid Workload Modeling Approach for Intelligent Grid
    Liu, Wei
    Cunha, Jose
    Duarte, Vitor
    Luo, Tiejian
    2009 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2009, : 801 - +
  • [27] Database Workload Characterization with Query Plan Encoders
    Paul, Debjyoti
    Cao, Jie
    Li, Feifei
    Srikumar, Vivek
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 15 (04): : 923 - 935
  • [28] QPSEncoder: A Database Workload Encoder with Deep Learning
    Yang, Jianwen
    Zhang, Qiuhong
    Yan, Jin
    Ding, Zhiming
    Zhu, Meiling
    Lv, Xinjie
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, DEXA 2024, 2024, 14910 : 108 - 123
  • [29] Study of Workload Interference with Intelligent Routing on Dragonfly
    Kang, Yao
    Wang, Xin
    Lan, Zhiling
    SC22: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2022,
  • [30] A Simulator for Intelligent Workload Managers in Heterogeneous Clusters
    Herrera, Adrian
    Ibanez, Mario
    Stafford, Esteban
    Bosque, Jose Luis
    21ST IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2021), 2021, : 196 - 205