AUTOPLACER: Scalable Self-Tuning Data Placement in Distributed Key-Value Stores

被引:27
|
作者
Paiva, Joao [1 ]
Ruivo, Pedro [2 ]
Romano, Paolo [1 ]
Rodrigues, Luis [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, P-1000029 Lisbon, Portugal
[2] Red Hat Inc, London, England
关键词
Performance; Distributed data management; data placement; probabilistic algorithms; machine learning;
D O I
10.1145/2641573
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article addresses the problem of self-tuning the data placement in replicated key-value stores. The goal is to automatically optimize replica placement in a way that leverages locality patterns in data accesses, such that internode communication is minimized. To do this efficiently is extremely challenging, as one needs not only to find lightweight and scalable ways to identify the right assignment of data replicas to nodes but also to preserve fast data lookup. The article introduces new techniques that address these challenges. The first challenge is addressed by optimizing, in a decentralized way, the placement of the objects generating the largest number of remote operations for each node. The second challenge is addressed by combining the usage of consistent hashing with a novel data structure, which provides efficient probabilistic data placement. These techniques have been integrated in a popular open-source key-value store. The performance results show that the throughput of the optimized system can be six times better than a baseline system employing the widely used static placement based on consistent hashing.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] Scalable Multi-Versioning Ordered Key-Value Stores with Persistent Memory Support
    Nicolae, Bogdan
    2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2022), 2022, : 93 - 103
  • [42] Characterizing and Adapting the Consistency-Latency Tradeoff in Distributed Key-Value Stores
    Rahman, Muntasir Raihan
    Tseng, Lewis
    Nguyen, Son
    Gupta, Indranil
    Vaidya, Nitin
    ACM TRANSACTIONS ON AUTONOMOUS AND ADAPTIVE SYSTEMS, 2017, 11 (04)
  • [43] FMKe: a Real-World Benchmark for Key-Value Data Stores
    Tomas, Goncalo
    Zeller, Peter
    Balegas, Valter
    Akkoorath, Deepthi
    Bieniusa, Annette
    Leitao, Joao
    Preguica, Nuno
    PROCEEDINGS OF THE 3RD INTERNATIONAL WORKSHOP ON PRINCIPLES AND PRACTICE OF CONSISTENCY FOR DISTRIBUTED DATA (PAPOC 17), 2017,
  • [44] Adaptive and Flexible Key-Value Stores Through Soft Data Partitioning
    Hong, Byungchul
    Kwon, Yongkee
    Ahn, Jung Ho
    Kim, John
    PROCEEDINGS OF THE 34TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD), 2016, : 296 - 303
  • [45] K-Indistinguishable Data Access for Encrypted Key-Value Stores
    Zhang, Chen
    Xie, Qingyuan
    Miao, Yinbin
    Jia, Xiaohua
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 1144 - 1154
  • [46] Poster: Load Balancing for In-Memory Key-Value Data Stores
    Azqueta-Alzuaz, Ainhoa
    Pahno-Martinez, Marta
    2024 IEEE 44TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS 2024, 2024, : 1442 - 1443
  • [47] Towards Elastic Key-value Stores on IaaS
    Li, Han
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 302 - 305
  • [48] Index Shipping for Efficient Replication in LSM Key-Value Stores with Hybrid KV Placement
    Stylianakis, Giorgos
    Saloustros, Giorgos
    Chiotakis, Orestis
    Xanthakis, Giorgos
    Forth, Angelos Bilas
    ACM TRANSACTIONS ON STORAGE, 2024, 20 (03)
  • [49] Integrity Verification in Cloud Key-Value Stores
    Weintraub, Grisha
    Rise, Leonid
    Kadosh, Alon
    PROCEEDINGS OF THE 15TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS AND STORAGE, SYSTOR 2022, 2022, : 140 - 140
  • [50] Privacy-Preserving Similarity Search With Efficient Updates in Distributed Key-Value Stores
    Lin, Wanyu
    Cui, Helei
    Li, Baochun
    Wang, Cong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (05) : 1072 - 1084