GL-Cache: Group-level learning for efficient and high-performance caching

被引:0
|
作者
Yang, Juncheng [1 ]
Mao, Ziming [2 ]
Yue, Yao [3 ]
Rashmi, K. V. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Yale Univ, New Haven, CT USA
[3] Pelikan Fdn, San Francisco, CA USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Web applications rely heavily on software caches to achieve low-latency, high-throughput services. To adapt to changing workloads, three types of learned caches (learned evictions) have been designed in recent years: object-level learning, learning-from-distribution, and learning-from-simple-experts. However, we argue that the learning granularity in existing approaches is either too fine (object-level), incurring significant computation and storage overheads, or too coarse (workload or expert-level) to capture the differences between objects and leaves a considerable efficiency gap. In this work, we propose a new approach for learning in caches ("group-level learning"), which clusters similar objects into groups and performs learning and eviction at the group level. Learning at the group level accumulates more signals for learning, leverages more features with adaptive weights, and amortizes overheads over objects, thereby achieving both high efficiency and high throughput. We designed and implemented GL-Cache on an opensource production cache to demonstrate group-level learning. Evaluations on 118 production block I/O and CDN cache traces show that GL-Cache has a higher hit ratio and higher throughput than state-of-the-art designs. Compared to LRB (object-level learning), GL-Cache improves throughput by 228x and hit ratio by 7% on average across cache sizes. For 10% of the traces (P90), GL-Cache provides a 25% hit ratio increase from LRB. Compared to the best of all learned caches, GL-Cache achieves a 64% higher throughput, a 3% higher hit ratio on average, and a 13% hit ratio increase at the P90.
引用
收藏
页码:115 / 133
页数:19
相关论文
共 50 条
  • [41] High-Performance with an In-GPU Graph Database Cache
    Morishima, Shin
    Matsutani, Hiroki
    IT PROFESSIONAL, 2017, 19 (06) : 58 - 64
  • [42] An efficient method for sorting and quantifying individual social traits based on group-level behaviour
    Szorkovszky, Alex
    Kotrschal, Alexander
    Read, James E. Herbert
    Sumpter, David J. T.
    Kolm, Niclas
    Pelckmans, Kristiaan
    METHODS IN ECOLOGY AND EVOLUTION, 2017, 8 (12): : 1735 - 1744
  • [43] HyGain: High-performance, Energy-efficient Hybrid Gain Cell-based Cache Hierarchy
    Singh, Sarabjeet
    Surana, Neelam
    Prasad, Kailash
    Jain, Pranjali
    Mekie, Joycee
    Awasthi, Manu
    ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2023, 20 (02)
  • [44] Beyond basic region caching: Specializing cache structures for high performance and energy conservation
    Geiger, MJ
    McKee, SA
    Tyson, GS
    HIGH PERFORMANCE EMBEDDED ARCHITECTURES AND COMPILERS, PROCEEDINGS, 2005, 3793 : 102 - 115
  • [45] Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators
    Bi, Jun
    Wen, Yuanbo
    Li, Xiaqing
    Zhao, Yongwei
    Guo, Yuxuan
    Zhou, Enshuai
    Hu, Xing
    Du, Zidong
    Li, Ling
    Chen, Huaping
    Chen, Tianshi
    Guo, Qi
    IEEE TRANSACTIONS ON COMPUTERS, 2025, 74 (01) : 155 - 169
  • [46] ECM:Effective Capacity Maximizer for High-Performance Compressed Caching
    Baek, Seungcheol
    Lee, Hyung Gyu
    Nicopoulos, Chrysostomos
    Lee, Junghee
    Kim, Jongman
    19TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA2013), 2013, : 131 - 142
  • [47] Effective SSD Caching for High-Performance Home Cloud Server
    Lee, Dongwoo
    Min, Changwoo
    Eom, Young Ik
    2015 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2015, : 152 - 153
  • [48] Flexible Group-Level Pruning of Deep Neural Networks for On-Device Machine Learning
    Lee, Kwangbae
    Kim, Hoseung
    Lee, Hayun
    Shin, Dongkun
    PROCEEDINGS OF THE 2020 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2020), 2020, : 79 - 84
  • [49] UNDERSTANDING PROSOCIAL BEHAVIOR, SALES PERFORMANCE, AND TURNOVER - A GROUP-LEVEL ANALYSIS IN A SERVICE CONTEXT
    GEORGE, JM
    BETTENHAUSEN, K
    JOURNAL OF APPLIED PSYCHOLOGY, 1990, 75 (06) : 698 - 709
  • [50] Does prevalence mitigate relevance? The moderating effect of group-level OCB on employee performance
    Bommer, William H.
    Dierdorff, Erich C.
    Rubin, Robert S.
    ACADEMY OF MANAGEMENT JOURNAL, 2007, 50 (06): : 1481 - 1494