GL-Cache: Group-level learning for efficient and high-performance caching

被引:0
|
作者
Yang, Juncheng [1 ]
Mao, Ziming [2 ]
Yue, Yao [3 ]
Rashmi, K. V. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Yale Univ, New Haven, CT USA
[3] Pelikan Fdn, San Francisco, CA USA
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Web applications rely heavily on software caches to achieve low-latency, high-throughput services. To adapt to changing workloads, three types of learned caches (learned evictions) have been designed in recent years: object-level learning, learning-from-distribution, and learning-from-simple-experts. However, we argue that the learning granularity in existing approaches is either too fine (object-level), incurring significant computation and storage overheads, or too coarse (workload or expert-level) to capture the differences between objects and leaves a considerable efficiency gap. In this work, we propose a new approach for learning in caches ("group-level learning"), which clusters similar objects into groups and performs learning and eviction at the group level. Learning at the group level accumulates more signals for learning, leverages more features with adaptive weights, and amortizes overheads over objects, thereby achieving both high efficiency and high throughput. We designed and implemented GL-Cache on an opensource production cache to demonstrate group-level learning. Evaluations on 118 production block I/O and CDN cache traces show that GL-Cache has a higher hit ratio and higher throughput than state-of-the-art designs. Compared to LRB (object-level learning), GL-Cache improves throughput by 228x and hit ratio by 7% on average across cache sizes. For 10% of the traces (P90), GL-Cache provides a 25% hit ratio increase from LRB. Compared to the best of all learned caches, GL-Cache achieves a 64% higher throughput, a 3% higher hit ratio on average, and a 13% hit ratio increase at the P90.
引用
收藏
页码:115 / 133
页数:19
相关论文
共 50 条
  • [1] High-Performance and Endurable Cache Management for Flash-Based Read Caching
    Xia, Qianbin
    Xiao, Weijun
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (12) : 3518 - 3531
  • [2] Group-level brain decoding with deep learning
    Csaky, Richard
    van Es, Mats W. J.
    Jones, Oiwi Parker
    Woolrich, Mark
    HUMAN BRAIN MAPPING, 2023, 44 (17) : 6105 - 6119
  • [3] BP-NUCA: CACHE PRESSURE-AWARE MIGRATION FOR HIGH-PERFORMANCE CACHING IN CMPS
    Jia, Xiaomin
    Jiang, Jiang
    Wang, Yongwen
    Qi, Shubo
    Zhao, Tianlei
    Fu, Guitao
    Zhang, Minxuan
    COMPUTING AND INFORMATICS, 2011, 30 (05) : 1037 - 1060
  • [4] MOLAR: A Cost-Efficient, High-Performance Hybrid Storage Cache
    Liu, Yi
    Ge, Xiongzi
    Huang, Xiaoxia
    Du, David H. C.
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [5] NB-Cache: Non-Blocking In-Network Caching for High-Performance Content Routers
    Pan, Tian
    Lin, Xingchen
    Song, Enge
    Xu, Cheng
    Zhang, Jiao
    Li, Hao
    Lv, Jianhui
    Huang, Tao
    Liu, Bin
    Zhang, Beichuan
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2021, 29 (05) : 1976 - 1989
  • [6] LEVEL-2 CACHE FOR HIGH-PERFORMANCE /390 MU-PROCESSORS
    BARSUHN, H
    LOEHLEIN, W
    WENDEL, D
    WILLE, U
    COPPENS, P
    MICROPROCESSING AND MICROPROGRAMMING, 1992, 35 (1-5): : 303 - 310
  • [7] The Effects of Individual-Level and Group-Level Trust on Willingness to Communicate in the Group Language Learning
    Ito, Takehiko
    Furuyabu, Mariko
    Toews-Shimizu, Jennifer
    FRONTIERS IN EDUCATION, 2022, 7
  • [8] Hybrid Nonvolatile Disk Cache for Energy-Efficient and High-Performance Systems
    Shi, Liang
    Li, Jianhua
    Xue, Chun Jason
    Zhou, Xuehai
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2013, 18 (01)
  • [9] Cooperative Partitioning: Energy-Efficient Cache Partitioning for High-Performance CMPs
    Sundararajan, Karthik T.
    Porpodas, Vasileios
    Jones, Timothy M.
    Topham, Nigel P.
    Franke, Bjoern
    2012 IEEE 18TH INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2012, : 311 - 322
  • [10] Relationships among satisfaction, commitment, and performance: A group-level analysis
    Leung, K
    APPLIED PSYCHOLOGY-AN INTERNATIONAL REVIEW-PSYCHOLOGIE APPLIQUEE-REVUE INTERNATIONALE, 1997, 46 (02): : 199 - 205