Transparent Dual Memory Compression Architecture

被引:9
|
作者
Kim, Seikwon [1 ,2 ]
Lee, Seonyoung [1 ]
Kim, Taehoon [1 ]
Huh, Jaehyuk [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
[2] Samsung Elect Co Ltd, SW Ctr, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
memory compression; dual compression technique; OS transparency; locality awareness;
D O I
10.1109/PACT.2017.12
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing memory requirements of big data applications have been driving the precipitous growth of memory capacity in server systems. To maximize the efficiency of external memory, HW-based memory compression techniques have been proposed to increase effective memory capacity. Although such memory compression techniques can improve the memory efficiency significantly, a critical trade-off exists in the HW-based compression techniques. As the memory blocks need to be decompressed as quickly as possible to serve cache misses, latency-optimized techniques apply compression at the cacheline granularity, achieving the decompression latency of less than a few cycles. However, such latency-optimized techniques can lose the potential high compression ratios of capacity-optimized techniques, which compress larger memory blocks with longer latency algorithms. Considering the fundamental trade-off in the memory compression, this paper proposes a transparent dual memory compression (DMC) architecture, which selectively uses two compression algorithms with distinct latency and compression characteristics. Exploiting the locality of memory accesses, the proposed architecture compresses less frequently accessed blocks with a capacity-optimized compression algorithm, while keeping recently accessed blocks compressed with a latency-optimized one. Furthermore, instead of relying on the support from the virtual memory system to locate compressed memory blocks, the study advocates a HW-based translation between the uncompressed address space and compressed physical space. This OS-transparent approach eliminates conflicts between compression efficiency and large page support adopted to reduce TLB misses. The proposed compression architecture is applied to the Hybrid Memory Cube (HMC) with a logic layer under the stacked DRAMs. The experimental results show that the proposed compression architecture provides 54% higher compression ratio than the state-of-the-art latency-optimized technique, with no performance degradation over the baseline system without compression.
引用
收藏
页码:206 / 218
页数:13
相关论文
共 50 条
  • [31] A combined memory compression and hierarchical motion estimation architecture for video encoding in embedded systems
    Osorio, Roberto R.
    Bruguera, Javier D.
    DSD 2006: 9TH EUROMICRO CONFERENCE ON DIGITAL SYSTEM DESIGN: ARCHITECTURES, METHODS AND TOOLS, PROCEEDINGS, 2006, : 269 - +
  • [32] Design of Memory Shifting System Based on Dual-Space Storage Architecture
    Wang, Zhehe
    Li, Shuang
    Jiang, Jiabao
    Wang, Chunteng
    Wang, Xianchao
    Zhu, Taiyu
    IEEE ACCESS, 2022, 10 : 91897 - 91905
  • [33] A DUAL SYMBOL ARITHMETIC CODER ARCHITECTURE WITH REDUCED MEMORY FOR JPEG2000
    Liu, Kai
    Li, YunSong
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 513 - 516
  • [34] Design and implementation of dual-mode configurable memory architecture for CNN accelerator
    Shan, Rui
    Li, Xiaoshuo
    Gao, Xu
    Huo, Ziqing
    High Technology Letters, 2024, 30 (02) : 211 - 220
  • [35] Architecture Territories informed and Transparent A Wiki in the School of Architecture
    Fernandez Garcia, Javier
    REDU-REVISTA DE DOCENCIA UNIVERSITARIA, 2009, 7 (05):
  • [36] Design and implementation of dual-mode configurable memory architecture for CNN accelerator
    山蕊
    LI Xiaoshuo
    GAO Xu
    HUO Ziqing
    High Technology Letters, 2024, (02) : 211 - 220
  • [37] Design and Implementation of Dual Port SRAM Memory Architecture using MOSFET's
    Srinivas, Seema
    Gudi, Anandthirtha B.
    2017 INTERNATIONAL CONFERENCE ON SMART GRIDS, POWER AND ADVANCED CONTROL ENGINEERING (ICSPACE), 2017, : 357 - 362
  • [38] Configurable in-memory computing architecture based on dual-port SRAM
    Zhao, Yue
    Liu, Yunlong
    Zheng, Jian
    Tong, Zhongzhen
    Wang, Xin
    Yu, Runru
    Wu, Xiulong
    Zhou, Yongliang
    Peng, Chunyu
    Lu, Wenjuan
    Zhao, Qiang
    Lin, Zhiting
    MICROELECTRONICS JOURNAL, 2024, 147
  • [39] Implementation of a primal-dual method for SDP on a shared memory parallel architecture
    Borchers, Brian
    Young, Joseph G.
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2007, 37 (03) : 355 - 369
  • [40] Architecture Territories informed and Transparent A Wiki in the School of Architecture
    Fernandez Garcia, Javier
    RED-REVISTA DE EDUCACION A DISTANCIA, 2010, (22):