DaeMon: Architectural Support for Efficient Data Movement in Fully Disaggregated Systems

被引:9
|
作者
Giannoula, Christina [1 ,2 ]
Huang, Kailong [1 ]
Tang, Jonathan [1 ]
Koziris, Nectarios [2 ]
Goumas, Georgios [2 ]
Chishti, Zeshan [3 ]
Vijaykumar, Nandita [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Natl Tech Univ Athens, Athens, Greece
[3] Intel Corp, Mountain View, CA USA
关键词
data movement; data access; memory access; hardware support; hardware mechanism; high performance; memory systems; memory disaggregation; resource disaggregation; disaggregated systems; workload characterization; benchmarking; performance characterization;
D O I
10.1145/3579445
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Resource disaggregation offers a cost effective solution to resource scaling, utilization, and failure-handling in data centers by physically separating hardware devices in a server. Servers are architected as pools of processor, memory, and storage devices, organized as independent failure-isolated components interconnected by a high-bandwidth network. A critical challenge, however, is the high performance penalty of accessing data from a remote memory module over the network. Addressing this challenge is difficult as disaggregated systems have high runtime variability in network latencies/bandwidth, and page migration can significantly delay critical path cache line accesses in other pages. This paper conducts a characterization analysis on different data movement strategies in fully disaggregated systems, evaluates their performance overheads in a variety of workloads, and introduces DaeMon, the first software-transparent mechanism to significantly alleviate data movement overheads in fully disaggregated systems. First, to enable scalability to multiple hardware components in the system, we enhance each compute and memory unit with specialized engines that transparently handle data migrations. Second, to achieve high performance and provide robustness across various network, architecture and application characteristics, we implement a synergistic approach of bandwidth partitioning, link compression, decoupled data movement of multiple granularities, and adaptive granularity selection in data movements. We evaluate DaeMon in a wide variety of workloads at different network and architecture configurations using a state-of-the-art simulator. DaeMon improves system performance and data access costs by 2.39x and 3.06x, respectively, over the widely-adopted approach of moving data at page granularity.
引用
收藏
页数:35
相关论文
共 50 条
  • [11] Enabling Efficient Erasure Coding in Disaggregated Memory Systems
    Li, Qiliang
    Xu, Liangliang
    Li, Yongkun
    Lyu, Min
    Wang, Wei
    Zuo, Pengfei
    Xu, Yinlong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (01) : 154 - 168
  • [12] Energy Efficient Disaggregated Servers for Future Data Centers
    Ali, Howraa M. Mohammad
    Lawey, Ahmed Q.
    El-Gorashi, Taisir E. H.
    Elmirghani, Jaafar M. H.
    2015 20TH EUROPEAN CONFERENCE ON NETWORKS AND OPTICAL COMMUNICATIONS (NOC), 2015,
  • [13] HICAMP: Architectural Support for Efficient Concurrency-safe Shared Structured Data Access
    Cheriton, David
    Firoozshahian, Amin
    Solomatnikov, Alex
    Stevenson, John P.
    Azizi, Omid
    ACM SIGPLAN NOTICES, 2012, 47 (04) : 287 - 299
  • [14] HICAMP: Architectural Support for Efficient Concurrency-safe Shared Structured Data Access
    Cheriton, David
    Firoozshahian, Amin
    Solomatnikov, Alex
    Stevenson, John P.
    Azizi, Omid
    ASPLOS XVII: SEVENTEENTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS, 2012, : 287 - 299
  • [15] SYSTEMS AND ARCHITECTURAL SUPPORT FOR OPEN DATA PRINCIPLES: A MARINE EARTH OBSERVATION PERSPECTIVE
    Sibolla, Bolelang
    Molapo, Raymond
    Vhengani, Lufuno
    Mdakane, Lizwe
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 1005 - 1012
  • [16] Architectural support for efficient multicasting in irregular networks
    Sivaram, R
    Kesavan, R
    Panda, DK
    Stunkel, CB
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2001, 12 (05) : 489 - 513
  • [17] iWatcher: Efficient architectural support for software debugging
    Zhou, P
    Qin, F
    Liu, W
    Zhou, YY
    Torrellas, J
    31ST ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE, PROCEEDINGS, 2004, : 224 - 235
  • [18] ON THE ARCHITECTURAL SUPPORT FOR LOGICAL MACHINE SYSTEMS
    TSAI, SR
    TSENG, LM
    CHEN, CN
    MICROPROCESSING AND MICROPROGRAMMING, 1988, 22 (02): : 81 - 96
  • [19] Skadi: Building a Distributed Runtime for Data Systems in Disaggregated Data Centers
    Hu, Cunchen
    Wang, Chenxi
    Wang, Sa
    Sun, Ninghui
    Bao, Yungang
    Zhao, Jieru
    Kashyap, Sanidhya
    Zuo, Pengfei
    Chen, Xusheng
    Xu, Liangliang
    Zhang, Qin
    Feng, Hao
    Shan, Yizhou
    PROCEEDINGS OF THE 19TH WORKSHOP ON HOT TOPICS IN OPERATING SYSTEMS, HOTOS 2023, 2023, : 94 - 102
  • [20] Optimizing Data-intensive Systems in Disaggregated Data Centers with TELEPORT
    Zhang, Qizhen
    Chen, Xinyi
    Sankhe, Sidharth
    Zheng, Zhilei
    Zhong, Ke
    Angel, Sebastian
    Chen, Ang
    Liu, Vincent
    Loo, Boon Thau
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 1345 - 1359