Parallelized In-Network Aggregation for Failure Repair in Erasure-Coded Storage Systems

被引:3
|
作者
Xia, Junxu [1 ]
Luo, Lailong [1 ]
Sun, Bowen [1 ]
Cheng, Geyao [1 ]
Guo, Deke [2 ]
机构
[1] Natl Univ Def Technol, Sci & Technol Informat Syst Engn Lab, Changsha 410073, Hunan, Peoples R China
[2] Xiangjiang Lab, Changsha 410073, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
Erasure code; distributed storage system; programmable switch; fault tolerance;
D O I
10.1109/TNET.2024.3367995
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
To repair a failed block in the erasure-coded storage system, multiple related blocks have to be retrieved from other storage nodes across the network. Such a process can lead to significant incast-type repair traffics and delays. The existing efforts mainly try to schedule the transmission of the requested blocks across different storage nodes to avoid network congestion. At their cores, they utilize part of the involved hosts to rely on or aggregate the file blocks from others. While we notice that, the programmability and capability of today's network devices (i.e., routers and switches) bring a great opportunity to further speed up the repair progress by aggregating the file blocks with such devices. By mitigating the aggregation operations from the network edges to network cores, it is possible to save more time and bandwidth. With this intuition, we propose Paint, a parallelized in-network aggregation framework for failure repair. Paint utilizes programmable switches to aggregate relevant data and improves the repair performance by implementing multiple parallelized repair pipelines. We propose a series of novel and time-friendly algorithms to construct the routing paths for Paint and design the Aggregation Control Protocol to implement Paint in production clusters. For all we know, this is the first work to explore and implement parallelized in-network repair with programmable switches. The extensive experiments on the prototype system and real-world datasets indicate that Paint can significantly improve repair performance while effectively reducing bandwidth overhead.
引用
收藏
页码:2888 / 2903
页数:16
相关论文
共 50 条
  • [1] An Efficient Failure Reconstruction Based on In-Network Computing for Erasure-Coded Storage Systems
    Tang Y.
    Wang F.
    Xie Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (04): : 767 - 778
  • [2] Fast Repair for Single Failure in Erasure-coded Distributed Storage Systems
    Zhang, Huayu
    Li, Hui
    Zhu, Bing
    Chen, Jun
    2014 IEEE 33RD INTERNATIONAL SYMPOSIUM ON RELIABLE DISTRIBUTED SYSTEMS (SRDS), 2014, : 146 - 151
  • [3] Repair Tree: Fast Repair for Single Failure in Erasure-Coded Distributed Storage Systems
    Zhang, Huayu
    Li, Hui
    Li, Shuo-Yen Robert
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (06) : 1728 - 1739
  • [4] Repair Pipelining for Erasure-Coded Storage
    Li, Runhui
    Li, Xiaolu
    Lee, Patrick P. C.
    Huang, Qun
    2017 USENIX ANNUAL TECHNICAL CONFERENCE (USENIX ATC '17), 2017, : 567 - 579
  • [5] Aggregation Decoding for Multi-Failure Recovery in Erasure-Coded Storage
    Zhang, Jing
    Li, Shanshan
    Liao, Xiangke
    Liu, Xiaodong
    2014 IEEE 17TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE), 2014, : 1326 - 1331
  • [6] Fast Predictive Repair in Erasure-Coded Storage
    Shen, Zhirong
    Li, Xiaolu
    Lee, Patrick P. C.
    2019 49TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2019), 2019, : 556 - 567
  • [7] XORInc: Optimizing Data Repair and Update for Erasure-Coded Systems with XOR-Based In-Network Computation
    Wang, Fang
    Tang, Yingjie
    Xie, Yanwen
    Tang, Xuehai
    2019 35TH SYMPOSIUM ON MASS STORAGE SYSTEMS AND TECHNOLOGIES (MSST 2019), 2019, : 244 - 256
  • [8] Repair Pipelining for Erasure-coded Storage: Algorithms and Evaluation
    Li, Xiaolu
    Yang, Zuoru
    Li, Jinhong
    Li, Runhui
    Lee, Patrick P. C.
    Huang, Qun
    Hu, Yuchong
    ACM TRANSACTIONS ON STORAGE, 2021, 17 (02)
  • [9] FullRepair: Towards Optimal Repair Pipelining in Erasure-Coded Clustered Storage Systems
    Zhang, Yuzuo
    Tu, Xinyuan
    Wang, Lin
    Hu, Yuchong
    Wang, Fang
    Wang, Ye
    2023 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING, CLUSTER, 2023, : 107 - 117
  • [10] PivotRepair: Fast Pipelined Repair for Erasure-Coded Hot Storage
    Yao, Qiaori
    Hu, Yuchong
    Tu, Xinyuan
    Lee, Patrick P. C.
    Feng, Dan
    Zhu, Xia
    Zhang, Xiaoyang
    Yao, Zhen
    Wei, Wenjia
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 614 - 624