ObjDedup: High-Throughput Object Storage Layer for Backup Systems With Block-Level Deduplication

被引:3
|
作者
Jackowski, Andrzej [1 ]
Slusarczyk, Lukasz [1 ]
Lichota, Krzysztof [1 ]
Welnicki, Michal [1 ]
Wijata, Rafal [1 ]
Kielar, Mateusz [1 ]
Kopec, Tadeusz [1 ]
Dubnicki, Cezary [1 ]
Iwanicki, Konrad [2 ]
机构
[1] LLC 9LivesData, PL-02796 Warsaw, Poland
[2] Univ Warsaw, Fac Math Informat & Mech, PL-00927 Warsaw, Poland
关键词
Metadata; Engines; Throughput; Object recognition; Cloud computing; Aerospace electronics; Quality of service; Backup storage; deduplication; object storage; secondary storage;
D O I
10.1109/TPDS.2023.3250501
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The immense popularity of object storage is also affecting the market of backup. Not only have novel backup solutions emerged that utilize cloud-based object storage as backends, but also support for object storage interfaces is increasingly expected from traditional dedicated backup appliances. This latter trend especially concerns systems with data deduplication, as they can offer compelling gains in storage capacity and throughput. However, such systems have been designed for interfaces and workloads that are markedly different from those encountered in object storage. Notably, they expect data to be written in portions that are orders of magnitude longer than those in the novel object-storage-oriented backup applications. In this light, we contribute twofold. First, contrasting the properties of object storage interfaces with usage patterns from 686 commercial deployments of backup appliances, we identify specific issues an implementation of such an interface has to address to offer adequate performance in a backup system with block-level deduplication. In particular, we show that a major challenge is efficient metadata management. Second, we present distributed data structures and algorithms to handle object metadata in backup systems with block-level deduplication. Subsequently, we implement them as an object storage layer for our HYDRAstor backup system. In comparison to object storage without in-line deduplication, our solution achieves 1.8-3.93x higher write throughput. Compared to object storage on top of a state-of-the-art file-based backup system, it processes 5.26-11.34x more object put operations per time unit.
引用
收藏
页码:2180 / 2197
页数:18
相关论文
共 30 条
  • [21] Systems level high-throughput and multiparametric analyses to elucidate cell death associated molecules involved in Pemphigus Vulgaris
    Cirillo, N.
    Lanza, A.
    Prime, S. S.
    JOURNAL OF INVESTIGATIVE DERMATOLOGY, 2010, 130 : S11 - S11
  • [22] Study of Cross-Contamination in Multi-Chamber PVD Systems used for High-Throughput Seed Layer Deposition
    Carazzetti, Patrik
    Drechsel, Carl
    Rettenmeier, Roland
    Weichart, Jurgen
    Viehweger, Kay
    Strolz, Ewald
    2024 IEEE 10TH ELECTRONICS SYSTEM-INTEGRATION TECHNOLOGY CONFERENCE, ESTC 2024, 2024,
  • [23] Macrocell Builder: IP-Block-Based Design Environment for High-Throughput VLSI Dedicated Digital Signal Processing Systems
    Nacer-Eddine Zergainoh
    Ludovic Tambour
    Pascal Urard
    Ahmed Amine Jerraya
    EURASIP Journal on Advances in Signal Processing, 2006
  • [24] Macrocell builder: IP-block-based design environment for high-throughput VLSI dedicated digital signal processing systems
    Zergainoh, Nacer-Eddine
    Tambour, Ludovic
    Urard, Pascal
    Jerraya, Ahmed Amine
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [25] An Approach for Systems-Level Understanding of Prostate Cancer from High-Throughput Data Integration to Pathway Modeling and Simulation
    Mobashir, Mohammad
    Turunen, S. Pauliina
    Izhari, Mohammad Asrar
    Ashankyty, Ibraheem Mohammed
    Helleday, Thomas
    Lehti, Kaisa
    CELLS, 2022, 11 (24)
  • [26] Getting the big picture of cell-matrix interactions: High-throughput biomaterial platforms and systems-level measurements
    Lei, Ruoxing
    Kumar, Sanjay
    CURRENT OPINION IN SOLID STATE & MATERIALS SCIENCE, 2020, 24 (06):
  • [27] A novel two layer-integrated microfluidic device for high-throughput yeast proteomic dynamics analysis at the single-cell level
    Chen, Kaiyue
    Rong, Nan
    Wang, Shujing
    Luo, Chunxiong
    INTEGRATIVE BIOLOGY, 2020, 12 (10) : 241 - 249
  • [28] Solving the picker routing problem in multi-block high-level storage systems using metaheuristics
    Alejandro Cano, Jose
    Cortes, Pablo
    Munuzuri, Jesus
    Correa-Espinal, Alexander
    FLEXIBLE SERVICES AND MANUFACTURING JOURNAL, 2023, 35 (02) : 376 - 415
  • [29] Solving the picker routing problem in multi-block high-level storage systems using metaheuristics
    Jose Alejandro Cano
    Pablo Cortés
    Jesús Muñuzuri
    Alexander Correa-Espinal
    Flexible Services and Manufacturing Journal, 2023, 35 : 376 - 415
  • [30] A novel two-layer-integrated microfluidic device for high-throughput yeast proteomic dynamics analysis at the single-cell level (vol 12, pg 241, 2020)
    Chen, Kaiyue
    Rong, Nan
    Wang, Shujing
    Luo, Chunxiong
    INTEGRATIVE BIOLOGY, 2021, 13 (10) : 258 - 258