Design and implementation of a Bloom filter-based data deduplication algorithm for efficient data management

被引:2
|
作者
Jang Y.-H. [1 ]
Lee N.-U. [1 ]
Kim H.-J. [1 ]
Park S.-C. [1 ]
机构
[1] Department of IT Convergence Engineering, Gachon University, Seongnam
关键词
Backup data; Bloom filters; Fast identification; Hash value; Removing duplicate data; Source-based deduplication;
D O I
10.1007/s12652-018-0893-1
中图分类号
学科分类号
摘要
Recently, the amount of data being stored has increased dramatically, and the amount of data backed up on servers increases yearly. However, the share of duplicate data in that backup data is also increasing, and because of this, the time spent on duplicate data processing is greatly increasing. Therefore, in this paper, we design a Bloom filter-based data deduplication algorithm for fast identification and removal of duplicate data. The results from evaluation of the implemented algorithm show that execution time is 17% less than with an existing deduplication algorithm. © Springer-Verlag GmbH Germany, part of Springer Nature 2018.
引用
收藏
页码:1387 / 1393
页数:6
相关论文
共 50 条
  • [21] An efficient provable data possession scheme based on counting bloom filter for dynamic data in the cloud storage
    Jung E.
    Jeong J.
    2016, Science and Engineering Research Support Society (11): : 9 - 16
  • [22] Bloom Filter-Based Keyword Search over XML Data in Structured Peer-to-Peer Systems
    He, Weimin
    Lv, Teng
    PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 177 - 181
  • [23] A Bloom Filter-based Approach for Efficient MapReduce Query Processing on Ordered Datasets
    Chen, Zhijian
    Wu, Dan
    Xie, Wenyan
    Zeng, Jiazhi
    He, Jian
    Wu, Di
    2013 INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2013, : 93 - 98
  • [24] Forwarding Anomalies in Bloom Filter-based Multicast
    Saerelae, Mikko
    Rothenberg, Christian Esteve
    Aura, Tuomas
    Zahemszky, Andras
    Nikander, Pekka
    Ott, Joerg
    2011 PROCEEDINGS IEEE INFOCOM, 2011, : 2399 - 2407
  • [25] Privacy-Preserving Bloom Filter-Based Keyword Search Over Large Encrypted Cloud Data
    Liang, Yanrong
    Ma, Jianfeng
    Miao, Yinbin
    Kuang, Da
    Meng, Xiangdong
    Deng, Robert H.
    IEEE TRANSACTIONS ON COMPUTERS, 2023, 72 (11) : 3086 - 3098
  • [26] Sum-Up Counting Bloom Filter-Based Name Lookup Method for Named Data Networking
    Wu, Tingting
    Zhang, Lang
    Lei, Jianyun
    Hou, Rui
    Song, Zhongshan
    RECENT ADVANCES IN ELECTRICAL & ELECTRONIC ENGINEERING, 2018, 11 (02) : 176 - 180
  • [27] An efficient secure data deduplication method using radix trie with bloom filter (SDD-RT-BF) in cloud environment
    Ebinazer, Silambarasan Elkana
    Savarimuthu, Nickolas
    Bhanu, Mary Saira S.
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2021, 14 (04) : 2443 - 2451
  • [28] A survey on the roles of Bloom Filter in implementation of the Named Data Networking
    Nayak, Sabuzima
    Patgiri, Ripon
    Borah, Angana
    COMPUTER NETWORKS, 2021, 196
  • [29] Scaling Bloom filter-based multicast via filter switching
    Tsilopoulos, Christos
    Xylomenos, George
    2013 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2013,
  • [30] Design and Implementation of an Efficient Electronic Bank Management Information System Based Data Warehouse and Data Mining Processing
    Luo, Jia
    Xu, Junping
    Aldosari, Obaid
    Althubiti, Sara A.
    Deebani, Wejdan
    INFORMATION PROCESSING & MANAGEMENT, 2022, 59 (06)