LEC-Codec: Learning-Based Genome Data Compression

被引:0
|
作者
Sun, Zhenhao [1 ]
Wang, Meng [2 ]
Wang, Shiqi [1 ]
Kwong, Sam [2 ]
机构
[1] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[2] Lingnan Univ, Sch Data Sci, Hong Kong, Peoples R China
关键词
Genomics; Bioinformatics; Encoding; Context modeling; Symbols; Predictive models; Codecs; Computational modeling; Complexity theory; Termination of employment; Data compression; learning-based method; lossless genome compression; non-reference method;
D O I
10.1109/TCBB.2024.3473899
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In this paper, we propose a Learning-based gEnome Codec (LEC), which is designed for high efficiency and enhanced flexibility. The LEC integrates several advanced technologies, including Group of Bases (GoB) compression, multi-stride coding and bidirectional prediction, all of which are aimed at optimizing the balance between coding complexity and performance in lossless compression. The model applied in our proposed codec is data-driven, based on deep neural networks to infer probabilities for each symbol, enabling fully parallel encoding and decoding with configured complexity for diverse applications. Based upon a set of configurations on compression ratios and inference speed, experimental results show that the proposed method is very efficient in terms of compression performance and provides improved flexibility in real-world applications.
引用
收藏
页码:2447 / 2458
页数:12
相关论文
共 50 条
  • [1] LMDC: Learning a multiple description codec for deep learning-based image compression
    Zhao, Lijun
    Zhang, Jinjing
    Bai, Huihui
    Wang, Anhong
    Zhao, Yao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (10) : 13889 - 13910
  • [2] LMDC: Learning a multiple description codec for deep learning-based image compression
    Lijun Zhao
    Jinjing Zhang
    Huihui Bai
    Anhong Wang
    Yao Zhao
    Multimedia Tools and Applications, 2022, 81 : 13889 - 13910
  • [3] Complexity-Configurable Learning-based Genome Compression
    Sun, Zhenhao
    Wang, Meng
    Wang, Shiqi
    Kwong, Sam
    2021 PICTURE CODING SYMPOSIUM (PCS), 2021, : 241 - 245
  • [4] A Universal Optimization Framework for Learning-based Image Codec
    Zhao, Jing
    Li, Bin
    Li, Jiahao
    Xiong, Ruiqin
    Lu, Yan
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (01)
  • [5] Editorial for Special Issue on Deep Learning-Based Data Compression
    Gao, Wei
    Wang, Shiqi
    Zhang, Xinfeng
    Kwong, Sam
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2024, 13 (06)
  • [6] A Video Dataset for Learning-based Visual Data Compression and Analysis
    Xu, Xiaozhong
    Liu, Shan
    Li, Zeqiang
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [7] Learning-based Visual Compression
    Ji, Ruolei
    Karam, Lina J.
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2023, 15 (01): : 1 - 112
  • [8] DICTIONARY LEARNING-BASED IMAGE COMPRESSION
    Wang, Hao
    Xia, Yong
    Wang, Zhiyong
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3235 - 3239
  • [9] Learning-Based Conditional Image Compression
    Shen, Tianma
    Peng, Wen-Hsiao
    Shih, Huang -Chia
    Liu, Ying
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [10] Green Image Codec: A Lightweight Learning-based Image Coding Method
    Wang, Yifan
    Mei, Zhanxuan
    Zhou, Qingyang
    Katsavounidis, Ioannis
    Kuo, C-C Jay
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226