Constrained Channel Capacity for DNA-Based Data Storage Systems

被引:2
|
作者
Fan, Kaixin [1 ]
Wu, Huaming [1 ]
Yan, Zihui [1 ]
机构
[1] Tianjin Univ, Ctr Appl Math, Tianjin 300072, Peoples R China
基金
国家重点研发计划;
关键词
DNA-based storage systems; constrained channels; channel capacity; CODES;
D O I
10.1109/LCOMM.2022.3212200
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Deoxyribonucleic acid (DNA)-based data storage has grown rapidly due to its advantages with the increase in infrequently large amounts of data. However, when the maximum homopolymer runlength (RLL) of the DNA strand is large and the GC-content is either too high or too low, the DNA synthesis and sequencing processes are prone to substitution, deletion and insertion errors. To reduce errors in DNA synthesis and sequencing, we require that the DNA storage channel satisfies both k-RLL and strong-(l,d)-locally-GC-balanced constraints, where the former refers to the maximum homopolymer runlength in each sequence is at most k, and the latter refers to the number of G and C of every length-(l' >= l) subsequence is bounded between [ (2)/(l') - delta,(2)/(l') + delta]. This constrained channel allows DNA data storage system to be less prone to errors during synthesis and sequencing and improves the success rate of Polymerase Chain Reaction (PCR) amplification. We propose a method to calculate the channel capacity. In particular, we provide a relationship between the 4-ary constrained channel capacity and the 2-ary constrained channel capacity, which makes it simpler to calculate the 4-ary constrained channel capacity.
引用
收藏
页码:70 / 74
页数:5
相关论文
共 50 条
  • [21] Improved Coding Over Sets for DNA-Based Data Storage
    Wei, Hengjia
    Schwartz, Moshe
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2022, 68 (01) : 118 - 129
  • [22] An Epigenetics-Inspired DNA-Based Data Storage System
    Mayer, Clemens
    McInroy, Gordon R.
    Murat, Pierre
    Van Delft, Pieter
    Balasubramanian, Shankar
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2016, 55 (37) : 11144 - 11148
  • [23] Portable and Error-Free DNA-Based Data Storage
    S. M. Hossein Tabatabaei Yazdi
    Ryan Gabrys
    Olgica Milenkovic
    Scientific Reports, 7
  • [24] Portable and Error-Free DNA-Based Data Storage
    Yazdi, S. M. Hossein Tabatabaei
    Gabrys, Ryan
    Milenkovic, Olgica
    SCIENTIFIC REPORTS, 2017, 7
  • [25] On the Efficient Digital Code Representation in DNA-based Data Storage
    Cevallos, Yesenia
    Tello-Oquendo, Luis
    Inca, Deysi
    Samaniego, Nicolay
    Santillan, Ivone
    Shirazi, Amin Zadeh
    Gomez, Guillermo A.
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON NANOSCALE COMPUTING AND COMMUNICATION - NANOCOM 2020, 2020,
  • [26] Sequencing Coverage Analysis for Combinatorial DNA-Based Storage Systems
    Preuss, Inbal
    Galili, Ben
    Yakhini, Zohar
    Anavy, Leon
    IEEE TRANSACTIONS ON MOLECULAR BIOLOGICAL AND MULTI-SCALE COMMUNICATIONS, 2024, 10 (02): : 297 - 316
  • [27] Expanding the Molecular Alphabet of DNA-Based Data Storage Systems with Neural Network Nanopore Readout
    Tabatabaei, S. Kasra
    Pham, Bach
    Pan, Chao
    Liu, Jingqian
    Chandak, Shubham
    Shorkey, Spencer A.
    Hernandez, Alvaro G.
    Aksimentiev, Aleksei
    Chen, Min
    Schroeder, Charles M.
    Milenkovic, Olgica
    NANO LETTERS, 2022, 22 (05) : 1905 - 1914
  • [28] DNA-Based Storage of RDF Graph Data: A Futuristic Approach to Data Analytics
    Usmani, Asad
    Wiese, Lena
    IEEE ACCESS, 2023, 11 (129931-129944): : 129931 - 129944
  • [29] Channel Capacity Analysis of DNA-based Molecular Communication with Length Encoding Mechanism
    Xie, Jialin
    Liu, Qiang
    Yang, Kun
    Lin, Lin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (08): : 2923 - 2943
  • [30] Clover: tree structure-based efficient DNA clustering for DNA-based data storage
    Qu, Guanjin
    Yan, Zihui
    Wu, Huaming
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)