An Information Theory for Out-of-Order Media With Applications in DNA Data Storage

被引:0
|
作者
Ravi, Aditya Narayan [1 ]
Vahid, Alireza [2 ]
Shomorony, Ilan [1 ]
机构
[1] Univ Illinois, Elect & Comp Engn Dept, Urbana, IL 61801 USA
[2] Rochester Inst Technol, Elect & Microelect Engn Dept, Rochester, NY 14623 USA
基金
美国国家科学基金会;
关键词
DNA; Sequential analysis; Out of order; Encoding; Codes; Channel capacity; Symbols; Biological information theory; Channel coding; Decoding; DIGITAL INFORMATION; CAPACITY; CODES; CHANNEL; RECONSTRUCTION; ROBUST; BOUNDS;
D O I
10.1109/TMBMC.2024.3403759
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recent advancements in DNA-based storage prototypes focus on encoding information across multiple DNA molecules. This approach utilizes high-throughput sequencing technologies, leading to outputs that are out-of-order. We study the shuffling channel, where input codewords are split into fixed-size fragments. We show that achieving channel capacity uses index-based coding, which assigns unique indices to each fragment. We also introduce two more complex channels, which aim to model popular sequencing strategies in DNA sequencing. In the torn-paper channel, the input codeword is torn up into fragments of random sizes, while in the shotgun sequencing channel, fixed-length random substrings of the input codeword are observed at the output. In both of these channels, the lack of ordering cannot be circumvented by simply adding unique indices to the fragments. We show how the capacity of both of these channels can be achieved using random codes. We introduce and analyze code constructions based on index sequences. While these codes are computationally efficient, they are not capacity-achieving, and we leave the questions of finding efficient capacity-achieving codes for these settings as open problems.
引用
收藏
页码:334 / 348
页数:15
相关论文
共 50 条
  • [1] Recycling Data Slack in Out-of-Order Cores
    Ravi, Gokul Subramanian
    Lipasti, Mikko H.
    2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2019, : 545 - 557
  • [2] Out-of-order event processing in kinetic data structures
    Abam, Mohammad Ali
    Agarwal, Pankaj K.
    de Berg, Mark
    Yu, Hai
    ALGORITHMS - ESA 2006, PROCEEDINGS, 2006, 4168 : 624 - 635
  • [3] Out-of-Order Event Processing in Kinetic Data Structures
    Mohammad Ali Abam
    Pankaj K. Agarwal
    Mark de Berg
    Hai Yu
    Algorithmica, 2011, 60 : 250 - 273
  • [4] Poster: Generating Reproducible Out-of-Order Data Streams
    Grulich, Philipp M.
    Traub, Jonas
    Bress, Sebastian
    Katsifodimos, Asterios
    Markl, Volker
    Rabl, Tilmann
    DEBS'19: PROCEEDINGS OF THE 13TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, 2019, : 256 - 257
  • [5] Out-of-Order Event Processing in Kinetic Data Structures
    Abam, Mohammad Ali
    Agarwal, Pankaj K.
    de Berg, Mark
    Yu, Hai
    ALGORITHMICA, 2011, 60 (02) : 250 - 273
  • [6] Out-Of-Order Execution of Synchronous Data-Flow Networks
    Baudisch, Daniel
    Brandt, Jens
    Schneider, Klaus
    2012 INTERNATIONAL CONFERENCE ON EMBEDDED COMPUTER SYSTEMS (SAMOS): ARCHITECTURES, MODELING AND SIMULATION, 2012, : 168 - 175
  • [7] D-DOG: Securing Sensitive Data in Distributed Storage Space by Data Division and Out-of-order keystream Generation
    Feng, Jun
    Chen, Yu
    Ku, Wei-Shinn
    Su, Zhou
    2010 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2010,
  • [8] DNA Merge-Sort: A Family of Nested Varshamov-Tenengolts Reassembly Codes for Out-of-Order Media
    Nassirpour, Sajjad
    Shomorony, Ilan
    Vahid, Alireza
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2024, 72 (03) : 1303 - 1317
  • [9] Runtime Verification of Temporal Properties over Out-of-Order Data Streams
    Basin, David
    Klaedtke, Felix
    Zalinescu, Eugen
    COMPUTER AIDED VERIFICATION, CAV 2017, PT I, 2017, 10426 : 356 - 376
  • [10] An Improved BP Algorithm over Out-of-order Streams for Big Data
    Wang, Kun
    Zhuo, Linchao
    Lu, Heng
    Guo, Huang
    Xu, Lili
    Zhang, Yuhua
    2013 8TH INTERNATIONAL ICST CONFERENCE ON COMMUNICATIONS AND NETWORKING IN CHINA (CHINACOM), 2013, : 840 - 845